...

SQL for Marketers: Dominate Data Analytics, Data Science, and Big Data

This is an annoucement along with free and discount coupons for my new course, SQL for Marketers: Dominate data analytics, data science, and big...

...

Why databases?

This article has been a long time coming. I wrote a shitty version years ago, but wanted to update it with new and current info, in a more organized...

...

Logistic Regression in Python video course

Hi all! Do you ever get tired of reading walls of text, and just want a nice video or 10 to explain to you the magic of logistic regression and...

...

How to run distributed machine learning jobs using Apache Spark and EC2 (and Python)

This is the age of big data. Sometimes sci-kit learn doesn’t cut it. In order to make your operations and data-driven decisions scalable -...

...

Bayes classifier and Naive Bayes tutorial (using the MNIST dataset)

The Naive Bayes classifier is a simple classifier that is often used as a baseline for comparison with more complex classifiers. It is also...

...

Tutorial: How to use Linux Screen

A problem that often arises when you’re dealing with lots of data is that it takes forever to process. So you SSH into your Amazon EC2 machine,...