...

How to Install SQLite on Windows: A Visual Walkthrough

Quite a few students in my SQL course have asked me about how to install SQLite on Windows. On the surface, it seems pretty simple: download some...

...

ALL Courses on Udemy $10 Again!

Big Surprise! Earlier this week, I mentioned Udemy was doing a promotion on Tech courses only (if you were signed up for my newsletter you would...

...

SQL for Marketers: Dominate Data Analytics, Data Science, and Big Data

This is an annoucement along with free and discount coupons for my new course, SQL for Marketers: Dominate data analytics, data science, and big...

...

Why databases?

This article has been a long time coming. I wrote a shitty version years ago, but wanted to update it with new and current info, in a more organized...

...

How to run distributed machine learning jobs using Apache Spark and EC2 (and Python)

This is the age of big data. Sometimes sci-kit learn doesn’t cut it. In order to make your operations and data-driven decisions scalable -...

...

Automation: For loops in bash (for loops on the command line)

If you have to run a script that processes data for a particular file for a particular day, i.e. your file is on hadoop with the date in the path,...

...

Pig Error -> ERROR 1070: Could not resolve count using imports

Or: ERROR 1070: Could not resolve sum using imports COUNT() and SUM() are case sensitive, you need to capitalize...

...

How to kill a hadoop job

Hadoop job -list List running hadoop jobs and their job ids. Hadoop job –kill...