Talk @ PyData NYC 2015: Querying 1.6 billion reddit comments with python

I had the luck to go to beautiful NYC in the fall to give a talk at PyData NYC 2015.

The talk was about how to query around 1.6 billion reddit comments with python tools while leveraging some big data tools like Impala and Hive.

Some of the content can be found in the continuum developer blog

Below you can find the video of the presentation and slides.

PD: There is a couple of good jokes at 35:05 - If you like bad jokes´╗┐