Presented at PyData Silicon Valley 2014 while at ZEFR. The talk walked through building a scalable data collection framework in Python — from a simple script to a distributed system using Elasticsearch, Redis queues, and Heroku workers.

Watch the talk