Hadoop, MapReduce and processing large Twitter datasets for fun and profit

I’m back in school for my PhD, and getting ready to conduct research on a HUGE Twitter dataset on the US 2012 presidential election collected by the Social Media and Democracy research team at UW-Madison. We’ve been brushing up on Python, Hadoop and MapReduce. As part of our training, Alex Hanna, a sociology PhD student at UW-Madison, put together an excellent series of workshops on Twitter (or, as he’s aptly named them, “Tworkshops”) to get us started. Check them out!

Be Sociable, Share!