Category Archives: Hadoop and related

Big Data processing with Scalding on Amazon EMR

Published at: https://blog.softwaremill.com/big-data-processing-with-scalding-on-amazon-emr-707c94dd56e1

Read More →

Big Data analysis with Hadoop, Pig and Stack Exchange data dump

Finally I have finished my latest pet project. I was learning Pig and Hadoop but most importantly I was trying to learn how to finish things. As with most of my pet projects, I start small hoping I will finish that, get smth useful and have something to write about, but usually I never stop […]

Read More →