Scalding: Powerful & Concise MapReduce Programming



In this presentation to the San Francisco Scala User Group on April 19, 2012, Dr. Oscar Boykin and Dr. Argyris Zymnis from Twitter give us some insight on Scalding DSL and provide some example jobs for common use cases.
 
Twitter uses Scalding for data analysis and machine learning, particularly in cases where we need more than sql-like queries on the logs, for instance fitting models and matrix processing. It scales beautifully from simple, grep-like jobs all the way up to jobs with hundreds of map-reduce pairs.
 
 
Familiar with Java, but new to Scala? Check out this presentation from Dan Rosen for a brief introduction and roadmap to the Scala universe.
Published May 10, 2012