Scalding is a Scala DSL built on Cascading that simplifies writing Hadoop MapReduce jobs. It lets users describe data transformations using Scala’s functional abstractions, while abstracting away low-level Hadoop boilerplate. It enables expressive and testable pipeline definitions and integrates with various input/output formats.
Features
- High-level Scala DSL for Hadoop MapReduce using Cascading
- Works with TextLine, Tsv, Json, JDBC, and custom sources
- Supports grouping, joins, aggregations, and typed case class pipelines
- Easily testable locally or on Hadoop clusters
- Compatible with sbt, Maven, Gradle, Mill, Ammonite, and Scala CLI
- Allows parameterized jobs and configurable reducers/settings
Categories
FrameworksLicense
Apache License V2.0Follow Scalding
You Might Also Like
Gen AI apps are built with MongoDB Atlas
MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Scalding!