Cascalog is a powerful Clojure (and Java) data processing and querying library built atop Hadoop (via Cascading), providing a high-level, Datalog-inspired abstraction for both big data processing and local computation. Cascalog is hosted at Clojars, and some of its dependencies are hosted at Conjars. Both Clo/Con-jars are maven repos that's easy to use with maven or leiningen. The Cascalog website contains more information and links to Various articles and tutorials. The best way to get started with Cascalog is experiment with the toy datasets that ship with the project.
Features
- Expressive, Datalog-like query language that runs on Hadoop or locally
- Simplified abstraction over Cascading to avoid low-level Hadoop complexity
- Seamless handling of distributed Big Data workflows
- Pure Java API (JCascalog) available for Java integration and experimentation
- Useful for prototyping data flows that scale from local tests to production clusters
- Draws inspiration from existing tools like Pig, Hive, and Cascading while providing richer abstraction
Categories
Data ManagementLicense
MIT LicenseFollow Cascalog
Other Useful Business Software
Houzz Pro is the #1 business management software for home construction and design professionals.
Get an all-in-one solution that spans the full customer lifecycle, including marketing, CRM, estimation & proposal building, project management, a 3D Floor Plan builder, an online invoicing and payment portal, as well as a client portal and collaboration tools. Start a free trial today to see why thousands of Pros run their business on Houzz Pro. Plans available for all business sizes.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Cascalog!