Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.

Project Activity

See All Activity >

License

GNU Library or Lesser General Public License version 2.0 (LGPLv2)

Follow Crawl-By-Example (Heritrix plugin)

Crawl-By-Example (Heritrix plugin) Web Site

Other Useful Business Software
Get full visibility and control over your tasks and projects with Wrike. Icon
Get full visibility and control over your tasks and projects with Wrike.

A cloud-based collaboration, work management, and project management software

Wrike offers world-class features that empower cross-functional, distributed, or growing teams take their projects from the initial request stage all the way to tracking work progress and reporting results.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Crawl-By-Example (Heritrix plugin)!

Additional Project Details

Languages

English

Intended Audience

Advanced End Users, Developers, Science/Research

User Interface

Web-based

Programming Language

Java

Related Categories

Java Search Engines, Java Information Analysis Software

Registered

2007-02-12