Select your language

Home arrow-right Heritrix

We've compiled a list of 8 free and paid alternatives to Heritrix. The primary competitors include Algolia, Mixnode. In addition to these, users also draw comparisons between Heritrix and Expertrec Search Engine, Apache Nutch, StormCrawler. Also you can look at other similar options here: About.


Algolia
Free Subscription

Algolia stands as the comprehensive AI search and discovery platform, seamlessly integrating natural language processing and vector search via a singular API.

Apache Nutch
Free Open Source

Apache Nutch is a highly extensible and scalable open source web crawler software project.

StormCrawler
Free Open Source

StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.

Apisearch
Open Source

Search over millions of documents, and give to your users unique, amazing and unforgettable...

The Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Heritrix Platforms

tick-square Windows
tick-square Linux
tick-square Mac

Heritrix Video and Screenshots

Heritrix Overview

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Heritrix (sometimes spelled heretrix, or misspelled or mis-said as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.

Top Heritrix Alternatives

Share your opinion about the software, leave a review and help make it even better!

Heritrix Tags

web-data-crawling web-crawling web-crawler

Suggest Changes

Your Feedback

Select a rating
Please select a rating

Your vote has been counted.

Do you have experience using this software?