finds you similar websites
auto-suggest    top sites

Nov 22nd, 2024

17 Popular Sites Like Archive Crawler

Our technology has scanned through the web and turned up several first-class crawler and java sites like Archive Crawler. So come and explore more sites that are alternatives to Archive Crawler.

Displaying 1 to 10 of 500 alternatives to Archive Crawler. (Updated: Nov 22nd, 2024)     [about these results]
Advanced Options
? Sort by:
popularity similarity
? Must Include:
? Cannot Include:
? Look For


Sponsored Links
 
You're looking for other sites like Archive Crawler:
  Heritrix - Home Page
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival ... Since our crawler seeks to collect and preserve the digital artifacts ...
http://crawler.archive.org/
popularity:
crawler
java
opensource
spider
web
search
software
programming
heritrix
webcrawler
new search by a custom tag signature
  80legs
No information avaiable
similarity:
popularity:
crawler
search
spider
web
cloud
tools
distributed
api
startup
software
  Welcome to Lucene!
Jakarta Lucene is a full-featured text search engine written entirely in Java, and it is an open ... See http://lucene.apache.org/openrelevance for more info ...
similarity:
popularity:
search
java
lucene
apache
opensource
software
programming
development
searchengine
web
  Web-Harvest Project Home Page
First of all, now it supports plugin development and has several new processors: ... GUI brings better editing, simple debugging and bug fixes. Check the ...
similarity:
popularity:
java
web
opensource
datamining
scraping
tools
extraction
programming
crawler
html
  Web Crawler, spider, ant, bot... how to make one?
No information avaiable
similarity:
popularity:
spider
crawler
programming
howto
web
webcrawler
asp.net
search
projects
webdev
  Open Source Software in Java(tm)
A directory of open source software focused on java. ... See all Open Source Security & Cryptography Tools in Java. Source Control Tools in Java ...
similarity:
popularity:
java
opensource
software
programming
development
tools
open-source
source
reference
library
  The Open Software Wiki - SWiK
No information avaiable
similarity:
popularity:
opensource
wiki
software
programming
search
web
ajax
development
web2.0
community
  Grub's Distributed Web Crawling Project
Open source, distributed Internet crawler. ... Changed to new grub.org dispatcher address. Fixed compressed responce data bug ...
similarity:
popularity:
search
opensource
web
distributed
crawling
software
internet
crawler
searchengine
grub
  Apache Lucene - Overview
26 February 2010 - Lucene Java 3.0.1 and 2.9.2 available. 25 November ... 6 November 2009 - Lucene Java 2.9.1 available. 07 Oct. 2009 - Lucene at US ApacheCon ...
similarity:
popularity:
search
java
lucene
apache
opensource
programming
engine
software
searchengine
development
  Hyper Estraier: a full-text search system for communities
Hyper Estraier is a full-text search system. You can search lots of ... If you run a web site, it is useful as your own search engine for pages in your site. ...
similarity:
popularity:
search
searchengine
opensource
ruby
software
programming
fulltext
java
web
tools
  OpenSymphony - Welcome To OpenSymphony
Download: http://www.opensymphony.com/sitemesh/download.action. Changes: http://jira.opensymphony.com/secure/IssueNavigator.jspa?reset=true&pid=1000 0&fixfor=21683 ...
similarity:
popularity:
java
opensource
framework
j2ee
programming
development
software
tools
components
web
1 2 3 4 5 ... 50 next >
Sorting Results
  • This slider determines how the matched sites are sorted.
  • If you want to see the most popular sites that are somewhat related to your search, slide this more towards "popularity."
  • If you want to see the sites that best matched your search, regardless of popularity, slide this towards "similarity."
Must Include Tags
  • Matched sites will not be shown unless they have all of the tags on this list.
  • This feature is useful for when you require a site to have been tagged as something.
  • To add a tag to this list, click "add tag" or click on any tag in a result.
Must Not Include Tags
  • Matched sites that have any tag on this list will not be shown.
  • This feature is useful for filtering out results that have tags you are absolutely not interested in.
  • To add a tag to this list, click "add tag" or click on any tag in a result.
Types of Results
  • This option lets you specify the types of sites to show.
  • If you want to only see domains (www..com), select "domains only."
  • If you want to only see articles (www..com/something/here), select "articles only."
  • If you don't care, or care so much about both, select "Both".
About The Results
an example search result
How moreofit Searches
Each website has a unique tag signature -- a set of words that users have described the website as. Moreofit searches for websites that have similar tag signatures and displays the results.
1: Similarity
A site's "similarity" is determined by how well its tag signature matches the tag signature that is being searched for. A 100% match means that it has the exact same tags in the exact same order, while a 0% match means it has no tags in common.
2: Popularity
The popularity of a website is, well, pretty much self explanatory.
3: Tag Signature
The tag signatures show how a site is described. The deeper the color of the tag, the more frequently the website is tagged as this. Tags underlined blue denote a tag that is in common with the search's tag signature.