Crawlers
Heritrix
Other Resources
A
stop
list
(also known as a list of stop words)
IR resources
(Mark Sanderson)
Cross-language information retrieval (CLIR)
WebIR
Search Engine Watch
Open Directory: Information Retrieval
Chris Manning's
NLP resources