Please feel free to contribute by suggesting new tools or by pointing out mistakes in the data.
Tool | Description | Categories | Platform | Pricing |
---|---|---|---|---|
BootCat | Tool for crawling and compiling data from the web with a list of seed words. | crawler, compilation | ||
ICEweb | A tool for compiling, downloading, and analyzing web corpora in accordance with the ICE | ICE, compilation, crawler | Windows | Free |
SpiderLing | Software for obtaining text from the web useful for building text corpora | crawler | Free |