Information harvesting in MoK: information retrieval using web crawlers
Authors
- Giulio Crestani
- Gianluca Spadazzi
References
- Terrier IR platform
- crawler4j web crawler
- Apache Nutch scalable web crawler
- RSS-based sources of knowledge in MoK-News
Material
- repo — Maven module 'retrieval'