- Courses
- Sistemi distribuiti 2014/2015
- Projects
- Information harvesting in MoK: information retrieval using web crawlers
Information harvesting in MoK: information retrieval using web crawlers
Information harvesting in MoK: information retrieval using web crawlers
Authors
- Giulio Crestani
- Gianluca Spadazzi
References
- Terrier IR platform
- crawler4j web crawler
- Apache Nutch scalable web crawler
- RSS-based sources of knowledge in MoK-News
Outcomes
- repo — Maven module 'retrieval'