Scrapinghub in GSoC 2016
Scrapinghub is applying!
Scrapinghub is a company focused on information retrieval and its later manipulation, deeply involved on developing and contributing in Open Source projects regarding web crawling and data processing technologies.
This year we are applying with four of our most renowned projects, Scrapy, Portia, Splash and Frontera. You can learn more about these projects on their respective repositories: https://github.com/scrapy/scrapy, https://github.com/scrapinghub/portia, https://github.com/scrapinghub/splash, and https://github.com/scrapinghub/frontera
Frontera is a web crawling framework consisting of crawl frontier, and distribution/scaling primitives, allowing to build a large scale online web crawler.Check Frontera ideas