Portia uses https://github.com/scrapy/scrapely library for data extraction. It d...

		kmike84 on April 1, 2014 \| parent \| context \| favorite \| on: Portia, an open-source visual web scraper Portia uses https://github.com/scrapy/scrapely library for data extraction. It doesn't use XPaths for learning. There are some links to papers in scrapely README; scrapely is largely based on ideas from these papers, but there are many improvements. In short - yes, this is taken care of.