Scrapy project
Repositories
-
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
-
parsel
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
-
xtractmime
https://mimesniff.spec.whatwg.org/ implementation for Python
-
itemadapter
Common interface for data container classes
-
-
queuelib
Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python
-
itemloaders
Library to populate items using XPath and CSS with a convenient API
-
scrapyd-client
Command line client for Scrapyd server
-
scrapyd
A service daemon to run Scrapy spiders
-
scrapy-bench
A CLI for benchmarking Scrapy.
-
loginform
Fill HTML login forms automatically
-
protego
A pure-Python robots.txt parser with support for modern conventions.
-
-
quotesbot
This is a sample Scrapy project for educational purposes
-
scrapy-itemloader Archived
[Archived] Library to populate Scrapy items using XPath and CSS with a convenient API
-
scrapely
A pure-python HTML screen-scraping library
-
url-chromium
url component from Chromium source code, forked from https://chromium.googlesource.com/chromium/src/url
-
base-chromium
base component forked from Chromium source https://chromium.googlesource.com/chromium/src/base/
-
dirbot
Scrapy project to scrape public web directories (educational) [DEPRECATED]
-
scrapy-bench-speedcenter
Forked from Parth-Vader/scrapy-bench-speedcenterCodespeed for scrapy-bench
-
pypydispatcher
A fork of http://pydispatcher.sourceforge.net/ with PyPy support
-
gsoc2014-integration-tests
GSoC2014 - Scrapy Integration tests project