Build software better, together

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

crawler csharp dotnetcore scraping crawling webscraper scrapy entity-framework-core webcrawler webscraping scrapy-crawler ddd-architecture htmlagilitypack webcrawling webcrawler-htmlagilitypack

Updated Nov 13, 2019
C#

DwarfThief / Raspagem-de-dados-para-iniciantes

Star

Raspagem de dados para iniciante usando Scrapy e outras libs básicas

python opensource web-crawler jupyter-notebook scrapy spyder estudo datascraping webcrawling raspagem-de-dados

Updated Feb 8, 2022
Python

DedSecInside / gotor

Star

This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.

go cli golang osint command-line service rest-api tor information-extraction http-server command-line-tool webcrawler webscraping hacktoberfest golang-server webcrawling torbot osint-tools

Updated Oct 18, 2021
Go

datawizard1337 / ARGUS

Star

ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9

python scraping crawling scrapy webscraping scrapyd webcrawling

Updated Jan 13, 2022
Python

andersonkrs / malheatmap

Star

An extension for tracking your activities on myanimelist.net

ruby rails myanimelist webcrawling

Updated Jul 6, 2022
Ruby

kafagy / fifa-FUT-Data

Star

Web-scraping script that writes the data of all players from FutHead and FutBin to a CSV file or a DB

mysql python csv database video-game soccer dataset webscraping fifa fifa-ultimate-team webcrawling fifa18 futhead fifa19 futbin-prices futbin player-data

Updated Nov 26, 2019
Python

flickz / newspaperjs

Star

News extraction and scraping. Article Parsing

nodejs crawler scraper news news-aggregator webscraping webcrawling

Updated Jun 4, 2022
HTML

Skumarr53 / Stock-Fundamental-data-scraping-and-analysis

Star

Project on building a web crawler to collect the fundamentals of the stock and review their performance in one go

automation selenium python3 web-scraping webcrawling datacollection stock-fundamentalplots

Updated Mar 8, 2021
Jupyter Notebook

rootVIII / proxy_web_crawler

Star

Automates the process of repeatedly searching for a website via scraped proxy IP and search keywords

bot ssl firefox scraper webdriver regex selenium proxies python3 urls selenium-webdriver geckodriver scraping-websites ssl-proxy webcrawling python-selenium

Updated Jun 9, 2020
Python

lorien / ioweb

Star

Web Scraping Framework

crawler framework scraping web-scraping webscraping web-crawling webcrawling webmining web-scraping-python

Updated Mar 1, 2022
Python

zcrawl / zcrawl

Star

An open source web crawling platform

golang scraping crawling crawlers web-crawling webcrawling

Updated May 6, 2018
Go

Galarzaa90 / tibia.py

Star

API to parse tibia.com content into python objects.

python python3 beautifulsoup tibia webcrawling crawling-python

Updated Mar 1, 2022
Python

crawler-commons / url-frontier

Star

API definition, resources and reference implementation of URL Frontiers

grpc webcrawling web-crawlers url-frontier

Updated Jul 4, 2022
Java

science-math-guy / Ultimate-Guide-to-Sneaker-Bot-Creation

Star

The Ultimate Guide to Sneaker Bot 🤖 Creation using JavaScript and NodeJS ☣️ . Learn how to get the most out of tools like the Chrome devTools, and JS Libraries like Puppeteer or Axios.

nodejs javascript bot node webdriver bots bot-framework bot-api requests axios auto webscraping sneakers sneakerbot webcrawling puppeteer sneakermonitor playwright

Updated May 10, 2021

kkyon / inparse

Star

Open Collaborative AI Driven Parser builder for Web Scraping, Data Extraction and Crawling,Knowledge Graph

python data-structures knowledge-graph data-extraction webscraping webcrawling

Updated Mar 20, 2019
Python

adbar / courlan

Star

Clean, filter and sample URLs to optimize data collection – includes spam, content type and language filters

url crawler validation url-parsing cleaner preprocessing url-manipulation webcrawling

Updated Jul 4, 2022
Python

colmex / frontera_example

Star

Example frontera project

python example webcrawling frontera

Updated Aug 10, 2017
Python

dhyeythumar / Search-Engine

Star

Application made with Node.js and Python.

python nltk node-js express-js express-session natural webspider lemmatization textblob mysql2 webcrawling beautifulsoup4

Updated Mar 27, 2021
HTML

joao2391 / DotNetExpose

Star

A package that helps you to scrap web pages. It shows you a lot of information about the page.

c-sharp dotnetcore webscraper webcrawler webscraping c-sharp-library webcrawling dotnet5

Updated Jun 23, 2022
C#

sunil-sandhu / scrawly

Star

Package wrapper around Node.js and Puppeteer for web crawling/scraping. Originally put together to accompany an article that can be found here: https://sunilsandhu.com/posts/how-to-scrape-data-from-a-website-with-javascript

web-scraping webscraping web-crawling webcrawling puppeteer

Updated Jul 16, 2021
JavaScript

mincloud1501 / Python

Star

Jupyter Notebook을 활용한 Time-series data 분석 및 crawling 기술, D3를 이용한 시각화 기술 구현 및 연구

python jupyter-notebook d3js webcrawling deck-gl pycharm-edu pydeck

Updated Feb 14, 2020
Jupyter Notebook

michaelradu / web-crawler

Star

A Web Crawler developed in Python.

python crawler web script scripting web-crawler scripts python-script scripting-language python3 python-3 crawlers webcrawler web-crawling web-crawler-python webcrawling webcrawl crawler-python web-crawlers

Updated Aug 17, 2020
Python

yashrajkakkad / AUtomate

Star

Scrapes attendance and marks related data from AURIS (Ahmedabad University Resource Information System) and notifies the user without him having to check his data repeatedly

python selenium chromedriver selenium-webdriver webscraping hacktoberfest webcrawling

Updated Jul 6, 2022
Python

cjf8899 / WebCrawler_exe

Star

👻Web Crawling and Convert to Executable with Pyinstaller

python webcrawler pyinstaller webcrawling

Updated Oct 20, 2021
Python

webcrawling

Here are 195 public repositories matching this topic...

internetarchive / heritrix3

scrapinghub / scrapyrt

DemonDamon / Listed-company-news-crawl-and-text-analysis

jaeksoft / opensearchserver

feddelegrand7 / ralger

voliveirajr / seleniumcrawler

mehmetozkaya / DotnetCrawler

DwarfThief / Raspagem-de-dados-para-iniciantes

DedSecInside / gotor

datawizard1337 / ARGUS

andersonkrs / malheatmap

kafagy / fifa-FUT-Data

flickz / newspaperjs

Skumarr53 / Stock-Fundamental-data-scraping-and-analysis

rootVIII / proxy_web_crawler

lorien / ioweb

zcrawl / zcrawl

Galarzaa90 / tibia.py

crawler-commons / url-frontier

science-math-guy / Ultimate-Guide-to-Sneaker-Bot-Creation

kkyon / inparse

adbar / courlan

colmex / frontera_example

dhyeythumar / Search-Engine

joao2391 / DotNetExpose

sunil-sandhu / scrawly

mincloud1501 / Python

michaelradu / web-crawler

yashrajkakkad / AUtomate

cjf8899 / WebCrawler_exe

Improve this page

Add this topic to your repo