Here are
74 public repositories
matching this topic...
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Updated
Jul 19, 2021
Python
Collect and revisit web pages.
Updated
Jul 9, 2021
Python
Core Python Web Archiving Toolkit for replay and recording of web archives
Updated
Jul 19, 2021
Python
Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)
Updated
Sep 17, 2020
JavaScript
InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS
Updated
Jun 30, 2021
Python
Updated
Jul 19, 2021
JavaScript
A Tool To Push Web Resources Into Web Archives
Updated
Feb 14, 2021
Python
Archiveror will help you preserve the webpages you love. 💾
Updated
Oct 18, 2019
JavaScript
🐋 Web Archiving Integration Layer: One-Click User Instigated Preservation
Updated
Jul 19, 2021
Roff
Streaming WARC/ARC library for fast web archive IO
Updated
Nov 3, 2020
Python
Chrome extension to "Create WARC files from any webpage"
Updated
Jun 28, 2021
JavaScript
Social Feed Manager user interface application.
Updated
Jul 7, 2021
Python
Serverless Web Archive Replay directly in the browser
Updated
Jul 16, 2021
JavaScript
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Updated
May 6, 2021
Scala
A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!
Updated
Jun 25, 2021
JavaScript
🐋 One-Click User Instigated Preservation
Updated
Feb 3, 2019
JavaScript
Perpetual Access To The Scholarly Record
Updated
Jul 13, 2021
Python
A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine
Updated
Jul 3, 2021
Python
Recover lost websites from the Web Infrastructure
Updated
Feb 10, 2021
HTML
Parse And Create Web ARChive (WARC) files with node.js
Updated
Jun 4, 2021
JavaScript
A server to collect & archive websites, also supports video downloads
Updated
Jun 12, 2021
TypeScript
A Memento Aggregator CLI and Server in Go
Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
Updated
Jul 19, 2021
JavaScript
🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.
Updated
Oct 19, 2020
JavaScript
Web archive index server based on RocksDB
CDXJ Indexing of WARC/ARCs
Updated
Jul 15, 2021
Python
A prototype server to swarm multiple DATs for Webrecorder
Updated
Apr 27, 2019
JavaScript
A PDF classifier ensemble with REST API service
Updated
Mar 5, 2021
Python
Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in
https://fatcat.wiki
Updated
Apr 20, 2021
HTML
Conifer setup and deployment via Ansible
Updated
Jun 15, 2020
Shell
Improve this page
Add a description, image, and links to the
web-archiving
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
web-archiving
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.