From 83d3540bf9ff39fdb5580eb36996686b6f9d408b Mon Sep 17 00:00:00 2001 From: nbats <44333466+nbats@users.noreply.github.com> Date: Fri, 3 May 2024 21:00:18 -0700 Subject: [PATCH] Updated Storage (markdown) --- Storage.md | 97 +++++++++++++++++++++++++++--------------------------- 1 file changed, 48 insertions(+), 49 deletions(-) diff --git a/Storage.md b/Storage.md index bb1db7c7..fdad11bd 100644 --- a/Storage.md +++ b/Storage.md @@ -115,6 +115,52 @@ *** +## Archiving + +### Archive Services + +* ⭐ **[Archive.org](https://archive.org/)** - Internet Archive +* ⭐ **[Wayback Machine](https://web.archive.org/)** or **[Archive.is](https://archive.is/)** / [.li](https://archive.li/) / [.ph](https://archive.ph/) / [.vn](https://archive.vn/) / [.fo](https://archive.fo/) / [.md](https://archive.md/) - Archive Web Pages +* ⭐ **Wayback Machine Tools** - [Downloader](https://github.com/jsvine/waybackpack) / [Browser Extension](https://github.com/internetarchive/wayback-machine-webextension), [2](https://vegetableman.github.io/vandal/) / [Script](https://github.com/overcast07/wayback-machine-spn-scripts) / [Auto Load](https://gitlab.com/gkrishnaks/WaybackEverywhere-Firefox) +* ⭐ **[Web Archives](https://github.com/dessant/web-archives)** or [Resurrect Pages Fork](https://github.com/Albirew/resurrect-pages-isup-edition) - Browser Extensions +* ⭐ **[CachedView](https://cachedview.nl/)** or [Quick Cache](https://cybdetective.com/quickcacheandarhivesearch.html) - Aggregate Cache Results +* [ArchiveTeam](https://wiki.archiveteam.org/index.php/Main_Page) - Archive Projects +* [Perma.cc](https://perma.cc/) - Create Permalinks + +### Web Archiving Tools + +* 🌐 **[Awesome Web Archiving](https://github.com/iipc/awesome-web-archiving)** - Web Archiving Tools +* 🌐 **[Webrecorder](https://webrecorder.net/)** - Open-source Archiving Tools +* ⭐ **[ArchiveBox](https://archivebox.io)** - Self-hosted Web Archiving +* ⭐ **[MarkDownload](https://github.com/deathau/markdownload)** - Download Web Pages as Markdown Files +* ⭐ **[HTTrack](https://www.httrack.com/)** / [Guide](https://rentry.co/cloneasite) - Website Downloader +* ⭐ **[datahoarder-website-to-markdown](https://github.com/evilsh3ll/datahoarder-website-to-markdown)** - Index to Markdown Tool +* [WAIL](https://matkelly.com/wail) / [GitHub](https://github.com/machawk1/wail) - GUI For Archiving Tools +* [ReplayWeb.page](https://replayweb.page/) - View Web Archive Files +* [ArchiveWeb.page](https://archiveweb.page/) - Browser Extension +* [WikiTeam](https://github.com/WikiTeam/wikiteam) - Archive Wikis +* [Wayback](https://github.com/wabarc/wayback) - Web Archiving Tool +* [DownloadNet](https://github.com/dosyago/DownloadNet) or [Kiwix](https://kiwix.org/en/) / [Wiki DL Guide](https://practicalbetterments.com/download-all-of-wikipedia-on-your-phone/) - Offline Website Readers +* [Wget2](https://gitlab.com/gnuwget/wget2) / [Commands](https://www.whatismybrowser.com/developers/tools/wget-wizard/), [SuckIT](https://github.com/skallwar/suckit), [Cyotek WebCopy](https://www.cyotek.com/cyotek-webcopy) or [Website Downloader](https://github.com/AhmadIbrahiim/Website-downloader) - Website Downloaders +* [Archivematica](https://www.archivematica.org/) - Digital Preservation System +* [wallabag](https://wallabag.org/) - Save Articles +* [CopySite](https://xdan.ru/copysite/) - Copy Websites +* [Scoop](https://github.com/harvard-lil/scoop) - Capture Engine + +### Web Scraping / Crawling + +* 🌐 **[Awesome Web Scraping](https://github.com/lorien/awesome-web-scraping)** - Web Scraping Tools +* 🌐 **[Awesome-crawler](https://github.com/BruceDone/awesome-crawler)** - Crawling Resources +* ⭐ **[Instant Data Scraper](https://chromewebstore.google.com/detail/instant-data-scraper/ofaokhiedipichpaobibbnahnkdoiiah)** - Browser Extension +* [Heritrix](https://heritrix.readthedocs.io/) / [GitHub](https://github.com/internetarchive/heritrix3) - Internet Archive's Web Crawler +* [80legs](https://80legs.com/) - Cloud-Based +* [Crawly](https://crawly.diffbot.com/) - Online Scraper +* [web.scraper.workers.dev](https://web.scraper.workers.dev/) - Web Scraper +* [grab-site](https://github.com/ArchiveTeam/grab-site) - ArchiveTeam Web Crawler +* [brozzler](https://github.com/internetarchive/brozzler) - Web Crawler + +*** + ## Browser eBook Readers * ⭐ **[Reader View](https://webextension.org/listing/chrome-reader-view.html)**, [2](https://mybrowseraddon.com/reader-view.html) @@ -330,6 +376,7 @@ * ⭐ **[Wolfram Alpha](https://www.wolframalpha.com/)** - Searchable Knowledgebase / [API Access](https://wolfreealpha.gitlab.io) * [EncycloReader](https://encycloreader.org/) - Encyclopedia Search * [Omniglot](https://www.omniglot.com/index.htm) - Writing Systems & Languages Encyclopedia +* [Archivy](https://github.com/archivy/archivy/) - Self-hosted Wiki [Britannica](https://www.britannica.com/),[EverybodyWiki](https://en.everybodywiki.com/), [Encyclopedia](https://www.encyclopedia.com/), [NewWorldEncyclopedia](https://www.newworldencyclopedia.org/), [Citizendium](https://citizendium.org/), [Wikitia](https://wikitia.com/), [Conze.pt](https://conze.pt/), [InfoPlease](https://www.infoplease.com/), [Refdesk](https://www.refdesk.com/factency.html) @@ -1151,54 +1198,6 @@ *** -## Web Archiving - -* 🌐 **[awesome-web-scraping](https://github.com/lorien/awesome-web-scraping)** / [2](https://github.com/iipc/awesome-web-archiving) / [3](https://github.com/BruceDone/awesome-crawler) -* ⭐ **[datahoarder-website-to-markdown](https://github.com/evilsh3ll/datahoarder-website-to-markdown)** - Index to Markdown Archiving Tool -* [webrecorder](https://webrecorder.net/) -* [Heritrix](https://heritrix.readthedocs.io/) / [GitHub](https://github.com/internetarchive/heritrix3) -* [wail](https://matkelly.com/wail) / [GitHub](https://github.com/machawk1/wail) -* [80legs](https://80legs.com/) -* [crawly](https://crawly.diffbot.com/) -* [replayweb](https://replayweb.page/) - View Archive Format Files - -### Archiving Services - -* ⭐ **[Wayback Machine](https://web.archive.org/)** -* ⭐ **Wayback Machine Tools** - [ArchiveTeam Contribute](https://tracker.archiveteam.org/) / [Downloader](https://github.com/hartator/wayback-machine-downloader), [2](https://github.com/jsvine/waybackpack) / [Classic Frontend](https://wayback-classic.net/) / [Extension](https://github.com/internetarchive/wayback-machine-webextension), [2](https://vegetableman.github.io/vandal/) / [Addon](https://www.reddit.com/r/FREEMEDIAHECKYEAH/wiki/storage#wiki_wayback_machine_extension) / [Script](https://github.com/overcast07/wayback-machine-spn-scripts) / [Toolkit](https://docs.wabarc.eu.org/) / [Multi-URL](https://liamswayne.github.io/Super-Archiver/) / [Auto Load](https://gitlab.com/gkrishnaks/WaybackEverywhere-Firefox) -* ⭐ **[Archive.is](https://archive.is/)** / [.li](https://archive.li/) / [.ph](https://archive.ph/) / [.vn](https://archive.vn/) / [.fo](https://archive.fo/) / [.md](https://archive.md/) -* ⭐ **[cachedview](https://cachedview.nl/)**, **[Web Archives](https://github.com/dessant/web-archives)**, [quickcache](https://cipher387.github.io/quickcacheandarchivesearch/), [resurrect-pages](https://github.com/Albirew/resurrect-pages-isup-edition) - Aggregate Cache Results -* [Perma.cc](https://perma.cc/) -* [archiveforever](https://www.archiveforever.xyz/) -* [ghostarchive](https://ghostarchive.org/) -* [hozon](https://hozon.site/) -* [Arquivo.pt](https://arquivo.pt/?l=en) - -### Local Archiving - -* ⭐ **[ArchiveBox](https://archivebox.io)** -* ⭐ **[HTTrack](https://www.httrack.com/)** / [Guide](https://rentry.co/cloneasite) -* ⭐ **[MarkDownload](https://github.com/deathau/markdownload)** - Get Markdown of a page -* ⭐ **[Instant Data](https://chromewebstore.google.com/detail/instant-data-scraper/ofaokhiedipichpaobibbnahnkdoiiah)** -* [Kiwix](https://kiwix.org/en/) / [Wiki DL Guide](https://practicalbetterments.com/download-all-of-wikipedia-on-your-phone/) -* [cyotek-webcopy](https://www.cyotek.com/cyotek-webcopy) -* [Website-downloader](https://github.com/AhmadIbrahiim/Website-downloader) -* [archiveweb](https://archiveweb.page/) -* [archivematica](https://www.archivematica.org/) -* [suckit](https://github.com/skallwar/suckit) -* [DownloadNet](https://github.com/dosyago/DownloadNet) -* [wget2](https://gitlab.com/gnuwget/wget2) / [Commands](https://www.whatismybrowser.com/developers/tools/wget-wizard/) -* [archivy](https://github.com/archivy/archivy/) -* [web.scraper](https://web.scraper.workers.dev/) -* [WikiTeam](https://github.com/WikiTeam/wikiteam) -* [grab-site](https://github.com/ArchiveTeam/grab-site) -* [wallabag](https://github.com/wallabag/docker) -* [brozzler](https://github.com/internetarchive/brozzler) -* [Scoop](https://github.com/harvard-lil/scoop) -* [CopySite](https://xdan.ru/copysite/) - -*** - ## WordPress Themes [gpldl](https://gpldl.com/), [wplocker](https://www.wplocker.com/), [Weadown](https://weadown.com/), [crackthemes](https://www.crackthemes.com/), [Mega Drive](https://rentry.co/FMHYBase64#wordpress-themes), [babiato](https://babia.to/), [newtemplate](https://newtemplate.net/), [justfreewpthemes](https://justfreewpthemes.com/), [themesplugins](https://themesplugins.club/), [wpthemesandplugins](https://t.me/wpthemesandplugins) @@ -1215,4 +1214,4 @@ * [Video Dictionary](https://videodictionary.kwebpia.net/?m=Full_Movies) * [MoviesFoundOnline](https://moviesfoundonline.com/) * [FREEMOVIESNOW](https://www.youtube.com/c/FREEMOVIESNOW/featured) -* [FreeGreatMovies](https://www.freegreatmovies.com/) \ No newline at end of file +* [FreeGreatMovies](https://www.freegreatmovies.com/)