ArchiveBox

You are viewing an old revision of this post, from March 23, 2019 @ 21:15:59. See below for differences between this version and the current revision.

ArchiveBox is a free, open-source tool that lets users create archived versions of web pages,

ArchiveBox takes a list of website URLs you want to archive, and creates a local, static, browsable HTML clone of the content from those websites (it saves HTML, JS, media files, PDFs, images and more).

You can use it to preserve access to websites you care about by storing them locally offline. ArchiveBox imports lists of URLs, renders the pages in a headless, autheticated, user-scriptable browser, and then archives the content in multiple redundant common formats (HTML, PDF, PNG, WARC) that will last long after the originals disappear off the internet. It automatically extracts assets and media from pages and saves them in easily-accessible folders, with out-of-the-box support for extracting git repositories, audio, video, subtitles, images, PDFs, and more.

Post Revisions:

Changes:

There are no differences between the March 23, 2019 @ 21:15:59 revision and the current revision. (Maybe only post meta information was changed.)

Leave a Reply