Ask HN: What is the best way to archive a webpage
I’ve enjoyed great success with various archiving proxies, including https://github.com/internetarchive/warcprox#readme and https://github.com/zaproxy/zaproxy#readme (which saves the content to an embedded database, and can be easier to work with than warc files). The benefit of those approaches over just save-as from the browser is that almost by definition the proxy will save all the components required to re-render the page, whereas save will only grab the parts it sees at that time.
Brewster Kahle on Innovation
Interview with Daniel Erasmus
CSPAN: Internet Archive: Brewster Kahle
Founder and digital librarian Brewster Kahle, scanning supervisor Jesse Bell, and board memberRick Prelinger talked about the Internet Archive in San… read more