- Advertisement -

- Advertisement -

OHIO WEATHER

List of Web archiving initiatives: Difference between revisions


Name Country Creation Year Technologies Number of Employees Comments Full-time Part-time End of Term Web Archive United States 2008 Heritrix, Wayback 6–10 The End of Term Web Archive captures and saves U.S. Government federal government websites (.gov, .mil, etc) in the Legislative, Executive, or Judicial branches of the government at the end of presidential administrations. Beginning in 2008, the EOT has thus far preserved websites from administration changes in 2008, 2012 and 2016, and is currently gearing up for the 2020 transition. Project partners include CA Digital Library, Internet Archive, Library of Congress, George Washington University, Stanford University, University of North Texas, and the US Government Publishing Office. Archive.st United States 2017 Archive.st custom programming provided by US Support LLC 1 0 Archive.st provides free online web archiving in the form of a .JPG and HTML archive. EU Web archive European Union 2013 Archive-it service 1 The EU Web archive compiles the captures of the websites of the EU institutions, which are hosted on the europa.eu domain and subdomains. Its aim is to preserve EU web content in the long term and to keep it accessible for the public. The archive was created in 2013 by the Historical Archives of the European Union and in 2018, the Publications Office of the EU took over this task and created the EU Web archive service. The collection of archived websites is covered by the EU legal deposit schema, which collects all the material produced by the European bodies in a comprehensive bibliography. Alabama State Government and Politics Web Site and Social Media Archives[3] United States 2005 Archive-it service Australia’s Web Archive[4] Australia 1996 PANDORA Digital Archiving System (PANDAS), Heritrix, Bamboo, NLA Trove, HTTrack, Webrecorder, outbackCDX. 4 10 The National Library of Australia leads the ‘PANDORA’ component of the Australian Web Archive which takes a selective approach and is a collaborative program of 10 agencies providing curatorial input. PANDORA uses the PANDAS workflow system (developed by the NLA in the late 1990s) with HTTrack as the default harvester. The National Library of Australia also conducts bulk harvesting of Australian government (the Australian Government Web Archive) websites using the Heritrix harvester and Webrecorder with a backend infrastructure (referred to as ‘Bamboo’) to organise content and the NLA developed outbackCDX tool to manage indexing access restrictions for content. In addition to these approaches the National Library also conducts annual harvests of the whole .au domain which is donein collaboration with the Internet Archive using Heritrix and Wayback. In 2019, PANDORA, the Australian Government Web Archive and the whole domain harvests were integrated into a new single discovery and delivery portal through the NLA’s Trove discovery service. PROMISE project[5] Belgium 2017 Heritrix, PyWB 7 The PROMISE project was a two-year project (2017–2019) that explored the policy-related, legal, technical and scientific issues related to archiving the Belgian web. The aim of the project was to a) identify best practices in the field of web-archiving b) develop a strategy for preserving the Belgian web c) set up a pilot for preserving and providing access to the archived Belgian web and d) make recommendations for the implementation of a sustainable web-archiving service. The project was launched by the Royal Library of Belgium[6] and the State Archives of Belgium[7] in collaboration with Ghent University (Research Group for Media, Innovation and Communication[8] and Ghent Centre for Digital Humanities),[9] Université de Namur (Research Centre in Information, Law and Society)[10] and Haute-École Bruxelles-Brabant[11] (Unité de Recherche et de Formation en Sciences de l’Information et de la Documentation). In October 2019 the concluding colloquium ‘Saving the web: the promise of a Belgian web archive’)[12] took place at KBR. The main research findings were presented during this colloquium. KBR web archive[13] Belgium 2020 1 KBR[14] or the Belgian Royal Library is developing an operational web archive based on the findings of the PROMISE research project[5] (2017–2019). Operational policies and technical infrastructure will be developed based on the strategy outlined in the PROMISE project. MT.GOV Connect United States 2007 Archive-It Service 1 Montana State Library collection of state agency websites dating from 1996 in partial fulfillment of statutory mandate[15] to identify, acquire, describe, and provide permanent public access to state publications. Digitized historic state publications available at https://archive.org/details/MontanaStateLibrary Stillio[16] Worldwide 2011 Puppeteer, V8 engine, Gecko, WebKit, Amazon Web Services 3 4 SaaS solution for periodical website & social media archiving….



Read More: List of Web archiving initiatives: Difference between revisions

This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Accept Read More

Privacy & Cookies Policy

Get more stuff like this
in your inbox

Subscribe to our mailing list and get interesting stuff and updates to your email inbox.

Thank you for subscribing.

Something went wrong.