Infrastrukturen und Services für die wissenschaftliche Nutzung von Webarchiven
Ein Überblick
DOI:
https://doi.org/10.5282/o-bib/5821Keywords:
Web archivingAbstract
The article first gives a brief overview of the current state of web archiving in German libraries and also sheds light on the legal framework. Based on this, the current practice of indexing and using web archives as well as the requirements for the documentation of web archiving processes are described. The focus of the article is a comprehensive analysis of additional forms of data provision from web archives and of supporting services for scientific use with computer-aided analysis methods using examples from the international web archiving community.
References
Adewoye, Tobi et al.: Content-Based Exploration of Archival Images Using Neural Networks, in: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, China 2020, S. 489–490. Online: https://doi.org/10.1145/3383583.3398577.
Altenhöner, Reinhard: Noch immer am Anfang? Stand und Perspektiven der Webarchivierung in Deutschland 2019, in: Fühles-Ubach, Simone; Georgy, Ursula (Hg.): Bibliotheksentwicklung im Netzwerk von Menschen, Informationstechnologie und Nachhaltigkeit. Festschrift für Achim Oßwald, Bad Honnef 2019, S. 237-250. Online https://nbn-resolving.org/urn:nbn:de:hbz:79pbc-opus-16232.
Bailey, Jefferson: Archive-It and Archives Unleashed Join Forces to Scale Research Use of Web Archives, 28.07.2020, http://blog.archive.org/2020/07/28/archive-it-and-archives-unleashed-join-forces-to-scale-research-use-of-web-archives/, Stand: 01.03.2022.
Bailey, Jefferson: Early Web Datasets & Researcher Opportunities, 2021, http://blog.archive.org/tag/web-data-research, Stand: 01.03.2022.
Beinert, Tobias; Schoger, Astrid: Vernachlässigte Pflicht oder Sammlung aus Leidenschaft. Zum Stand der Webarchivierung in deutschen Bibliotheken, in: Zeitschrift für Bibliothekswesen und Bibliographie 62 (3/4), 2015, S. 172–183. Online: http://dx.doi.org/10.3196/1864295015623459.
Blumenthal, Karl-Reiner: Access Archive-It’s Wayback Index with the CDX/C API, 2022, https://support.archive-it.org/hc/en-us/articles/115001790023-Access-Archive-It-s-Wayback-index-with-the-CDX-C-API, Stand: 01.03.2022.
Bode, Peter de; Geldermans, Iris; Teszelszky, Kees: Web collection NL-blogosfeer, 2021. Online: https://doi.org/10.5281/zenodo.4593479.
Brügger, Niels: Digital Humanities and Web Archives. Possible New Paths for Combining Datasets, in: International Journal of Digital Humanities, 2021. Online: https://doi.org/10.1007/s42803-021-00038-z.
Brügger, Niels; Nielsen, Janne; Laursen, Ditte: Big Data Experiments with the Archived Web. Methodological Reflections on Studying the Development of a Nation’s Web, in: First Monday 25 (3), 2020. Online: https://doi.org/10.5210/fm.v25i3.10384.
Costa, Miguel: Full-Text and URL Search Over Web Archives. Online: https://doi.org/10.48550/arXiv.2108.01603.
Egense, Thomas: SolrWayback 4.0 Release! What’s It All about? Part 2, 2021, https://netpreserveblog.wordpress.com/2021/03/04/solrwayback-4-0-release-whats-it-all-about- part-2/, Stand: 01.03.22.
Eldakar, Youssef; Alsabbagh, Lana: LinkGate: Let’s Build a Scalable Visualization Tool for Web Archive, 2020, https://netpreserveblog.wordpress.com/2020/04/23/linkgate-update/, Stand: 01.03.2022.
Hockx-Yu, Helen: Access and Scholarly Use of Web Archives, in: Alexandria: The Journal of National and International Library and Information Issues 25 (1–2), 2014, S. 113–127. Online: https://doi.org/10.7227/ALX.0023.
Huurdeman, Hugo C.; Ben-David, Anat; Sammar, Thaer: Sprint Methods for Web Archive Research, in: Proceedings of the 5th Annual ACM Web Science Conference on - WebSci ’13, Paris 2013, S. 182–90. Online: https://doi.org/10.1145/2464464.2464513.
Jackson, Andrew et al.: Desiderata for Exploratory Search Interfaces to Web Archives in Support of Scholarly Activities, in: Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, Newark, New Jersey USA 2016, S. 103–106.
Kleinkopf, Felicitas; Jacke, Janina; Gärtner Markus: Text-und Data-Mining. Urheberrechtliche Grenzen der Nachnutzung wissenschaftlicher Korpora und ihre Bedeutung für die Digital Humanities, 2021, https://elib.uni-stuttgart.de/bitstream/11682/11462/1/Urheberrechtliche_%20Nachnutzbarkeit_TDM_Korpora_KleinkopfJackeGaertner.pdf,Stand: 01.03.22.
Lauridsen, Jesper: SolrWayback 4.0 Release! What’s It All about?, 2021, https://netpreserveblog.wordpress.com/2021/02/25/solrwayback-4-0-release-whats-it-all- about/, Stand: 01.03.2022.
Maemura, Emily; Worby, Nicholas; Milligan, Ian; Becker, Christoph: If these crawls could talk: Studying and documenting web archives provenance. Journal of the Association for Information Science and Technology, 69 (10), 2018, S. 1223–1233. Online: https://doi.org/10.1002/asi.24048.
Mutschler, Thomas: Zum Stand der kooperativen Webarchivierung in Thüringen. Gemeinsames Sammeln von landeskundlich relevanten Websites der Thüringer Universitäts- und Landesbibliothek und der Deutschen Nationalbibliothek, in: O-Bib. Das Offene Bibliotheksjournal 7 (4), 2020, S. 1–12. Online: https://doi.org/10.5282/o-bib/5632.
Ruest, Nick; Fritz, Samantha; Deschamps, Ryan et al.: From archive to analysis: accessing web archives at scale through a cloud-based interface, in: International Journal of Digital Humanities, 2021. Online: https://doi.org/10.1007/s42803-020-00029-6.
Ruest, Nick; Lin, Jimmy; Milligan, Ian; Fritz, Samantha: The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives, in: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, China 2020, S. 157–166. Online: https://doi.org/10.1145/3383583.3398513.
Schoger, Astrid; Weimer, Konstanze: Das Dateiformat WARC für die Webarchivierung, in: Nestor Thema 15, 2021, https://files.dnb.de/nestor/kurzartikel/thema_15-WARC.pdf, Stand: 01.03.2022. ҄ Sherratt, Tim; Jackson, Andrew: GLAM-Workbench/web-archives, 2021, https://doi.org/10.5281/zenodo.5584126.
Downloads
Published
Issue
Section
License
Copyright (c) 2022 Tobias Beinert, Katharina Schmid, Konstanze Weimer
This work is licensed under a Creative Commons Attribution 4.0 International License.