Blog topic: Web archiving
Since our collaboration with political science researchers using web archives to understand the 2014 U.S. congressional elections, we've seen (and, hopefully, helped foster) growing interest in web archives as primary source material. This trend parallels a similar refocusing by other web archiving programs toward enhancing access services and facilitating research use. The maturity and the variety of these efforts, as well as the accumulating body of resulting research, provide an expanding list of references with which to orient and entice prospective researchers to the potential of working with web archives.
A welcome complement to the lately growing number of web archiving-specific events, the inaugural Web Archives: Capture, Curate, Analyze conference (tweet stream) brought together an eclectic crowd of researchers, instructors, students, archivists, librarians, developers, and others interested in web archiving. A novel mixture of institutions was also represented - some active principally through IIPC, many more associated with the SAA Web Archiving Roundtable and/or Archive-It Partner communities, and still others who I'd not yet encountered in these more established, practitioner-centric fora.
Echoing the sentiments of other participants, I was impressed and inspired both by the diversity of perspectives and the excitement for moving web archiving forward. As befitting such a group, the schedule and hallway conversations crossed a wide array of topics. Running through it all, though, questions of ethics seemed to be a persistent subject. I'll highlight three areas of ethical concern that stood out for me.
The world's first websites were built for very different rendering and navigation interfaces than the comparatively advanced browsers available today. Thanks to the work of web archivists (e.g., CERN, SLAC), we can celebrate the incongruity of accessing some of these ancient websites using modern browsers. While a traditional goal of web archiving has been to preserve the "canonical" user experience of a website, this has been persistently impaired by (among other challenges) accessing web archives using software other than would've been available at the time content was archived.
"What does it take to archive a linear foot of the Web?," Anna Perricci posed rhetorically to our web archiving metrics breakout discussion group two weeks ago. I don't yet have a good answer for what the question's getting at, but I was gratified by the level of interest and engagement in web archiving as archiving at the just-concluded Society of American Archivists (SAA) Annual Meeting and inaugurally coscheduled Archive-It Partner Meeting.
A couple of weeks have passed since the successful conclusion of the annual IIPC General Assembly, hosted this year by Stanford University Libraries and Internet Archive. The meeting has been well summarized already in posts by Sawood Alam, Jefferson Bailey, Emmanuelle Bermes, Tom Cramer, Carlos Eduardo Entini, and Ian Milligan. Rather than contributing another retrospective, I'd like to instead look ahead to 2016 and consider what the web archiving community might accomplish together in the coming year, highlighting some of the opportunities discussed and presented two weeks ago.
This past week saw the 2015 General Assembly of the IIPC, International Internet Preservation Coalition--probably the biggest week and biggest event of the year in the web archiving world. The IIPC has 50 members from 30 countries, and comprises the leading web archiving institutions in the world, including tons of national libraries, the Internet Archive, and a growing number of research institutions.