Once each year, the international web archiving community represented by the International Internet Preservation Consortium meets for a week-long "General Assembly". As alluded to in my recap of the 2014 meeting, I'm pleased to belatedly announce that Stanford University is the confirmed host for the 2015 IIPC General Assembly as well as more promptly announce that registration is now open!
On Tuesday, Dec 16 2014, the SearchWorks team added a new set of features supporting the display and use of digital content in SearchWorks:
- An updated image viewer that includes a styled header and footer, three different viewing modes (single image, thumbnail gallery and horizontal scroll), file download links and improved navigation and full-screen mode. You can see an example at http://searchworks.stanford.edu/view/tv206kh7995
- Direct download access to non-image file content, such as deposited datasets, media files and objects from born-digital collections. Users no longer need to click a link to purl.stanford.edu to access file-type content that is available in SearchWorks. For an example see http://searchworks.stanford.edu/view/rq635hh7400
- For both the new image and file viewer, users now have the ability to embed these digital objects in other web pages, such as blog posts. By clicking the embed icon </> in the footer of the viewer, the user is shown embed code that they can copy and paste into html when authoring a blog post or other web page. The fully functional embedded version of the viewer will also include in the footer a link that shows the use and reproduction statement, and copyright statement (if applicable). See an example of a digital object embedded in a blog post at http://stanford.io/1zICRVe
This latest release (SearchWorks version 3.0.8), also includes several other bug fixes and enhancements.
We are pleased to announce the acceptance of our bid to join the IIPC Steering Committee, based on a vote by the IIPC membership. SUL joins the 15-member group as one of two currently-serving university library members (the other being the University of North Texas Libraries) and as the third university library to ever serve on the body (the other being the California Digital Library).
"We had no idea that we were making history and were just trying to get the job done in our 'spare' time',” Louise Addis, one of the WWWizards team who developed the SLAC website from 1991, said during our conversation about the restoration of SLAC's earliest website. Last May, Nicholas Taylor, web archiving service manager, told me, "SLAC has a historical collection of webpages that may be the first website in the US. Can we help them to find a home for this archive?” As Web archivist, I felt that I found a treasure. I replied, "Of course, Stanford Web Archive Portal should be the home."
One of the major use cases for the Web Archiving Service is preserving Stanford University web content. The earliest SLAC website represent the oldest such content we could find; it is the first website in the US dated to 1991, so we started there. Stanford Web Archiving Service launched its portal this week which featured SLAC's earliest website that was kept on SLAC servers for many years. This Halloween, it comes back to life. Our task was to convert the original list of scattered files into an accessible, browsable website with temporal navigation. In this post, I will discuss the technical challenges of and lessons learned from restoration process.
In the course of creating a browsable archive of the SLAC earliest websites, we discovered a number of interesting facts and features that might not be readily apparent on casual browsing. While surely not an exhaustive catalog, we hope that these observations will help you to quickly get into the archive and discover some of what it has to offer.
At a microscopic level, web archives document the evolution of individual websites. At a macroscopic level, they document the evolution of the Web itself. In the case of web archives for the period when the entire Web consisted of only a handful of individual websites, changes to even a single website reflect changes to the Web itself. We are pleased to announce the availability of such an archive, notably featuring the oldest U.S. website, dating to December 21, 1991.
A couple of weeks ago, Stanford University Libraries hosted Dame Wendy Hall, Jim Hendler, and other web scientists affiliated with the Web Science Trust for a briefing on the Web Observatory initiative and a follow-on workshop organized by Lisa Green from Common Crawl. The notion of a Web Observatory implies a center proferring scientific instruments, but for the analysis of web data rather than natural phenomena. Indeed, the group's vision is that Web Observatories provide access to web datasets, projects, and tools. Eventually, a network of Web Observatories might offer both an interoperable architecture and distributed infrastructures for sharing and analysis of web datasets. The initiative touches on several areas of interest and investment by Stanford University Libraries, including data curation, web archiving, and supporting social science research.
A major challenge for web archivists is the low visibility that downstream archiving has on upstream web content creation. And, yet, deliberate and inadvertent architectural decisions made by web content creators strongly impact the ease or difficulty with which their websites can be captured and faithfully re-presented. A non-trivial byproduct of webmasters helping to ensure their content is archived for their own later use is that the Web itself becomes more archivable, to everyone's benefit.