We've written before on our restoration of the oldest U.S. website, covering in detail how we did it and some interesting discoveries we made along the way. More recently, Web Archiving Engineer Ahmed AlSum prepared a visual diagram (see below) of the steps involved in packaging, indexing, and making accessible the legacy web content in a poster for the Joint Conference on Digital Libraries (JCDL), an annual meeting sponsored by the Association for Computing Machinery (ACM) and the Institute of Electrical and Electronics Engineers (IEEE) focused on research and development for digital libraries. Notably, the display won the Best Poster Award! We celebrate the continued community interest in Ahmed's innovative work.
A couple of weeks have passed since the successful conclusion of the annual IIPC General Assembly, hosted this year by Stanford University Libraries and Internet Archive. The meeting has been pretty well summarized already in posts by Sawood Alam, Jefferson Bailey, Emmanuelle Bermes, Tom Cramer, Carlos Eduardo Entini, and Ian Milligan. Rather than contributing another retrospective, I'd like to instead look ahead to 2016 and consider what the web archiving community might accomplish together in the coming year, highlighting some of the opportunities discussed and presented two weeks ago.
Stanford University Libraries is happy to introduce EarthWorks, our new geospatial data discovery application. EarthWorks is a discovery tool for geospatial (a.k.a. GIS) data. It allows users to search and browse the GIS collections owned by Stanford University Libraries, as well as data collections from many other institutions. Data can be searched spatially, by manipulating a map; by keyword search; by selecting search limiting facets (e.g., limit to a given format type); or by combining these options.
A few weeks ago, Jerry McBride, Mimi Tashiro, Jon Manton, Casey Mullin (our musical colleague in Lathrop Library) and I traveled to the Mile High City of Denver Colorado for the 84th annual meeting of the Music Library Association. We were greeted by a swirling snowstorm and single-digit temperatures: novel to us, not so much for the East Coast attendees!
Once each year, the international web archiving community represented by the International Internet Preservation Consortium meets for a week-long "General Assembly". As alluded to in my recap of the 2014 meeting, I'm pleased to belatedly announce that Stanford University is the confirmed host for the 2015 IIPC General Assembly as well as more promptly announce that registration is now open!
Prospective attendees may register for 1-3 segments making up the week-long event:
- Monday, April 27th, is the Open Conference day. Registration is open to anyone, as the intended audience — researchers, web professionals, digital humanists, digital library specialists, and other interested parties — is broadest. The agenda hasn't been finalized yet but tentatively includes talks on large-scale, longitudinal analyses of web data; insights and differences in personal-scale web content preservation; and experimentation with derivative web archive datasets.
- Tuesday, April 28th, is the Open Workshops day. Registration is again open to anyone, though the program has a slightly more web archiving community-centric focus. The agenda currently features, in parallel, a file formats hackathon and sessions on web archive information retrieval, content analysis, and promising new tools.
- Wednesday through Friday, April 29th to May 1st, are IIPC working days. Registration is open to staff of IIPC member institutions only. These days will feature tracks for each of the working groups and sessions proposed by IIPC members focused on collaborative projects and exchange of best practices.
I encourage anyone interested in a survey of the work happening in the field to sign up for one or both of the "open" days. If you belong to an IIPC member institution (Stanford University employees are eligible) and are currently or prospectively engaged in web archiving on an ongoing basis, you may also want to consider the IIPC days.
The General Assembly location rotates every year, typically between Europe, North America, and Oceania, so this is an infrequent opportunity for nearby interested individuals and institutions to plug in. There's also an attendee cap, so please register sooner as versus later, if you plan to attend.
We hope to see you there!
On Tuesday, Dec 16 2014, the SearchWorks team added a new set of features supporting the display and use of digital content in SearchWorks:
- An updated image viewer that includes a styled header and footer, three different viewing modes (single image, thumbnail gallery and horizontal scroll), file download links and improved navigation and full-screen mode. You can see an example at http://searchworks.stanford.edu/view/tv206kh7995
- Direct download access to non-image file content, such as deposited datasets, media files and objects from born-digital collections. Users no longer need to click a link to purl.stanford.edu to access file-type content that is available in SearchWorks. For an example see http://searchworks.stanford.edu/view/rq635hh7400
- For both the new image and file viewer, users now have the ability to embed these digital objects in other web pages, such as blog posts. By clicking the embed icon </> in the footer of the viewer, the user is shown embed code that they can copy and paste into html when authoring a blog post or other web page. The fully functional embedded version of the viewer will also include in the footer a link that shows the use and reproduction statement, and copyright statement (if applicable). See an example of a digital object embedded in a blog post at http://stanford.io/1zICRVe
This latest release (SearchWorks version 3.0.8), also includes several other bug fixes and enhancements.
We are pleased to announce the acceptance of our bid to join the IIPC Steering Committee, based on a vote by the IIPC membership. SUL joins the 15-member group as one of two currently-serving university library members (the other being the University of North Texas Libraries) and as the third university library to ever serve on the body (the other being the California Digital Library).
"We had no idea that we were making history and were just trying to get the job done in our 'spare' time',” Louise Addis, one of the WWWizards team who developed the SLAC website from 1991, said during our conversation about the restoration of SLAC's earliest website. Last May, Nicholas Taylor, web archiving service manager, told me, "SLAC has a historical collection of webpages that may be the first website in the US. Can we help them to find a home for this archive?” As Web archivist, I felt that I found a treasure. I replied, "Of course, Stanford Web Archive Portal should be the home."
One of the major use cases for the Web Archiving Service is preserving Stanford University web content. The earliest SLAC website represent the oldest such content we could find; it is the first website in the US dated to 1991, so we started there. Stanford Web Archiving Service launched its portal this week which featured SLAC's earliest website that was kept on SLAC servers for many years. This Halloween, it comes back to life. Our task was to convert the original list of scattered files into an accessible, browsable website with temporal navigation. In this post, I will discuss the technical challenges of and lessons learned from restoration process.