You are here

Web

RSS

Archives

logo of the International Internet Preservation Consortium

A couple of weeks have passed since the successful conclusion of the annual IIPC General Assembly, hosted this year by Stanford University Libraries and Internet Archive. The meeting has been pretty well summarized already in posts by Sawood Alam, Jefferson Bailey, Emmanuelle Bermes, Tom Cramer, Carlos Eduardo Entini, and Ian Milligan. Rather than contributing another retrospective, I'd like to instead look ahead to 2016 and consider what the web archiving community might accomplish together in the coming year, highlighting some of the opportunities discussed and presented two weeks ago.

Continue the "mainstreaming" of web archives as primary research materials

It was gratifying to see the breadth of both research disciplines and research support initiatives represented at the General Assembly. I hope to see brilliant new scholarship in the coming year from the maturing community of researchers working with web archives and believe that we're also well-positioned to make inroads with many who haven't worked with web archives before. Continued experimentation is needed not just in tools and interfaces but also in service and engagement models. We should mind, and then mine, local models of success for replicable access and research services.

Salmon data in EarthWorks

Stanford University Libraries is happy to introduce EarthWorks, our new geospatial data discovery application. EarthWorks is a discovery tool for geospatial (a.k.a. GIS) data. It allows users to search and browse the GIS collections owned by Stanford University Libraries, as well as data collections from many other institutions. Data can be searched spatially, by manipulating a map; by keyword search; by selecting search limiting facets (e.g., limit to a given format type); or by combining these options.

The blogger enjoying Denver's weather

A few weeks ago, Jerry McBride, Mimi Tashiro, Jon Manton, Casey Mullin (our musical colleague in Lathrop Library) and I traveled to the Mile High City of Denver Colorado for the 84th annual meeting of the Music Library Association.  We were greeted by a swirling snowstorm and single-digit temperatures: novel to us, not so much for the East Coast attendees!

logo of the International Internet Preservation Consortium

Once each year, the international web archiving community represented by the International Internet Preservation Consortium meets for a week-long "General Assembly". As alluded to in my recap of the 2014 meeting, I'm pleased to belatedly announce that Stanford University is the confirmed host for the 2015 IIPC General Assembly as well as more promptly announce that registration is now open!

Prospective attendees may register for 1-3 segments making up the week-long event:

  • Monday, April 27th, is the Open Conference day. Registration is open to anyone, as the intended audience — researchers, web professionals, digital humanists, digital library specialists, and other interested parties — is broadest. The agenda hasn't been finalized yet but tentatively includes talks on large-scale, longitudinal analyses of web data; insights and differences in personal-scale web content preservation; and experimentation with derivative web archive datasets.
  • Tuesday, April 28th, is the Open Workshops day. Registration is again open to anyone, though the program has a slightly more web archiving community-centric focus. The agenda currently features, in parallel, a file formats hackathon and sessions on web archive information retrieval, content analysis, and promising new tools.
  • Wednesday through Friday, April 29th to May 1st, are IIPC working days. Registration is open to staff of IIPC member institutions only. These days will feature tracks for each of the working groups and sessions proposed by IIPC members focused on collaborative projects and exchange of best practices.

I encourage anyone interested in a survey of the work happening in the field to sign up for one or both of the "open" days. If you belong to an IIPC member institution (Stanford University employees are eligible) and are currently or prospectively engaged in web archiving on an ongoing basis, you may also want to consider the IIPC days.

The General Assembly location rotates every year, typically between Europe, North America, and Oceania, so this is an infrequent opportunity for nearby interested individuals and institutions to plug in. There's also an attendee cap, so please register sooner as versus later, if you plan to attend.

We hope to see you there!

a screenshot of the enhanced image viewer in SearchWorks

On Tuesday, Dec 16 2014, the SearchWorks team added a new set of features supporting the display and use of digital content in SearchWorks:

  • An updated image viewer that includes a styled header and footer, three different viewing modes (single image, thumbnail gallery and horizontal scroll), file download links and improved navigation and full-screen mode. You can see an example at http://searchworks.stanford.edu/view/tv206kh7995
  • Direct download access to non-image file content, such as deposited datasets, media files and objects from born-digital collections.  Users no longer need to click a link to purl.stanford.edu to access file-type content that is available in SearchWorks. For an example see http://searchworks.stanford.edu/view/rq635hh7400
  • For both the new image and file viewer, users now have the ability to embed these digital objects in other web pages, such as blog posts.  By clicking the embed icon </> in the footer of the viewer, the user is shown embed code that they can copy and paste into html when authoring a blog post or other web page.  The fully functional embedded version of the viewer will also include in the footer a link that shows the use and reproduction statement, and copyright statement (if applicable). See an example of a digital object embedded in a blog post at http://stanford.io/1zICRVe

This latest release (SearchWorks version 3.0.8), also includes several other bug fixes and enhancements.  

logo of the International Internet Preservation Consortium

We are pleased to announce the acceptance of our bid to join the IIPC Steering Committee, based on a vote by the IIPC membership. SUL joins the 15-member group as one of two currently-serving university library members (the other being the University of North Texas Libraries) and as the third university library to ever serve on the body (the other being the California Digital Library).

The Steering Committee provides strategic direction for the IIPC, defining the structures for successful projects and collaborations, overseeing sponsored initiatives and partnerships, discussing and approving the annual budget, and vetting new member applications. This particular Steering Committee will have the additional role of guiding the creation of the 2016-2018 membership agreement, which informs the IIPC mission and goals.

Recent years have seen an especial increase in the number of IIPC members that are both research university libraries and based in North America; SUL exemplifies this category. And though the percentage of U.S. web archiving organizations that are IIPC members is in the single digits, the work of the IIPC strongly affects the field as a whole.

We look forward to participating in the IIPC Steering Committee starting in 2015 and, through this opportunity, advancing both the SUL web archiving program and web archiving generally.

SLAC Early website "We had no idea that we were making history and were just trying to get the job done in our 'spare' time',”  Louise Addis, one of the WWWizards team who developed the SLAC website from 1991, said during our conversation about the restoration of SLAC's earliest website. Last May, Nicholas Taylor, web archiving service manager, told me, "SLAC has a historical collection of webpages that may be the first website in the US. Can we help them to find a home for this archive?” As Web archivist, I felt that I found a treasure. I replied, "Of course, Stanford Web Archive Portal should be the home."

One of the major use cases for the Web Archiving Service is preserving Stanford University web content. The earliest SLAC website represent the oldest such content we could find; it is the first website in the US dated to 1991, so we started there. Stanford Web Archiving Service launched its portal this week which featured SLAC's earliest website that was kept on SLAC servers for many years. This Halloween, it comes back to life. Our task was to convert the original list of scattered files into an accessible, browsable website with temporal navigation. In this post, I will discuss the technical challenges of and lessons learned from restoration process.

logo graphic appearing on the "SLAC Home Page" 1994-1995

In the course of creating a browsable archive of the SLAC earliest websites, we discovered a number of interesting facts and features that might not be readily apparent on casual browsing. While surely not an exhaustive catalog, we hope that these observations will help you to quickly get into the archive and discover some of what it has to offer.

Pages