Blog topic: Web archiving

The Official Senate CIA Torture Report

December 11, 2014
by Mr. James R. (Librarian) Jacobs

Yesterday the U.S. Senate Select Committee on Intelligence released its "Study of the CIA's Detention and Interrogation Program - Foreword, Findings, and Conclusions, and Executive Summary." (BIG PDF!) The report is 525 pages, heavily redacted, and includes graphic details about the torture techniques used by the CIA. The study found that American torture was not confined to a handful of aberrational cases or techniques, nor was it the work of rogue CIA agents. It was an officially sanctioned, worldwide (over 1/4 of the world's countries participated in some way!) regime of torture that had the acquiescence, if not explicit approval, of the top members of both political parties in Congress.

Web Halloween, secrets of resurrecting the SLAC dead website from the computer cemetery to live Web.

SLAC Early website "We had no idea that we were making history and were just trying to get the job done in our 'spare' time',”  Louise Addis, one of the WWWizards team who developed the SLAC website from 1991, said during our conversation about the restoration of SLAC's earliest website. Last May, Nicholas Taylor, web archiving service manager, told me, "SLAC has a historical collection of webpages that may be the first website in the US. Can we help them to find a home for this archive?” As Web archivist, I felt that I found a treasure. I replied, "Of course, Stanford Web Archive Portal should be the home."

One of the major use cases for the Web Archiving Service is preserving Stanford University web content. The earliest SLAC website represent the oldest such content we could find; it is the first website in the US dated to 1991, so we started there. Stanford Web Archiving Service launched its portal this week which featured SLAC's earliest website that was kept on SLAC servers for many years. This Halloween, it comes back to life. Our task was to convert the original list of scattered files into an accessible, browsable website with temporal navigation. In this post, I will discuss the technical challenges of and lessons learned from restoration process.

logo graphic appearing on the "SLAC Home Page" 1994-1995

Explore the oldest U.S. website

October 28, 2014
by Nicholas Taylor

At a microscopic level, web archives document the evolution of individual websites. At a macroscopic level, they document the evolution of the Web itself. In the case of web archives for the period when the entire Web consisted of only a handful of individual websites, changes to even a single website reflect changes to the Web itself. We are pleased to announce the availability of such an archive, notably featuring the oldest U.S. website, dating to December 21, 1991.

'The Dish (HDR)' by Flickr user whsieh78 under CC BY-NC-SA 2.0

Web Observatory briefing and workshop

October 9, 2014
by Nicholas Taylor

A couple of weeks ago, Stanford University Libraries hosted Dame Wendy Hall, Jim Hendler, and other web scientists affiliated with the Web Science Trust for a briefing on the Web Observatory initiative and a follow-on workshop organized by Lisa Green from Common Crawl. The notion of a Web Observatory implies a center proferring scientific instruments, but for the analysis of web data rather than natural phenomena. Indeed, the group's vision is that Web Observatories provide access to web datasets, projects, and tools. Eventually, a network of Web Observatories might offer both an interoperable architecture and distributed infrastructures for sharing and analysis of web datasets. The initiative touches on several areas of interest and investment by Stanford University Libraries, including data curation, web archiving, and supporting social science research.

'Step 7' by Flickr user george under CC BY-NC-ND 2.0

Guidance on building archivable websites

September 29, 2014
by Nicholas Taylor

A major challenge for web archivists is the low visibility that downstream archiving has on upstream web content creation. And, yet, deliberate and inadvertent architectural decisions made by web content creators strongly impact the ease or difficulty with which their websites can be captured and faithfully re-presented. A non-trivial byproduct of webmasters helping to ensure their content is archived for their own later use is that the Web itself becomes more archivable, to everyone's benefit.

logo of the International Internet Preservation Consortium

Recap of international web archiving community meeting

June 12, 2014
by Nicholas Taylor

Web archivists Ahmed AlSum and Nicholas Taylor and LOCKSS Chief Scientist David Rosenthal recently attended the International Internet Preservation Consortium (IIPC) General Assembly, an annual meeting of national libraries, research universities, non-profits, and service providers engaged in web archiving. This was the first General Assembly we all attended since Stanford University Libraries (SUL) joined the IIPC, though we had all previously attended meetings under the auspices of other organizations.

Pages

RSS
accessibilityaccessprivsarrow-circle-rightaskus-chataskus-librarianbarsblogsclosecoffeecomputercomputersulcontactsconversationcopierelectricaloutleteventsexternal-linkfacebook-circlegroupstudyhoursindividualinterlibrarynewsnextoffcampusopenlateoutdoorpeoplepolicypreviousprinterprojectsquietreservesscannersearchstudysupportingtabletourstwitter-circleworking