You are here

Digital library



Three new digital collections were added to SearchWorks during the month of July. This brings the total number of digital collections available in SearchWorks to 93. The collections recently added are:

Title: A tide prediction and tide height control system for laboratory mesocosms

Abstract: Software code to accompany the manuscript "A tide prediction and tide height control system for laboratory aquaria" by L.P. Miller and J. D. Long

Collection contact: Amy Hodge

Two new digital collections were added to SearchWorks during the month of June. This brings the total number of digital collections available in SearchWorks to 90. The collections recently added are:

A novel computational method of metabolic network analysis for genetic discovery

Abstract: Java scripts for re-production of the computational method being published in Pharmacogenetics and Genomics 2012, 22:877–886

Collection contact: Amy Hodge

Undergraduate Theses, Department of Biology, 2014-2015 

Abstract: Honors theses written by undergraduates in the Stanford University Department of Biology, 2014-2015.

Collection contact: Michael Newman

The integration of digital collections into SearchWorks means that items from collections containing digital material can be discovered in the course of searching and browsing through the totality of Stanford's vast library catalog.

For more information on depositing materials into the Stanford Digital Repository, visit our website. For questions or additional information about the Stanford Digital Repository service, please email us at

This project is a spin-off from the ongoing GAMECIP (Game Metadata and Citation Project) funded by the Institute for Museum and Library Services (IMLS).  We are planning to spin off and expand our work on controlled platform vocabularies with additional research and photography, then publish a definitive terminology and ontology for computer game platforms in a manner that will be useful for libraries, museums, collectors and scholars. We are beginning with the most used set as an initial set.

The two core components of the project are (1) review and photography of hardware in Stanford’s Stephen M. Cabrinety Collection in the History of Microcomputing, in order to complete our assessment and provide visual aids in the published terminology guide; and (2) collation, editing and design of platform and media description sheets that will be made available online and, eventually, be available for publication in a print format.  

Shu-Wen Lin

The Stanford Media Preservation Lab and the Department of Special Collections are delighted that Shu-Wen Lin is spending the summer with us in Redwood City interning as a media archivist. Leveraging her interest and background in the arts, Shu-Wen will help to process and preserve several media-based collections, including the archives of visual artist Carolee Schneeman and the archives of Telling Pictures, a prominent Bay Area film production company. Shu-Wen's first day is June 1.

Three new digital collections are now available in SearchWorks. These collections take advantage of SearchWorks' ability to provide users with rich discovery and access capabilities for finding and working with digital collection content.

Stanford Geospatial Center Teaching Data

Abstract: These items are intended for use in Stanford Geospatial Center teaching materials.

Collection contact: Amy Hodge

This past week saw the 2015 General Assembly of the IIPC, International Internet Preservation Coalition--probably the biggest week and biggest event of the year in the web archiving world. The IIPC has 50 members from 30 countries, and comprises the leading web archiving institutions in the world, including tons of national libraries, the Internet Archive, and a growing number of research institutions.

Here are the five key lessons that I am taking away from this year’s IIPC.  

More is More. Less is Less. Avoid Monocultures.

  1. “More is More. Less is Less. Avoid Monoculture.” Abby Smith Rumsey articulated the soul and mission of web archiving in 8 words during her Wednesday talk on Memory in the Digital Age. Speaking from her perspectives as an historian, a member of the NSF’s Blue Ribbon Task Force on Sustainable Digital Preservation, and as a longtime leader and practitioner in digital libraries, Abby outlined both the needs and opportunities for archiving the modern web. The essence of her talk (from my perspective): capture as much as possible, don’t over-invest in curation today (let future users make their own calls), demonstrate value, and get as many players as possible active in the effort.  

  2. APIs, APIs, APIs. The fundamentals of web archiving are now (pretty well) understood, and it’s a (somewhat) mature space. We now know enough to create standard definitions for interactions across the major functions of web archiving--selection, capture, preservation, indexing, playback, mining. Lets define these interactions formally via APIs; once we do, we’ll see all sorts of associated benefits. Institutional software stacks will be componentized, modular, and swappable allowing us to assemble best-of-breed systems. Archives and their associated functions will become interoperable--allowing for the reuse and exchange of content, software and services across institutions, time and place. Developers will be able to swarm an individual component (say playback with Open Wayback; crawling with Open Heretrix; indexing with solr) with the confidence it will plug in to their local stack.

    If there is a single opportunity for IIPC to advance the technology-scape supporting Web Archiving in the next year, it’s defining these community-standard APIs among layers of the end-to-end web archiving stack.

  3. It’s time to intentionally cultivate a community of Web Archiving developers. If each of the 50 institutions in the IIPC has .5 of a developer allocated to web archiving, that makes 25 FTE developers. If each site allocates 1 software engineer, that makes 50 developers. I don’t know what the average size of an IIPC member’s dev team is, but it’s a BIG POOL of talent, and one that collectively could get a LOT done if they work in concert. The seeds of a robust development community have been sown. The development of Open Wayback, while slow to pick up community traction at first, is now looks healthy and even thriving, with 10 committers from as many institutions.

    If there are two opportunities for IIPC to advance the technology-scape supporting Web Archiving in the coming year, the second is to foster a thriving, collaborative developer community engaged in building tools for web archiving--one that is intensely collaborative, and subscribes to best practices in open source, collaborative software development.

  4. Reaffirming the mission and focus of IIPC. This is the third and final year of the IIPC’s three year consortial agreement. Going into 2016 means the organization must renew the consortial agreement, and provides a rare chance to reaffirm its mission, specify current goals, and update its operating practices. This is a fantastic opportunity for IIPC members to double down on the areas of greatest value for advancing web archiving internationally. The IIPC Steering Committee is primed to take this on; I am very much looking forward to the next few months of discussions among the Steering Committee and members as we re-express the core mission and objectives of IIIPC for the next five years. Personally, I hope and expect that the objectives of tool development, training, advocacy and collaborative collecting/preservation/access remain the fundamental objectives of the organization.

  5. Networked archives are the future of archiving the ‘net. Brewster Kahle gave a provocative speech on the heels of Abby Smith Rumsey’s talk on Wednesday. In essence, he called for “collective collection building; distributed preservation; local/cloud-based access” for web archives. While there are a lot of details still to be sorted, I think Brewster has hit a key theme and need for web archives of the future--given the scale and nature of the Web, it’s no longer sufficient to think about global Web Archiving as a series of independent archives each pursuing its own selection, capture, preservation, access and use objectives. Rather, web archiving in the future needs to reflect the distributed nature of the very thing we’re seeking to preserve--a collective, distributed, networked effort to capture, archive and give access to the Web, with many, many players invested and specializing in the functions they do best. Scale and specialization are both watchwords here. APIs and Abby Smith Rumsey’s council (more is more; avoid monoculture) are also both critical here. This is going to be very interesting to observe in the next three to five years.

This was my first IIPC General Assembly, and overall I am impressed. The IIPC is a serious, substantial and dedicated group of practitioners. There is a broad front of international concern and engagement in preserving the Internet and its contents for future generations; at the same time, there are deep pockets of innovation and progress in advancing the state of the art in practice. For all the substance of IIPC in its current form, though, there needs to be MORE--the Internet is the largest, most significant, most far reaching, and most interconnected vehicle for human communication and history, EVER. Capturing its contents over time is critical to capturing the history, substance and technology of humanity.

IIPC is the vanguard for memory and research institutions in the world for capturing this critical content, defining the technology and policies and practices that will capture this critical period of humanity’s development. If there were ever a group and cause that merits national and international investment to capture a unique period of history, this is it. After this week, I’m happy to report that--as large and critical as the need is--IIPC seems like is in a unique position to help meet it, and define the history of the present and future.

Mappa [sic] geologico do Brazil, Geological Society of America, 1991 Branner, John Casper, Contributor.

The Branner Earth Sciences Library is named after John Casper Branner (1850-1922). Branner was, among many things, a geologist, an academic and a founding member of the faculty at Stanford and went on to become Stanford’s second president. He was also president of the Geological Society of America and served as the president of the Seismological Society of America.


Mappa [sic] geologico do Brazil, Geological Society of America, 1919. Branner, John Casper, Contributor.

Salmon data in EarthWorks

Stanford University Libraries is happy to introduce EarthWorks, our new geospatial data discovery application. EarthWorks is a discovery tool for geospatial (a.k.a. GIS) data. It allows users to search and browse the GIS collections owned by Stanford University Libraries, as well as data collections from many other institutions. Data can be searched spatially, by manipulating a map; by keyword search; by selecting search limiting facets (e.g., limit to a given format type); or by combining these options.