CIDR Python Introduction to Text Analysis

RSS

Date and Time 
May 10, 2022
2:00pm to 4:00pm
Location 
Zoom
Audience 
Faculty/Staff
Students
Event Sponsor 
Stanford University Libraries
Contact 
muzzall@stanford.edu

This workshop will introduce you to working with text data using the spaCy and textacy Python libraries. You will learn to effectively streamline text preprocessing, understand word tokenization, sentence segmentation, part-of-speech tagging, and named entity recognition using a single document (a short excerpt from H.G. Wells's A Short History of the World) and across a corpus of legal documents. 

Prerequisites:

  • CIDR Introduction to Python workshop (or similar experience)

Before the workshop, be sure to download and install:

Workshop materials:

accessibilityaccessprivsarrow-circle-rightaskus-chataskus-librarianbarsblogsclosecoffeecomputercomputersulcontactsconversationcopierelectricaloutleteventsexternal-linkfacebook-circlegroupstudyhoursindividualinterlibrarynewsnextoffcampusopenlateoutdoorpeoplepolicypreviousprinterprojectsquietreservesscannersearchstudysupportingtabletourstwitter-circleworking