CIDR Python Text Analysis

RSS

Center for Interdisciplinary Research logo
Date and Time 
October 24, 2022
9:00am to 12:00pm
Location 
Hybrid
Audience 
Faculty/Staff
Students
Event Sponsor 
Stanford University Libraries

This workshop will introduce you to working with text data using the spaCy and textacy Python libraries. You will learn to effectively streamline text preprocessing, understand word tokenization, sentence segmentation, part-of-speech tagging, and named entity recognition using a single document (a short excerpt from H.G. Wells's A Short History of the World) and across a corpus of legal documents. 

This is a Hybrid workshop! Upon registering you can select to attend in-person (Velma Denning Room, 120F Green Library) or via Zoom. 

Prerequisites:

  • CIDR Introduction to Python workshop (or similar experience)

Workshop materials:

accessibilityaccessprivsarrow-circle-rightaskus-chataskus-librarianbarsblogsclosecoffeecomputercomputersulcontactsconversationcopierelectricaloutleteventsexternal-linkfacebook-circlegroupstudyhoursindividualinterlibrarynewsnextoffcampusopenlateoutdoorpeoplepolicypreviousprinterprojectsquietreservesscannersearchstudysupportingtabletourstwitter-circleworking