TAML Bootcamp Part 4/6

RSS

Center for Interdisciplinary Research logo
Date and Time 
September 21, 2022
1:00pm to 3:00pm
Location 
Hybrid
Audience 
Faculty/Staff
Students
Event Sponsor 
Stanford University Libraries

This is a hybrid event. Please register for each part separately.

TAML Bootcamp Part 4 / 6

In Part 4, we will focus on specific aspects of standardizing text prior to analysis for a single document and for a corpus of United Nations Human Rights Council documents, including:

  • Standardizing casing and spacing
  • Removing punctuation and stop words
  • Tokenizing text
  • Lemmatizing/stemming
  • Part of speech tags

Workshop materials:

accessibilityaccessprivsarrow-circle-rightaskus-chataskus-librarianbarsblogsclosecoffeecomputercomputersulcontactsconversationcopierelectricaloutleteventsexternal-linkfacebook-circlegroupstudyhoursindividualinterlibrarynewsnextoffcampusopenlateoutdoorpeoplepolicypreviousprinterprojectsquietreservesscannersearchstudysupportingtabletourstwitter-circleworking