TAML Bootcamp Part 4/6
Date and Time
September 21, 2022
1:00pm to 3:00pm
Location
Hybrid
Audience
Faculty/Staff
Students
Event Sponsor
Stanford University Libraries
This is a hybrid event. Please register for each part separately.
TAML Bootcamp Part 4 / 6
In Part 4, we will focus on specific aspects of standardizing text prior to analysis for a single document and for a corpus of United Nations Human Rights Council documents, including:
- Standardizing casing and spacing
- Removing punctuation and stop words
- Tokenizing text
- Lemmatizing/stemming
- Part of speech tags
Workshop materials: