Part-of-Speech Tagging
Important Notions
- The n-class model
- Lexical probabilities: P(w|t)
- Contextual probabilities:
P(ti|ti-n+1...ti-1)
- Hidden Markov Models (HMM)
- Parameter estimation
- Tagged training data (MLE)
- Untagged training data (EM)
- Sparse data and smoothing
- Algorithms for HMMs
- String probability: Forward, Backward, Forward-Backward
- Optimal state sequence: Viterbi
- Unsupervised parameter estimation: Baum-Welch
Slides for lecture 6
Suggested Reading
- Krenn, B. & Samuelsson, C. (1997) The Linguist's
Guide to Statistics. Sections 2.1 and 5.1.
- Manning, C. D. & Schütze, H. (1999) Foundations of Statistical
Natural Language Processing. MIT Press. Chapter 9, sections
10.1-10.2.
Project