Lecture Notes and Assigned Readings
| Date | Lecture Topic and Handouts | Readings | Assignments |
| Mon 1/26 | Introduction (.pdf) | Jurafsky and Martin, Chapter 1. | |
| Weds 1/28 | Ambiguity in Language; a Bit o' History (.pdf) | L. Lee. "I'm
sorry Dave, I'm afraid I can't do that'': Linguistics, Statistics, and
Natural Language Processing circa 2001. To appear in the National Academies' Study on Fundamentals of Computer Science. |
|
| Mon 2/2 | Lexical semantics I (.pdf) | J&M: 16.1. | Paper critiques due (2).
|
| Weds 2/4 | Lexical semantics II; WordNet (.pdf) | J&M: 16.2; 16.4; 17.1. | |
| Mon 2/9 | Word sense disambiguation: supervised methods (.pdf) | J&M: 17.2. For a description of the relevant background material on probability theory, see Manning and Schutze 2.1.1-2.1.3. |
|
| Weds 2/11 | Word sense disambiguation: weakly supervised, unsupervised, dictionary-based methods (.pdf) | Paper critique due.
|
|
| Mon 2/16 |
SENSEVAL; Spelling correction, pronunciation variation (.pdf) |
SENSEVAL:
The evaluation of word sense disambiguation systems.
J&M, Chapter 5.1-5.5. |
|
| Weds 2/18 | Noisy channel model (.pdf) |
|
Paper critique due.
|
| Mon 2/23 |
Pronunciation variation via the noisy
channel model (see notes from last class); Project discussion. |
J&M, Chapter 5.7-5.8 | |
| Weds 2/25 | N-gram models and the statistics of natural language (.pdf) | ||
| Mon 3/1 | Zipf's law; unsmoothed n-gram models (.pdf) | J&M, Chapter 6.1-6.2. Additional material from Manning & Schutze. | Paper critique due.
|
| Weds 3/3 | Smoothing (.pdf) | J&M, Chapter 6.3 | |
| Mon 3/8 | Linear interpolation and backoff (.pdf) | J&M, Chapter 6.4-6.6 | Paper critique due.
|
| Weds 3/10 | Part-of-speech tagging, TBL (.pdf) | J&M, Chapter 8.1-8.4, 8.6 | |
| Mon 3/15 | HMM tagger (.pdf) | J&M, Chapter 8.5. The Viterbi algorithm is explained in Chapter 7. Alternatively, see this HMM tutorial. | Project proposals due. |
| Weds 3/17 | Parsing with CFG's (.pdf) | J&M, Chapter 9; 10.1-10.3 | |
| Mon 3/22 | NO CLASS: spring break | ||
| Weds 3/24 | NO CLASS: spring break | ||
| Mon 3/29 | Bottom-up chart parsing (.pdf) | J&M, Chapter 10.4 | Paper critique due.
Similarity-Based
Estimation of Word Cooccurrence Probabilities. |
| Weds 3/31 | Earley algorithm (see notes from last class); partial parsing (.pdf) |
J&M, Chapter 10.5-10.6 |
|
| Mon 4/5 |
Question-answering systems (.pdf)
Guest lecture: Eric Breck |
|
|
| Weds 4/7 | Class canceled (sick) |
|
|
| Mon 4/12 | Lexicalized and probabilistic parsing (.pdf) | J&M, Chapter 12 | Project literature survey due. |
| Weds 4/14 | Information extraction (.pdf) |
|
|
| Mon 4/19 | Learning extraction patterns (.pdf) | J&M, Chapter 15.5 | |
| Weds 4/21 |
EM (.pdf) Guest lecture: Vincent Ng |
|
|
| Mon 4/26 |
Theories of Syntax (.pdf) Guest lecture: Veselin Stoyanov |
|
|
| Weds 4/28 |
Finish IE lecture. Pragmatics and the problem of Inference (.pdf) |
|
|
| Mon 5/3 Weds 5/5 |
Likely to be NO CLASS |
|
|
| Study week: Mon 5/10 and Tues 5/11 | Mon: 10-11:30 |
|
Project presentations. |
| Mon 5/17 |
|
Project report due. |