Michael Elhadad
Natural Language Processing (202-2-5211)
Meets:
Sun 10-12 Bdg 34 Room 105
Mon 10-12 Bdg 34 Room 003
News:
- March 22nd: Assignment 1 is available
- May 31st: Assignment 2 is available
Lecture Notes
- General Intro to NLP - Linguistic Concepts
- Parts of speech Tagging
- Context Free Grammars Parsing
- Automatic Text Summarization
Topics covered in assignments include:
- Language Models and n-grams -- Statistical Models of Unseen Data (Smoothing)
- Information Extraction / Named Entity Recognition
- Using Machine Learning Tools: Classification, Sequence Labeling / Supervised Methods / SVM
- Compositional Semantic from CFG Parsing
- Sentence Simplification
Assignments
Software
- NLTK Installation:
Nltk is a Python based toolkit with wide coverage of NLP techniques -
both statistical and knowledge-based.
- SISC Scheme Interpreter: we use
Scheme examples to demonstrate algorithms in parsing, generation and
some semantic analysis. This interpreter is very small and convenient
to use on any platform supporting Java (full version is 2.4MB with
full doc - jar is 300KB).
Resources
Last modified March 22, 2010
Michael Elhadad