General Intro to NLP


Structural Analysis of Text

Infinite occurrences / finite processing.

Text is structured, but structure is not manifest. Therefore, problems of ambiguity (determining the intended structure). Local vs global ambiguity:

Problem of parsing linear structure into constituent tree. The questions to address are: Why is syntactic structure important:

Structural vs Functional

Map meanings to linguistic forms Language is intentional - used for a purpose Several parallel levels to language

Representation of meaning

  1. Option1: meaning = procedure - set of instructions to achieve what the speaker wants. Simplest form: Problem: what is a procedure? opaque?
  2. Option2: network based representations. concept = nodes, links = relationships. facilitate certain type of inferences (inheritance, propagation). intuitive.
  3. Option3: logic based representation. Compositionality principle (Frege 1890, Montague).

Use of knowledge

   "City officials refused the demonstrators a permit because"
   --- they feared violence
   --- they advocated violence
Difficult in terms of simple selectional markers. Need to reason about goals and plans of the participants to generate expectations about meaning.
   --- Excuse me, do you know if there's a bank hapoalim near here?
   --- Today is Monday!
Why is he telling me this? Regard utterances as actions given certain preconditions will achieve an Effect. Plan intended not marked in syntax:
   Can you pass the salt?
   It's rather cold!

Parts of Speech and Morphology

Syntactic analysis starts with the following questions: Linguists define groups of words which "behave in a similar manner" in syntactic contexts. The basic test is substitution:
  A {large, small, blue, enormous, ...} box is in the room.
Basic classes are: verb, noun, adjective. These are called parts of speech. For basic Parts of speech we can provide a semantic interpretation: Distinguish: Morphological features: Inflection is the process of modifying a root form by combining prefixes and suffixes to indicate the presence of morphological features.

Derivation is the process of creating a new lexical item from an existing, more basic one. For example, the derivation of the noun of action destruction from the verb destroy, or the chain of derivations luck/noun, lucky/adjective, luckily/adverb.

Compounding is the process of combining several lexical items into a new one, whose properties can be derived from the compounded elements, or can be independent. For example, in Hebrew beit sefer is derived from bayit and sefer. For more information on morphology, refer to this excellent Introduction to morphology.


Phrase Structure / Dependency

Constraints on word order. Words occur in groups, with dependencies among them. Constituent / Phrase.

Common Phrases:

Phrase Structure Grammars / (PSG)

Context Free Grammars (CFG)



Last modified Mar 18, 2007