ANNA UNIVERSITY TIRUCHIRAPPALLI
Tiruchirappalli - 620 024
Regulations 2007
Syllabus
B.E. COMPUTER SCIENCE AND ENGINEERING
SEMESTER VIII
NATURAL LANGUAGE PROCESSING
L T P
3 0 0
UNIT I FUNDAMENTALS 6
Basics − Knowledge in speech and language processing − Ambiguity − Models and algorithms −
Language − Thought and understanding − Regular expressions and automata − Regular expressions −
Finite state automata. morphology and finite − State transducers − Survey of english morphology −
Finite state morphological parsing − Combining FST lexicon and rules − Lexicon free FSTs − The
Porter stammer − Human morphological processing.
UNIT II SYNTAX 10
Word classes and part of speech tagging − English word classes − Tagsets for english − Part of speech
tagging − Rule-based part of speech tagging − Stochastic part of speech tagging − Transformation-
based tagging − Other issues − Context-free grammars for english: Constituency − Context-free rules
and trees − Sentence-level constructions − Noun phrase − Coordination − Agreement − Verb phase and
sub categorization − Auxiliaries − Spoken language syntax − Grammars equivalence and normal form
− Finite state and context-free grammars − Grammars and human processing − Parsing with context-
free grammars − Parsing as search − Basic top-down parser − Problems with the basic top-down parser
− Early algorithm − Finite-state parsing methods.
UNIT III ADVANCED FEATURES AND SYNTAX 11
Features and unification − Feature structures − Unification of feature structures − Features structures in
the grammar − Implementing unification − Parsing with unification constraints − Types and inheritance
− Lexicalized and probabilistic parsing − Probabilistic context-free grammar − Problems with PCFGS
− Probabilistic lexicalized CFGS − Dependency grammars − Human parsing.
UNIT IV SEMANTIC 10
Representing meaning − Computational desiderata for representations − Meaning structure of language
− First order predicate calculus − Some linguistically relevant concepts − Related representational
approaches − Alternative approaches to meaning − Semantic analysis − Syntax driven semantic
analysis − Attachments for a fragment of english − Integrating semantic analysis into the early parser −
Idioms and compositionality − Robust semantic analysis − Lexical semantics − Relational among
lexemes and their senses − Word net − Database of lexical relations − Internal structure of words −
Creativity and the lexicon.
UNIT V APPLICATIONS 8
Word sense disambiguation and information retrieval − Selectional restriction − Based disambiguation
− Robust word sense disambiguation − Information retrieval − Other information retrieval tasks −
Natural language generation − Introduction to language generation − Architecture for generation −
Surface realization − Discourse planning − Other issues − Machine translation − Language similarities
and differences − Transfer metaphor − Interlingua idea: using meaning − Direct translation − Using
statistical techniques − Usability and system development.
Total: 45
TEXT BOOK
1. Daniel Jurafsky and James H. Martin, “Speech and Language Processing”, Pearson Education
Pvt. Ltd., 2002.
REFERENCES
1. James Allen, “Natural Language Understanding”, Pearson Education, 2003.
2. Akshar Bharathi, Chaitanya and Sangal, “Natural Language Processing : A Paninian
Approach”, PHI, 2004.
0 comments :
Post a Comment