Natural Language Processing - CS60075

Spring Semester - 2020-21


Pawan Goyal

Course Timings


Monday - 14:00 - 15:00 (Online)

Tuesday - 16:00 - 18:00 (Online)

Teaching Assistants

Bishal Santra -

Rajdeep Mukherjee -

Ankan Mullick -

samir ghui -

Rajpratim Mitra -

Lecture Material

The lecture material of the course will be uploaded every week on Piazza.


Course Registration: There have been quite a lot of requests for the course. I have tried to take care of many but since the class strength is already beyond 100, I will not be able to approve any further requests. Specifically, I have not approved any 2nd year students since they would have many more chances to take this course in future. The course will start from Jan 4th on MS Teams. The registered students will get an invite.

Reference Books

  1. Daniel Jurafsky and James H. Martin. 2020. Speech and Language Processing. 3rd Edition (draft)
  2. Christopher D. Manning and Hinrich Sch├╝tze. 1999. Foundations of Statistical Natural Language Processing. MIT Press.
  3. Sowmya Vajjala, Bodhisattwa Majumder, Anuj Gupta, Harshit Surana. 2020. Practical Natural Language Processing. O'Reilly.
  4. Hobson Lane, Cole Howard, Hannes Hapke. 2019. Natural Language Processing in Action. Live Book.

Course Contents

Major Components of the Course include (Tentative)
  1. Basic Text Processing: Tokenization, Stemming
  2. Language Modeling: N-grams, smoothing
  3. Morphology, Parts of Speech Tagging
  4. Syntax: PCFGs, Dependency Parsing
  5. Topic Models
  6. Distributional Semantics
  7. Lexical Semantics, Word Sense Disambiguation
  8. Information Extraction: Relation extraction
  9. Text Classification, Sentiment Analysis
  10. Deep Learning for NLP: Basic and Advanced Models; Analysis