Natural Language Processing - CS60075

Spring Semester - 2020-21

Instructor

Pawan Goyal

Course Timings

Lectures

Monday - 14:00 - 15:00 (Online)

Tuesday - 16:00 - 18:00 (Online)

Teaching Assistants

Dr. Tapas Nayak - tnk02.05@gmail.com

Bishal Santra - bsantraigi@gmail.com

Rajdeep Mukherjee - rajdeep1989.iitkgp@gmail.com

Ankan Mullick - aankanmullick@gmail.com

samir ghui - ghuimilsamir2303@gmail.com

Rajpratim Mitra - jonty.mitra@gmail.com

Lecture Material

The lecture material of the course will be uploaded every week on Piazza.

Announcements

Course Registration: There have been quite a lot of requests for the course. I have tried to take care of many but since the class strength is already beyond 100, I will not be able to approve any further requests. Specifically, I have not approved any 2nd year students since they would have many more chances to take this course in future. The course will start from Jan 4th on MS Teams. The registered students will get an invite.

Reference Books

  1. Daniel Jurafsky and James H. Martin. 2020. Speech and Language Processing. 3rd Edition (draft)
  2. Christopher D. Manning and Hinrich Schütze. 1999. Foundations of Statistical Natural Language Processing. MIT Press.
  3. Sowmya Vajjala, Bodhisattwa Majumder, Anuj Gupta, Harshit Surana. 2020. Practical Natural Language Processing. O'Reilly.
  4. Hobson Lane, Cole Howard, Hannes Hapke. 2019. Natural Language Processing in Action. Live Book.

Course Contents

Major Components of the Course include (Tentative)
  1. Basic Text Processing: Tokenization, Stemming
  2. Language Modeling: N-grams, smoothing
  3. Morphology, Parts of Speech Tagging
  4. Syntax: PCFGs, Dependency Parsing
  5. Topic Models
  6. Distributional Semantics
  7. Lexical Semantics, Word Sense Disambiguation
  8. Information Extraction: Relation extraction
  9. Text Classification, Sentiment Analysis
  10. Deep Learning for NLP: Basic and Advanced Models; Analysis