Pawan Goyal

Associate Professor

Department of Computer Science and Engineering

Indian Institute of Technology, Kharagpur, India -- 721302

Phone: +91-3222282370 (Office)

Email: pawang AT cse DOT iitkgp DOT ac DOT in

My Google Scholar page

My CV

Brief Bio

I joined the Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur as an Assistant Professor on July 30th, 2013. Prior to that, I was working at INRIA Paris-Rocquencourt as a post doctoral fellow with Prof. Gérard Huet on The Sanskrit Heritage Site.

I did my B. Tech. in Electrical Engineering from Indian Institute of Technology, Kanpur. I received my Ph. D. from Intelligent Systems Research Centre, Faculty of Computing and Engineering, University of Ulster, UK. My PhD advisors were Prof. Laxmidhar Behera and Prof. T. M. McGinnity. The topic of my PhD dissertation was "Analytic Knowledge Discovery Techniques for Ad-Hoc information Retrieval and Text Summarization".

My main research interests include Text Mining, Natural Language Processing, Information Retrieval and Sanskrit Computational Linguistics.

Professional Activities

We organized sixth international conference on Sanskrit Computational Linguistics from October 23-25, 2019 at IIT Kharagpur. For more details, please visit this link.

Senior PC / Meta-Reviewer: AAAI 2021

Area Chair: ICON 2021, ICON 2020 (Syntax and Semantics)

Reviewer for Conferences: EMNLP 2021, SIGIR 2021, ACL 2021, NAACL 2021, EMNLP 2020, SIGIR 2020, ACL 2020, IJCAI 2020, LREC 2020, SDM 2020, ECIR 2020, AAAI 2020, EMNLP 2019, ACL 2019, ACL Demo 2019, NAACL 2019, ECIR 2019, AAAI 2019, EMNLP 2018, EMNLP Demo 2018, ACL Demo 2018, NAACL 2018, Coling 2018, SIGIR 2018, EMNLP 2017, ACL 2017, NAACL 2017, Coling 2016.

Journals: TACL, AI Journal, IEEE TCSS, ACM TALLIP, IPM, ACM TWEB.

News

September 3rd, 2021: Our paper, "Network Embeddings from Distributional Thesauri for Improving Static Word Representations"got accepted in Expert Systems with Applications.

September 2nd, 2021: Our paper received Hypertext Ted Nelson Best Newcomer Paper Award.

August 26th, 2021: Our papers, "PASTE: A Tagging-free Decoding Framework using Pointer Networks for Aspect Sentiment Triplet Extraction" and "Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA" got accepted in EMNLP'21 (main and findings, respectively).

June 23rd, 2021: Our proposal, " Multilingual Dialogue as a Novel Framework for AutoSuggest" got selected as part of Microsoft Academic Partnership Grant 2021 program.

May 6th, 2021: Our paper, " Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights" got accepted in Findings of ACL'21.

March 11th, 2021: Our paper, " Hierarchical Transformer for Task Oriented Dialog Systems" got accepted in NAACL-HLT'21.

January 19th, 2021: Our paper, " MatScIE: An automated tool for the generation of databases of methods and parameters used in the computational materials science literature" got accepted in Computational Materials Science, Elsevier.

December 31st, 2020: Abhilash Nandy and Ankan Mullick receive Prime Ministers Research Fellowship (PMRF).

December 15th, 2020: One Full paper and One Reproducibility Track paper got accepted in ECIR'21.

December 2nd, 2020: Our paper, "HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection" got accepted in AAAI'21.

October 3rd, 2020: Our paper, "A Graph Based Framework for Structured Prediction Tasks in Sanskrit" got accepted in Computational Linguistics Journal.

September 30th, 2020: Our paper, "Automatic Crime Identification from Facts: A Few Sentence-Level Crime Annotations is All You Need" got accepted in COLING.

September 15th, 2020: Our paper, "Keep it Surprisingly Simple: A Simple First Order Graph Based Parsing Model for Joint Morphosyntactic Parsing in Sanskrit" got accepted in EMNLP as a short paper.

September 10th, 2020, 2020: Received INAE Young Engineer Awards 2020.

July 24th, 2020: Our paper, "Hate begets Hate: A Temporal Study of Hate Speech" got accepted in ACM CSCW.

June 10th, 2020: Received Google India AI/ML Research Awards 2020.

April 22nd, 2020: Our paper, "Read what you need: Controllable Aspect-based Opinion Summarization of Tourist Reviews" got accepted in SIGIR 2020 as a short paper.

April 20th, 2020: Our paper, "Logic Constrained Pointer Networks for Interpretable Textual Similarity" got accepted in IJCAI 2020.

March 7th, 2020: Our paper, "Aspect-based Sentiment Analysis of Scientific Reviews" got accepted in JCDL 2020.

February 11th, 2020: Two papers got accepted in LREC 2020.

August 13th, 2019: Our paper, "Incorporating Domain Knowledge into Medical NLI using Knowledge Graphs" got accepted in EMNLP 2019 as a short paper.

July 3rd, 2019: Our paper, "Spread of hate speech in online social media" get the best paper award (honorable mention) at WebSci 2019.

May 14th, 2019: Our long paper, "On the Compositionality Prediction of Noun Phrases using Poincaré embeddings" and a short paper, "Poetry to Prose Conversion in Sanskrit as a Linearisation Task: A case for Low-Resource Languages" got accepted in ACL 2019.

April 14th, 2019: Our paper, "Addressing Vocabulary Gap in E-commerce Search" got accepted in SIGIR 2019 as a short paper.

April 6th, 2019: Our paper, "Spread of hate speech in online social media" got accepted in WebSci 2019.

March 16th, 2019: Our paper, "Thou shalt not hate: Countering online hate speech" got accepted in ICWSM 2019.

December 5th, 2018: One long paper, "Automated Early Leaderboard Generation From Comparative Tables" and three short papers got accepted in ECIR 2019.

August 11th, 2018: Our paper, "Free as in Free Word Order: An Energy Based Model for Word Segmentation and Morphological Tagging in Sanskrit" got accepted in EMNLP.

August 9th, 2018: Our paper, "Opinion Conflicts: An Effective Route to Detect Incivility in Twitter" got accepted in CSCW.

July 27th, 2018: Our paper, "Upcycle Your OCR: Reusing OCRs for Post-OCR Text Correction in Romanised Sanskrit" got accepted in CoNLL.

May 16th, 2018: Our paper, "WikiRef: Wikilinks as a route to recommending appropriate references for scientific Wikipedia pages" got accepted in Coling.

April 12th, 2018: Our paper, "Identifying Sub-events and Summarizing Information during Disasters" got accepted in SIGIR.

February 15th, 2018: Our paper, "Can Network Embedding of Distributional Thesaurus be Combined with Word Vectors for Better Representation?" got accepted in NAACL-HLT.

December 30th, 2017: Our paper, "Extracting and Summarizing Situational Information from the Twitter Social Media during Disasters" got accepted in ACM Transactions on the Web.

December 13th, 2017: Two full papers, "Building a Word Segmenter for Sanskrit Overnight" and "Network Features Based Co-hyponymy Detection" got accepted in LREC 2018 for oral presentations.

December 11th, 2017: Our Paper, "Automated Assistance in E-commerce: An Approach based on Category-Sensitive Retrieval" got accepted in ECIR 2018 as a short paper.

August 21st, 2017: I will be chairing the Young Researchers' Symposium at CODS-COMAD 2018 along with Dr. Amit Awekar from IIT Guwahati. Please consider submitting. You can find more details here.

August 5th, 2017: Our Paper, "Extracting Entities of Interest from Comparative Product Reviews" got accepted in CIKM 2017 as a short paper.

May 29th, 2017:We are organizing ACM summer school on NLP and Machine Learning from June 1st to June 21st, 2017. More details acan be found here.

May 16th, 2017: Our Paper, "Relay-Linking Models for Prominence and Obsolescence in Evolving Networks" got accepted in KDD 2017 for a poster presentation.

March 21st, 2017: Two full papers, "Understanding the Impact of Early Citers on Long-Term Scientific Impact", "WikiM: Metapaths based Wikification of Scientific Abstracts" and one short paper, "Citation sentence reuse behavior of scientists: A case study on massive bibliographic text dataset of computer science" accepted in JCDL 2017.

February 25th, 2017: We are organizing workshop on Complex and Social Networks on March 15th, 2017 in Gargi Auditorium. Prof. Frank Schweitzer (ETH, Zurich), Prof. Laxmidhar Behera (IIT Kanpur) and Dr. Manish Gupta (Microsoft Bing) are the speakers. There will also be a panel discussion on "How to sell your thesis to industry". For more details, visit the website here.

February 23rd, 2017: OCR++ got selected for the Gandhian Young Technological Innovation (GYTI) Award/Appreciation 2017.

February 11th, 2017: Our paper, "A Generic Opinion-Fact Classifier with Application in Understanding Opinionatedness in Various News Section" got accepted as a poster in WWW 2017.

September 21st, 2016: Our papers, "Word Segmentation in Sanskrit Using Path Constrained Random Walks" and "OCR++: A Robust Framework For Information Extraction from Scholarly Articles" got accepted in Coling 2016.

July 19th, 2016: Our paper, "peq : An explainable, specification-based, aspect-oriented product comparator for e-commerce" got accepted in ACM CIKM, 2016 as a short paper.

April 1st, 2016: Our paper, "Summarizing Situational Tweets in Crisis Scenario" got accepted in ACM HyperText, 2016.

December 11th, 2015: Our paper, "FeRoSA: A Faceted Recommendation System for Scientific Articles" got accepted in PAKDD, 2016.

July 4th, 2015: Our papers, "Extracting Situational Information from Microblogs during Disaster Events: A Classification-Summarization Approach" and "The role of citation context in predicting long-term citation profiles: an experimental study based on a massive bibliographic text dataset" got accepted in ACM CIKM, 2015.

May 13th, 2015: Our paper, "On the formation of circles in co-authorship networks" got accepted in ACM SIGKDD, 2015.

January 12th, 2015: Our paper, "An automatic approach to identify word sense changes in text media across timescales" got accepted in JNLE special issue on Graph Methods for NLP.

December 21st, 2014: Our paper, "On the categorization of scientific citation profiles in computer sciences" got accepted in Communications of the ACM.

October 1st, 2014: Our proposal, "IndicView: because language is no more a barrier" has been accepted as part of the Google - IIT Pilot program.

September 8th, 2014: Received a grant of USD 1000 from Yahoo! Labs towards encouraging student participation in the SNLP course projects this semester.