Pawan Goyal

Assistant Professor

Department of Computer Science and Engineering

Indian Institute of Technology, Kharagpur, India -- 721302

Phone: +91-3222282370 (Office)

Email: pawang AT cse DOT iitkgp DOT ac DOT in

Brief Bio

I joined the Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur as an Assistant Professor on July 30th, 2013. Prior to that, I was working at INRIA Paris-Rocquencourt as a post doctoral fellow with Prof. Gérard Huet on The Sanskrit Heritage Site.

I did my B. Tech. in Electrical Engineering from Indian Institute of Technology, Kanpur. I received my Ph. D. from Intelligent Systems Research Centre, Faculty of Computing and Engineering, University of Ulster, UK. My PhD advisors were Prof. Laxmidhar Behera and Prof. T. M. McGinnity. The topic of my PhD dissertation was "Analytic Knowledge Discovery Techniques for Ad-Hoc information Retrieval and Text Summarization".

My main research interests include Text Mining, Natural Language Processing, Information Retrieval and Sanskrit Computational Linguistics.

Tools and Projects

Cl Scholar, the ACL Anthology knowledge graph miner tool is available here.

You can check our tool OCR++ for extracting metadata information frpm Scientific Articles here.

Project IndicView

Professional Activities: Conference Organization

We are organizing sixth international conference on Sanskrit Computational Linguistics from October 23-25, 2019 at IIT Kharagpur. For more details, please visit this link.

Professional Activities: Reviewer

Conferences: Reviewer for EMNLP 2019, ACL 2019, ACL Demo 2019, NAACL 2019, ECIR 2019, AAAI 2019, EMNLP 2018, EMNLP Demo 2018, ACL Demo 2018, NAACL 2018, Coling 2018, SIGIR 2018, EMNLP 2017, ACL 2017, NAACL 2017, Coling 2016.



August 13th, 2019: Our paper, "Incorporating Domain Knowledge into Medical NLI using Knowledge Graphs" got accepted in EMNLP 2019 as a short paper.

July 3rd, 2019: Our paper, "Spread of hate speech in online social media" get the best paper award (honorable mention) at WebSci 2019.

May 14th, 2019: Our long paper, "On the Compositionality Prediction of Noun Phrases using Poincaré embeddings" and a short paper, "Poetry to Prose Conversion in Sanskrit as a Linearisation Task: A case for Low-Resource Languages" got accepted in ACL 2019.

April 14th, 2019: Our paper, "Addressing Vocabulary Gap in E-commerce Search" got accepted in SIGIR 2019 as a short paper.

April 6th, 2019: Our paper, "Spread of hate speech in online social media" got accepted in WebSci 2019.

March 16th, 2019: Our paper, "Thou shalt not hate: Countering online hate speech" got accepted in ICWSM 2019.

December 5th, 2018: One long paper, "Automated Early Leaderboard Generation From Comparative Tables" and three short papers got accepted in ECIR 2019.

August 11th, 2018: Our paper, "Free as in Free Word Order: An Energy Based Model for Word Segmentation and Morphological Tagging in Sanskrit" got accepted in EMNLP.

August 9th, 2018: Our paper, "Opinion Conflicts: An Effective Route to Detect Incivility in Twitter" got accepted in CSCW.

July 27th, 2018: Our paper, "Upcycle Your OCR: Reusing OCRs for Post-OCR Text Correction in Romanised Sanskrit" got accepted in CoNLL.

May 16th, 2018: Our paper, "WikiRef: Wikilinks as a route to recommending appropriate references for scientific Wikipedia pages" got accepted in Coling.

April 12th, 2018: Our paper, "Identifying Sub-events and Summarizing Information during Disasters" got accepted in SIGIR.

February 15th, 2018: Our paper, "Can Network Embedding of Distributional Thesaurus be Combined with Word Vectors for Better Representation?" got accepted in NAACL-HLT.

December 30th, 2017: Our paper, "Extracting and Summarizing Situational Information from the Twitter Social Media during Disasters" got accepted in ACM Transactions on the Web.

December 13th, 2017: Two full papers, "Building a Word Segmenter for Sanskrit Overnight" and "Network Features Based Co-hyponymy Detection" got accepted in LREC 2018 for oral presentations.

December 11th, 2017: Our Paper, "Automated Assistance in E-commerce: An Approach based on Category-Sensitive Retrieval" got accepted in ECIR 2018 as a short paper.

August 21st, 2017: I will be chairing the Young Researchers' Symposium at CODS-COMAD 2018 along with Dr. Amit Awekar from IIT Guwahati. Please consider submitting. You can find more details here.

August 5th, 2017: Our Paper, "Extracting Entities of Interest from Comparative Product Reviews" got accepted in CIKM 2017 as a short paper.

May 29th, 2017:We are organizing ACM summer school on NLP and Machine Learning from June 1st to June 21st, 2017. More details acan be found here.

May 16th, 2017: Our Paper, "Relay-Linking Models for Prominence and Obsolescence in Evolving Networks" got accepted in KDD 2017 for a poster presentation.

March 21st, 2017: Two full papers, "Understanding the Impact of Early Citers on Long-Term Scientific Impact", "WikiM: Metapaths based Wikification of Scientific Abstracts" and one short paper, "Citation sentence reuse behavior of scientists: A case study on massive bibliographic text dataset of computer science" accepted in JCDL 2017.

February 25th, 2017: We are organizing workshop on Complex and Social Networks on March 15th, 2017 in Gargi Auditorium. Prof. Frank Schweitzer (ETH, Zurich), Prof. Laxmidhar Behera (IIT Kanpur) and Dr. Manish Gupta (Microsoft Bing) are the speakers. There will also be a panel discussion on "How to sell your thesis to industry". For more details, visit the website here.

February 23rd, 2017: OCR++ got selected for the Gandhian Young Technological Innovation (GYTI) Award/Appreciation 2017.

February 11th, 2017: Our paper, "A Generic Opinion-Fact Classifier with Application in Understanding Opinionatedness in Various News Section" got accepted as a poster in WWW 2017.

September 21st, 2016: Our papers, "Word Segmentation in Sanskrit Using Path Constrained Random Walks" and "OCR++: A Robust Framework For Information Extraction from Scholarly Articles" got accepted in Coling 2016.

July 19th, 2016: Our paper, "peq : An explainable, specification-based, aspect-oriented product comparator for e-commerce" got accepted in ACM CIKM, 2016 as a short paper.

April 1st, 2016: Our paper, "Summarizing Situational Tweets in Crisis Scenario" got accepted in ACM HyperText, 2016.

December 11th, 2015: Our paper, "FeRoSA: A Faceted Recommendation System for Scientific Articles" got accepted in PAKDD, 2016.

July 4th, 2015: Our papers, "Extracting Situational Information from Microblogs during Disaster Events: A Classification-Summarization Approach" and "The role of citation context in predicting long-term citation profiles: an experimental study based on a massive bibliographic text dataset" got accepted in ACM CIKM, 2015.

May 13th, 2015: Our paper, "On the formation of circles in co-authorship networks" got accepted in ACM SIGKDD, 2015.

January 12th, 2015: Our paper, "An automatic approach to identify word sense changes in text media across timescales" got accepted in JNLE special issue on Graph Methods for NLP.

December 21st, 2014: Our paper, "On the categorization of scientific citation profiles in computer sciences" got accepted in Communications of the ACM.

October 1st, 2014: Our proposal, "IndicView: because language is no more a barrier" has been accepted as part of the Google - IIT Pilot program.

September 8th, 2014: Received a grant of USD 1000 from Yahoo! Labs towards encouraging student participation in the SNLP course projects this semester.