Journal Papers

Kalyani Roy, Goyal, Pawan and Pandey, Manish (2024). Exploring generative frameworks for product attribute value extraction. Expert systems with Applications, Vol. 243, 122850.

Mullick, Ankan, Ghosh, Akash, Chaitanya, G Sai, Ghui, Samir, Nayak, Tapas, Lee, Seung-Cheol, Bhattacharjee, Satadeep and Goyal, Pawan (2024). MatSciRE: Leveraging Pointer Networks to Automate Entity and Relation Extraction for Material Science Knowledge-base Construction. Computational Materials Science, Elsevier, Vol. 233, 112659.

Hazra, Rima, Singh, Mayank, Goyal, Pawan, Adhikari, Bibhas and Mukherjee, Animesh (2023). Modeling interdisciplinary interactions among Physics, Mathematics & Computer Science. Journal of Physics: Complexity, Vol. 4, No. 4, 045001.

Roy, Aniruddha, Sharma, Isha, Sarkar, Sudeshna and Goyal, Pawan (2022). Meta-ED: Cross-lingual Event Detection using Meta-learning for Indian Languages. ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 22, No. 2, Feb 2023.

Das, Kishalay, Samanta, Bidisha, Goyal, Pawan, Lee, Seung-Cheol, Bhattacharjee, Satadeep and Ganguly, Niloy (2022). CrysXPP: An Explainable Property Predictor for Crystalline Materials. npj Computational Materials, Vo. 8, No. 1.

Jana, Abhik, Haldar, Siddhant and Goyal, Pawan (2021). Network Embeddings from Distributional Thesauri for Improving Static Word Representations. Expert Systems with Applications, Vol. 187.

Nayak, Tapas, Majumder, Navonil, Goyal, Pawan and Poria, Soujanya (2021). Deep Neural Approaches to Relation Triplets Extraction: A Comprehensive Survey. Cognitive Computation, Vol. 13, pp. 1215--1232.

Ghosh, Krishnendu, Nangi, Sharmila Reddy, Kanchugantla, Yashasvi, Rayapati, Pavan Gopal, Bhowmick, Plaban Kumar and Goyal, Pawan (2021). Augmenting Video Lectures: Predicting Off-topic Concepts and Linking to Relevant Video Lecture Segments. International Journal of Artificial Intelligence in Education, Springer.

Guha, Souradip, Mullick, Ankan, Agrawal, Jatin, Ram, Swetarekha, Ghui, Samir, Lee, Seung-Cheol, Bhattacharjee, Satadeep and Goyal, Pawan (2021). MatScIE: An automated tool for the generation of databases of methods and parameters used in the computational materials science literature. Computational Materials Science, Elsevier.

Das, Mithun, Mathew, Binny, Saha, Puyajoy, Goyal, Pawan and Mukherjee, Animesh (2020). Hate speech in online social media. ACM SigWeb Newsletter, Nov 2020.

Krishna, Amrith, Santra, Bishal, Gupta, Ashim, Satuluri, Pavankumar and Goyal, Pawan (2020). A Graph Based Framework for Structured Prediction Tasks in Sanskrit. Computational Linguistics (PDF).

Jana, Abhik, Mukherjee, Animesh and Goyal, Pawan (2020). Network Measures: A New Paradigm Towards Reliable Novel Word Sense Detection. Information Processing and Management, Elsevier, Vol. 57, No. 6 (PDF).

Pandey, Pradumn Kumar, Singh, Mayank, Goyal, Pawan, Mukherjee, Animesh and Chakrabarti, Soumen (2020). Analysis of Reference and Citation Copying in Evolving Bibliographic Networks. Journal of Informetrics, Elsevier Vol. 14, No. 1 (PDF).

Rudra, Koustav, Goyal, Pawan, Ganguly, Niloy, Imran, Mohammad and Mitra, Prasenjit (2019). Summarizing Situational Tweets in Crisis Scenarios: An Extractive-Abstractive Approach. IEEE Transactions on Computational Social Systems, Vol. 6, No. 5, pp. 981-993.

Bhattacharya, Paheli, Goyal, Pawan and Sarkar, Sudeshna (2018). Using Communities of Words Derived from Multilingual Word Vectors for Cross-Language Information Retrieval in Indian Languages. ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 18, No. 1, Article 1, Dec 2018. (PDF)

Mullick, Ankan, Goyal, Pawan, Ganguly, Niloy and Gupta, Manish (2018). Harnessing Twitter for Answering Opinion List Queries. IEEE Transactions on Computational Social Systems, Vol. 5, No. 4, pp. 1083-1095. (PDF)

Rudra, Koustav, Ganguly, Niloy, Goyal, Pawan and Ghosh, Saptarshi (2018). Extracting and Summarizing Situational Information from the Twitter Social Media during Disasters. ACM Transactions on the Web, Vol. 12, No. 3, Article 17. (PDF)

Singh, Mayank, Chakraborty, Tanmoy, Mukherjee, Animesh and Goyal, Pawan (2016). Is this conference a top-tier? ConfAssist: An assistive conflict resolution framework for conference categorization. Journal of Informetrics, Vol. 10, No. 4, pp. 1005-1022. (Arxiv)

Goyal, Pawan and Huet, Gérard (2016). Design and analysis of a lean interface for Sanskrit corpus annotation. Journal of Language Modeling, Vol. 4, No. 2, pp. 145-182. (Access Online)

Chakraborty, Tanmoy, Kumar, Suhansanu, Goyal, Pawan, Ganguly, Niloy and Mukherjee, Animesh (2015). On the categorization of scientific citation profiles in computer sciences. Communications of the ACM, Vol. 58, No. 9, pp. 82-90 (Access Online).

Jonnalagadda, Siddhartha R, Goyal, Pawan and Huffman, Mark D (2015). Automating Data Extraction in Systematic Reviews: A Systematic Review. Systematic Reviews, 4:78. (Access Online).

Mitra, Sunny, Mitra, Ritwik, Maity, Suman Kalyan, Riedl, Martin, Biemann, Chris, Goyal, Pawan and Mukherjee, Animesh (2015). An automatic approach to identify word sense changes in text media across timescales. JNLE special issue on Graph Methods for NLP, Vol. 21, No. 5, pp. 773 -- 798. (PDF)

Goyal, Pawan, Behera, Laxmidhar and McGinnity, TM (2013). A Context based Word Indexing Model for Document Summarization. IEEE Transactions on Knowledge and Data Engineering, 25 (8). pp. 1693-1705.

Goyal, Pawan, Behera, Laxmidhar and McGinnity,TM (2013). A Novel Neighborhood Based Document Smoothing Model for Information Retrieval. Information Retrieval, Springer, 16 (3). pp. 391-425.

Goyal, Pawan, Behera, Laxmidhar and McGinnity, TM (2012). Query Representation through Lexical Association for Information Retrieval. IEEE Transactions on Knowledge and Data Engineering, 24 (12). pp. 2260-2273.

Goyal, Pawan, Behera, Laxmidhar and McGinnity, TM (2008). Application of Bayesian Framework In Natural Language Understanding. IETE Technical Review, 25 (5). pp. 251-269.

Conference Papers

Khatuya, Subhendu, Mukherjee, Rajdeep, Ghosh, Akash, Hegde, Manjunath, Dasgupta, Koustuv, Ganguly, Niloy, Ghosh, Saptarshi and Goyal, Pawan (2024). Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling. NAACL.

Nandy, Abhilash, Kulkarni, Yash, Goyal, Pawan and Ganguly, Niloy (2024). Order-Based Pre-training Strategies for Procedural Text Understanding. NAACL (short).

Ghosh, Akash, Bathini, Venkata Sahith, Ganguly, Niloy, Goyal, Pawan and Singh, Mayank (2024). How Robust are the QA Models for Hybrid Scientific Tabular Data? A Study using Customized Dataset. LREC-Coling (short).

Sandhan, Jivnesh, Narsupalli, Yaswanth, Muppirala, Sreevatsa, Krishnan, Sriram, Satuluri, Pavankumar, Kulkarni, Amba and Goyal, Pawan (2023). DepNeCTI: Dependency-based Nested Compound Type Identification for Sanskrit. EMNLP Findings.

Santra, Bishal, Basak, Sakya, De, Abhinandan, Gupta, Manish and Goyal, Pawan (2023). Frugal Prompting for Dialog Models. EMNLP Findings.

Mukherjee, Rajdeep, Kannen, Nithish, Pandey, Saurabh Kumar and Goyal, Pawan (2023). CONTRASTE: Supervised Contrastive Pre-training With Aspect-based Prompts For Aspect Sentiment Triplet Extraction. EMNLP Findings.

Nandy, Abhilash, Kapadnis, Manav Nitin, Goyal, Pawan and Ganguly, Niloy (2023). CLMSM: A Multi-Task Learning Framework for Pre-training on Procedural Text. EMNLP Findings.

Sandhan, Jivnesh, Agarwal, Anshul, Behera, Laxmidhar, Sandhan, Tushar and Goyal, Pawan (2023). SanskritShala: A Neural Sanskrit NLP Toolkit with Web-Based Interface for Pedagogical and Annotation Purposes. ACL (Demo).

Das, Kishalay, Goyal, Pawan, Lee, Seung-Cheol, Bhattacharjee, Satadeep and Ganguly, Niloy (2023). CrysMMNet: Multimodal Representation for Crystal Property Prediction. UAI.

Sharma, Soumya, Khatuya, Subhendu, Hegde, Manjunath, Shaikh, Afreen, Dasgupta, Koustuv, Goyal, Pawan and Ganguly, Niloy (2023). Financial Numeric Extreme Labelling: A dataset and benchmarking. Findings of ACL (short).

Paul, Shounak, Mandal, Arpan, Goyal, Pawan and Ghosh, Saptarshi (2023). Pre-trained Language Models for the Legal Domain: A Case Study on Indian Law. ICAIL.

Sandhan, Jivnesh, Behera, Laxmidhar and Goyal, Pawan (2023). Systematic Investigation of Strategies Tailored for Low-Resource Settings for Low-Resource Dependency Parsing. EACL (short).

Mullick, Ankan, Mondal, Ishani, Ray, Sourjyadip, R, Raghav, Chaitanya, G Sai and Goyal, Pawan (2023). Intent Identification and Entity Extraction for Healthcare Queries in Indic Languages. EACL findings.

Das, Kishalay, Samanta, Bidisha, Goyal, Pawan, Lee, Seung-Cheol, Bhattacharjee, Satadeep and Ganguly, Niloy (2023). CrysGNN : Distilling pre-trained knowledge to enhance property prediction for crystalline materials. AAAI.

Mukherjee, Rajdeep, Bohra, Abhinav, Banerjee, Akash, Sharma, Soumya, Hegde, Manjunath, Shaikh, Afreen, Shrivastava, Shivani, Dasgupta, Koustuv, Ganguly, Niloy, Ghosh, Saptarshi and Goyal, Pawan (2022). ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long Earnings Call Transcripts. EMNLP.

Sandhan, Jivnesh, Singla, Rathin, Rao, Narein, Samanta, Suvendu, Behera, Laxmidhar and Goyal, Pawan (2022). TransLIST: A Transformer-Based Linguistically Informed Sanskrit Tokenizer. EMNLP findings.

Shukla, Abhay, Bhattacharya, Paheli, Poddar, Soham, Mukherjee, Rajdeep, Ghosh, Kripabandhu, Goyal, Pawan and Ghosh, Saptarshi (2022). Legal Case Document Summarization: Extractive and Abstractive Methods and their Evaluation. AACL.

Kar, Debanjana, Sarkar, Sudeshna and Goyal, Pawan (2022). ArgGen: Prompting Text Generation Models for Document-Level Event-Argument Aggregation. AACL findings (short).

Chakraborty, Souvic, Goyal, Pawan and Mukherjee, Animesh (2022). Fast Few shot Self-attentive Semi-supervised Political Inclination Prediction. ICADL.

Sandhan, Jivnesh, Gupta, Ashish, Terdalkar, Hrishikesh, Sandhan, Tushar, Samanta, Suvendu, Behera, Laxmidhar and Goyal, Pawan (2022). A Novel Multi-Task Learning Approach for Context-Sensitive Compound Type Identification in Sanskrit. Coling.

Roy, Aniruddha, Thankur, Rupak Kumar, Sharma, Isha, Gupta, Ashim, Krishna, Amrith, Sarkar, Sudeshna and Goyal, Pawan (2022). Does Meta-learning Help mBERT for Few-shot Question Generation in a Cross-lingual Transfer Setting for Indic Languages? Coling, (short paper).

Chakraborty, Souvic, Goyal, Pawan and Mukherjee, Animesh (2022). (Im)balance in the Representation of News? An Extensive Study on a Decade Long Dataset from India. SocInfo.

Kumar, Rishabh, Adiga, Devaraja, Ranjan, Rishav, Krishna, Amrith, Ramakrishnan, Ganesh, Goyal, Pawan and Jyothi, Preethi (2022). Linguistically Informed Post-processing for ASR Error correction in Sanskrit. Interspeech.

Santra, Bishal, Roychowdhury, Sumegh, Mandal, Aishik, Gurram, Vasu, Naik, Atharva, Gupta, Manish and Goyal, Pawan (2022). Representation Learning for Conversational Data using Discourse Mutual Information Maximization. NAACL.

Mullick, Ankan, Purkayastha, Sukannya, Goyal, Pawan and Ganguly, Niloy (2022). A Framework to Generate High-quality Datapoints for Multiple Novel Intent Detection. NAACL findings.

Mullick, Ankan, Pal, Shubhraneel, Nayak, Tapas, Lee, Seung-Cheol, Bhattacharjee, Satadeep and Goyal, Pawan (2022). Using Sentence-level Classification Helps Entity Extraction from Material Science Literature. LREC.

Roy, Kalyani, Goel, Avani and Goyal, Pawan (2022). Using Data Augmentation to Identify Relevant Reviews for Product Question Answering. The ACM Web Conference (poster).

Paul, Shounak, Goyal, Pawan and Ghosh, Saptarshi (2022). LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Legal Documents. AAAI.

Mukherjee, Rajdeep, Vishnu, Uppada, Peruri, Hari Chandana, Ganguly, Niloy, Bhattacharya, Sourangshu, Goyal, Pawan and Rudra, Koustav (2022). MTLVS: A Multi-Task Framework to Verify and Summarize Crisis-Related Microblogs. WSDM.

Mukherjee, Rajdeep, Nayak, Tapas, Butala, Yash, Bhattacharya, Sourangshu and Goyal, Pawan (2021). PASTE: A Tagging-free Decoding Framework using Pointer Networks for Aspect Sentiment Triplet Extraction. EMNLP.

Nandy, Abhilash, Sharma, Soumya, Maddhashiya, Shubham, Sachdeva, Kapil, Goyal, Pawan and Ganguly, Niloy (2021). Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework. EMNLP Findings.

Sandhan, Jivnesh, Adideva, Om, Komal, Digumarthi, Behera, Laxmidhar and Goyal, Pawan (2021). Evaluating Neural Word Embeddings for Sanskrit. World Sanskrit Conference, Computational Sanskrit and Digital Humanities Track.

Krishna, Amrith, Gupta, Ashim, Garasangi, Deepak, Sandhan, Jivnesh, Satuluri, Pavan Kumar and Goyal, Pawan (2021). Neural Approaches for Data Driven Dependency Parsing in Sanskrit. World Sanskrit Conference, Computational Sanskrit and Digital Humanities Track.

Sarkar, Sujoy, Krishna, Amrith and Goyal, Pawan (2021). Pre-annotation Based Approach for Development of a Sanskrit Named Entity Recognition Dataset. World Sanskrit Conference, Computational Sanskrit and Digital Humanities Track.

Das, Mithun, Saha, Puyajoy, Dutt, Ritam, Goyal, Pawan, Mukherjee, Animesh and Mathew, Binny (2021). You too Brutus! Trapping Hateful Users in Social Media: Challenges, Solutions & Insights. ACM HyperText. (Received HyperText Ted Nelson Best Newcomer Paper Award)

Singh, Shruti, Singh, Mayank and Goyal, Pawan (2021). COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer Reviews. JCDL (short paper).

Adiga, Devaraja, Kumar, Rishabh, Krishna, Amrith, Jyothi, Preethi, Ramakrishnan, Ganesh and Goyal, Pawan (2021). Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights. Findings of ACL.

Santra, Bishal, Potnuru, Anusha and Goyal, Pawan (2021). Hierarchical Transformer for Task Oriented Dialog Systems. NAACL-HLT.

Mathew, Binny, Saha, Puyajoy, Yimam, Seid Muhie, Biemann, Chris, Goyal, Pawan and Mukherjee, Animesh (2021). HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection. AAAI.

Rima Hazra, Hardik Aggarwal, Goyal, Pawan, Animesh Mukherjee and Soumen Chakrabarti (2021). Joint Autoregressive and Graph Models for Software and Developer Social Networks. ECIR.

Rajdeep Mukherjee, Shreyas Shetty, Subrata Chattopadhyay, Subhadeep Maji, Samik Datta and Goyal, Pawan (2021). Reproducibility, Replicability and Beyond: Assessing Production Readiness of Aspect Based Sentiment Analysis in the Wild.. ECIR (Reproducibility Track).

Paul, Shounak, Goyal, Pawan and Ghosh, Saptarshi (2020). Automatic Crime Identification from Facts: A Few Sentence-Level Crime Annotations is All You Need. COLING.

Krishna, Amrith, Gupta, Ashim, Garasangi, Deepak, Satuluri, Pavankumar and Goyal, Pawan (2020). Keep it Surprisingly Simple: A Simple First Order Graph Based Parsing Model for Joint Morphosyntactic Parsing in Sanskrit. EMNLP (short paper, PDF, Supplementary).

Mathew, Binny, Illendula, Anurag, Saha, Punyajoy, Sarkar, Soumya, Goyal, Pawan and Mukherjee, Animesh (2020). Hate begets Hate: A Temporal Study of Hate Speech. CSCW.

Mukherjee, Rajdeep, Peruri, Hari Chandana, Vishnu, Uppada, Goyal, Pawan, Bhattacharya, Sourangshu and Ganguly, Niloy (2020). Read what you need: Controllable Aspect-based Opinion Summarization of Tourist Reviews. SIGIR, July 25-30, 2020, Xian, China.

Maji, Subhadeep, Kumar, Rohan, Bansal, Manish, Roy, Kalyani and Goyal, Pawan (2020). Logic Constrained Pointer Networks for Interpretable Textual Similarity. IJCAI-PRICAI, July 11-17, 2020, Yokohama, Japan. (PDF)

Chakraborty, Souvic, Goyal, Pawan and Mukherjee, Animesh (2020). Aspect-based Sentiment Analysis of Scientific Reviews. JCDL, August 1-5, Xian, China.

Krishna, Amrith, Vidhyut, Shiv, Chawla, Dilpreet, Sambhavi, Sruti and Goyal, Pawan (2020). SHR++: An Interface for Morpho-syntactic annotation of Sanskrit Corpora. LREC, May 11-16, Marseille, France.

Jana, Abhik, Varimala, Nikhil and Goyal, Pawan (2020). Using Distributional Thesaurus Embedding for Co-hyponymy Detection. LREC, May 11-16, Marseille, France.

Mathew, Binny, Kumar, Navish, Goyal, Pawan and Mukherjee, Animesh (2020). Interaction dynamics between hate and counter users on Twitter. CODS-COMAD, January 5-7, Hyderabad, India.

Mathew, Binny, Maity, Suman Kalyan, Goyal, Pawan and Mukherjee, Animesh (2020). Competing Topic Naming Conventions in Quora: Predicting Appropriate Topic Merges and Winning Topics from Millions of Topic Pairs. CODS-COMAD, January 5-7, Hyderabad, India.

Kayal, Pratik, Singh, Mayank and Goyal, Pawan (2020). Weakly-Supervised Deep Learning for Domain Invariant Sentiment Classification. CODS-COMAD, January 5-7, Hyderabad, India (short paper).

Sharma, Soumya, Santra, Bishal, Jana, Abhik, Tokala, Santosh, Ganguly, Niloy and Goyal, Pawan. Incorporating Domain Knowledge into Medical NLI using Knowledge Graphs. EMNLP-IJCNLP, November 3-7, Hong Kong (short paper).

Hazra, Rima, Singh, Mayank, Goyal, Pawan, Adhikari, Bibhas and Mukherjee, Animesh (2019). The rise and rise of interdisciplinary research: Understanding the interaction dynamics of three major fields -- Physics, Mathematics & Computer Science. ICADL, November 4-7, Kualalumpur (short paper).

Sandhan, Jivnesh, Krishna, Amrith, Goyal, Pawan and Behera, Laxmidhar (2019). Revisiting the Role of Feature Engineering for Compound Type Identification in Sanskrit. 6th ISCLS, October 23-25, IIT Kharagpur, WB, India.

Mathew, Binny, Dutt, Ritam, Maity, Suman Kalyan, Goyal, Pawan and Mukherjee, Animesh (2019). Deep Dive into Anonymity: Large Scale Analysis of Quora Questions. SocInfo, November 18-21, Doha, Qatar (Received the Best Paper Award Nomination).

Patro, Jasabanta, Baruah, Sabyasachee, Gupta, Vivek, Choudhury, Monojit, Goyal, Pawan and Mukherjee, Animesh (2019). Characterizing the spread of exaggerated health news content over social media. ACM HyperText (HT), September 17 - 20, Hof, Germany (poster).

Jana, Abhik, Puzyrev, Dima, Panchenko, Alexander, Goyal, Pawan, Biemann, Chris and Mukherjee, Animesh (2019). On the Compositionality Prediction of Noun Phrases using Poincaré embeddings. ACL, July 28th - August 3rd, Florence, Italy. (PDF)

Krishna, Amrith, Sharma, Vishnu Dutt, Santra, Bishal, Satuluri, Pavan Kumar and Goyal, Pawan (2019). Poetry to Prose Conversion in Sanskrit as a Linearisation Task: A case for Low-Resource Languages. ACL, July 28th - August 3rd, Florence, Italy (short paper). (PDF)

Maji, Subhadeep, Kumar, Rohan, Bansal, Manish, Roy, Kalyani, Kumar, Mohit and Goyal, Pawan (2019). Addressing Vocabulary Gap in E-commerce Search. ACM SIGIR, July 21 - July 25, Paris, France (short paper). (PDF)

Mathew, Binny, Dutt, Ritam, Goyal, Pawan and Mukherjee, Animesh (2019). Spread of hate speech in online social media. ACM WebSci, June 30 - July 3, Boston, MA, USA. (PDF, Best paper Award (honorable mention))

Mathew, Binny, Saha, Punyajoy, Tharad, Hardik, Rajgaria, Subham, Singhania, Prajwal, Maity, Suman Kalyan, Goyal, Pawan and Mukherjee, Animesh (2019). Thou shalt not hate: Countering online hate speech. ICWSM, June 11-14, Munich, Germany. (PDF)

Singh, Mayank, Sarkar, Rajdeep, Vyas, Atharva, Goyal, Pawan, Mukherjee, Animesh and Chakrabarti, Soumen (2019). Automated Early Leaderboard Generation From Comparative Tables. ECIR, April 14-18, Cologne, Germany. (PDF)

Palod, Priyank, Patwari, Ayush, Bahety, Sudhanshu, Bagchi, Saurabh and Goyal, Pawan (2019). Misleading Metadata Detection on YouTube. ECIR, April 14-18, Cologne, Germany (short paper). (PDF, Poster)

Gupta, Ashim, Goyal, Pawan, Sarkar, Sudeshna and Gattu, Nandeshwar (2019). Fully Contextualized Biomedical NER. ECIR, April 14-18, Cologne, Germany (short paper). (PDF)

Maity, Suman Kalyan, Panigrahi, Abhishek, Ghosh, Sayan, Banerjee, Arundhati, Goyal, Pawan and Mukherjee, Animesh (2019). DeepTagRec: A Content-cum-User based Tag Recommendation Framework for Stack Overflow. ECIR, April 14-18, Cologne, Germany (short paper). (PDF)

Jana, Abhik, Mukherjee, Animesh and Goyal, Pawan (2019). Detecting Reliable Novel Word Senses: A Network-Centric Approach. ACM SAC Knowledge and Language Processing Track, April 8-12, Limassol, Cyprus. (PDF)

Amrith Krishna, Bishal Santra, Sasi Prasanth Bandaru, Sahu, Gaurav, Sharma, Vishnu Dutt, Satuluri, Pavan Kumar and Pawan Goyal (2018). Free as in Free Word Order: An Energy Based Model for Word Segmentation and Morphological Tagging in Sanskrit. EMNLP, Brussels, Belgium, October 31-November 4, 2018. (PDF)

Maity, Suman Kalyan, Chakraborty, Aishik, Goyal, Pawan and Mukherjee, Animesh (2018). Opinion Conflicts: An Effective Route to Detect Incivility in Twitter. ACM CSCW, November 3 - 7, New York City's Hudson River. (PDF)

Krishna, Amrith, Majumder, Bodhisattwa Prasad, Bhat, Rajesh and Goyal, Pawan (2018). Upcycle Your OCR: Reusing OCRs for Post-OCR Text Correction in Romanised Sanskrit. CoNLL, October 31 - November 1, Brussels, Belgium. (PDF)

Jana, Abhik, Kanojia, Pranjal, Goyal, Pawan and Mukherjee, Animesh (2018). WikiRef: Wikilinks as a route to recommending appropriate references for scientific Wikipedia pages. Coling, August 20-26, Santa Fe, New Mexico, USA. (PDF)

Rudra, Koustav, Goyal, Pawan, Ganguly, Niloy, Mitra, Prasenjit and Imran, Mohammad (2018). Identifying Sub-events and Summarizing Information during Disasters. SIGIR, July 8-12, Ann Arbor, Michigan. (PDF)

Singh, Mayank, Dogga, Pradeep, Patro, Sohan, Barnwal, Dhiraj, Dutt, Ritam, Haldar, Rajarshi, Goyal, Pawan and Mukherjee, Animesh (2018). CL Scholar: The ACL Anthology Knowledge Graph Miner. NAACL-HLT (Demo), June 1-6, New Orleans, Louisiana. (PDF)

Pani, Sandeep Kumar, R, Naresh, Goyal, Pawan and Bhowmick, Plaban Kumar(2018). Learning to Extract Comparison Points of Entity Pairs from Wikipedia Articles. JCDL (Poster), June 3-6, Fort Worth, Texas. (PDF)

Jana, Abhik and Goyal, Pawan (2018). Can Network Embedding of Distributional Thesaurus be Combined with Word Vectors for Better Representation?. NAACL-HLT, June 1-6, New Orleans, Louisiana. (PDF)

Jana, Abhik and Goyal, Pawan (2018). Network Features Based Co-hyponymy Detection. 11th International Conference on Language Resources and Evaluation (LREC), May 7-12, Miyazaki, Japan. (PDF)

Reddy, Vikas, Krishna, Amrith, Sharma, Vishnu Dutt, Gupta, Prateek, R, Vineeth M and Goyal, Pawan (2018). Building a Word Segmenter for Sanskrit Overnight. 11th International Conference on Language Resources and Evaluation (LREC), May 7-12, Miyazaki, Japan. (PDF)

Majumder, Anirban, Pande, Abhay, Vonteru, Kondalarao, Gangwar, Abhishek, Maji, Subhadeep, Bhatia, Pankaj and Goyal, Pawan (2018). Automated Assistance in E-commerce: An Approach based on Category-Sensitive Retrieval. 40th European Conference on Information Retrieval (ECIR), March 26-29, Grenoble, France (short paper). (PDF)

Krishna, Amrith, Majumder, Bodhisattwa Prasad and Goyal, Pawan (2018). An `Ekalavya' Approach to Learning Context Free Grammar Rules for Sanskrit Using Adaptor Grammar. 17th World Sanskrit Conference (WSC), section on Computational Sanskrit & Digital Humanities, July 9-13, Vancouver, Canada. (PDF)

Mullick, Ankan, Dastider, Surjodoy G, Maheshwari, Shivam, Sahoo, Srotaswini, Maity, Suman Kalyan, C, Soumya and Goyal, Pawan (2018). Identifying Opinion and Fact Subcategories from the Social Web. ACM International Conference on Supporting Group Work (Group), January 7-10, Florida, USA (poster). (PDF)

Arora, Jatin, Agrawal, Sumit, Goyal, Pawan and Pathak, Sayan (2017). Extracting Entities of Interest from Comparative Product Reviews. CIKM, Singapore, November 6-10, 2017 (short paper). (PDF)

Singh, Mayank, Sarkar, Rajdeep, Goyal, Pawan, Mukherjee, Animesh and Chakrabarti, Soumen (2017). Relay-Linking Models for Prominence and Obsolescence in Evolving Networks. KDD, Halifax, Canada, August 10-13, 2017. (Arxiv)

Ghosh, Krishnendu, Bhowmick, Plaban and Goyal, Pawan (2017). Using Re-ranking to Boost Deep Learning based Community Question Retrieval. Web Intelligence (WI), Leipzig, Germany, August 23-26, 2017. (PDF)

Mullick, Ankan, Goyal, Pawan, Ganguly, Niloy and Gupta, Manish (2017). Extracting Social Lists from Twitter. ASONAM, Sydney, Australia, July 31-August 3, 2017 (short paper). (PDF)

Singh, Mayank, Jaiswal, Ajay, Shree, Priya, Pal, Arindam, Mukherjee, Animesh and Goyal, Pawan (2017). Understanding the Impact of Early Citers on Long-Term Scientific Impact. ACM/IEEE Joint Conference on Digital Libraries (JCDL), Toronto, Ontario, Canada, June 19-23, 2017. (Arxiv)

Jana, Abhik, Mooriyath, Sruthi, Mukherjee, Animesh and Goyal, Pawan (2017). WikiM: Metapaths based Wikification of Scientific Abstracts. ACM/IEEE Joint Conference on Digital Libraries (JCDL), Toronto, Ontario, Canada, June 19-23, 2017. (Arxiv)

Singh, Mayank, Niranjan, Abhishek, Gupta, Divyansh, Bakshi, Nikhil Angad, Mukherjee, Animesh and Goyal, Pawan (2017). Citation sentence reuse behavior of scientists: A case study on massive bibliographic text dataset of computer science. ACM/IEEE Joint Conference on Digital Libraries (JCDL), Toronto, Ontario, Canada, June 19-23, 2017 (short paper). (Arxiv)

Mullick, Ankan, Maheshwari, Shivam, C, Soumya, Goyal, Pawan and Ganguly, Niloy (2017). A Generic Opinion-Fact Classifier with Application in Understanding Opinionatedness in Various News Sections. 26th World Wide Web (WWW), April 3rd - 7th, Perth, Australia (poster). (PDF)

Maity, Suman Kalyan, Chakraborty, Aishik, Goyal, Pawan and Mukherjee, Animesh (2017). Detection of Sockpuppets in Social Media. 20th ACM CSCW, Portland, OR, Feb 25th - March 1st, 2017 (poster highlights). (PDF)

Amrith Krishna, Bishal Santra, Pavan Kumar Satuluri, Sasi Prasanth Bandaru, Bhumi Faldu, Yajuvendra Singh and Pawan Goyal (2016). Word Segmentation in Sanskrit Using Path Constrained Random Walk. 26th International Conference on Computational Linguistics (Coling), Osaka, Japan, December 11-16, 2016 (Poster). (PDF)

Mayank Singh, Barnopriyo Barua, Priyank Palod, Manvi Garg, Sidhartha Satapathy, Samuel Bushi, Kumar Ayush, Krishna Sai Rohith, Tulasi Gamidi, Pawan Goyal and Animesh Mukherjee (2016). OCR++: A Robust Framework For Information Extraction from Scholarly Articles. 26th International Conference on Computational Linguistics (Coling), Osaka, Japan, December 11-16, 2016 (Poster). (PDF)

Sikchi, Abhishek, Goyal, Pawan and Datta, Samik (2016). peq : An explainable, specification-based, aspect-oriented product comparator for e-commerce. 25th ACM Conference on Information and Knowledge Management (CIKM), Indianapolis, October 24-28, 2016 (short paper). (PDF)

Rudra, Koustav, Banerjee, Siddhartha, Ganguly, Niloy, Goyal, Pawan, Imran, Mohammad and Mitra, Prasenjit (2016). Summarizing Situational Tweets in Crisis Scenario. 27th ACM Conference on Hypertext and Social Media, Halifax, Canada, July 10-13. (PDF)

Bhattacharya, Paheli, Goyal, Pawan and Sarkar, Sudeshna (2016). Using Word Embeddings for Query Translation for Hindi to English Cross Language Information Retrieval. 17th International Conference on Intelligent Text Processing and Computational Linguistics, (CICLing), Konya, Turkey, April 3-9. (Arxiv)

Chakraborty, Tanmoy, Krishna, Amrith, Singh, Mayank, Ganguly, Niloy, Goyal, Pawan and Mukherjee, Animesh (2016). FeRoSA: A Faceted Recommendation System for Scientific Articles. 20th Pacific Asia Conference on Knowledge Discovery and Data Mining (PAKDD), Auckland, New Zealand, April 19-22. (PDF)

Raja, Kalpana, Dasot, Naman, Goyal, Pawan and Jonnalagadda, Siddhartha R (2016). Towards Evidence-Based Precision Medicine: Extracting Population Information from Biomedical Text using Binary Classifiers and Syntactic Patterns. AMIA 2016 Joint Summits on Translational Science, San Francisco, USA, March 21-24. (PDF)

Singh, Mayank, Patidar, Vikas, Kumar, Suhansanu, Chakraborty, Tanmoy, Mukherjee, Animesh and Goyal, Pawan (2015). The role of citation context in predicting long-term citation profiles: an experimental study based on a massive bibliographic text dataset. 24th ACM Conference on Information and Knowledge Management (CIKM), Melbourne, Australia, October 19-23. (PDF)

Rudra, Koustav, Ghosh, Shubham, Ganguly, Niloy, Goyal, Pawan and Ghosh, Saptarshi (2015). Extracting Situational Information from Microblogs during Disaster Events: A Classification-Summarization Approach. 24th ACM Conference on Information and Knowledge Management (CIKM), Melbourne, Australia, October 19-23. (PDF)

Chakraborty, Tanmoy, Patranabis, Sikhar, Goyal, Pawan and Mukherjee, Animesh (2015). On the formation of circles in co-authorship networks. 21st ACM SIGKDD, Sydney, Australia, August 10-13. (Arxiv)

Singh, Mayank, Chakraborty, Tanmoy, Mukherjee, Animesh and Goyal, Pawan (2015). ConfAssist: A Conflict resolution framework for assisting the categorization of Computer Science conferences. ACM/IEEE Joint Conference on Digital Libraries (JCDL), Knoxville, Tennessee, USA, June 21-25 (poster). (PDF)

Maity, Suman Kalyan, Gupta, Abhishek, Goyal, Pawan and Mukherjee, Animesh (2015). A stratified learning approach for predicting the popularity of Twitter Idioms. The 9th AAAI Conference on Web and Social Media (ICWSM), Oxford, UK, May 26-29 (poster). (PDF)

Krishna, Amrith and Goyal, Pawan (2015). Towards automating the generation of derivative nouns in Sanskrit by simulating Panini. 16th World Sanskrit conference, Sanskrit and the IT world, Bangkok, Thailand, June 27 - July 02. (Arxiv)

Chakraborty, Tanmoy, Kumar, Suhansanu, Goyal, Pawan, Ganguly, Niloy and Mukherjee, Animesh (2014). Towards a Stratified Learning Approach to Predict Future Citation Counts. ACM/IEEE Joint Conference on Digital Libraries (JCDL), London, UK, pp. 351 - 360. (PDF)

Goyal, Pawan and Kulkarni, Amba (2014). Converting Phrase Structures to Dependency Structures in Sanskrit. 25th International Conference on Computational Linguistics (COLING), August 23-29, Dublin, Ireland, pp. 1834 - 1843.

Mitra, Sunny, Mitra, Ritwik, Riedl, Martin, Biemann, Chris, Mukherjee, Animesh and Goyal, Pawan (2014). That’s sick dude!: Automatic identification of word sense change across different timescales. 52nd Annual Meeting of the Association for Computational Linguistics (ACL), June 22-27, Baltimore, USA, pp. 1020 - 1029.

Huet, Gérard and Goyal, Pawan (2013). Design of a Lean Interface for Sanskrit Corpus Annotation. In the proceedings of ICON 2013.

Goyal, Pawan, Huet, Gérard, Kulkarni, Amba, Scharf, Peter and Bunker, Ralph (2012). A Distributed Platform for Sanskrit Processing. In Proceedings of COLING, IIT Bombay, INDIA, December 2012. pp. 1011-1028. (PDF)

Goyal, Pawan, Behera, Laxmidhar and McGinnity, TM (2009). Entailment of Causal Queries in Narratives Using Action Language. Proceedings of the International Conference on Knowledge Discovery and Information Retrieval, October 04-06, Funchal, Portugal. pp. 112-118.

Goyal, Pawan, Behera, Laxmidhar and McGinnity, TM (2009). An Information Retrieval Model Based On Automatically Learnt Concept Hierarchies. Proceedings of the IEEE International Conference on Semantic Computing, Berkeley, CA, USA. pp. 458-465.

Goyal, Pawan, Behera, Laxmidhar and McGinnity, TM (2009). An Information Retrieval Approach Based on Semantically Adapted Vector Space Model. Proceedings of the 2009 International Conference on Natural Language Processing, Hyderabad, INDIA, December 14-17.

Goyal, Pawan, Arora, Vipul, Behera, Laxmidhar and McGinnity, TM (2008). Tagging of Text with Emotion for Emotional Speech synthesis. Proceedings of 8th Information Technology and Telecommunications Conference, GMIT, Galway. pp 111-118.

Workshop Papers

Sandhan, Jivnesh, Daksh, Ayush, Paranjay, Om Adideva, Behera, Laxmidhar and Goyal, Pawan (2021). Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages. LaTeCH-CLfL, workshop at Coling'22.

Kalyani Roy, Vineeth Kumar Balapanuru, Tapas Nayak and Goyal, Pawan (2022). Investigating the Generative Approach for Question Answering in E-Commerce. E-commerce and NLP (ECNLP), workshop at ACL 2022 (short paper).

Soumya Sharma, Tapas Nayak, Yash Butala, Koustuv Dasgupta, Goyal, Pawan and Niloy Ganguly (2022). A Generative Approach for Financial Causality Extraction. FinWeb, workshop at The Web Conference 2022 (poster paper).

Tapas Nayak, Soumya Sharma, Arusarka Bose, Ajay Kumar Meena, Koustuv Dasgupta, Niloy Ganguly and Goyal, Pawan (2022). FinRED: A Dataset for Relation Extraction in Financial Domain. FinWeb, workshop at The Web Conference 2022 (poster paper).

Kalyani Roy, Goyal, Pawan and Pandey, Manish (2021). Attribute Value Generation from Product Title using Language Models. E-commerce and NLP (ECNLP), workshop at ACL 2021 (short paper).

Kar, Debanjana, Sarkar, Sudeshna and Goyal, Pawan (2021). ArgFuse: A Weakly-Supervised Framework for Document-Level Event Argument Aggregation. Challenges and Applications of Automated Extraction of Socio-political Events from Text, Workshop at ACL 2021.

Sandhan, Jivnesh, Krishna, Amrith, Gupta, Ashim, Behera, Laxmidhar and Goyal, Pawan (2021). A Little Pretraining Goes a Long Way: A Case Study on Dependency Parsing Task for Low-resource Morphologically Rich Languages. EACL SRW.

Kalyani Roy, Smit Shah, Nithish Pai, Jaidam Ramtej, Prajit Nadkarni, Jyotirmoy Banerjee, Goyal, Pawan and Surender Kumar (2020). Using Large Pretrained Language Models for Answering User Queries from Product Specifications. E-commerce and NLP, workshop at ACL 2020 (short paper).

Gupta, Ashim, Krishna, Amrith, Goyal, Pawan and Hellwig, Oliver (2020). Evaluating Neural Morphological Taggers for Sanskrit. SIGMORPHON, workshop at ACL 2020 (short paper).

Mondal, Ishani, Purkayastha, Sukannya, Sarkar, Sudeshna, Goyal, Pawan, Pillai, Jitesh, Bhattacharyya, Amitava and Gattu, Mahanandeeshwar (2019). Medical Entity Linking using Triplet Network. ClinicalNLP, workshop at NAACL 2019, Minneapolis, USA, June 2-7, 2019.

Krishna, Amrith, Satuluri, Pavankumar, Ponnada, Harshavardhan,Ahmed, Muneeb, Arora, Gulab, Hiware, Kaustubh and Goyal, Pawan (2017). A Graph Based Semi-Supervised Approach for Analysis of Derivational Nouns in Sanskrit. TextGraphs, workshop at ACL 2017, Vancouver, Canada, July 30-August 4, 2017.

Mathew, Binny, Maity, Suman Kalyan, Sarkar, Pratip, Mukherjee, Animesh and Goyal, Pawan (2017). Adapting predominant and novel sense discovery algorithms for identifying corpus-specific sense differences. TextGraphs, workshop at ACL 2017, Vancouver, Canada, July 30-August 4, 2017.

Krishna, Amrith, Satuluri, Pavankumar and Goyal, Pawan (2017). A Dataset for Sanskrit Word Segmentation. LaTeCH-CLfL, workshop at ACL 2017, Vancouver, Canada, July 30-August 4, 2017.

Singh, Mayank, Dan, Soham, Agarwal, Sanyam, Goyal, Pawan and Mukherjee, Animesh (2017). AppTechMiner: Mining Applications and Techniques from Scientific Articles. eC6th International Workshop on Mining Scientific Publications, workshop at JCDL 2017, Toronto, Ontario, Canada, June 19-23, 2017.

Krishna, Amrith, Satuluri, Pavankumar, Sharma, Shubham, Kumar, Apurv and Goyal, Pawan (2016). Compound Type Identification in Sanskrit: What Roles do the Corpus and Grammar Play?. WSSANLP, Workshop at Coling 2016, Osaka, Japan, December 11-16.

Bhattacharya, Paheli, Goyal, Pawan and Sarkar, Sudeshna (2016). Query Translation for Cross-Language Information Retrieval using Multilingual Word Clusters. WSSANLP, Workshop at Coling 2016, Osaka, Japan, December 11-16.

Mullick, Ankan, Goyal, Pawan and Ganguly, Niloy (2016). A graphical framework to detect and categorize diverse opinions from online news. Computational Modeling of People's Opinions, Personality, and Emotions in Social Media (PEOPLES), Workshop at Coling 2016, Osaka, Japan, December 11-16 (Poster).

Pathak, Arkanath, Goyal, Pawan and Bhowmick, Plaban (2016). A Two-Phase Approach Towards Identifying Argument Structure in Natural Language. Natural Language Processing Techniques for Educational Applications (NLP-TEA-3), Workshop at Coling 2016, Osaka, Japan, December 11-16.

Rudra, Koustav, Banerjee, Siddhartha, Ganguly, Niloy, Goyal, Pawan, Imran, Mohammad and Mitra, Prasenjit (2016). Summarizing Situational and Topical Information During Crises. Social Web for Disaster Management, Workshop at CIKM 2016, Indianapolis, USA, October 24-28.

Rajkumar, Pujari, Desai, Swara, Ganguly, Niloy and Goyal, Pawan (2014). A Novel Two-stage Framework for Extracting Opinionated Sentences from News Articles. In the proceedings of Textgraphs-9, workshop at EMNLP 2014, Doha, Qatar, October 25-29, pp. 25 - 33.

Quasthoff, Uwe, Mitra, Ritwik, Mitra, Sunny, Eckart, Thomas, Goldhahn, Dirk, Goyal, Pawan and Mukherjee, Animesh (2014). Large Web Corpora of High Quality for Indian Languages. LREC Workshop on Indian Language Data: Resources and Evaluation, Reykjavik, Iceland (Poster).

Book Chapters

Rudra, Koustav, Goyal, Pawan, Ganguly, Niloy, Mitra, Prasenjit and Imran, Muhammad (2023). Role of Crisis Information Summarization Through Microblogs in Disaster Managemente. International Handbook of Disaster Research, pp. 1 - 21.

Scharf, Peter, Goyal, Pawan, Ajotikar, Anuja and Ajotikar, Tanuja (2015). Voice, preverb, and transitivity restrictions in Sanskrit verb use. Sanskrit Syntax, pp. 157 - 202.

Scharf, Peter, Ajotikar, Anuja, Savardekar, Sampada and Goyal, Pawan (2015). Distinctive features of poetic syntax: preliminary results. Sanskrit Syntax, pp. 305 - 324.

Melnad, Keshav, Goyal, Pawan and Scharf, Peter (2015). Meter identification of Sanskrit verse. Sanskrit Syntax, pp. 325 - 346.

Goyal, Pawan and Huet, Gérard, (2013). Completeness Analysis of a Sanskrit Reader. In Recent Researches in Sanskrit Computational Linguistics, DK Publisher. pp. 130-171. (PDF)

Goyal, Pawan and Sinha, RMK (2009). Translation Divergence Between English- Sanskrit-Hindi Language Pairs. Proceedings of Third Sanskrit Computational Linguistic Symposium , Springer-Verlag. pp. 134-143.

Goyal, Pawan and Sinha, RMK (2009). A Study towards Design of an English to Sanskrit Machine Translation System. Sanskrit Computational Linguistics: Revised, Selected and Invited Papers, Springer-Verlag. pp. 287-305.

Goyal, Pawan, Kulkarni, Amba and Behera, Laxmidhar (2009). Computer Simulation of Ashtadhyayi: Some insights. Sanskrit Computational Linguistics: Revised, Selected and Invited Papers, Springer-Verlag. pp. 139-160.

Goyal, Pawan, Arora, Vipul and Behera, Laxmidhar (2009). Analysis of Sanskrit Text: Parsing and Semantic Relations. Sanskrit Computational Linguistics: Revised Selected and Invited Papers , Springer-Verlag, pp. 200 - 218.