Present Affiliation: Associate Professor (A K Singh Chair)
Department of Computer Science and Engineering,
Indian Institute of Technology, Kharagpur, Poschimbongo, India -- 721302,
Humboldt Fellow (Experienced Researchers) -- (2018 - 2022)
Phone: +91-3222283472 (Office) +91-3222283473 (Residence)
Presently I am an Associate Professor in the Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur. Prior to this, I was working as a post doctoral researcher in the Complex Systems Lagrange Lab, ISI Foundation, Italy. I received my PhD from the Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur with a thesis on self-organization of human speech sound inventories. My main research interests center around content governance which includes (i) content moderation (harmful content analysis, detection, and mitigation), (ii) content dissemination (fairness issues in e-commerce platforms and interfaced systems like facial recognition, automatic speech recognition etc.), and (iii) content maintenance (quality analysis and improvement of encyclopaedia like Wikipedia and large software systems like Ubuntu releases). In all these applications, I extensively use concepts from NLP, IR, and network science.
Note to aspiring applicants:
- For Post-Doc: The Institute now has a regularized procedure to appoint post-docs. Interested candidates may directly write to me with a copy of their recent resume and one letter of recommendation. The formal process for application is outlined here. You can apply here.
- For PhD: The Institute has a centralized policy for PhD entrance. You can apply here. The syllabus for the entrance test is here.
- For MS: For MS you have to first get through some project. For project advertisements from CNeRG see here. Once you are already a project staff you can apply for MS here. The syllabus for the entrance test is here. However, these days I am reluctant of taking MS students since I am more interested in longer commitments, e.g., sponsored/direct PhDs.
- For internships: We only offer summer internships; therefore, do not apply for internships in fall. In general, your internship requests directly to my mail-box will never be acknowledged! Therefore, rather apply online.
New paper accepted in ICWSM 2023: Dummy Grandpa, do you know anything?”: Identifying and Characterizing Ad hominem Fallacy Usage in the Wild.
I shall serve as an Area Chair of AAAI 2023.
I shall serve as an Associate Chair of CSCW 2023.
New paper accepted in NeurIPS 2022 Datasets and Benchmarks: MACD: Multilingual Abusive Comment Detection at Scale for Indic Languages.
New paper accepted in ICADL 2022: Fast Few shot Self-attentive Semi-supervised Political Inclination Prediction.
New paper accepted in AACL-IJCNLP 2022: Hate Speech and Offensive Language Detection in Bengali.
New paper accepted in ACM TALLIP: Transfer Learning for Low Resource Multilingual Relation Classification.
New paper accepted in ECML-PKDD 2022: Is this bug severe? A text-cum-graph based model for bug severity prediction.
New paper accepted in ECML-PKDD 2022: Placing (Historical) Facts on a Timeline: A Classification cum Co-ref Resolution Approach.
New paper accepted in ECAI-IJCAI 2022: CounterGEDI: a controllable approach to generate polite, detoxified and emotional counterspeech.
New late breaking paper accepted in SocInfo 2022: Decoding Demographic un-fairness from Indian Names.
New paper accepted in SocInfo 2022: (Im)balance in the Representation of News? An Extensive Study on a Decade Long Dataset from India.
New paper accepted in ACM Hypertext 2022: Data Bootstrapping Approaches to Improve Low Resource Abusive Language Detection for Indic Languages.
New paper accepted in SNAM: Constant Community Identification in Million Scale Networks.
Siddharth receives the very prestigious Prime Minister Research Fellowship (PMRF).
New paper accepted in NAACL 2022 (Findings): CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection.
New paper accepted in LREC 2022: HateCheckHIn: Evaluating Hindi Hate Speech Detection Models.
I shall serve as the Tutorial Co-Chair of CoDS-COMAD 2022.
I shall serve as the Track Co-Chair (Algorithm and Theory track) of IEEE MASS 2022.
I shall serve as the Associate Editor of Advances in Complex systems for the next 2 years. Please consider submitting your paper to this excellent journal!
New paper accepted in The WebConf 2022: Alexa, in you, I trust! Fairness and Interpretability Issues in E-commerce Search through Smart Speakers.
Wikimedia foundation invites us to present our work on "patterns of quality changes in Wikipedia articles" in their research showcase series.
New case study paper accepted in CHI 2022: Marching with the Pink Parade: Evaluating Visual Search Recommendations for Non-binary Clothing Items.
New paper accepted in ICWSM 2022: Two-Face: Adversarial Audit of Commercial Face Recognition Systems.
New paper accepted in CSCW 2022: Quality Change: norm or exception? Measurement, Analysis and Detection of Quality Change in Wikipedia.
New paper accepted in IEEE TCSS: FaiRIR: Mitigating Exposure Bias from Related Item Recommendations in Two-Sided Platforms.
We shall be delivering a tutorial titled Hate speech: Detection, Mitigation and Beyond at AAAI 2022.
We receive this year's Best Student Paper Award at ICADL for our paper: When expertise gone missing: Uncovering the loss of prolific contributors in Wikipedia.
New short paper accepted in ASONAM 2021: Constant Community Identification in Million Scale Networks Using Image Thresholding Algorithms.
We receive this year's Ted Nelson Best Newcomer Paper Award at ACM Hypertext for our paper: You too Brutus! Trapping Hateful Users in Social Media: Challenges, Solutions & Insights.
New paper accepted in CoNLL 2021: A Data Bootstrapping Recipe for Low-Resource Multilingual Relation Classification.
New paper accepted in ICADL 2021: When expertise gone missing: Uncovering the loss of prolific contributors in Wikipedia.
New survey paper accepted in WIREs Data Mining and Knowledge Discovery journal: Mining the Online Inphosphere: A survey.
New paper accepted in The Hypertext 2021: Debiasing Multilingual Word Embeddings: A Case Study of Three IndianLanguages.
New paper accepted in The Hypertext 2021: You too Brutus! Trapping Hateful Users in Social Media: Challenges, Solutions & Insights.
Our ICWSM 2021 tutorial on Hate speech: Detection, Mitigation and Beyond can be accessed here.
Team Hate-Alert has been invited to showcase their CSCW 2020 work on Hate begets Hate: A Temporal Study of Hate Speech at the India HCI 2021.
Team Hate-Alert to set the scene at "Incitement to violence in E2EE platforms" workshop, organised by the Stanford Internet Observatory.
Punyajoy receives the very prestigious Prime Minister Research Fellowship (PMRF).
New abstract (peer-reviewed) accepted in IC2S2 2021: "Short is the Road that Leads from Fear to Hate'': Fear Speech in Indian WhatsApp Groups.
New paper accepted in The SNAM Journal: A core-periphery structure based network embedding approach.
We shall be delivering a tutorial titled Hate speech: Detection, Mitigation and Beyond at ICWSM 2021.
New paper accepted in The WebConf 2021: "Short is the Road that Leads from Fear to Hate'': Fear Speech in Indian WhatsApp Groups.
New paper accepted in AAAI 2021: HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection.
New paper accepted in ACM FAccT 2021: When the Umpire is also a Player: Bias in Private Label Product Recommendations on E-commerce Marketplaces.
New paper accepted in ECIR 2021: Joint Autoregressive and Graph Models for Software and Developer Social Networks.
New invited article in ACM SIGWEB Newsletters: Hatespeech in online social media.
We finish at world rank 18 in the Facebook+Drivendata Hateful Memes competition.
New paper accepted in EMNLP 2020: NwQM: A neural quality assessment framework for Wikipedia.
New paper accepted in PLoS ONE: Unsupervised Ranking of Clustering Algorithms by INFOMAX.
New paper accepted in CSCW 2020: Hate begets Hate: A Temporal Study of Hate Speech.
New paper accepted in ECML-PKDD 2020: A Deep Dive into Multilingual Hate Speech Classification.
We win the very competitive worldwide Facebook+Social Science One data grant. This means "Unprecedented (a trillion!) Facebook URLs Dataset" now available to us with an access license of 12 months.
New medium article on #COVID-19 test: Aggressive and widespread: On systematically estimating the number of tests to detect COVID-19 infections.
New demo paper accepted in JCDL 2020: Gandhipedia: A one-stop AI-enabled portal for browsing Gandhian literature, life-events and his social network.
New short paper accepted in ACL 2020: Code-switching patterns can be an effective route to improve performance of downstream NLP applications: A case study of humour, sarcasm and hate speech detection.
Due to the COVID-19 outbreak and the current administrative circular issued by IIT Kharagpur, AI and Ethics classes will be held in online mode through Zoom until further notices.
New paper accepted in JCDL 2020: Aspect-based Sentiment Analysis of Scientific Reviews.
New paper accepted in JCDL 2020: Characterising authors on the extent of their paper acceptance: A case study of the Journal of High Energy Physics.
New paper accepted in JCDL 2020: Identification, Tracking and Impact: Understanding the trade secret of catchphrases.
New paper accepted in Journal of Informetrics: Analysis of Reference and Citation Copying in Evolving Bibliographic Networks.
New paper accepted in Journal of Information Processing and Management: Network Measures: A New Paradigm Towards Reliable Novel Word Sense Detection.
New research track paper accepted in CoDS-COMAD 2020: Competing Topic Naming Conventions in Quora: Predicting Appropriate Topic Merges and Winning Topics from Millions of Topic Pairs.
New research track paper accepted in CoDS-COMAD 2020: Interaction dynamics between hate and counter users on Twitter.
New industry track paper accepted in CoDS-COMAD 2020: Innovation and Revenue: Deep Diving into the Temporal Rank-shifts of Fortune 500 Companies.