I am a Ph.D. student in the Department of Computer Science and Engineering, IIT Kharagpur, since Jan 2015. My supervisor is Dr. Pawan Goyal. My basic interest lies in the study of Natural Language Processing, Cognitive Computing.
Research | Publications | Academics |Professional Experience| Contact |
RESEARCH ABSTRACT [Back to top]
Word sense change detection
Word Sense Induction (WSI) methods induce word senses from raw text by clustering word occurrences on the basis of the distributional hypothesis. Approaches based on context clustering either use a context vector for each word and cluster it into various groups denoting the senses or build a word co-occurrence graph and cluster the open neighborhood to obtain the word senses. Word sense discovery methods, on the other hand, attempt to discover novel senses by comparing sense clusters across two time-periods. For sense induction, one can use a distributional thesauri based network from a large dataset such as Google syntactic n-grams. After clustering the network for each target word, different clusters for the target word are considered to denote various senses. Such sense clusters can be constructed across various time-points and new senses can be discovered by comparing the two sets of clusters. On manual inspection, however, it appears that each “new sense” cluster does not always necessarily indicate a sense . Our proposal is to use network properties to enhance the existing framework to detect the word sense change more accurately. We take the words in the sense cluster of a particular target word as an ego network for that word, and measure the network properties across different time points for this ego network. We see that it helps to improve the accuracy of word sense change detection.
Predicting references for Wikipedia pages
Wikipedia is a free encyclopedia, written collaboratively by the people who use it. It consists of millions of articles in more than 270 languages. So it is a huge knowledge base, which is evolving every moment. Researchers have also been working on enriching this knowledge base automatically. Each Wikipedia page usually has several sections like introduction, history, references, external links etc. Our plan is to enrich the reference section of Wikipedia pages, so that it helps the reader to refer to specific document to get more information about the article. Our objective is to predict the reference documents and add them to the reference section of Wikipedia article. In order to do that we consider Computer Science related articles (Wikipedia pages) of a particular timestamp and try to predict computer science related papers, which can be added to the reference section in future.
PUBLICATIONS [Back to top]
A Jana, P Kanojiya, P Goyal, A Mukherjee, WikiRef: Wikilinks as a route to recommending appropriate references for scientific Wikipedia pages, in COLING 2018.(Accepted, yet to appear)
A Jana, P Goyal, Can Network Embedding of Distributional Thesaurus be Combined with Word Vectors for Better Representation?, in NAACL 2018. [PDF]
A Jana, P Goyal, Network Features Based Co-hyponymy Detection, in LREC 2018. [PDF]
A Jana, S Mooriyath, A Mukherjee, P Goyal, WikiM: Metapaths based Wikification of Scientific Abstracts, in JCDL 2017. [PDF]
M Sinha, T Dasgupta, A Jana, A Basu, Design and Development of a Bangla Semantic Lexicon and Semantic Similarity Measure, in International Journal of Computer Applications June 2014 Edition. [PDF]
M Sinha, A Jana, T Dasgupta, A Basu, A New Semantic Lexicon and Similarity Measure in Bangla, in Proceedings of the 3rd Workshop on Cognitive Aspects of the Lexicon (CogALex-III), pages 171-182, COLING 2012, Mumbai,December 2012. [PDF]
M Biswas, A Jana, R Paul, HN Saha, A Secure Routing Protocol Based on Fidelity, in International Conference on Scientific Paradigm Shift In Information Technology & Management. Science City, Kolkata. January, 2011. [PDF]
ACADEMICS [Back to top]
B.Tech. in Computer Science & Engg. from Institute of Engineering & Management, Kolkata [2007 - 2011]
Schooling from Bagnan High School, Bagnan, Howrah (Class V to XII) [1999 - 2007]
Spring 2018: Information Retrieval, CSE, IIT Kharagpur
Autumn 2017: Algorithm-I, CSE, IIT Kharagpur
Spring 2017: Deep Learning, CSE, IIT Kharagpur
Spring 2016: Complex Network, CSE, IIT Kharagpur
Autumn 2015: Data Analytics, CSE, IIT Kharagpur
Spring 2015: Information Retrieval, CSE, IIT Kharagpur
PROFESSIONAL EXPERIENCE [Back to top]
Senior Research Fellow in Department of Computer Science and Engineering (CSE), IIT Kharagpur [Aug 2014 - Dec 2014]
Member of Technical Staff II in Advanced Technology Group(Research) at NetApp India Private Limited, Bangalore, India [July 2013 - July 2014]
CONTACT ME [Back to top]
Permanent Address: Kali Krishna Dham, Khalore, Bagnan, Howrah, West Bengal-711303, India.
Email addresses: abhikjana1 [AT] gmail [DOT] com; abhik [DOT] jana [AT] iitkgp [DOT] ac [DOT] in
Date modified: June 15, 2018