Blar i NTNU Open på forfatter "Jamatia, Anupam"
-
Deep Learning-Based Language Identification in English-Hindi-Bengali Code-Mixed Social Media Corpora
Jamatia, Anupam; Das, Amitava; Gambäck, Björn (Peer reviewed; Journal article, 2018)This article addresses language identification at the word level in Indian social media corpora taken from Facebook, Twitter and WhatsApp posts that exhibit code-mixing between English-Hindi, English-Bengali, as well as a ... -
NIT_Agartala_NLP_Team at SemEval-2019 Task 6: An Ensemble Approach to Identifying and Categorizing Offensive Language in Twitter Social Media Corpora
Swamy, Steve Durairaj; Jamatia, Anupam; Gambäck, Björn; Das, Amitava (Chapter, 2019)The paper describes the systems submitted to OffensEval (SemEval 2019, Task 6) on ‘Identifying and Categorizing Offensive Language in Social Media’ by the ‘NIT_Agartala_NLP_Team’. A Twitter annotated dataset of 13,240 ... -
Part-of-Speech Tagging for Code-Mixed English-Hindi Twitter and Facebook Chat Messages
Jamatia, Anupam; Gambäck, Björn; Das, Amitava (Proceedings of the International Conference Recent Advances in Natural Language Processing;33, Chapter, 2015)The paper reports work on collecting and annotating code-mixed English-Hindi so- cial media text (Twitter and Facebook messages), and experiments on automatic tagging of these corpora, using both a coarse-grained ... -
Sentence Boundary Detection for Social Media Text
Rudrapal, Dwijen; Jamatia, Anupam; Chakma, Kunal; Das, Amitava; Gambäck, Björn (ICON;2015, Chapter, 2015) -
Studying Generalisability across Abusive Language Detection Datasets
Swamy, Steve Durairaj; Jamatia, Anupam; Gambäck, Björn (Chapter, 2019)Work on Abusive Language Detection has tackled a wide range of subtasks and domains. As a result of this, there exists a great deal of redundancy and non-generalisability between datasets. Through experiments on cross-dataset ...