PITS@Dravidian-CodeMix-FIRE2020: Traditional approach to noisy code-mixed sentiment analysis
| dc.contributor.author | Kanwar N.; Agarwal M.; Mundotiya R.K. | |
| dc.date.accessioned | 2025-05-23T11:30:49Z | |
| dc.description.abstract | Sentiment Analysis (SA) is a process for characterizing the response or opinion by which sentiment polarity of the text is decided. Nowadays, social media is a common platform to convey opinions, suggestions and much more in a user’s native language or multilingual in Roman script (for ease). In this task, Malayalam-English and Tamil-English code mixed dataset in the Roman script has provided for SA. To solve this task, we have generated syntax-based features and used trained logistic regression with as an under-sampling technique. We have obtained best F1-score of 0.71 and 0.62 on the blind test set of Malayalam-English and Tamil-English code mixed datasets, respectively. The code is available at Github1 © 2020 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). | |
| dc.identifier.doi | DOI not available | |
| dc.identifier.uri | http://172.23.0.11:4000/handle/123456789/12604 | |
| dc.relation.ispartofseries | CEUR Workshop Proceedings | |
| dc.title | PITS@Dravidian-CodeMix-FIRE2020: Traditional approach to noisy code-mixed sentiment analysis |