Posts

Showing posts from July, 2021

Sarcasm Detection in Telugu Language

Image
SARCASM DETECTION IN TELUGU LANGUAGE     1)P roblem Statement: The Main Aim of the project is to detect the whether the given statement or sentence is sarcastic or not. 2) Dataset: The dataset is of Telugu sarcastic sentences collected from the Telugu comedy shows and annotated as sarcastic or non-sarcastic. 3) Data Preprocessing:  In the Data Preprocessing 3.1 Removal of Stop Words, 3.2 Removal of Punctuation marks, 3.3 Tokenization and  POS Tagging 3.1 Removal of Stop Words: The Stops words are identified from the dataset and removed them. the stop words are words which do not contribute for the identification or classification of the sentences. some of the stop words are downloaded from the StandfordNLP telugu stop words data. 3.2. Removal of Punctuation Marks Punctuation marks are removed from the sentences the punctuation , where it is a pattern based approach punctuation marks are not much necessary and removed the punctuation marks like ”./$&*()!¿:¡” ˆ 3....