Sentiment Analysis

Analyzing The Textual Data Of Hindi Movie Reviews

Methodology

Step 1: Preprocessing

    a) Sentence Segmentation - Since the sentences have been separated 
       by a dollar sign, we will separate them and store them in a 
       different array
    b) Tokenization
    c) Removal of Stop Words

Step 2: Feature Extraction

    a) TF-IDF (Term Frequency- Inverse Document Frequency)
       - Compute the TF-IDF score of unigram, bigram and trigram.
       - Make a feature matrix
       - Split the data into training and testing. 
    b) Lexicon Based Approach
       - Compare with Hindi SentiWordNet and find the polarity
          and the POS tag as well.

Step 3: Sentiment Score Computation

    a) Machine Learning Algorithms
       - k-Nearest Neighbors
    
    b) Lexicon-Based Approach
       - Use Hindi SentiWordNet(HSWN) to compute the Sentiment Score
       the sentence.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
src		src
FINAL.xlsx		FINAL.xlsx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis

Methodology

Step 1: Preprocessing

Step 2: Feature Extraction

Step 3: Sentiment Score Computation

About

Releases

Packages

Languages

sharmaachintya/SentimentAnalysis

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis

Methodology

Step 1: Preprocessing

Step 2: Feature Extraction

Step 3: Sentiment Score Computation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages