I have a manually labelled set of ~120K tweets. If I use VADER's compound score it only matches the manual labelling for ~24% of the records, textblob matches ~35% of the manually labelled record. If I take Vaders compound score and textblobs score and add then together and divide by 2 the resulting sentiment result matches the manual labelling ~70% of the time. Is there any reason for why its more accurate or is it just coincidence?
Using an average of VADER and textBlob's sentiment polarity gives me a more accurate result, why?
354 Views Asked by July Jones At
1
There are 1 best solutions below
Related Questions in PYTHON
- How to store a date/time in sqlite (or something similar to a date)
- Instagrapi recently showing HTTPError and UnknownError
- How to Retrieve Data from an MySQL Database and Display it in a GUI?
- How to create a regular expression to partition a string that terminates in either ": 45" or ",", without the ": "
- Python Geopandas unable to convert latitude longitude to points
- Influence of Unused FFN on Model Accuracy in PyTorch
- Seeking Python Libraries for Removing Extraneous Characters and Spaces in Text
- Writes to child subprocess.Popen.stdin don't work from within process group?
- Conda has two different python binarys (python and python3) with the same version for a single environment. Why?
- Problem with add new attribute in table with BOTO3 on python
- Can't install packages in python conda environment
- Setting diagonal of a matrix to zero
- List of numbers converted to list of strings to iterate over it. But receiving TypeError messages
- Basic Python Question: Shortening If Statements
- Python and regex, can't understand why some words are left out of the match
Related Questions in NLP
- Seeking Python Libraries for Removing Extraneous Characters and Spaces in Text
- Clarification on T5 Model Pre-training Objective and Denoising Process
- The training accuracy and the validation accuracy curves are almost parallel to each other. Is the model overfitting?
- Give Bert an input and ask him to predict. In this input, can Bert apply the first word prediction result to all subsequent predictions?
- Output of Cosine Similarity is not as expected
- Getting an error while using the open ai api to summarize news atricles
- SpanRuler on Retokenized tokens links back to original token text, not the token text with a split (space) introduced
- Should I use beam search on validation phase?
- Dialogflow failing to dectect the correct intent
- How to detect if two sentences are simmilar, not in meaning, but in syllables/words?
- Is BertForSequenceClassification using the CLS vector?
- Issue with memory when using spacy_universal_sentence_encoder for similarity detection
- Why does the Cloud Natural Language Model API return so many NULLs?
- Is there any OCR or technique that can recognize/identify radio buttons printed out in the form of pdf document?
- Model, lexicon to do fine grained emotions analysis on text in r
Related Questions in SENTIMENT-ANALYSIS
- How to do sentiment analysis in R of multiple annual reports which is in pdf format? Please provide the code as I am a beginner
- How can i get the first content of a python synsets list?
- SpaCy Sentiment Analysis: Non-blank NLP model raises error during training update
- Issue Accessing .bin Files in React Native App
- How to save all of the recently recorded real-time audio into a .mp3 or a .wav file?
- I can't use unnest tokens properly when importing from excel
- How to fine-tune a llm for fine-grained sentiment analysis?
- Sentiment Analysis: tokenized data cannot fit in Keras model, Failed to convert a NumPy array to a Tensor (Unsupported object type numpy.ndarray)
- (Huggingface) Using fine tuned mode for inference over a dataset
- Why the value of accuracy is the exact same every epoch while training a sentiment analysis model?
- I ran a VADER sentiment analysis on mulitple files and the compound score for all of them was 1; how can I validate this result?
- Error deploying python flask app on heroku
- Must have equal len keys and value when setting with an iterable error
- How to fix invalid index to scalar variable Using NLTK | Python
- Model Accuracy Using Transformer
Related Questions in TEXTBLOB
- 'TextBlob' object has no attribute 'detect_language'
- Textblob language translation
- sentiment.polarity doesn't seem to be working in python
- Sentiment in Sentiment Analysis from Polarity Scores
- Finding the nouns in a sentence given the context in Python
- Executing a function on clicking specific Tkinter Label() text
- Using Python to find noun phrase and dependency
- SQl Loop through a column text field to find ids and join those ids with another table
- Python textblob installation on VS code
- hoe to applied example in Sentiment analysis using textblob and naive bayes
- Why do I get an HTTP Error 400 when using TextBlob and how can i resolve?
- Download file using javascript not working
- Detect language in pandas column in python
- Do not change tense within quote
- How to get complex words from a text file in python?
Related Questions in VADER
- I ran a VADER sentiment analysis on mulitple files and the compound score for all of them was 1; how can I validate this result?
- Using fully custom lexicon for VADER sentiment analyzer
- IndexError: index out of range in self ( Google Colab notebook ) while implementing Roberta Pretrained Model
- nltk code in google colab works and returns real values, but in jupyter notebook only zeros
- How to labeling text based on aspect term and sentiment
- Creating a progress bar for sentiment analysis in R
- What could be the reasons for vader_df() giving NA compound scores in R?
- vaderSentiment TypeError: 'float' object is not iterable
- how do i solve AttributeError: 'float' object has no attribute 'encode'
- Using "ifelse" with negative values - R
- Vader Sentiment Analysis in R
- Pandas code running but not doing anything
- Daily Sentiment Values
- Cant seem to iterate over a column to assign sentiment values
- Iterating over a DataFrame and appending the score into a column
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
I think you're stumbling upon the idea behind ensemble learning. More often than not, putting multiple models together and combining their predictions leads to better results. Your implementation could be thought of as an equally weighted soft-voting ensemble. For more examples and additional implementations, the scikit-learn Voting Classifier docs are great.