Using an average of VADER and textBlob's sentiment polarity gives me a more accurate result, why?

354 Views Asked by July Jones At 30 September 2021 at 14:44

I have a manually labelled set of ~120K tweets. If I use VADER's compound score it only matches the manual labelling for ~24% of the records, textblob matches ~35% of the manually labelled record. If I take Vaders compound score and textblobs score and add then together and divide by 2 the resulting sentiment result matches the manual labelling ~70% of the time. Is there any reason for why its more accurate or is it just coincidence?

Original Q&A

There are 1 best solutions below

pmbaumgartner On 03 October 2021 at 00:03

I think you're stumbling upon the idea behind ensemble learning. More often than not, putting multiple models together and combining their predictions leads to better results. Your implementation could be thought of as an equally weighted soft-voting ensemble. For more examples and additional implementations, the scikit-learn Voting Classifier docs are great.

Using an average of VADER and textBlob's sentiment polarity gives me a more accurate result, why?

There are 1 best solutions below

Related Questions in PYTHON

Related Questions in NLP

Related Questions in SENTIMENT-ANALYSIS

Related Questions in TEXTBLOB

Related Questions in VADER

Trending Questions

Popular # Hahtags

Popular Questions