Sentiment Accuracy: Explaining the Baseline and How to Test It - Lexalytics (2024)

I often hesitate to offer benchmarks and general metrics for sentiment accuracy. Why? Because sentiment accuracy depends on so many factors: the type of data you’re dealing with, the people who hand-tagged your sentiment library, how much sleep they each got the night before, the complexity of the language that your industry uses (financial and medical data is particularly arcane)… the list goes on.

That said, it’s natural to want to know how something will perform. And I often talk with prospects and customers who’d like to hear a baseline number. So, in this article I’ll try to break things down.

Setting a baseline sentiment accuracy rate

When evaluating the sentiment (positive, negative, neutral) of a given text document, research shows that human analysts tend to agree around 80-85% of the time. This is the baseline we (usually) try to meet or beat when we’re training a sentiment scoring system. But this does mean that you’ll always find some text documents that even two humans can’t agree on, even with their wealth of experience and knowledge.

But when you’re running automated sentiment analysis through natural language processing, you want to be certain that the results are reliable. So, how accurate can we get, and how can we ensure the best-possible sentiment accuracy?

How to test sentiment accuracy (an example)

For a quick test of baseline sentiment accuracy, I built a new sentiment scoring model. As recommended on an oldYahoo text analytics mailing list, I used thisMovie Review Data (link since removed) put together by Pang and Lee for their various sentiment papers.

This data consisted of 2000 documents (1000 positive, 1000 negative). I further divided it into a training set consisting of 1800 documents (900 positive and 900 negative), and a test set of the remaining 200.

It took me about 45 seconds to train a sentiment scoring model using the training set. Then I used a quick PHP script to run it against the test set.

The results

Now, remember that I built this sentiment model for speed as much as for accuracy. Even so, the results surprised me (pleasantly).

Of the 200-document test set, the model correctly identified 81 of the positive documents and 82 of the negative ones. This is sentiment accuracy score of 81.5%. That means that right off the bat, our basic sentiment scoring model already matched human agreement levels.

Next, I ran the same 200 test set documents against our phrase-based sentiment system. To be honest, I expected a far lower score. But I was pleasantly surprised.

Our simplest sentiment scoring models, trained on very general sentiment libraries, performed admirably, reaching 70.5% accuracy. With a domain-specific dictionary, I’m sure we couldreach 80% accuracy or more.

What does this tell us?

So, what can we learn from this quick sentiment accuracy test?

Well, for one thing, this shows how automated sentiment scoring accuracy can easily reach or exceed the 80-85% human agreementbaseline.

Of course, the best results will always come from analyzing domain-specific content with a sentiment scoring model trained on similar content.

For example, if you analyze a data set of financial content using a model trained on movie reviews, the results won’t be nearly so good. But try analyzing the same data set using a system that’s configured to understand financial language. You’ll find that you can achieve very high sentiment accuracy without much extra effort.

That said, this test shows how phrase-based sentiment scoring can produce good results, even in its most basic state.

Further reading on sentiment accuracy

Research paper: SentiBench – a benchmark comparison of state-of-the-practice sentiment analysis methods

Explainer: What is Sentiment Analysis, How Does it Work, and How is it Used?

Documentation: Lexalytics, an InMoment company, NLP Glossary

Sentiment Accuracy: Explaining the Baseline and How to Test It - Lexalytics (2024)
Top Articles
How to Turn Off 2-Step Verification in Gmail (2023)
Which Cryptocurrencies are ISO 20022 Compatible? - Ledgernomic
Riverrun Rv Park Middletown Photos
Warren Ohio Craigslist
I Make $36,000 a Year, How Much House Can I Afford | SoFi
Terrorist Usually Avoid Tourist Locations
New Slayer Boss - The Araxyte
Chalupp's Pizza Taos Menu
Www Thechristhospital Billpay
What Happened To Father Anthony Mary Ewtn
shopping.drugsourceinc.com/imperial | Imperial Health TX AZ
Craigslist/Phx
Sport Clip Hours
5808 W 110Th St Overland Park Ks 66211 Directions
My.doculivery.com/Crowncork
Caliber Collision Burnsville
Alejos Hut Henderson Tx
Nba Rotogrinders Starting Lineups
Vanessa West Tripod Jeffrey Dahmer
Straight Talk Phones With 7 Inch Screen
Nail Salon Goodman Plaza
3476405416
CVS Near Me | Columbus, NE
Graphic Look Inside Jeffrey Dahmer
Quick Answer: When Is The Zellwood Corn Festival - BikeHike
A Cup of Cozy – Podcast
Masterbuilt Gravity Fan Not Working
Reserve A Room Ucla
Ts Modesto
How rich were the McCallisters in 'Home Alone'? Family's income unveiled
Elanco Rebates.com 2022
My Dog Ate A 5Mg Flexeril
Craigslist Scottsdale Arizona Cars
WOODSTOCK CELEBRATES 50 YEARS WITH COMPREHENSIVE 38-CD DELUXE BOXED SET | Rhino
Roadtoutopiasweepstakes.con
The Ultimate Guide to Obtaining Bark in Conan Exiles: Tips and Tricks for the Best Results
Edward Walk In Clinic Plainfield Il
Aliciabibs
The Vélodrome d'Hiver (Vél d'Hiv) Roundup
Shih Tzu dogs for sale in Ireland
9781644854013
Priscilla 2023 Showtimes Near Consolidated Theatres Ward With Titan Luxe
Taylor University Baseball Roster
Google Flights Orlando
Gifford Christmas Craft Show 2022
Prior Authorization Requirements for Health Insurance Marketplace
Lonely Wife Dating Club בקורות וחוות דעת משתמשים 2021
Walmart 24 Hrs Pharmacy
Heat Wave and Summer Temperature Data for Oklahoma City, Oklahoma
Wild Fork Foods Login
Latest Posts
Article information

Author: Stevie Stamm

Last Updated:

Views: 5454

Rating: 5 / 5 (60 voted)

Reviews: 91% of readers found this page helpful

Author information

Name: Stevie Stamm

Birthday: 1996-06-22

Address: Apt. 419 4200 Sipes Estate, East Delmerview, WY 05617

Phone: +342332224300

Job: Future Advertising Analyst

Hobby: Leather crafting, Puzzles, Leather crafting, scrapbook, Urban exploration, Cabaret, Skateboarding

Introduction: My name is Stevie Stamm, I am a colorful, sparkling, splendid, vast, open, hilarious, tender person who loves writing and wants to share my knowledge and understanding with you.