Sentiment Accuracy: Explaining the Baseline and How to Test It - Lexalytics (2024)

I often hesitate to offer benchmarks and general metrics for sentiment accuracy. Why? Because sentiment accuracy depends on so many factors: the type of data you’re dealing with, the people who hand-tagged your sentiment library, how much sleep they each got the night before, the complexity of the language that your industry uses (financial and medical data is particularly arcane)… the list goes on.

That said, it’s natural to want to know how something will perform. And I often talk with prospects and customers who’d like to hear a baseline number. So, in this article I’ll try to break things down.

Setting a baseline sentiment accuracy rate

When evaluating the sentiment (positive, negative, neutral) of a given text document, research shows that human analysts tend to agree around 80-85% of the time. This is the baseline we (usually) try to meet or beat when we’re training a sentiment scoring system. But this does mean that you’ll always find some text documents that even two humans can’t agree on, even with their wealth of experience and knowledge.

But when you’re running automated sentiment analysis through natural language processing, you want to be certain that the results are reliable. So, how accurate can we get, and how can we ensure the best-possible sentiment accuracy?

How to test sentiment accuracy (an example)

For a quick test of baseline sentiment accuracy, I built a new sentiment scoring model. As recommended on an oldYahoo text analytics mailing list, I used thisMovie Review Data (link since removed) put together by Pang and Lee for their various sentiment papers.

This data consisted of 2000 documents (1000 positive, 1000 negative). I further divided it into a training set consisting of 1800 documents (900 positive and 900 negative), and a test set of the remaining 200.

It took me about 45 seconds to train a sentiment scoring model using the training set. Then I used a quick PHP script to run it against the test set.

The results

Now, remember that I built this sentiment model for speed as much as for accuracy. Even so, the results surprised me (pleasantly).

Of the 200-document test set, the model correctly identified 81 of the positive documents and 82 of the negative ones. This is sentiment accuracy score of 81.5%. That means that right off the bat, our basic sentiment scoring model already matched human agreement levels.

Next, I ran the same 200 test set documents against our phrase-based sentiment system. To be honest, I expected a far lower score. But I was pleasantly surprised.

Our simplest sentiment scoring models, trained on very general sentiment libraries, performed admirably, reaching 70.5% accuracy. With a domain-specific dictionary, I’m sure we couldreach 80% accuracy or more.

What does this tell us?

So, what can we learn from this quick sentiment accuracy test?

Well, for one thing, this shows how automated sentiment scoring accuracy can easily reach or exceed the 80-85% human agreementbaseline.

Of course, the best results will always come from analyzing domain-specific content with a sentiment scoring model trained on similar content.

For example, if you analyze a data set of financial content using a model trained on movie reviews, the results won’t be nearly so good. But try analyzing the same data set using a system that’s configured to understand financial language. You’ll find that you can achieve very high sentiment accuracy without much extra effort.

That said, this test shows how phrase-based sentiment scoring can produce good results, even in its most basic state.

Further reading on sentiment accuracy

Research paper: SentiBench – a benchmark comparison of state-of-the-practice sentiment analysis methods

Explainer: What is Sentiment Analysis, How Does it Work, and How is it Used?

Documentation: Lexalytics, an InMoment company, NLP Glossary

Sentiment Accuracy: Explaining the Baseline and How to Test It - Lexalytics (2024)
Top Articles
How to Format Hard Drives
Public Health | US EPA
Coverage of the introduction of the Water (Special Measures) Bill
Blanchard St Denis Funeral Home Obituaries
Otterbrook Goldens
Beautiful Scrap Wood Paper Towel Holder
Geodis Logistic Joliet/Topco
Mustangps.instructure
South Bend Tribune Online
Culvers Tartar Sauce
Med First James City
The Shoppes At Zion Directory
Craiglist Galveston
How To Cut Eelgrass Grounded
5 high school volleyball stars of the week: Sept. 17 edition
Enterprise Car Sales Jacksonville Used Cars
Roll Out Gutter Extensions Lowe's
Craigslist Portland Oregon Motorcycles
Hdmovie 2
Craigslist Pearl Ms
Routing Number For Radiant Credit Union
Roane County Arrests Today
Обзор Joxi: Что это такое? Отзывы, аналоги, сайт и инструкции | APS
Apparent assassination attempt | Suspect never had Trump in sight, did not get off shot: Officials
Foodsmart Jonesboro Ar Weekly Ad
Pensacola Tattoo Studio 2 Reviews
FAQ's - KidCheck
Catchvideo Chrome Extension
Star Wars Armada Wikia
Skidware Project Mugetsu
Farm Equipment Innovations
Learn4Good Job Posting
Ellafeet.official
Eaccess Kankakee
Spy School Secrets - Canada's History
10 games with New Game Plus modes so good you simply have to play them twice
How are you feeling? Vocabulary & expressions to answer this common question!
Telugu Moviez Wap Org
The best bagels in NYC, according to a New Yorker
Silive Obituary
Actor and beloved baritone James Earl Jones dies at 93
Former Employees
How Big Is 776 000 Acres On A Map
Big Reactors Best Coolant
How to Connect Jabra Earbuds to an iPhone | Decortweaks
Mit diesen geheimen Codes verständigen sich Crew-Mitglieder
Mytmoclaim Tracking
Ty Glass Sentenced
Fresno Craglist
Grandma's Portuguese Sweet Bread Recipe Made from Scratch
Best brow shaping and sculpting specialists near me in Toronto | Fresha
Zalog Forum
Latest Posts
Article information

Author: Stevie Stamm

Last Updated:

Views: 5454

Rating: 5 / 5 (60 voted)

Reviews: 91% of readers found this page helpful

Author information

Name: Stevie Stamm

Birthday: 1996-06-22

Address: Apt. 419 4200 Sipes Estate, East Delmerview, WY 05617

Phone: +342332224300

Job: Future Advertising Analyst

Hobby: Leather crafting, Puzzles, Leather crafting, scrapbook, Urban exploration, Cabaret, Skateboarding

Introduction: My name is Stevie Stamm, I am a colorful, sparkling, splendid, vast, open, hilarious, tender person who loves writing and wants to share my knowledge and understanding with you.