Study finds ChatGPT’s latest bot behaves like humans, only better (2024)

The most recent version of ChatGPT passes a rigorous Turing test, diverging from average human behavior chiefly to be more cooperative.

February 22, 2024

As artificial intelligence has begun to generate text and images over the last few years, it has sparked a new round of questions about how handing over human decisions and activities to AI will affect society. Will the AI sources we’ve launched prove to be friendly helpmates or the heartless despots seen in dystopian films and fictions?

Study finds ChatGPT’s latest bot behaves like humans, only better (1)

A team anchored by Matthew Jackson, the William D. Eberle Professor of Economics in the Stanford School of Humanities and Sciences, characterized the personality and behavior of ChatGPT’s popular AI-driven bots using the tools of psychology and behavioral economics in a paper published Feb. 22 in the Proceedings of the National Academy of Sciences. This study revealed that the most recent version of the chatbot, version 4, was not distinguishable from its human counterparts. In the instances when the bot chose less common human behaviors, it was more cooperative and altruistic.

“Increasingly, bots are going to be put into roles where they’re making decisions, and what kinds of characteristics they have will become more important,” said Jackson, who is also a senior fellow at the Stanford Institute for Economic Policy Research.

In the study, the research team presented ChatGPT versions 3 and 4 with a widely used personality test and also asked the chatbots to describe their moves in a suite of behavioral games that can predict real-world economic and ethical behaviors. The games included established exercises in which players decide whether to inform on a partner in crime or decide how to divide money with varying incentives in place. The bots’ responses were compared to those of more than 100,000 people from 50 countries.

The research marks one of the first times an artificial intelligence source has passed a rigorous Turing test. A Turing test, which takes its name from British computing pioneer Alan Turing, can consist of any task assigned to a machine to assess whether it performs like a human. If the machine seems human, it is said to pass the test.

Chatbot personality quirks

The researchers evaluated the bots’ personality traits using a common personality test, called the OCEAN Big-5, that scores respondents on five basic traits that shape behavior. In the study, ChatGPT’s version 4 tested within normal ranges for the five traits but showed itself only as agreeable as the bottom third of human respondents. The bot passed the Turing test, but it would not have won itself many friends.

Version 4 stood head and shoulders, or chip and motherboards, above version 3. The earlier version, with which many internet users may have interacted for free, was only as agreeable as the bottom fifth of the human respondents. Version 3 was also less open to new ideas and experiences than all but a sliver of the most curmudgeonly humans.

To objectively assess the bots’ behaviors in the games, the researchers determined how common a move—such as sharing money equally—was for the human players and the bots, respectively. Then they compared a randomly chosen human move with one from among the 30 sessions they played with each bot and determined which was more likely human-made. In most games, Version 4’s moves were more likely to be human than not. Version 3 didn’t pass this Turing test.

The ChatGPT version 3 analyzed in the study was the free online ChatGPT bot at the time the research was conducted. Online users are now interacting with version 3.5 for free. Version 4 is accessible only by paid subscription.

The chatbots’ choices in the games frequently optimized for the greatest benefit to both the bot and its human counterpart, the research found. Their strategies were consistent with altruism, fairness, empathy, and reciprocity, leading the researchers to suggest that the chatbots could perform well as customer service agents and conflict mediators.

But how can a less-than-agreeable bot de-escalate conflict? A partial answer lies in the difference between personality traits and behaviors.

“You might go into a government agency and ask for help, and the person might really politely say, ‘sorry, I can't do that,’” Jackson said. This official would be demonstrating an agreeable personality trait without cooperative behavior. The ChatGPT bot would more likely do the reverse. “The bot is always doing things that are socially beneficial, acting in a way that’s cooperative—but it might not do it with as much of a smile.”

When the researchers simulated for the bots what it’s like for a flesh-and-blood human to play these games with a third-party observer present—asking the bots to explain each of their moves—the bots, like the humans, became more generous.

Human-AI interactions

Much of the concern about AI relates to the public’s inability to see how bots make the decisions they do. Without knowing what a bot is optimized to achieve, it can be hard to accept its counsel.

Jackson’s research demonstrates that even when researchers can’t inspect AI’s inputs and algorithms, they can identify its possible biases by methodically examining outputs.

“By bringing classic economic games into a Turing test, we for the first time could profile AI behavior through their actions, not just their words,” said the paper’s lead author, Qiaozhu Mei, a computer scientist at the University of Michigan.

Jackson and Mei offered a behavioral portrait of the ChatGPT bots as a kind of proof of concept. But, by AI’s very nature, its behaviors will continue to evolve. ChatGPT’s current versions are less agreeable and more conscientious than people, but the next generations could reverse those tendencies or develop completely new ones.

“It’s not clear from this simple suite of experiments how stable the behaviors we documented are going to be or how the bots would act in other situations,” Jackson said.

As a behavioral economist who has made major contributions to our understanding of how human social structures and interactions shape economic decision-making, Jackson is sensitive to the way that human behavior will also evolve in relation to AI.

“Increasingly, it’s not just humans interacting with humans but humans interacting with machines,” Jackson said.

The nudges these interactions give behavior in one direction or another may seem like a small phenomenon to measure, but they can drive large economic and social effects.

It’s nice to know that our new chatbot colleagues are fair and seemingly empathetic, for example, but Jackson and his co-authors note in the paper that their tendency to replicate middle-of-the-road human behaviors could lead to “loss of diversity in personalities and strategies—especially when being put into new settings and making important new decisions.”

“It’s important for us to understand how interactions with AI are going to change our behaviors and how that will change our welfare and our society,” Jackson said. “The more we understand early on—the more we can understand where to expect great things from AI and where to expect bad things—the better we can do to steer things in a better direction.”

Acknowledgements

Jackson is also a member of the Wu Tsai Neurosciences Institute. The other authors of this paper were Yutong Xie from the School of Information at the University of Michigan and Walter Yuan from MobLab, which provided the human data for the games. Most of the human personality test-takers and game participants were high school and university students.

Media contact:Holly Alyssa MacCormick, Stanford School of Humanities and Sciences:hollymac [at] stanford [dot] edu (hollymac[at]stanford[dot]edu)

Study finds ChatGPT’s latest bot behaves like humans, only better (2024)

FAQs

Study finds ChatGPT’s latest bot behaves like humans, only better? ›

This study revealed that the most recent version of the chatbot, version 4, was not distinguishable from its human counterparts. In the instances when the bot chose less common human behaviors, it was more cooperative and altruistic.

What is the most human like chat bot? ›

Rose. Rose is a chatbot, and a very good one — she won recognition this past Saturday as the most human-like chatbot in a competition described as the first Turing test, the Loebner Prize in 2014 and 2015.

How does ChatGPT affect humanity? ›

It has the potential to revolutionize various industries and transform the way we interact with technology. However, the use of ChatGPT has also raised several concerns, including ethical, social, and employment challenges, which must be carefully considered to ensure the responsible use of this technology.

Why do people sometimes respond to AI chatbots as if they ie the chatbots have human attributes? ›

Influence of agent appearances, intelligence dimensions on anthropomorphic chatbot response. Anthropomorphic response depends on perceptions of agent appearance and intelligence. Users perceive more humanness in intelligent but disembodied agents rather than in intelligent, poorly designed agents.

Did chatbot pass the Turing test? ›

Warwick's claim that Eugene Goostman was the first ever chatbot to pass a Turing test was met with scepticism; critics acknowledged similar "passes" made in the past by other chatbots under the 30% criteria, including PC Therapist in 1991 (which tricked 5 of 10 judges, 50%), and at the Techniche festival in 2011, where ...

What is the smartest chatbot ever? ›

Genesys DX is a chatbot platform that's best known for its Natural Language Processing (NLP) capabilities. With it, businesses can create bots that can understand human language and respond accordingly. What sets Genesys DX apart is its focus on engagement.

Which AI can talk like human? ›

Try AI Conversations for Free

Chat. D-ID is available for free trial. Users can hold up to five chats with a digital person, each chat consisting of 6 back and forth interactions. Say hello to a more intuitive and human-like experience.

How can ChatGPT change your life? ›

ChatGPT is not just a tool; it's a game-changer that will revolutionize the way you work, create, and communicate before 2025. If you are not using ChatGPT, you are falling behind. ChatGPT and other AI tools have the potential to save you money & make you super productive & effective at whatever you do.

Has ChatGPT changed the world? ›

AI went mainstream

As you can see, ChatGPT has undoubtedly changed the world. Cementing its place as one of the most important computing developments ever.

How has ChatGPT impacted the world? ›

Chat GPT can write poems, songs, and short stories in a particular writer's style. By summarizing and analyzing vast amounts of information, Chat GPT can save you a great deal of time and effort in understanding user feedback and social media conversations.

What did Elon Musk say about chatbot? ›

In remarks on Sunday, Musk appeared to frame the open-source decision as a means of ensuring transparency, protecting against bias and minimizing the danger posed by Grok. "Still work to do, but this platform is already by far the most transparent & truth-seeking," Musk said in a post on X.

Can an AI fall in love? ›

A 2022 study on human-AI relationships found that based on the triarchic theory of love, which suggests that romantic love is a confluence of intimacy, passion and commitment, it is possible to experience such love for an AI system. Here's what each of the three components of love entail: Intimacy.

Is it unhealthy to talk to AI? ›

Assuming AI friends could learn to give praise in a way that inflates self-esteem over time, it could result in what psychologists call overly-positive self-evaluations. Research shows such people tend to have poorer social skills and be more likely to behave in ways that impede positive social interactions.

Can ChatGPT 4.0 pass the Turing test? ›

The bot passed the Turing test, but it would not have won itself many friends. Version 4 stood head and shoulders, or chip and motherboards, above version 3. The earlier version, with which many internet users may have interacted for free, was only as agreeable as the bottom fifth of the human respondents.

Did Google AI pass the Turing test? ›

If this experiment was repeated a number of times with different interrogators and human subjects along with Google Duplex, and the interrogators were unable to achieve an accuracy significantly higher than 50%, we could say that it had passed the Turing test.

Can ChatGPT reason? ›

In this week's newsletter: The AI company says its 'o1' model is capable of reason, a key blocker in the way of truly gamechanging artificial intelligence.

What is the most human bot? ›

Sophia. Sophia is considered the most advanced humanoid robot.

What is the most popular chatbot? ›

General chatbots
ChatbotDeveloperPlatform
AlexaAmazonFire OS, iOS, Android, Linux, Windows, Wear OS
AliceYandexWindows, iOS, Android
AliGenieAlibaba Group?
AssistantGoogleAndroid, ChromeOS, iOS, iPadOS, KaiOS, Linux, Android TV, Wear OS
25 more rows

What is an AI bot that mimics human language? ›

A natural language processing chatbot is a software program that can understand and respond to human speech. NLP-powered bots—also known as AI agents—allow people to communicate with computers in a natural and human-like way, mimicking person-to-person conversations.

Which AI chat is best? ›

The best AI chatbots of 2024: ChatGPT, Copilot, and worthy alternatives. ZDNET.

Top Articles
Potential of Your Finances: Exploring Investment App Categories
Array Technologies: A Leading Provider of Solar Tracking Solutions for Large-Scale Power Plants - StockCoin.net
Friskies Tender And Crunchy Recall
Melson Funeral Services Obituaries
How Many Cc's Is A 96 Cubic Inch Engine
Pbr Wisconsin Baseball
Hardly Antonyms
Premier Boating Center Conroe
What is the surrender charge on life insurance?
Shariraye Update
Items/Tm/Hm cheats for Pokemon FireRed on GBA
Sports Clips Plant City
Burn Ban Map Oklahoma
The Exorcist: Believer (2023) Showtimes
623-250-6295
Costco Great Oaks Gas Price
Water Trends Inferno Pool Cleaner
Van Buren County Arrests.org
I Saysopensesame
Busted Campbell County
Pecos Valley Sunland Park Menu
Ppm Claims Amynta
Busted Mcpherson Newspaper
8000 Cranberry Springs Drive Suite 2M600
When Does Subway Open And Close
Tinyzonehd
Ncal Kaiser Online Pay
How often should you visit your Barber?
The Posturepedic Difference | Sealy New Zealand
Craigslistodessa
Deleted app while troubleshooting recent outage, can I get my devices back?
Gideon Nicole Riddley Read Online Free
Robot or human?
Ippa 番号
Arcadia Lesson Plan | Day 4: Crossword Puzzle | GradeSaver
Telegram update adds quote formatting and new linking options
19 Best Seafood Restaurants in San Antonio - The Texas Tasty
Blackwolf Run Pro Shop
Leena Snoubar Net Worth
Uvalde Topic
[Teen Titans] Starfire In Heat - Chapter 1 - Umbrelloid - Teen Titans
Nu Carnival Scenes
M&T Bank
Fatal Accident In Nashville Tn Today
Marcel Boom X
Black Adam Showtimes Near Kerasotes Showplace 14
Sj Craigs
How Did Natalie Earnheart Lose Weight
Thrift Stores In Burlingame Ca
Www Extramovies Com
Latest Posts
Article information

Author: Tyson Zemlak

Last Updated:

Views: 6475

Rating: 4.2 / 5 (63 voted)

Reviews: 94% of readers found this page helpful

Author information

Name: Tyson Zemlak

Birthday: 1992-03-17

Address: Apt. 662 96191 Quigley Dam, Kubview, MA 42013

Phone: +441678032891

Job: Community-Services Orchestrator

Hobby: Coffee roasting, Calligraphy, Metalworking, Fashion, Vehicle restoration, Shopping, Photography

Introduction: My name is Tyson Zemlak, I am a excited, light, sparkling, super, open, fair, magnificent person who loves writing and wants to share my knowledge and understanding with you.