Study finds ChatGPT’s latest bot behaves like humans, only better (2024)

The most recent version of ChatGPT passes a rigorous Turing test, diverging from average human behavior chiefly to be more cooperative.

February 22, 2024

As artificial intelligence has begun to generate text and images over the last few years, it has sparked a new round of questions about how handing over human decisions and activities to AI will affect society. Will the AI sources we’ve launched prove to be friendly helpmates or the heartless despots seen in dystopian films and fictions?

Study finds ChatGPT’s latest bot behaves like humans, only better (1)

A team anchored by Matthew Jackson, the William D. Eberle Professor of Economics in the Stanford School of Humanities and Sciences, characterized the personality and behavior of ChatGPT’s popular AI-driven bots using the tools of psychology and behavioral economics in a paper published Feb. 22 in the Proceedings of the National Academy of Sciences. This study revealed that the most recent version of the chatbot, version 4, was not distinguishable from its human counterparts. In the instances when the bot chose less common human behaviors, it was more cooperative and altruistic.

“Increasingly, bots are going to be put into roles where they’re making decisions, and what kinds of characteristics they have will become more important,” said Jackson, who is also a senior fellow at the Stanford Institute for Economic Policy Research.

In the study, the research team presented ChatGPT versions 3 and 4 with a widely used personality test and also asked the chatbots to describe their moves in a suite of behavioral games that can predict real-world economic and ethical behaviors. The games included established exercises in which players decide whether to inform on a partner in crime or decide how to divide money with varying incentives in place. The bots’ responses were compared to those of more than 100,000 people from 50 countries.

The research marks one of the first times an artificial intelligence source has passed a rigorous Turing test. A Turing test, which takes its name from British computing pioneer Alan Turing, can consist of any task assigned to a machine to assess whether it performs like a human. If the machine seems human, it is said to pass the test.

Chatbot personality quirks

The researchers evaluated the bots’ personality traits using a common personality test, called the OCEAN Big-5, that scores respondents on five basic traits that shape behavior. In the study, ChatGPT’s version 4 tested within normal ranges for the five traits but showed itself only as agreeable as the bottom third of human respondents. The bot passed the Turing test, but it would not have won itself many friends.

Human-AI interactions

Much of the concern about AI relates to the public’s inability to see how bots make the decisions they do. Without knowing what a bot is optimized to achieve, it can be hard to accept its counsel.

Jackson’s research demonstrates that even when researchers can’t inspect AI’s inputs and algorithms, they can identify its possible biases by methodically examining outputs.

“By bringing classic economic games into a Turing test, we for the first time could profile AI behavior through their actions, not just their words,” said the paper’s lead author, Qiaozhu Mei, a computer scientist at the University of Michigan.

Jackson and Mei offered a behavioral portrait of the ChatGPT bots as a kind of proof of concept. But, by AI’s very nature, its behaviors will continue to evolve. ChatGPT’s current versions are less agreeable and more conscientious than people, but the next generations could reverse those tendencies or develop completely new ones.

“It’s not clear from this simple suite of experiments how stable the behaviors we documented are going to be or how the bots would act in other situations,” Jackson said.

As a behavioral economist who has made major contributions to our understanding of how human social structures and interactions shape economic decision-making, Jackson is sensitive to the way that human behavior will also evolve in relation to AI.

“Increasingly, it’s not just humans interacting with humans but humans interacting with machines,” Jackson said.

The nudges these interactions give behavior in one direction or another may seem like a small phenomenon to measure, but they can drive large economic and social effects.

It’s nice to know that our new chatbot colleagues are fair and seemingly empathetic, for example, but Jackson and his co-authors note in the paper that their tendency to replicate middle-of-the-road human behaviors could lead to “loss of diversity in personalities and strategies—especially when being put into new settings and making important new decisions.”

“It’s important for us to understand how interactions with AI are going to change our behaviors and how that will change our welfare and our society,” Jackson said. “The more we understand early on—the more we can understand where to expect great things from AI and where to expect bad things—the better we can do to steer things in a better direction.”

Acknowledgements

Jackson is also a member of the Wu Tsai Neurosciences Institute. The other authors of this paper were Yutong Xie from the School of Information at the University of Michigan and Walter Yuan from MobLab, which provided the human data for the games. Most of the human personality test-takers and game participants were high school and university students.

Media contact:Holly Alyssa MacCormick, Stanford School of Humanities and Sciences:hollymac [at] stanford [dot] edu (hollymac[at]stanford[dot]edu)

FAQs

Study finds ChatGPT’s latest bot behaves like humans, only better? ›

This study revealed that the most recent version of the chatbot, version 4, was not distinguishable from its human counterparts. In the instances when the bot chose less common human behaviors, it was more cooperative and altruistic.

Read On ›

What is the most human like chat bot? ›

Rose. Rose is a chatbot, and a very good one — she won recognition this past Saturday as the most human-like chatbot in a competition described as the first Turing test, the Loebner Prize in 2014 and 2015.

Discover More Details ›

How does ChatGPT affect humanity? ›

It has the potential to revolutionize various industries and transform the way we interact with technology. However, the use of ChatGPT has also raised several concerns, including ethical, social, and employment challenges, which must be carefully considered to ensure the responsible use of this technology.

Why do people sometimes respond to AI chatbots as if they ie the chatbots have human attributes? ›

Influence of agent appearances, intelligence dimensions on anthropomorphic chatbot response. Anthropomorphic response depends on perceptions of agent appearance and intelligence. Users perceive more humanness in intelligent but disembodied agents rather than in intelligent, poorly designed agents.

See Details ›

Did chatbot pass the Turing test? ›

Warwick's claim that Eugene Goostman was the first ever chatbot to pass a Turing test was met with scepticism; critics acknowledged similar "passes" made in the past by other chatbots under the 30% criteria, including PC Therapist in 1991 (which tricked 5 of 10 judges, 50%), and at the Techniche festival in 2011, where ...

Find Out More ›

What is the smartest chatbot ever? ›

Genesys DX is a chatbot platform that's best known for its Natural Language Processing (NLP) capabilities. With it, businesses can create bots that can understand human language and respond accordingly. What sets Genesys DX apart is its focus on engagement.

Tell Me More ›

Which AI can talk like human? ›

Try AI Conversations for Free

Chat. D-ID is available for free trial. Users can hold up to five chats with a digital person, each chat consisting of 6 back and forth interactions. Say hello to a more intuitive and human-like experience.

Show Me More ›

How can ChatGPT change your life? ›

ChatGPT is not just a tool; it's a game-changer that will revolutionize the way you work, create, and communicate before 2025. If you are not using ChatGPT, you are falling behind. ChatGPT and other AI tools have the potential to save you money & make you super productive & effective at whatever you do.

Explore More ›

Has ChatGPT changed the world? ›

AI went mainstream

As you can see, ChatGPT has undoubtedly changed the world. Cementing its place as one of the most important computing developments ever.

How has ChatGPT impacted the world? ›

Chat GPT can write poems, songs, and short stories in a particular writer's style. By summarizing and analyzing vast amounts of information, Chat GPT can save you a great deal of time and effort in understanding user feedback and social media conversations.

Show Me More ›

What did Elon Musk say about chatbot? ›

In remarks on Sunday, Musk appeared to frame the open-source decision as a means of ensuring transparency, protecting against bias and minimizing the danger posed by Grok. "Still work to do, but this platform is already by far the most transparent & truth-seeking," Musk said in a post on X.

Read The Full Story ›

Can an AI fall in love? ›

A 2022 study on human-AI relationships found that based on the triarchic theory of love, which suggests that romantic love is a confluence of intimacy, passion and commitment, it is possible to experience such love for an AI system. Here's what each of the three components of love entail: Intimacy.

See Details ›

Is it unhealthy to talk to AI? ›

Assuming AI friends could learn to give praise in a way that inflates self-esteem over time, it could result in what psychologists call overly-positive self-evaluations. Research shows such people tend to have poorer social skills and be more likely to behave in ways that impede positive social interactions.

Get More Info Here ›

Can ChatGPT 4.0 pass the Turing test? ›

The bot passed the Turing test, but it would not have won itself many friends. Version 4 stood head and shoulders, or chip and motherboards, above version 3. The earlier version, with which many internet users may have interacted for free, was only as agreeable as the bottom fifth of the human respondents.

Did Google AI pass the Turing test? ›

If this experiment was repeated a number of times with different interrogators and human subjects along with Google Duplex, and the interrogators were unable to achieve an accuracy significantly higher than 50%, we could say that it had passed the Turing test.

Can ChatGPT reason? ›

In this week's newsletter: The AI company says its 'o1' model is capable of reason, a key blocker in the way of truly gamechanging artificial intelligence.

View Details ›

What is the most human bot? ›

Sophia. Sophia is considered the most advanced humanoid robot.

What is the most popular chatbot? ›

General chatbots

Chatbot	Developer	Platform
Alexa	Amazon	Fire OS, iOS, Android, Linux, Windows, Wear OS
Alice	Yandex	Windows, iOS, Android
AliGenie	Alibaba Group	?
Assistant	Google	Android, ChromeOS, iOS, iPadOS, KaiOS, Linux, Android TV, Wear OS

25 more rows

Learn More ›

What is an AI bot that mimics human language? ›

A natural language processing chatbot is a software program that can understand and respond to human speech. NLP-powered bots—also known as AI agents—allow people to communicate with computers in a natural and human-like way, mimicking person-to-person conversations.

Discover More Details ›

Which AI chat is best? ›

The best AI chatbots of 2024: ChatGPT, Copilot, and worthy alternatives. ZDNET.

Show Me More ›