Is the Turing Test Dead?

IEEE SpectrumFOR THE TECHNOLOGY INSIDER
TopicsAerospaceArtificial IntelligenceBiomedicalClimate TechComputingConsumer ElectronicsEnergyHistory of TechnologyRoboticsSemiconductorsTelecommunicationsTransportation
SectionsFeaturesNewsOpinionCareersDIYEngineering Resources
MoreNewslettersPodcastsSpecial ReportsCollectionsExplainersTop Programming LanguagesRobots Guide ↗IEEE Job Site ↗
For IEEE MembersCurrent IssueMagazine ArchiveThe InstituteThe Institute Archive
For IEEE MembersCurrent IssueMagazine ArchiveThe InstituteThe Institute Archive
IEEE SpectrumAbout UsContact UsReprints & Permissions ↗Advertising ↗
Follow IEEE Spectrum
Support IEEE SpectrumIEEE Spectrum is the flagship publication of the IEEE — the world’s largest professional organization devoted to engineering and applied sciences. Our articles, podcasts, and infographics inform our readers about developments in technology, engineering, and science.
Join IEEE
Subscribe
About IEEEContact & SupportAccessibilityNondiscrimination PolicyTermsIEEE Privacy PolicyCookie PreferencesAd Privacy Options
© Copyright 2024 IEEE — All rights reserved. A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

When in 1950 Alan Turing first proposed an approach to distinguish the “minds” of machines from those of human beings, the idea that a machine could ever achieve human-level intelligence was almost laughable.

In the Turing test—which Turing himself originally called the “imitation game“—human participants conduct a conversation with unknown users to determine if they’re talking to a human or a computer. In 2014, a chatbot masquerading as a Ukrainian teenager named Eugene Goostman seemed to put one of the first nails in the Turing test’s coffin by fooling more than one-third of human interrogators into thinking they were talking to another human, although some researchers dispute the claim that the chatbot passed the test.

Today, we run into seemingly intelligent machines all day long. Our smart speakers tell us to bring umbrellas on our way out the door and large language models (LLMs) like ChatGPT can write promotion-worthy emails. Stacked up against a human, these machines might be easy to confuse with the real thing.

Does this mean the Turing test is a thing of the past?

In a new paper published 10 November in the journal Intelligent Computing, a pair of researchers have proposed a new kind of intelligence test that treats machines as participants of a psychological study to determine how closely their reasoning skills match those of human beings. The researchers are Philip Johnson-Laird, a Princeton psychology professor and pioneer of the mental model of human reasoning, and Marco Ragni, a professor of predictive analytics at Chemnitz University of Technology, in Germany.

“As chatbots have approached and succeeded at the Turing test, it has quietly slipped away from importance.” —Anders Sandberg, University of Oxford

In their paper, Johnson-Laird and Ragni argue that the Turing test was never a good measure of machine intelligence in the first place, as it fails to address the process of human thinking.

“Given that such algorithms do not reason in the way that humans do, the Turing test and any others it has inspired are obsolete,” they write.

This assertion is one that Anders Sandberg, a senior research fellow at the University of Oxford’s Future of Humanity Institute, says he agrees with. That said, he’s not convinced that a human-reasoning assessment will be the ultimate test of intelligence either.

“As chatbots have approached and succeeded at the Turing test, it has quietly slipped away from importance,” Sandberg says. “This paper tries to see if a program reasons the way humans reason. That is both interesting and useful, but will of course only tell us if there is human-style intelligence, not some other form of potentially valuable intelligence.”

Likewise, even though Turing tests may be going out of fashion, Huma Shah, an assistant professor of computing at the University of Coventry, in England, whose research has focused on the Turing test and machine intelligence, says that doesn’t necessarily mean they’re no longer useful.

“In terms of indistinguishability, no, [the Turing test is not obsolete],” Shah says. “You can apply indistinguishability to other areas where we would want a machine’s performance to be as good as or better than a human carrying out that task efficiently and ethically. For example, in facial recognition, or the ability to drive safely while avoiding hurting passengers and pedestrians.”

As for Johnson-Laird and Ragni’s test, it would be carried out in three steps. First, machines would be asked a number of questions to test their own reasoning—for example, they could be asked, “If Ann is intelligent, does it follow that Ann is intelligent or she is rich, or both?” They would then be tested on whether or not they understood their own reasoning, such as with the response “Nothing in the premise supports the possibility that Ann is rich.” Finally, researchers would take a look under the hoods of the machines to determine whether the neural networks are built to simulate human cognition.

This last step is where Sandberg worries there could be complications.

“The last step can be very hard,” he says. “Most LLMs are vast neural networks that are not particularly inspectable, despite much research on how to do this.”

Translating a machine’s internal representation of reasoning into a form that humans can understand may even ultimately distort the original nature of the machine’s thought process, Sandberg says. In other words, would we recognize a machine’s interpretation of human reasoning if we saw it?

This question is especially complicated, as the science of human cognition itself isn’t yet set in stone.

While replacing the Turing test may not be a simple process, Shah says that alternatives like this reasoning test have the opportunity to advance how we think about these big questions, like what it means to be human. They may also help shed light on what it means to be a computer, such as what processes take place inside a neural network’s black box.

“If new tests for human-machine indistinguishability progress machine ‘explainability’—for example, the ‘reasoning’ in algorithms that render their decision-making comprehensible to the general public, such as in financial algorithms for insurance, mortgages, loans, etc., then this objective is an invaluable contribution to progressing intelligent machinery,” Shah says.

From Your Site Articles

Topics

Sections

More

For IEEE Members

For IEEE Members

IEEE Spectrum

Follow IEEE Spectrum

Support IEEE Spectrum

Is the Turing Test Dead?

Researchers wonder whether improved large language models require new tests for machine intelligence

Will Human Soldiers Ever Trust Their Robot Comrades?

Video Friday: RACER Heavy

As Ukraine Builds New Reactors, Renewables Beckon

Related Stories

Llama 3 Establishes Meta as the Leader in “Open” AI

AI Chip Trims Energy Budget Back by 99+ Percent

Faster, More Secure Photonic Chip Boosts AI Training

Topics

Sections

More

For IEEE Members

For IEEE Members

IEEE Spectrum

Follow IEEE Spectrum

Support IEEE Spectrum

Enjoy more free content and benefits by creating an account

Saving articles to read later requires an IEEE Spectrum account

The Institute content is only available for members

Downloading full PDF issues is exclusive for IEEE Members

Downloading this e-book is exclusive for IEEE Members

Access to Spectrum 's Digital Edition is exclusive for IEEE Members

Following topics is a feature exclusive for IEEE Members

Adding your response to an article requires an IEEE Spectrum account

Create an account to access more content and features on IEEE Spectrum , including the ability to save articles to read later, download Spectrum Collections, and participate in conversations with readers and editors. For more exclusive content and features, consider Joining IEEE .

Join the world’s largest professional organization devoted to engineering and applied sciences and get access to all of Spectrum’s articles, archives, PDF downloads, and other benefits. Learn more →

Join the world’s largest professional organization devoted to engineering and applied sciences and get access to this e-book plus all of IEEE Spectrum’s articles, archives, PDF downloads, and other benefits. Learn more →

Access Thousands of Articles — Completely Free

Create an account and get exclusive content and features: Save articles, download collections, and talk to tech insiders — all free! For full access and benefits, join IEEE as a paying member.

Is the Turing Test Dead?

Researchers wonder whether improved large language models require new tests for machine intelligence

Will Human Soldiers Ever Trust Their Robot Comrades?

Video Friday: RACER Heavy

As Ukraine Builds New Reactors, Renewables Beckon

Related Stories

Llama 3 Establishes Meta as the Leader in “Open” AI

AI Chip Trims Energy Budget Back by 99+ Percent

Faster, More Secure Photonic Chip Boosts AI Training