AI models struggle to identify nonsense, says study | Tech News

AI models struggle to identify nonsense, says study

AI models struggle to differentiate between nonsense and natural language, highlighting limitations and raising concerns about their use in legal or medical settings.

By:AFP
| Updated on: Sep 15 2023, 07:45 IST
artificial intelligence
The study found that all models tested made mistakes in differentiating between meaningful and gibberish sentences. (Pexels)

The AI models that power chatbots and other applications still have difficulty distinguishing between nonsense and natural language, according to a study released on Thursday.

The researchers at Columbia University in the United States said their work revealed the limitations of current AI models and suggested it was too early to let them loose in legal or medical settings.

You may be interested in

MobilesTablets Laptops
7% OFF
Apple iPhone 15 Pro Max
  • Black Titanium
  • 8 GB RAM
  • 256 GB Storage
23% OFF
Samsung Galaxy S23 Ultra 5G
  • Green
  • 12 GB RAM
  • 256 GB Storage
Google Pixel 8 Pro
  • Obsidian
  • 12 GB RAM
  • 128 GB Storage
Apple iPhone 15 Plus
  • Black
  • 6 GB RAM
  • 128 GB Storage

They put nine AI models through their paces, firing hundreds of pairs of sentences at them and asking which were likely to be heard in everyday speech.

Also read
Looking for a smartphone? To check mobile finder click here.

They asked 100 people to make the same judgement on pairs of sentences like: "A buyer can own a genuine product also / One versed in circumference of highschool I rambled."

The research, published in the Nature Machine Intelligence journal, then weighed the AI answers against the human answers and found dramatic differences.

Sophisticated models like GPT-2, an earlier version of the model that powers viral chatbot ChatGPT, generally matched the human answers.

Other simpler models did less well.

But the researchers highlighted that all the models made mistakes.

"Every model exhibited blind spots, labelling some sentences as meaningful that human participants thought were gibberish," said psychology professor Christopher Baldassano, an author of the report.

"That should give us pause about the extent to which we want AI systems making important decisions, at least for now."

Tal Golan, another of the paper's authors, told AFP that the models were "an exciting technology that can complement human productivity dramatically".

However, he argued that "letting these models replace human decision-making in domains such as law, medicine, or student evaluation may be premature".

Among the pitfalls, he said, was the possibility that people might intentionally exploit the blind spots to manipulate the models.

AI models burst into public consciousness with the release of ChatGPT last year, which has since been credited with passing various exams and has been touted as a possible aide to doctors, lawyers and other professionals.

Catch all the Latest Tech News, Mobile News, Laptop News, Gaming news, Wearables News , How To News, also keep up with us on Whatsapp channel,Twitter, Facebook, Google News, and Instagram. For our latest videos, subscribe to our YouTube channel.

First Published Date: 15 Sep, 07:45 IST
NEXT ARTICLE BEGINS
Not sure which Mobile to buy? Need help?