Artificial Intelligence

AI hallucination debate among social media influencers surges as concerns mount over reliability of GenAI tools, reveals GlobalData

May 23, 2023

The rise of generative artificial intelligence (AI) tools with large language models (LLM) such as ChatGPT, Bard, and Bing, has ignited a spirited discussion among Twitter and Reddit influencers. Dubbed the “AI hallucination debate,” it centers around concerns regarding the accuracy and reliability of information generated by these tools, fueling calls for ethical oversight and fact-checking, says GlobalData, a leading data and analytics company.

Smitarani Tripathy, Social Media Analyst at GlobalData, comments: “The debate on the phenomenon of hallucination of generative pretrained transformers (GPT) AI models by social media contributors is linked with the transparency and reliability of the analysis, academic writing, and potentially biased information being generated by these AI models.

“Twitter influencers have opined that these AI tools with LLMs are designed to approximate human language, not truth, and often contain half-truths, misremembered details, and plagiarism, confounding users and raising questions about the accuracy and reliability of AI. Sentiments of the contributors are mostly negative as they have highlighted the need for ethical oversight, fact-checking, and input from social scientists and ethicists to align AI systems with human values.”

Meanwhile, some infuencers highlight the potential implications for healthcare, privacy, and information accuracy and argue that hallucinations in AI could lead to creativity and understanding human conversation.

Tripathy concludes: “Striking the right balance will not only address the challenges but also unleash the potential for creativity and improved understanding in human-AI interactions.”

Below are a few popular influencer opinions captured by GlobalData’s Social Media Analytics Platform:

Ethan Mollick, Professor at The Wharton School:

“GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the paper measured hallucinations. Bard had a hallucination rate of 57% while GPT-4 was just 2% That suggests a potential for real progress on made up answers”

Ian Connolly, Neurosurgery Resident at MGH Neurosurgery:

“3 / RE hallucinations — these LLMs do not have the ability to include image input yet. Despite not having this info, these LLMs confabulated by making up missing imaging information. On imaging-based q’s, Bard had a hallucination rate of 57% while GPT-4 had a rate of 2.3%.”

Alex Jimenez, Fintech Consultant:

“I call BS. #GenerativeAI is designed to approximate human language but NOT to be truthful. These tools have been rushed out without vetting.”

John Nosta, Founder of NostaLab:

“GPT hallucination is a feature, not a bug! “People always confabulate. Half-truths and misremembered details are hallmarks of human conversation—confabulation is a signature of human memory. These models are doing something just like people.” #GeoffreyHinton#AI#ChatGPT#GPT4”

DCInvestor, Individual Investor:

“for those not familiar, AI hallucination is when the AI starts making stuff up to fill in space within a discussion, or starts to believe that falsehoods are true in discussion all Ais i’ve used may do this to some degree, but Bard sorta be trippin’…”

AI hallucination debate among social media influencers surges as concerns mount over reliability of GenAI tools, reveals GlobalData

LEAVE A REPLY Cancel reply

TECH NEWS

Gartner Predicts Legal, Risk and Compliance Functions to Double Technology Spend...

Microsoft to End Support for Windows Mail, Calendar and People Apps...

IDC Predicts: Asia/Pacific Business Leaders to Demand 80% Success Rate on...

The Cooling Conundrum: AI and Automation Push Data Centers Toward 3X...

Gartner Identifies Four Emerging Challenges to Delivering Value from AI Safely...

The Future of Data Protection: A Deep Dive into NAKIVO Backup...

TOP STORIES

Most banks and insurers adopt cloud solutions with the primary objective...

India’s Web3 Ecosystem Has Over 400 Firms, Karnataka Emerges as Industry...

Next-generation spirits innovation to be shaped by premiumization, convenience, generational shifts,...

Trump Triumph: What it Means for Big Tech, Tariffs, Semiconductors, Automotive...

High- cyber-maturity organizations expect to achieve their business outcomes by 27%...

AI Adoption in 2024: 74% of Companies Struggle to Achieve and...

Cyber Security

Tenable Forecasts Data Security in the Cloud to Take Centre Stage...

Blockchain-Enhanced Cybersecurity-Safeguarding Digital Identities and Data

New F5 Report Unveils Scary Truths About API Security in the...

SteelFox exploits Foxit PDF Editor and AutoCAD for banking data theft...

Kaspersky identifies new stealthy ransomware

Gartner Survey Shows AI Enhanced Malicious Attacks as Top Emerging Risk...