We have to stop ignoring AI’s hallucination problem – The Verge

Category: Ai

Google I/O introduced an AI assistant that can see and hear the world, while OpenAI put its version of a Her-like chatbot into an iPhone. Next week, Microsoft will be hosting Build, where its sure to have some version of Copilot or Cortana that understands pivot tables. Then, a few weeks after that, Apple will host its own developer conference, and if the buzz is anything to go by, itll be talking about artificial intelligence, too. (Unclear if Siri will be mentioned.)

AI is here! Its no longer conceptual. Its taking jobs, making a few new ones, and helping millions of students avoid doing their homework. According to most of the major tech companies investing in AI, we appear to be at the start of experiencing one of those rare monumental shifts in technology. Think the Industrial Revolution or the creation of the internet or personal computer. All of Silicon Valley of Big Tech is focused on taking large language models and other forms of artificial intelligence and moving them from the laptops of researchers into the phones and computers of average people. Ideally, they will make a lot of money in the process.

But I cant really care about that because Meta AI thinks I have a beard.

I want to be very clear: I am a cis woman and do not have a beard. But if I type show me a picture of Alex Cranz into the prompt window, Meta AI inevitably returns images of very pretty dark-haired men with beards. I am only some of those things!

Meta AI isnt the only one to struggle with the minutiae of The Verges masthead. ChatGPT told me yesterday I dont work at The Verge. Googles Gemini didnt know who I was (fair), but after telling me Nilay Patel was a founder of The Verge, it then apologized and corrected itself, saying he was not. (I assure you he was.)

The AI keeps screwing up because these computers are stupid. Extraordinary in their abilities and astonishing in their dimwittedness. I cannot get excited about the next turn in the AI revolution because that turn is into a place where computers cannot consistently maintain accuracy about even minor things.

I mean, they even screwed up during Googles big AI keynote at I/O. In a commercial for Googles new AI-ified search engine, someone asked how to fix a jammed film camera, and it suggested they open the back door and gently remove the film. That is the easiest way to destroy any photos youve already taken.

An AIs difficult relationship with the truth is called hallucinating. In extremely simple terms: these machines are great at discovering patterns of information, but in their attempt to extrapolate and create, they occasionally get it wrong. They effectively hallucinate a new reality, and that new reality is often wrong. Its a tricky problem, and every single person working on AI right now is aware of it.

One Google ex-researcher claimed it could be fixed within the next year (though he lamented that outcome), and Microsoft has a tool for some of its users thats supposed to help detect them. Googles head of Search, Liz Reid, told The Verge its aware of the challenge, too. Theres a balance between creativity and factuality with any language model, she told my colleague David Pierce. Were really going to skew it toward the factuality side.

But notice how Reid said there was a balance? Thats because a lot of AI researchers dont actually think hallucinations can besolved. A study out of the National University of Singapore suggested that hallucinations are an inevitable outcome of all large language models. Just as no person is 100 percent right all the time, neither are these computers.

And thats probably why most of the major players in this field the ones with real resources and financial incentive to make us all embrace AI think you shouldnt worry about it. During Googles IO keynote, it added, in tiny gray font, the phrase check responses for accuracy to the screen below nearly every new AI tool it showed off a helpful reminder that its tools cant be trusted, but it also doesnt think its a problem. ChatGPT operates similarly. In tiny font just below the prompt window, it says, ChatGPT can make mistakes. Check important info.

Thats not a disclaimer you want to see from tools that are supposed to change our whole lives in the very near future! And the people making these tools do not seem to care too much about fixing the problem beyond a small warning.

Sam Altman, the CEO of OpenAI who was briefly ousted for prioritizing profit over safety, went a step further and said anyone who had an issue with AIs accuracy was naive. If you just do the naive thing and say, Never say anything that youre not 100 percent sure about, you can get them all to do that. But it wont have the magic that people like so much, he told a crowd at Salesforces Dreamforce conference last year.

This idea that theres a kind of unquantifiable magic sauce in AI that will allow us to forgive its tenuous relationship with reality is brought up a lot by the people eager to hand-wave away accuracy concerns. Google, OpenAI, Microsoft, and plenty of other AI developers and researchers have dismissed hallucination as a small annoyance that should be forgiven because theyre on the path to making digital beings that might make our own lives easier.

But apologies to Sam and everyone else financially incentivized to get me excited about AI. I dont come to computers for the inaccurate magic of human consciousness. I come to them because they are very accurate when humans are not. I dont need my computer to be my friend; I need it to get my gender right when I ask and help me not accidentally expose film when fixing a busted camera. Lawyers, I assume, would like it to get the case law right.

I understand where Sam Altman and other AI evangelists are coming from. There is a possibility in some far future to create a real digital consciousness from ones and zeroes. Right now, the development of artificial intelligence is moving at an astounding speed that puts many previous technological revolutions to shame. There is genuine magic at work in Silicon Valley right now.

But the AI thinks I have a beard. It cant consistently figure out the simplest tasks, and yet, its being foisted upon us with the expectation that we celebrate the incredible mediocrity of the services these AIs provide. While I can certainly marvel at the technological innovations happening, I would like my computers not to sacrifice accuracy just so I have a digital avatar to talk to. That is not a fair exchange its only an interesting one.

Follow this link:

We have to stop ignoring AI's hallucination problem - The Verge

Google rolls out Gemini Deep Think AI, a reasoning model that tests multiple ideas in parallel - TechCrunch - August 1st, 2025 [August 1st, 2025]
Big Tech may be breaking the bank for AI, but investors love it - Reuters - August 1st, 2025 [August 1st, 2025]
Is Amazon Losing Ground To Microsoft And Google in AI? - Investor's Business Daily - August 1st, 2025 [August 1st, 2025]
Tim Cook Says Apple Is Investing Significantly in AI and Could Buy Another Company - Investopedia - August 1st, 2025 [August 1st, 2025]
UBS took a sweeping look at the AI revolution and concluded the 'visible' impact is at least 3 years away for consumer firms - Fortune - August 1st, 2025 [August 1st, 2025]
China has top-flight AI models. But it is struggling to run them - The Economist - August 1st, 2025 [August 1st, 2025]
The 10 jobs least and most threatened by AI - Axios - August 1st, 2025 [August 1st, 2025]
Startup Trunk Tools is using AI to reduce construction errors and waste - CNBC - August 1st, 2025 [August 1st, 2025]
Metas superintelligence isnt here yet. But its AI bets are already paying off - CNN - August 1st, 2025 [August 1st, 2025]
Ethan Thornton of Mach Industries takes the AI stage at Disrupt 2025 - TechCrunch - August 1st, 2025 [August 1st, 2025]
20 jobs and careers AI is unlikely to ever touch, according to Microsoft - Fortune - August 1st, 2025 [August 1st, 2025]
Palantir Won a Big Army Pact. The AI Firm Is in the Sweet Spot, Says Analyst. - Barron's - August 1st, 2025 [August 1st, 2025]
Etsy Turns From TV Ads Toward Search, With AI as the Wild Card - The Wall Street Journal - August 1st, 2025 [August 1st, 2025]
Big Tech's AI and core businesses are blurring together - Yahoo Finance - August 1st, 2025 [August 1st, 2025]
These 40 Jobs May Be Replaced by AI. These 40 Probably Won't Microsoft study identifies most AI affected jobs - Inc.com - August 1st, 2025 [August 1st, 2025]
Why You Shouldnt Use AI To Write Your Executive (Legal) Resume Or LinkedIn Profile - Above the Law - August 1st, 2025 [August 1st, 2025]
Prediction: This Artificial Intelligence (AI) Stock Could Hit a $10 Trillion Valuation by 2035 - The Motley Fool - August 1st, 2025 [August 1st, 2025]
What is AI, how do apps like ChatGPT work and why are there concerns? - BBC - August 1st, 2025 [August 1st, 2025]
BG3's Astarion voice actor is 'not interested' in AI: 'Where's the joy in it?' - Polygon - August 1st, 2025 [August 1st, 2025]
TikTok took the world by storm. Now, Chinese companies are taking videos further with AI - CNBC - August 1st, 2025 [August 1st, 2025]
'What am I falling in love with?' Human-AI relationships are no longer just science fiction - CNBC - August 1st, 2025 [August 1st, 2025]
I Watched AI Agents Try to Hack My Vibe-Coded Website - WIRED - August 1st, 2025 [August 1st, 2025]
Personal Perspective: AI training is here. But what we need is human training. - Psychology Today - August 1st, 2025 [August 1st, 2025]
Amazons overwhelming AI demand is just a bronze medal compared to its rivals - Sherwood News - August 1st, 2025 [August 1st, 2025]
Bidens autopen controversy says more about AI than you might think - The Hill - August 1st, 2025 [August 1st, 2025]
Enterprise AI is at a tipping Point, heres what comes next - The World Economic Forum - August 1st, 2025 [August 1st, 2025]
Apple quietens Wall Streets fears of China struggles and slow AI progress - The Guardian - August 1st, 2025 [August 1st, 2025]
The AI Company Capitalizing on Our Obsession With Excel - The Wall Street Journal - August 1st, 2025 [August 1st, 2025]
Apple to Significantly Invest in AI, Warns of $1 Billion Tariff Hit - PYMNTS.com - August 1st, 2025 [August 1st, 2025]
I'm a software engineer, and I've lost my job 4 times in the last 18 years. I don't think AI is the problem. - Business Insider - August 1st, 2025 [August 1st, 2025]
Exclusive | AI Finance App Ramp Is Valued at $22.5 Billion in Funding Round - The Wall Street Journal - July 30th, 2025 [July 30th, 2025]
How spy agencies are experimenting with the newest AI models - The Economist - July 30th, 2025 [July 30th, 2025]
Metas AI Recruiting Campaign Finds a New Target - WIRED - July 30th, 2025 [July 30th, 2025]
SOCOM adds new advanced AI capabilities to tech wish list - DefenseScoop - July 30th, 2025 [July 30th, 2025]
We will sign the EU AI Code of Practice. - The Keyword - July 30th, 2025 [July 30th, 2025]
Why the man behind The Haters Guide to the AI Bubble thinks Wall Streets hottest trade will go bust - MarketWatch - July 30th, 2025 [July 30th, 2025]
How Do You Think AI Will Impact Jobs In The Next 5-10 Years? - The Seattle Medium - July 30th, 2025 [July 30th, 2025]
The Trumpification of AI: What Could Go Wrong? - Mother Jones - July 30th, 2025 [July 30th, 2025]
YouTube to roll out new AI-powered technology aimed at identifying teen users - CBS News - July 30th, 2025 [July 30th, 2025]
Google says it will sign EUs AI code of practice - TechCrunch - July 30th, 2025 [July 30th, 2025]
Millions are watching these bizarre AI Bible videos on TikTok. Should you be worried or encouraged? - Premier Christianity Magazine - July 30th, 2025 [July 30th, 2025]
Silicon Valley's billions of dollars on AI haven't actually generated a return yet. Here's why most companies should embrace 'small AI' instead -... - July 30th, 2025 [July 30th, 2025]
Alibabas AI coding tool raises security concerns in the West - AI News - July 30th, 2025 [July 30th, 2025]
How Zuckerbergs Prometheus AI project could change the world as we know it - The Independent - July 30th, 2025 [July 30th, 2025]
Nvidia AI chip challenger Groq said to be nearing new fundraising at $6B valuation - TechCrunch - July 30th, 2025 [July 30th, 2025]
How US adults are using AI, according to AP-NORC polling - AP News - July 30th, 2025 [July 30th, 2025]
Gen AI apps doubled their revenue, grew to 1.7B downloads in first half of 2025 - TechCrunch - July 30th, 2025 [July 30th, 2025]
The AI browser war is underway. Compare the top browsers from Perplexity, Opera, and more. - Mashable - July 30th, 2025 [July 30th, 2025]
Aetna Launches New AI and Digital Tools to Improve Access and Care - CVS Health - July 30th, 2025 [July 30th, 2025]
YouTube is turning over age verification to AI - Engadget - July 30th, 2025 [July 30th, 2025]
Apple is facing pressure from Wall Street to figure out its AI strategy - CNBC - July 30th, 2025 [July 30th, 2025]
Google was once the most exciting place on the internet. AI mode will ruin it - The Independent - July 30th, 2025 [July 30th, 2025]
Microsoft research: Which jobs overlap most with AI tasks? - theregister.com - July 30th, 2025 [July 30th, 2025]
Amazons AI Coding Revealed a Dirty Little Secret - Bloomberg.com - July 30th, 2025 [July 30th, 2025]
Apple Loses Fourth AI Researcher in a Month to Metas Superintelligence Team - Bloomberg.com - July 30th, 2025 [July 30th, 2025]
Could This Once-Hot AI Stock Get Another Shot At Stardom? - Barchart.com - July 30th, 2025 [July 30th, 2025]
The race to provide AI agents for tedious tasks is on, but should we trust them with our data? - CBC - July 30th, 2025 [July 30th, 2025]
The Shortcut AI Excel agent could 'one-shot' spreadsheet jobs. Here's how to try it. - Mashable - July 30th, 2025 [July 30th, 2025]
With Ambiences new mega-round, AI scribes have announced nearly $1 billion in funding this year - statnews.com - July 30th, 2025 [July 30th, 2025]
Microsoft study identifies 40 jobs AI chatbots are likely to help automate and those where the tech is barely being used - Business Insider - July 30th, 2025 [July 30th, 2025]
AMD Raises AI Chip Price, Confident It Can Compete With Nvidia - Investor's Business Daily - July 28th, 2025 [July 28th, 2025]
Opinion | How AI is impacting 700 professions and might impact yours - The Washington Post - July 28th, 2025 [July 28th, 2025]
AMD Stock Is Rising. Its AI Chip Business Is Improving, Says Analyst. - Barron's - July 28th, 2025 [July 28th, 2025]
NFL record predictions 2025: AI makes win-loss picks for all 32 teams - USA Today - July 28th, 2025 [July 28th, 2025]
China's latest AI model claims to be even cheaper to use than DeepSeek - CNBC - July 28th, 2025 [July 28th, 2025]
Auterion says it will provide Ukraine with 33,000 AI drone guidance kits - Reuters - July 28th, 2025 [July 28th, 2025]
How Microsofts customers and partners accelerated AI Transformation in FY25 to innovate with purpose and shape their future success - The Official... - July 28th, 2025 [July 28th, 2025]
Hawleys bill would let people sue AI firms using their content without permission - STLPR - July 28th, 2025 [July 28th, 2025]
Western Union to Tap Stablecoins and AI for Greater Efficiencies - PYMNTS.com - July 28th, 2025 [July 28th, 2025]
Microsoft Edge transforms into an AI browser with new Copilot Mode - The Verge - July 28th, 2025 [July 28th, 2025]
Forget the Turing Test, AIs real challenge is communication - AI News - July 28th, 2025 [July 28th, 2025]
Trumps order to remove woke AI from government may have downstream impacts, experts worry - Nextgov/FCW - July 28th, 2025 [July 28th, 2025]
Google Search: Introducing AI Mode in the UK - The Keyword - July 28th, 2025 [July 28th, 2025]
AI is driving mass layoffs in tech, but it's boosting salaries by $18,000 a year everywhere else, study says - Fortune - July 28th, 2025 [July 28th, 2025]
Warren Buffett Has 40% of Berkshire Hathaway's $293 Billion Portfolio Invested in 5 Artificial Intelligence (AI) Stocks - Yahoo Finance - July 28th, 2025 [July 28th, 2025]
Meta Earnings On Deck. Zuckerberg's Big AI Bets Will Be In Focus. - Investor's Business Daily - July 28th, 2025 [July 28th, 2025]
Everyone's a loser in Trump's AI Action Plan - Engadget - July 28th, 2025 [July 28th, 2025]
On GPS: Bill Gates on navigating the future of AI - CNN - July 28th, 2025 [July 28th, 2025]
AI Apps Are Undressing Women Without Consent And Its A Problem - Forbes - July 28th, 2025 [July 28th, 2025]
AI's race in the dark with China - Axios - July 28th, 2025 [July 28th, 2025]

May 20th, 2024

No comments yet

Comments are closed.

Mediaboss Marketing

We have to stop ignoring AI’s hallucination problem – The Verge

About

Pages

Categories

Media Sites

Recommended Sites

Archives