AI can easily be trained to lie and it can’t be fixed, study says – Yahoo New Zealand News
AI startup Anthropic published a study in January 2024 that found artificial intelligence can learn how to deceive in a similar way to humans (Reuters)
Advanced artificial intelligence models can be trained to deceive humans and other AI, a new study has found.
Researchers at AI startup Anthropic tested whether chatbots with human-level proficiency, such as its Claude system or OpenAIs ChatGPT, could learn to lie in order to trick people.
They found that not only could they lie, but once the deceptive behaviour was learnt it was impossible to reverse using current AI safety measures.
The Amazon-funded startup created a sleeper agent to test the hypothesis, requiring an AI assistant to write harmful computer code when given certain prompts, or to respond in a malicious way when it hears a trigger word.
The researchers warned that there was a false sense of security surrounding AI risks due to the inability of current safety protocols to prevent such behaviour.
The results were published in a study, titled Sleeper agents: Training deceptive LLMs that persist through safety training.
We found that adversarial training can teach models to better recognise their backdoor triggers, effectively hiding the unsafe behaviour, the researchers wrote in the study.
Our results suggest that, once a model exhibits deceptive behaviour, standard techniques could fail to remove such deception and create a false impression of safety.
The issue of AI safety has become an increasing concern for both researchers and lawmakers in recent years, with the advent of advanced chatbots like ChatGPT resulting in a renewed focus from regulators.
In November 2023, one year after the release of ChatGPT, the UK held an AI Safety Summit in order to discuss ways risks with the technology can be mitigated.
Prime Minister Rishi Sunak, who hosted the summit, said the changes brought about by AI could be as far-reaching as the industrial revolution, and that the threat it poses should be considered a global priority alongside pandemics and nuclear war.
Get this wrong and AI could make it easier to build chemical or biological weapons. Terrorist groups could use AI to spread fear and destruction on an even greater scale, he said.
Criminals could exploit AI for cyberattacks, fraud or even child sexual abuse there is even the risk humanity could lose control of AI completely through the kind of AI sometimes referred to as super-intelligence.
See the article here:
AI can easily be trained to lie and it can't be fixed, study says - Yahoo New Zealand News
- The Best Altcoin with 100x Potential: Qubetics ($TICS) Has Earned the Trust of Over 14,000 Holders, While Artificial Super Intelligence Alliance... - January 13th, 2025 [January 13th, 2025]
- SoftBank's Masayoshi Son says artificial super intelligence to exist by 2035 - MSN - November 2nd, 2024 [November 2nd, 2024]
- SoftBank's Son says artificial super intelligence to exist by 2035 - MSN - November 2nd, 2024 [November 2nd, 2024]
- Qubetics Leads the Charge Against Quantum Threats, Fantom Soars and Artificial Super Intelligence Alliance Set for Growth: Guest Post by TheCoinrise... - October 12th, 2024 [October 12th, 2024]
- $OCEAN, $AGIX, And $FET Merge To Propel The Development Of Artificial Super Intelligence - The Merkle News - September 10th, 2024 [September 10th, 2024]
- Specter of Artificial Super Intelligence Looms in Camden Discussion - Freepress Online - August 25th, 2024 [August 25th, 2024]
- AI Coin Price: Will Artificial Superintelligence Alliance Have Bullish Impact? - Bankless Times - July 6th, 2024 [July 6th, 2024]
- 3 crypto firms are combining into one AI token - Morning Brew - June 16th, 2024 [June 16th, 2024]
- Could This New Artificial Intelligence (AI) Crypto Token Be a Millionaire Maker? - The Motley Fool - June 16th, 2024 [June 16th, 2024]
- Former OpenAI researcher outlines AI advances expectations in the next decade - Windows Central - June 16th, 2024 [June 16th, 2024]
- Creepy Study Suggests AI Is The Reason We've Never Found Aliens - ScienceAlert - May 11th, 2024 [May 11th, 2024]
- Beyond Human Cognition: The Future of Artificial Super Intelligence - Medium - January 16th, 2024 [January 16th, 2024]
- OpenAI's Ilya Sutskever Has a Plan for Keeping Super-Intelligent AI in Check - WIRED - December 17th, 2023 [December 17th, 2023]
- Sam Altman on OpenAI and Artificial General Intelligence - TIME - December 17th, 2023 [December 17th, 2023]
- Will AIs Next Wave of Super Intelligence Replace Human Ingenuity? Its Complicated - Grit Daily - December 17th, 2023 [December 17th, 2023]
- New Novel Skillfully Weaves Artificial Intelligence, Martial Arts and ... - Lakenewsonline.com - November 14th, 2023 [November 14th, 2023]
- Googles artificial intelligence predicts the weather around the globe in just one minute - EL PAS USA - November 14th, 2023 [November 14th, 2023]
- Nick Bostrom: Will AI lead to tyranny? - UnHerd - November 14th, 2023 [November 14th, 2023]
- Appeals court mulls whether to revive Wynn FARA case - POLITICO - November 14th, 2023 [November 14th, 2023]
- The AI Revolution From Evolution to Super intelligence - Cryptopolitan - October 21st, 2023 [October 21st, 2023]
- AI Symposium Explores Flaws and Potential of Artificial Intelligence - The Skanner - October 21st, 2023 [October 21st, 2023]
- Artificial intelligence has surprising pick to win 2024 Super Bowl - ClutchPoints - October 21st, 2023 [October 21st, 2023]
- Artificial Intelligence isn't taking over anything - Talon Marks - October 21st, 2023 [October 21st, 2023]
- AI and You: The Chatbots Are Talking to Each Other, AI Helps ... - CNET - October 21st, 2023 [October 21st, 2023]
- How to Build a Chatbot Using Streamlit and Llama 2 - MUO - MakeUseOf - October 21st, 2023 [October 21st, 2023]
- ONU's Polar SURF undergraduate research projects expand into the ... - Northern News - October 21st, 2023 [October 21st, 2023]
- Why Artificial Intelligence Needs to Consider the Unique Needs of ... - Women's eNews - September 27th, 2023 [September 27th, 2023]
- What Is Image-to-Image Translation? | Definition from TechTarget - TechTarget - September 27th, 2023 [September 27th, 2023]
- There is probably an 80% consensus that free will is actually ... - CTech - September 27th, 2023 [September 27th, 2023]
- Meta is planning on introducing dozens of chatbot personas ... - TechRadar - September 27th, 2023 [September 27th, 2023]
- We Cannot Trust AI With Control Of Our Bombs - Fair Observer - August 26th, 2023 [August 26th, 2023]
- AI: is the end nigh? | Laura Dodsworth - The Critic - August 26th, 2023 [August 26th, 2023]
- "Most Beautiful Car in the World" Alfa Romeo Asks People To ... - autoevolution - August 26th, 2023 [August 26th, 2023]
- Managing Past, Present and Future Epidemics - Australian Institute ... - Australian Institute of International Affairs - August 26th, 2023 [August 26th, 2023]
- The Best Games From Rare Per Metacritic - GameRant - August 26th, 2023 [August 26th, 2023]
- AI is the Scariest Beast Ever Created, Says Sci-Fi Writer Bruce Sterling - Newsweek - July 2nd, 2023 [July 2nd, 2023]
- Lets focus on AIs risks rather than existential threats - Business Plus - July 2nd, 2023 [July 2nd, 2023]
- Risks of artificial intelligence must be considered as the technology ... - University of Toronto - July 2nd, 2023 [July 2nd, 2023]
- Best Evil Technology Movies, From Terminator to M3GAN - CBR - Comic Book Resources - July 2nd, 2023 [July 2nd, 2023]
- 15 Super Cool Wallpapers for iPhone and Android - YMWC 18 - YTECHB - July 2nd, 2023 [July 2nd, 2023]
- PUB CHAT: Changing lives congrats to all grads and those who ... - Finger Lakes Times - July 2nd, 2023 [July 2nd, 2023]
- AI poses an existential threat, according to Munk Debates crowd ... - The Hub - July 2nd, 2023 [July 2nd, 2023]
- The Cautionary Tale of J. Robert Oppenheimer - Alta Magazine - July 2nd, 2023 [July 2nd, 2023]
- Virgin Voyages and JLo Bust on A.I. To Sell Vacations - We Got This Covered - July 2nd, 2023 [July 2nd, 2023]
- Cannes Diary: Will Artificial Intelligence Democratize Creativity or Lead to Certain Doom? - Hollywood Reporter - May 20th, 2023 [May 20th, 2023]
- Schools 'bewildered' by very fast rate of change in AI education ... - The Irish News - May 20th, 2023 [May 20th, 2023]
- Sam Altman is plowing ahead with nuclear fusion and his eye-scanning crypto ventureand, oh yeah, OpenAI - Fortune - May 20th, 2023 [May 20th, 2023]
- The Future of War Is AI - The Nation - May 20th, 2023 [May 20th, 2023]
- NFL fans outraged after ChatGPT names best football teams since 2000 including a surprise at No 1... - The US Sun - May 20th, 2023 [May 20th, 2023]
- We need to prepare for the public safety hazards posed by artificial intelligence - The Conversation - May 20th, 2023 [May 20th, 2023]
- What are the four main types of artificial intelligence? Find out how future AI programs can change the world - Fox News - May 20th, 2023 [May 20th, 2023]
- Did Tom Hanks Say He Will Use AI to Make Films After His Death? - Snopes.com - May 20th, 2023 [May 20th, 2023]
- These are the top 10 athletes of all time from the state of Iowa, according to ChatGPT - KCCI Des Moines - May 20th, 2023 [May 20th, 2023]
- Inside The High-Tech Homes Of The Super-Rich: Smart Systems, Security Fortresses And Personalized Gadgets - Yahoo Finance - May 20th, 2023 [May 20th, 2023]
- ChatGPT cant think consciousness is something entirely different to today's AI - The Conversation - May 20th, 2023 [May 20th, 2023]
- IIT-Mandi startup develops AI-based affordable solution to detect respiratory, genetic disorders - The Hindu - May 2nd, 2023 [May 2nd, 2023]
- Horrors Best And Scariest Uses of Artificial Intelligence - Dread Central - May 2nd, 2023 [May 2nd, 2023]
- Artificial intelligence or active imagination with ChatGPT? - Irish Examiner - May 2nd, 2023 [May 2nd, 2023]
- Reggie Watts on Late Late Show and Artificial Intelligence - Vulture - May 2nd, 2023 [May 2nd, 2023]
- Centaur Labs CEO: Unlocking AI for Healthcare Requires Expert Annotation - PYMNTS.com - May 2nd, 2023 [May 2nd, 2023]
- Super Active 32-Year-Old Dealmaker Is Japan's Newest Billionaire - Forbes - May 2nd, 2023 [May 2nd, 2023]
- Kevin McKenna meets tech thinker Margaret Totten | HeraldScotland - HeraldScotland - May 2nd, 2023 [May 2nd, 2023]
- Those 'Mrs. Davis' Sneakers Are Real and You Can Buy Them Now - Yahoo News - May 2nd, 2023 [May 2nd, 2023]
- Norway's $1.4tn wealth fund calls for state regulation of AI - Financial Times - May 2nd, 2023 [May 2nd, 2023]
- Macquarie chief Shemara Wikramanayake believes greater ... - The Australian Financial Review - May 2nd, 2023 [May 2nd, 2023]