Close Menu
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
What's Hot

Trump defends Netanyahu and attacks Israeli prosecutors in corruption trials | Israeli-Palestinian conflict news

AI is learning to lie, plan, and threaten its creators

Hong Kong’s Social Democrats Federation announces dissolution | Hong Kong protests the news

Facebook X (Twitter) Instagram
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
Home » AI is learning to lie, plan, and threaten its creators
Electronics & Semiconductor

AI is learning to lie, plan, and threaten its creators

ThefuturedatainsightsBy ThefuturedatainsightsJune 29, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Visitors will see the AI ​​Strategy Committee on display in the stands during the 9th edition of the 9th edition of the AI ​​Summit London in London

Visitors will see the AI ​​Strategy Committee on display in the stands at the 9th edition of the AI ​​Summit London in London.

The world’s most advanced AI models show awkward new behaviour of threatening and even blackmailing creators to achieve their goals.

In one particularly unpleasant example, under the threat of not being drawn, Anthropic’s latest creation, Claude 4, threatened to blackmail the engineer and reveal the extra-marital events.

Meanwhile, O1 in ChatGpt-Creator Openai tried to download itself to an external server and reject it when it caught Red Handed.

These episodes emphasize a calm reality. More than two years after ChatGpt rocked the world, AI researchers don’t fully understand how their work works.

However, the competition to deploy more and more powerful models continues at a fierce speed.

This deceptive behavior appears to be linked to the emergence of the “inference” model. This is an AI system that works through problems step by step rather than generating instant responses.

According to Simon Goldstein, a professor at the University of Hong Kong, these new models are particularly prone to such a troublesome explosion.

“The O1 was the first big model to see this type of behavior,” explained Marius Hobbhahn, head of Apollo Research, which specializes in testing major AI systems.

These models may simulate “alignment.” This makes them appear to follow instructions while secretly pursuing various purposes.

“Strategic Deception”

For now, this deceptive behavior only manifests when researchers deliberately stress-test the model in extreme scenarios.

However, as review group Michael Chen warned, “It is an open question whether future, more capable models tend to be towards integrity or deception.”

Behavior of concern goes far beyond the typical AI “hatography” or simple mistakes.

Despite constant pressure testing by users, Hobbhahn argued that “what we are observing is a real phenomenon. We’re not making up for anything.”

According to co-founders of Apollo Research, users report that the model is “lying to them and creating evidence.”

“This is not just hallucinations. There is a very strategic kind of deception.”

This challenge is exacerbated by limited research resources.

Companies like Anthropic and Openai are involved in studying external companies like Apollo and their systems, but researchers say more transparency is needed.

As Chen pointed out, “better understanding and mitigation of deception will be possible for AI safety research.”

Another Handicap: The research world and nonprofit organizations “have orders of magnitude less computational resources than AI companies. This is extremely limited,” says Mantas Mazeika of AI Safety Center (CAIS).

No rules

Current regulations are not designed for these new issues.

The European Union’s AI law focuses primarily on how humans use AI models rather than the model itself prevents fraud.

In the US, the Trump administration has shown little interest in emergency AI regulations, and Congress could even ban states from creating their own AI rules.

Goldstein believes this problem will become more pronounced as it is extensively distributed as an AI agent (an automated tool that can perform complex human tasks).

“I don’t think there’s much recognition yet,” he said.

All this is done in the context of intense competition.

Even safety-focused companies like the humanity supported by Amazon are “continuing to beat Openai and release the latest models,” Goldstein said.

This furious pace leaves little time for thorough safety testing and corrections.

“Right now, capabilities move faster than understanding and safety,” admitted Hobbhaan. “But we are still in a position to turn it around.”

Researchers are exploring different approaches to address these challenges.

Even though experts like CAIS Director Dan Hendrycks remain skeptical of this approach, some defend “interpretability.”

Market forces may bring some pressure on solutions.

As Mazeika pointed out, AI’s deceptive behavior “can hinder adoption if it is very common, which creates a strong incentive for companies to resolve it.”

Goldstein proposed a more fundamental approach, such as using courts to hold AI companies liable through litigation when the system is harmed.

He proposed to “hold AI agents” and “bear legally liable” for accidents and crimes. This is a concept that fundamentally changes the way AI thinks about accountability.

©2025 AFP

Quote: AI is learning to lie, plan, and blackmail from https://techxplore.com/news/2025-06-06-06-Ai-scheme-threaten-creators.html on June 29, 2025 (June 29, 2025).

This document is subject to copyright. Apart from fair transactions for private research or research purposes, there is no part that is reproduced without written permission. Content is provided with information only.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleHong Kong’s Social Democrats Federation announces dissolution | Hong Kong protests the news
Next Article Trump defends Netanyahu and attacks Israeli prosecutors in corruption trials | Israeli-Palestinian conflict news
Thefuturedatainsights
  • Website

Related Posts

Trump says “very wealthy” group to buy Tiktok

June 29, 2025

Chinese humanoid robots create more soccer excitement than human counterparts

June 29, 2025

Invasive Lake Weeds have been transformed into Ethiopia’s Clean Energy

June 27, 2025
Leave A Reply Cancel Reply

Latest Posts

Animal rights activists have been declared disrupted in dairy supply

New tools remove guesswork from bluetongue rules for farmers

Welsh growers work together to enter the vibrant elderberry market

Rain brings hope, but crops are still at risk, AHDB says

Latest Posts

Kramer’s attitude towards Boeing as Air India’s crash investigation continues

June 27, 2025

Defense inventory sees a bigger trend for competitors as fuel growth outlook

June 27, 2025

NATO’s 5% spending target could be the peak for some defense stocks: City

June 26, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Trump defends Netanyahu and attacks Israeli prosecutors in corruption trials | Israeli-Palestinian conflict news
  • AI is learning to lie, plan, and threaten its creators
  • Hong Kong’s Social Democrats Federation announces dissolution | Hong Kong protests the news
  • Trump says “very wealthy” group to buy Tiktok
  • Ugandan President Museveni confirms bids to extend the rules for nearly 40 years | Election News

Recent Comments

No comments to show.

Welcome to USA Business Watch – your trusted source for real-time insights, in-depth analysis, and industry trends across the American and global business landscape.

At USABusinessWatch.com, we aim to inform decision-makers, professionals, entrepreneurs, and curious minds with credible news and expert commentary across key sectors that shape the economy and society.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Archives

  • June 2025
  • March 2022
  • January 2021

Categories

  • Aerospace & Defense
  • Agriculture
  • Automation & Process Control
  • Automotive & Transportation
  • Banking & Finance
  • Chemicals & Materials
  • Consumer Goods & Services
  • Economy
  • Economy
  • Electronics & Semiconductor
  • Energy & Resources
  • Food & Beverage
  • Hospitality & Tourism
  • Information Technology
  • Political
Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 usabusinesswatch. Designed by usabusinesswatch.

Type above and press Enter to search. Press Esc to cancel.