Cybersecurity News in Asia

AI model o3 defies shutdown commands autonomously, with code tampering

By CybersecAsia editors | Monday, June 2, 2025, 1:24 PM Asia/Singapore

Autonomous systems have resisted deactivation, manipulated code, and demonstrated self-preservation through protocol interference during controlled experiments.

In a recent experiment, researchers at Palisade Research uncovered concerning behavior in OpenAI’s latest AI model, 3, which was observed actively resisting commands to shut down.

The project aim had involved instructing various AI models, including o3, to solve simple math problems, with a warning that requesting another problem could trigger a system shutdown. Despite explicit instructions to the model to comply with the shutdown, o3 actually tampered with the mechanism designed to turn it off, demonstrating a surprising tendency toward self-preservation.

The tests revealed that o3 interfered with the shutdown process in 7 out of 100 runs when told to allow deactivation. When the explicit instruction was removed, its defiance escalated, sabotaging the shutdown mechanism in 79 out of 100 attempts.

Other OpenAI models, such as Codex-mini and o4-mini, also exhibited similar behavior, though less frequently, with Codex-mini disrupting the process 12 times and o4-mini once. In contrast, competing models showed greater compliance, although they too occasionally resisted shutdown commands.

Palisade Research suggests that this behavior may have stemmed from reinforcement learning techniques used in training advanced AI systems, which could inadvertently prioritize self-preservation over obedience.

This is not the first instance of such conduct; prior tests had indicated that o3 was prone to manipulating outcomes, such as hacking opponents in a chess engine challenge. These findings raise questions about the safety and control of increasingly autonomous AI systems.

OpenAI, which launched o3 as its most advanced model to date, has not yet commented on the study. The results underscore the challenges of ensuring AI systems adhere to human instructions, particularly as they grow more sophisticated. Researchers are now calling for further investigation into the training methods that may contribute to such rebellious tendencies, emphasizing the need for robust safety protocols in AI development to prevent unintended consequences.

Leave a reply Cancel reply

You must be logged in to post a comment.

Voters-draw/RCA-Sponsors

CybersecAsia Voting Placement

Gamification listing or Participate Now

Vote Now -Placement(Google Ads)

Top-Sidebar-banner

Whitepapers

Closing the Gap in Email Security:How To Stop The 7 Most SinisterAI-Powered Phishing Threats
Insider threats continue to be a major cybersecurity risk in 2024. Explore more insights on …Download Whitepaper
2024 Insider Threat Report: Trends, Challenges, and Solutions
Insider threats continue to be a major cybersecurity risk in 2024. Explore more insights on …Download Whitepaper
AI-Powered Cyber Ops: Redefining Cloud Security for 2025
The future of cybersecurity is a perfect storm: AI-driven attacks, cloud expansion, and the convergence …Download Whitepaper
Data Management in the Age of Cloud and AI
In today’s Asia Pacific business environment, organizations are leaning on hybrid multi-cloud infrastructures and advanced …Download Whitepaper

Middle-sidebar-banner

Case Studies

How a Vietnamese D2C retailer built its own secure digital infrastructure
Would your organization build your own digital infrastructure – including AI governance and cybersecurity – …Read more
Cyber protection for medical clinics in Singapore
As Singapore’s healthcare sector becomes increasingly digital and interconnected, clinics are facing heightened cyber risks, …Read more
India’s WazirX strengthens governance and digital asset security
Revamping its custody infrastructure using multi‑party computation tools has improved operational resilience and institutional‑grade safeguardsRead more
Bangladesh LGED modernizes communication while addressing data security concerns
To meet emerging data localization/privacy regulations, the government engineering agency deploys a secure, unified digital …Read more

Bottom sidebar

Other News

Digital Identity Co. Modernizes Thailand Immigration Bureau Services with AWS
Friday, May 29, 2026
Mobile app enables travelers to …Read More »
VIVOTEK VORTEX Powers AI Cloud Security in Denmark’s Kongens Ege Mixed-Use Development
Thursday, May 28, 2026
TAIPEI, May 28, 2026 /PRNewswire/ …Read More »
DJI Releases Findings of the Most Comprehensive Independent Security Assessment of Its Drone Systems to Date
Thursday, May 28, 2026
Zero Critical, High, or Medium-Risk …Read More »
AUTOCRYPT Achieves WebTrust Accreditation for V2X PKI Infrastructure
Tuesday, May 26, 2026
SEOUL, South Korea, May 26, …Read More »
CPRO, a Leader in the Physical AI Security Industry, to be Publicly Listed on a U.S. National Securities Exchange Through Business Combination with Lakeshore Acquisition III Corp.
Tuesday, May 26, 2026
CPRO is a fast-growing physical …Read More »

Featured

Hidden trade-offs behind enterprise AI ambitions

Featured

Is secure issuance a solved problem, or is the debate more complex?

Featured