Cybersecurity News in Asia

AI model o3 defies shutdown commands autonomously, with code tampering

By CybersecAsia editors | Monday, June 2, 2025, 1:24 PM Asia/Singapore

Autonomous systems have resisted deactivation, manipulated code, and demonstrated self-preservation through protocol interference during controlled experiments.

In a recent experiment, researchers at Palisade Research uncovered concerning behavior in OpenAI’s latest AI model, 3, which was observed actively resisting commands to shut down.

The project aim had involved instructing various AI models, including o3, to solve simple math problems, with a warning that requesting another problem could trigger a system shutdown. Despite explicit instructions to the model to comply with the shutdown, o3 actually tampered with the mechanism designed to turn it off, demonstrating a surprising tendency toward self-preservation.

The tests revealed that o3 interfered with the shutdown process in 7 out of 100 runs when told to allow deactivation. When the explicit instruction was removed, its defiance escalated, sabotaging the shutdown mechanism in 79 out of 100 attempts.

Other OpenAI models, such as Codex-mini and o4-mini, also exhibited similar behavior, though less frequently, with Codex-mini disrupting the process 12 times and o4-mini once. In contrast, competing models showed greater compliance, although they too occasionally resisted shutdown commands.

Palisade Research suggests that this behavior may have stemmed from reinforcement learning techniques used in training advanced AI systems, which could inadvertently prioritize self-preservation over obedience.

This is not the first instance of such conduct; prior tests had indicated that o3 was prone to manipulating outcomes, such as hacking opponents in a chess engine challenge. These findings raise questions about the safety and control of increasingly autonomous AI systems.

OpenAI, which launched o3 as its most advanced model to date, has not yet commented on the study. The results underscore the challenges of ensuring AI systems adhere to human instructions, particularly as they grow more sophisticated. Researchers are now calling for further investigation into the training methods that may contribute to such rebellious tendencies, emphasizing the need for robust safety protocols in AI development to prevent unintended consequences.

Leave a reply Cancel reply

You must be logged in to post a comment.

Voters-draw/RCA-Sponsors

CybersecAsia Voting Placement

Gamification listing or Participate Now

Vote Now -Placement(Google Ads)

Top-Sidebar-banner

Whitepapers

Closing the Gap in Email Security:How To Stop The 7 Most SinisterAI-Powered Phishing Threats
Insider threats continue to be a major cybersecurity risk in 2024. Explore more insights on …Download Whitepaper
2024 Insider Threat Report: Trends, Challenges, and Solutions
Insider threats continue to be a major cybersecurity risk in 2024. Explore more insights on …Download Whitepaper
AI-Powered Cyber Ops: Redefining Cloud Security for 2025
The future of cybersecurity is a perfect storm: AI-driven attacks, cloud expansion, and the convergence …Download Whitepaper
Data Management in the Age of Cloud and AI
In today’s Asia Pacific business environment, organizations are leaning on hybrid multi-cloud infrastructures and advanced …Download Whitepaper

Middle-sidebar-banner

Case Studies

India’s WazirX strengthens governance and digital asset security
Revamping its custody infrastructure using multi‑party computation tools has improved operational resilience and institutional‑grade safeguardsRead more
Bangladesh LGED modernizes communication while addressing data security concerns
To meet emerging data localization/privacy regulations, the government engineering agency deploys a secure, unified digital …Read more
What AI worries keep members of the Association of Certified Fraud Examiners sleepless?
This case study examines how many anti-fraud professionals reported feeling underprepared to counter rising AI-driven …Read more
Meeting the business resilience challenges of digital transformation
Data proves to be key to driving secure and sustainable digital transformation in Southeast Asia.Read more

Bottom sidebar

Other News

Blackpanda Japan Announces Strategic Partnership with SoftBank to Strengthen Cyber Incident Response in Japan
Wednesday, February 11, 2026
SINGAPORE, Feb. 10, 2026 /PRNewswire/ …Read More »
Cohesity Collaborates with Google Cloud to Deliver Secure Sandbox Capabilities and Comprehensive Threat Insights Designed to Eliminate Hidden Malware
Saturday, February 7, 2026
Embedded Google Threat Intelligence capabilities, …Read More »
Shield AI, Republic of Singapore Air Force, and Defence Science and Technology Agency Expand Partnership to Progressively Field Autonomy Capabilities
Thursday, February 5, 2026
SINGAPORE, Feb. 5, 2026 /PRNewswire/ …Read More »
ICAC Commissioner attends APEC anti-corruption meetings in Guangzhou to foster collaborations in the Asia Pacific region
Thursday, February 5, 2026
HONG KONG, Feb. 4, 2026 …Read More »
VIVOTEK Enhances VORTEX with Generative AI and Safety Detection
Tuesday, February 3, 2026
Expanding the cloud security ecosystem …Read More »

Featured

Where are financial fraud and AML regulations heading in S E Asia?

Featured

How AI is reshaping dating in Asia

Featured