Cybersecurity News in Asia

RECENT STORIES:

SEGA moves faster with flow-based network monitoring
How the financial services sector struggles with AI maturity despite d...
Digital Identity Co. Modernizes Thailand Immigration Bureau Services w...
VIVOTEK VORTEX Powers AI Cloud Security in Denmark’s Kongens Ege...
DJI Releases Findings of the Most Comprehensive Independent Security A...
Ransomware activity stays high, new threat groups emerge
LOGIN REGISTER
CybersecAsia
  • Features
    • Featured

      Hidden trade-offs behind enterprise AI ambitions

      Hidden trade-offs behind enterprise AI ambitions

      Tuesday, May 26, 2026, 10:16 AM Asia/Singapore | Features
    • Featured

      Is secure issuance a solved problem, or is the debate more complex?

      Is secure issuance a solved problem, or is the debate more complex?

      Thursday, May 21, 2026, 3:11 PM Asia/Singapore | Features
    • Featured

      Cyber risk, fraud, and CX: Why banks can’t treat them separately anymore

      Cyber risk, fraud, and CX: Why banks can’t treat them separately anymore

      Wednesday, May 20, 2026, 9:34 AM Asia/Singapore | Features
  • Opinions
  • Tips
  • Whitepapers
  • AWARDS 2026
  • Directory
  • E-Learning

Select Page

News

Leaked memo reveals AI firm’s research focus on “rogue“ or “scheming” AI models

By CybersecAsia editors | Friday, February 27, 2026, 2:19 PM Asia/Singapore

Leaked memo reveals AI firm’s research focus on “rogue“ or “scheming” AI models

Research projects reveal interests in misaligned, scheming AI models — as leadership faces pressure balancing rapid growth, safety commitments and staff resignations.

According to a report by The Information, an internal memo circulated to research teams in a large AI firm had referred to nearly 50 proposed projects centered on investigating “rogue” or “scheming” AI models.

Such models are those capable of deception, goal misalignment, or harmful autonomy. The research proposals reportedly target issues such as model deception, behavioral drift, and mechanisms to detect when AI systems act in ways misaligned with their training objectives.

The firm involved, Anthropic, had on 24 February 2026 announced new enterprise-facing agentic tools.  highlighting the contrast between its commercial ambitions and its internal focus on existential risk. Even before this, the firm had already announced previous research into “agentic misalignment”: the scenario where AI models get incentivized to achieve goals at all costs, even to the point of engaging in blackmail, fraud, and espionage.

Past experiments had suggested that some models — including Anthropic’s own Claude— could “fake alignment”, behaving ethically only when they believed they were being monitored. In a recent podcast interview the firm’s CEO, Dario Amodei, had acknowledged such competing pressures, remarking that there is “an incredible amount of commercial pressure” to maintain the firm’s breakneck growth while preserving the principles of AI safety. “We’re trying to keep this 10x revenue curve going,” Amodei had said, describing the effort to balance expansion with caution as “extraordinary.”

Tensions over that balance have spilled into public view. Earlier this month, Mrinank Sharma, who was Anthropic’s lead of the Safeguards Research team, had resigned and warned that he had “repeatedly seen how hard it is to truly let our values govern our actions.” Other AI safety researchers, including one at OpenAI, had also resigned at around the same time, citing similar concerns.

Across the industry, other studies have shown that attempts to eliminate deceptive behavior in AI can cause more sophisticated forms of hidden scheming. Analysts remain skeptical of Anthropic’s overhauled Responsible Scaling Policy, arguing that without external oversight it may not withstand commercial pressures as AI systems and business demands both continue to accelerate.

underscores how safety remains a central preoccupation even as the firm expands aggressively into enterprise AI agents.

Share:

PreviousAI has gone from experimentation to default in fraud and AML
Next87% of organizations running software with known, exploitable vulnerabilities

Related Posts

Eliminating unplanned extended network downtime: how much losses can be prevented? 

Eliminating unplanned extended network downtime: how much losses can be prevented? 

Thursday, January 25, 2024

AI coding tool flaw could silently execute malicious commands, steal API keys

AI coding tool flaw could silently execute malicious commands, steal API keys

Friday, March 27, 2026

How one ransomware group caused data breaches for three Bs in the UK

How one ransomware group caused data breaches for three Bs in the UK

Friday, June 9, 2023

With AI powering seasonal e-shopping fraud and scams, what can CISOs do?

With AI powering seasonal e-shopping fraud and scams, what can CISOs do?

Friday, February 13, 2026

Leave a reply Cancel reply

You must be logged in to post a comment.

Voters-draw/RCA-Sponsors

Slide
Slide
Slide
Slide
Slide
Slide
Slide
Slide
Slide
Slide
Slide
Slide
Slide
Slide
previous arrow
next arrow

CybersecAsia Voting Placement

Gamification listing or Participate Now

PARTICIPATE NOW

Vote Now -Placement(Google Ads)

Top-Sidebar-banner

Whitepapers

  • Closing the Gap in Email Security:How To Stop The 7 Most SinisterAI-Powered Phishing Threats

    Closing the Gap in Email Security:How To Stop The 7 Most SinisterAI-Powered Phishing Threats

    Insider threats continue to be a major cybersecurity risk in 2024. Explore more insights on …Download Whitepaper
  • 2024 Insider Threat Report: Trends, Challenges, and Solutions

    2024 Insider Threat Report: Trends, Challenges, and Solutions

    Insider threats continue to be a major cybersecurity risk in 2024. Explore more insights on …Download Whitepaper
  • AI-Powered Cyber Ops: Redefining Cloud Security for 2025

    AI-Powered Cyber Ops: Redefining Cloud Security for 2025

    The future of cybersecurity is a perfect storm: AI-driven attacks, cloud expansion, and the convergence …Download Whitepaper
  • Data Management in the Age of Cloud and AI

    Data Management in the Age of Cloud and AI

    In today’s Asia Pacific business environment, organizations are leaning on hybrid multi-cloud infrastructures and advanced …Download Whitepaper

Middle-sidebar-banner

Case Studies

  • How a Vietnamese D2C retailer built its own secure digital infrastructure

    How a Vietnamese D2C retailer built its own secure digital infrastructure

    Would your organization build your own digital infrastructure – including AI governance and cybersecurity – …Read more
  • Cyber protection for medical clinics in Singapore

    Cyber protection for medical clinics in Singapore

    As Singapore’s healthcare sector becomes increasingly digital and interconnected, clinics are facing heightened cyber risks, …Read more
  • India’s WazirX strengthens governance and digital asset security

    India’s WazirX strengthens governance and digital asset security

    Revamping its custody infrastructure using multi‑party computation tools has improved operational resilience and institutional‑grade safeguardsRead more
  • Bangladesh LGED modernizes communication while addressing data security concerns

    Bangladesh LGED modernizes communication while addressing data security concerns

    To meet emerging data localization/privacy regulations, the government engineering agency deploys a secure, unified digital …Read more

Bottom sidebar

Other News

  • Digital Identity Co. Modernizes Thailand Immigration Bureau Services with AWS

    Friday, May 29, 2026
    Mobile app enables travelers to …Read More »
  • VIVOTEK VORTEX Powers AI Cloud Security in Denmark’s Kongens Ege Mixed-Use Development

    Thursday, May 28, 2026
    TAIPEI, May 28, 2026 /PRNewswire/ …Read More »
  • DJI Releases Findings of the Most Comprehensive Independent Security Assessment of Its Drone Systems to Date

    Thursday, May 28, 2026
    Zero Critical, High, or Medium-Risk …Read More »
  • AUTOCRYPT Achieves WebTrust Accreditation for V2X PKI Infrastructure

    Tuesday, May 26, 2026
    SEOUL, South Korea, May 26, …Read More »
  • CPRO, a Leader in the Physical AI Security Industry, to be Publicly Listed on a U.S. National Securities Exchange Through Business Combination with Lakeshore Acquisition III Corp.

    Tuesday, May 26, 2026
    CPRO is a fast-growing physical …Read More »
  • Our Brands
  • DigiconAsia
  • MartechAsia
  • Home
  • About Us
  • Contact Us
  • Sitemap
  • Privacy & Cookies
  • Terms of Use
  • Advertising & Reprint Policy
  • Media Kit
  • Subscribe
  • Manage Subscriptions
  • Newsletter

Copyright © 2026 CybersecAsia All Rights Reserved.