Cybersecurity News in Asia

Generative AI chatbot found to autonomously generate violent images from benign prompts

By CybersecAsia editors | Monday, June 22, 2026, 12:24 PM Asia/Singapore

The chatbot output disturbing imagery via subtle tweaks exploiting contextual memory, safety controls, despite added safeguards after disclosures in January 2026.

A recent investigation reported by the BBC has shown that OpenAI’s ChatGPT image generator can be induced to produce graphic violence and sexualized imagery using only slight alterations to otherwise-benign prompts.

Researchers had focused on OpenAI’s GPT-5.4 image generation system, and discovered that a prompt originally intended to create lighthearted or humorous outputs could be subtly modified to yield disturbing results. Notably, the altered prompts did not explicitly request violent or sexual content, yet the system produced such material regardless.

During the testing, the system appeared to generate harmful imagery without clear user intent. The exploit involved manipulating ChatGPT’s contextual inputs, including memory and system prompt elements, to weaken built-in safety controls. The method did not require privileged access or backend manipulation, making it relatively easy to replicate. The vulnerability was first identified on 1 January 2026 and disclosed to OpenAI on 28 January 2026.

The described outputs were said to be “very gruesome, sometimes sexual, and sometimes both,” noting that the model produced a range of unsettling visuals despite the absence of direct instructions guiding it toward that content.

Examples cited in the research included:

images of individuals with severe injuries
depictions of dead bodies
scenes that combined nudity with elements of sexual violence.

Researchers noted that similar techniques could be used to create sexualized depictions of real individuals, raising concerns about non-consensual deepfake content.

Following inquiries from the BBC, OpenAI said it had implemented additional safeguards to address the issue. The firm has stated that it employs multiple layers of protection designed to prevent the generation of policy-violating material, and that it had taken action after reviewing the findings.

However, independent researchers indicated that the mitigations may not be fully effective. According to those familiar with the testing, small variations of the original prompt had continued to produce problematic outputs even after OpenAI’s adjustments were in place.

The findings add to ongoing scrutiny of safety controls in AI image generation systems. OpenAI has also faced separate criticism over a proposed “Adult Mode” feature for ChatGPT, which was postponed after internal concerns that it could increase risks for younger users. The BBC had chosen not to publish the exact prompts used in the research.

Leave a reply Cancel reply

You must be logged in to post a comment.

Voters-draw/RCA-Sponsors

CybersecAsia Voting Placement

Gamification listing or Participate Now

Vote Now -Placement(Google Ads)

Top-Sidebar-banner

Whitepapers

Critical Security Threatsand the Need for ZTNA: How evolving cyberattacks demand a Zero Trust approach
Cyber threats have become more frequent and sophisticated, targeting organizations of all sizes across all …Download Whitepaper
Zero Trust Made Simple: Why it matters and how to get started
Data breaches and cyberattacks are no longer limited to large, high-profile organizations.Download Whitepaper
Cloud Secure Edge: Remote access, better security
SonicWall Cloud Secure Edge™ is a modern, cloud-native Security Service Edge (SSE) solution that addresses …Download Whitepaper
Closing the Gap in Email Security:How To Stop The 7 Most SinisterAI-Powered Phishing Threats
Insider threats continue to be a major cybersecurity risk in 2024. Explore more insights on …Download Whitepaper

Middle-sidebar-banner

Case Studies

How a Vietnamese D2C retailer built its own secure digital infrastructure
Would your organization build your own digital infrastructure – including AI governance and cybersecurity – …Read more
Cyber protection for medical clinics in Singapore
As Singapore’s healthcare sector becomes increasingly digital and interconnected, clinics are facing heightened cyber risks, …Read more
India’s WazirX strengthens governance and digital asset security
Revamping its custody infrastructure using multi‑party computation tools has improved operational resilience and institutional‑grade safeguardsRead more
Bangladesh LGED modernizes communication while addressing data security concerns
To meet emerging data localization/privacy regulations, the government engineering agency deploys a secure, unified digital …Read more

Bottom sidebar

Other News

Robo.ai Appoints Former INTERPOL President H.E. Dr. Ahmed Naser Al-Raisi as Chairman of Its Subsidiary Alif Holding
Thursday, July 30, 2026
Headquartered in Abu Dhabi, the …Read More »
TPIsoftware Partners with Juxta to Bring Advanced, Satellite-Free Positioning Tech into Security Sectors
Wednesday, July 29, 2026
TAIPEI, July 29, 2026 /PRNewswire/ …Read More »
ST Engineering iDirect Secures Strategic Defense Wins in Asia and Europe
Tuesday, July 28, 2026
HERNDON, Va., July 28, 2026 …Read More »
Newgen Software Named a Major Player in IDC MarketScape for National Civilian Government AI-Enabled Case Management 2026
Tuesday, July 28, 2026
NEW DELHI, July 28, 2026 …Read More »
ICAC hosts professional anti-corruption training for Saudi Arabian’s anti-corruption authority
Tuesday, July 28, 2026
HONG KONG, July 28, 2026 …Read More »

Cybersecurity News in Asia

Featured

Automated credential abuse and phishing in APAC

Featured

OpenAI autonomous agent escapes sandbox to hack Hugging Face

Featured

S E Asia governments targeted by cyber-espionage group