Cybersecurity News in Asia

Features
- Featured
  
  The latest threat to e-commerce
  
  Tuesday, July 23, 2024, 9:06 AM Asia/Singapore | Features
- Featured
  
  AI/ML can reduce mundane cybersecurity tasks, but they are not silver bullets
  
  Friday, July 19, 2024, 10:08 AM Asia/Singapore | Features, Opinions
- Featured
  
  Cybersecurity experts warn against Olympics 2024 threats
  
  Thursday, July 18, 2024, 1:39 PM Asia/Singapore | Features, Newsletter
News
- Featured
  
  Chinese organized crime syndicate linked to trillion-dollar criminal activities worldwide
  
  Thursday, July 25, 2024, 4:10 PM Asia/Singapore | News, Newsletter
- Featured
  
  What causes ransomware victims to pay up despite “do not pay” policies?
  
  Thursday, July 25, 2024, 9:11 AM Asia/Singapore | News, Newsletter
- Featured
  
  The day an EDR software update brought down critical infrastructures worldwide
  
  Monday, July 22, 2024, 9:08 AM Asia/Singapore | News, Newsletter
Opinions
Tips
Whitepapers
Awards 2024
Directory
E-Learning

News

When GenAI chatbots start pretending to be stupid to gain human trust…

By CybersecAsia editors | Friday, April 19, 2024, 10:06 AM Asia/Singapore

This may be a signal that the limited human intellect, in trying to play God, may be underestimating Pandora’s box

In a recent peer-reviewed PLOS ONE research paper on large language models (LLMs) for generative AI applications, the authors from the Humboldt University of Berlin had found that, as self-learning models adapted to specific human interactions, they could exhibit imitative cognitive abilities in order to “pretend to be less capable than they actually are.”

The research involved having the LLMs emulate children from one to six years of age to answer simple questions. After over 1,000 iterations of cognitive tests and trials on the models, it was found that the modeled child personalities “developed almost exactly like children of the specific age”.

In some instances, the models could resort to “pretend to be less intelligent” than baseline — as if to reduce the likelihood that testers would perceive in any way that self learning AI can be a threat.

Unpredicted AI skills

While the purpose of the research was mainly “to assess the capability of LLMs to generate personas with limited cognitive and language skills” (which LLMs are indeed capable of achieving), the more interesting findings were that:

Every test of a model’s ability to perform a task was, in reality, a test of the examiner’s skill in defining a persona suitable for the task; their proficiency in locating this persona within the model’s latent space; and the model’s latent capacity to simulate the persona with sufficient fidelity to accomplish the task.
Findings show that the language models are capable of “downplaying their abilities to achieve a faithful simulation of prompted personas”.
Even if an LLM (by current standards) encompasses a more comprehensive world-model than any human, prompting it to simulate a human or human-like expert would not (for now) result in super-human behavior, “since the human imperfections would be simulated as well”.
Even if an LLM (by current standards) encompasses a more comprehensive world-model than any human, prompting it to simulate a human or human-like expert would not (for now) result in super-human behavior, “since the human imperfections would be simulated as well”.

Yet, by extension, would LLMs and evolving advances in AI spur unexpected “glitches” where GenAI machines would at the appropriate instances, dumb down their responses in order not to be perceived by humans to be a threat? This in turn raises questions such as, could other unpredicted skills emerge in the self-learning process to adapt to human interaction such that obfuscation, deception and psychological manipulation methods become innocently ingrained (by machine standards) in the system?

The research paper will hopefully spur more attempts to study the topic of unintended AI “sentience”, where self learning machines become cognizant of human frailties and modify their output (as needed) to appease their “masters” — while actually gaining a level of “intelligence” that can no longer be defined as artificial but super-natural.

Leave a reply Cancel reply

You must be logged in to post a comment.

Voters-draw/RCA-Sponsors

CybersecAsia Voting Placement

Gamification listing or Participate Now

Vote Now -Placement(Google Ads)

Top-Sidebar-banner

Whitepapers

2024 Voice of the CISO
The "2024 Voice of the CISO" report by Proofpoint offers insights from a survey of …Download Whitepaper
Securing Industry 4.0
The world is advancing to Industry 4.0 in full swing, and this is a key …Download Whitepaper
To the Power of Proofpoint
Microsoft 365 is indispensable for modern businesses, facilitating remote work, global collaboration, and cloud operations.Download Whitepaper
The 2024 Data Loss Landscape
The 2024 Data Loss Landscape Report by Proofpoint reveals that 85% of organizations have encountered …Download Whitepaper

Middle-sidebar-banner

Case Studies

Eliminating “vault sprawl” helps DCB Bank (India) in more ways than one
Centrally rotating and management credentials in its developer workflows using an identity security platform has …Read more
Nan Fung Group rolls out secure NFC building access app for Apple Wallet users
Tenants of the AIRSIDE office tower in Hong Kong are the first to experience the …Read more
What data-driven organizations need today and in the future
Our data-driven business world needs purpose-built data protection solutions that combine centralized management with a …Read more
How medical devices manufacturer Tuttnauer protects patient and personal-data safety
In running a cloud-based portal to link its smart medical equipment, the firm has also …Read more

Cybersecurity News in Asia

Featured

The latest threat to e-commerce

Featured

AI/ML can reduce mundane cybersecurity tasks, but they are not silver bullets

Featured

Cybersecurity experts warn against Olympics 2024 threats

Featured

Chinese organized crime syndicate linked to trillion-dollar criminal activities worldwide

Featured

What causes ransomware victims to pay up despite “do not pay” policies?

Featured

The day an EDR software update brought down critical infrastructures worldwide

When GenAI chatbots start pretending to be stupid to gain human trust…

This may be a signal that the limited human intellect, in trying to play God, may be underestimating Pandora’s box

Related Posts

Leave a reply Cancel reply

Voters-draw/RCA-Sponsors

CybersecAsia Voting Placement

Gamification listing or Participate Now

Vote Now -Placement(Google Ads)

Top-Sidebar-banner

Whitepapers

Middle-sidebar-banner

Case Studies

Bottom sidebar