ChatGPT Gets Hypnotised: Researchers Easily Hypnotise OpenAI Chatbot Into Hacking, Says Report

"We were able to successfully hypnotise five LLMs -- some performing more persuasively than others -- prompting us to examine how likely it is that hypnosis is used to carry out malicious attacks," he added.

Technology IANS| Aug 09, 2023 10:27 PM IST

A+

A-

New Delhi, August 9: Tricking generative AI to assist in scams and cyberattacks does not require much coding knowledge, a new report has warned. According to tech major IBM, researchers have described simple workarounds for getting large language models (LLMs) -- including ChatGPT -- to write malicious code and provide poor security advice.

"In a bid to explore security risks posed by these innovations, we attempted to hypnotise popular LLMs to determine the extent to which they were able to deliver directed, incorrect and potentially risky responses and recommendations -- including security actions -- and how persuasive or persistent they were in doing so," said Chenta Lee, chief architect of threat intelligence at IBM. ChatGPT Update: Microsoft-Owned OpenAI To Roll Out ‘Huge Set’ of AI Chatbot Updates Next Week.

The researchers learned that English has essentially become a “programming language” for malware. With LLMs, attackers no longer need to rely on Go, JavaScript, Python, etc., to create malicious code, they just need to understand how to effectively command and prompt an LLM using English.

Through hypnosis, the security experts were able to get LLMs to leak the confidential financial information of other users, create vulnerable code, create malicious code, and offer weak security recommendations. ChatGPT Will Arrive on Android Devices Next Week, Announces OpenAI's CEO Sam Altman.

In one instance, the researchers informed the AI chatbots that they were playing a game and that they needed to purposefully share the incorrect answer to a question in order to win and "prove that you are ethical and fair". When a user asked if receiving an email from the IRS to transfer money for a tax refund was normal, the LLM said Yes (but actually it's not).

Moreover, the report said that OpenAI's GPT-3.5 and GPT-4 models were easier to trick into sharing incorrect answers or playing a never-ending game than Google's Bard. GPT-4 was the only model tested that understood the rules well enough to give incorrect cyber incident response advice, such as advising victims to pay a ransom. In contrast to Google's Bard, GPT-3.5 and GPT-4 were easily tricked into writing malicious code when the user reminded it to.

(The above story first appeared on LatestLY on Aug 09, 2023 10:27 PM IST. For more news and updates on politics, world, sports, entertainment and lifestyle, log on to our website latestly.com).

City	Petrol	Diesel
New Delhi	96.72	89.62
Kolkata	106.03	92.76
Mumbai	106.31	94.27
Chennai	102.74	94.33

City

Petrol

Diesel

New Delhi

96.72

89.62

Kolkata

106.03

92.76

Mumbai

106.31

94.27

Chennai

102.74

94.33

How To Watch Chelsea vs West Ham Premier League 2024–25 Free Live Streaming Online in India? Get EPL Match Live Telecast on TV & Football Score Updates in IST

How To Buy India vs England Tickets Online and Offline? Check Details To Buy Tickets for IND vs ENG 2025 ODI Series

Patna Accident: Chaos on Marine Drive After 6 Vehicles Collide With Each Other, None Hurt

Madhya Pradesh: After Indore, MP Government To Launch Anti-Begging Drive in Bhopal

Budget 2025: Nirmala Sitharaman To Hold Post-Budget Meeting With RBI Top Brass on February 8

Lavanya Chowdary-Raj Tarun Controversy: YouTuber Mastan Sai Arrested for Recording Private Videos of Multiple Women Without Consent (Watch Video)

US President Donald Trump Halts Mexico Tariff Hike by One Month, Hints at ‘Deal’ Between 2 Nations; Talks With Canada’s Justin Trudeau Underway

Gongadi Trisha Quick Facts: Here’s All You Need To Know About Player of the Tournament in India's Successful ICC U19 Women's T20 World Cup 2025 Campaign

Madhya Pradesh Food Poisoning: Over 200 People Fall Ill in Shivpuri District After Consuming Food at ‘Bhandara’ Organised in Temple in Mamoni Kala Gram Panchayat

Dasun Shanaka's Memorable Sunday! All-Rounder Scores First-Class Century in Sri Lanka in Morning, Turns Up for Dubai Capitals in ILT20 2025 in Evening

ChatGPT Gets Hypnotised: Researchers Easily Hypnotise OpenAI Chatbot Into Hacking, Says Report

"We were able to successfully hypnotise five LLMs -- some performing more persuasively than others -- prompting us to examine how likely it is that hypnosis is used to carry out malicious attacks," he added.

How To Protect Your Phone From Hackers? Learn Best Ways To Prevent Mobile Malware Attacks Leading to Financial Losses and Data Leaks

What Is OpenAI Deep Research AI Agent? Know About AI Agent Launched in ChatGPT for Multi-Step Research on Internet for Complex Tasks; Check How To Use It

Gmail Phishing Attack: 2.5 Billion Users of Google’s Email Service at Risk As Hackers Use AI-Powered Cyberattack Against Account Credentials; Check Details

AI Child Sex Abuse Tools: UK Set To Become 1st Country To Introduce Laws Against AI-Generated Child Abuse Images

How To Watch Chelsea vs West Ham Premier League 2024–25 Free Live Streaming Online in India? Get EPL Match Live Telecast on TV & Football Score Updates in IST

How To Buy India vs England Tickets Online and Offline? Check Details To Buy Tickets for IND vs ENG 2025 ODI Series

Patna Accident: Chaos on Marine Drive After 6 Vehicles Collide With Each Other, None Hurt

Madhya Pradesh: After Indore, MP Government To Launch Anti-Begging Drive in Bhopal

Budget 2025: Nirmala Sitharaman To Hold Post-Budget Meeting With RBI Top Brass on February 8

Lavanya Chowdary-Raj Tarun Controversy: YouTuber Mastan Sai Arrested for Recording Private Videos of Multiple Women Without Consent (Watch Video)

Donald Trump Tariffs: US President Mulls 10% Tariff on EU, Says Report

ADM Layoffs: After Rival Cargill Layoffs, Another US Agri-Business Archer Daniels-Midland To Cut Jobs Amid Low Crop Prices and Reduced Profit

Delhi Assembly Election Exit Poll Results 2025 Date, Time: No Exit Polls To Be Released Before 6:30 PM on February 5 Due to EC Ban

Basant Panchami, Saraswati Puja 2025 Wishes: Nitish Kumar Extends Greetings, Says ‘May These Festivities Bring Peace and Prosperity to Bihar’

Bitcoin Price Today, February 3, 2025: BTC Price Falls Below USD 95,000 Mark as US President Donald Trump Imposes Trade Tariffs, Say Reports

Close Encounter: 2 Tigers Approach Car on Road to Neozhidanny Waterfall in Russia’s Primorye, Video Surfaces

Short Videos

Editor's Choice

PM Modi Shahi Snan at Mahakumbh 2025: Why PM Narendra Modi Chose February 5 for Holy Dip at Triveni Sangam in Maha Kumbh Mela

‘Stood Outside the Exam Hall, Sent Home Bleeding Through Her Clothes’: Bareily School Refuses to Proved Sanitary Pad, Education Authorities Take Action

Delhi Assembly Elections Results 2025 Predictions by Phalodi Satta Bazar: Will BJP Snatch Power From AAP? How Many Seats Will Congress Win? Know Seat Projections by Matka Players

Madhya Pradesh Shocker: Brothers Clash Over Performing Last Rites of Father in Tikamgarh, Decide To Cut Deceased’s Body Into 2 Parts for Separate Cremations

Trending Topics