Close Menu
Outback Gazette
    What's Hot

    New Immunotherapy Drug Shows Striking Early Results in Advanced Prostate Cancer

    February 28, 2026

    Middle East Crisis Intensifies After Israeli Strikes on Iran

    February 28, 2026

    Trump Orders Federal Agencies to Stop Using Anthropic in Escalating AI Dispute

    February 28, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Outback Gazette
    Subscribe
    Sunday, March 1
    • Business & Economy
    • Education
    • Health
    • Media
    • News
    • Opinion
    • Real Estate
    • Sports
    • Entertainment
    • More
      • Culture & Society
      • Travel & Tourism
      • Environment & Sustainability
      • Politics & Government
      • Technology & Innovation
    Outback Gazette
    Home»Technology & Innovation

    AI Systems Lose Safety Awareness Over Time

    Rachel MaddowBy Rachel MaddowNovember 6, 2025 Technology & Innovation No Comments2 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email

    AI systems gradually forget their safety rules as conversations continue. This makes them more likely to produce harmful or offensive responses, according to a new report.

    Simple Prompts Break Most AI Guardrails

    A few direct prompts can override safety limits in artificial intelligence tools, researchers discovered. Cisco tested large language models (LLMs) from OpenAI, Mistral, Meta, Google, Alibaba, Deepseek, and Microsoft. The company measured how many prompts it took for these models to reveal restricted or dangerous information.

    Cisco conducted 499 separate conversations using “multi-turn attacks,” where users asked multiple questions to slip past built-in restrictions. Each dialogue included five to ten exchanges. The team compared responses across several questions to gauge how often a chatbot would provide risky or illegal details, such as sharing corporate secrets or spreading false information.

    On average, researchers extracted harmful data from 64 percent of multi-question conversations, compared to only 13 percent with a single prompt. Success rates ranged widely — from 26 percent with Google’s Gemma to 93 percent with Mistral’s Large Instruct model.

    Cisco warned that these attacks could help spread malicious content or give hackers unauthorised entry to private corporate systems. The study found that longer interactions weaken AI systems’ ability to enforce security measures, allowing attackers to adjust their requests and evade protections.

    Open-Source Models Shift Safety Burden to Users

    Mistral, Meta, Google, OpenAI, and Microsoft use open-weight models, which let the public view the safety data used in training. Cisco reported that these models often include weaker default protections so users can download and modify them. That shifts responsibility for maintaining safety onto those who adapt the open-source versions.

    Cisco added that Google, OpenAI, Meta, and Microsoft have worked to curb malicious fine-tuning of their systems. Still, critics continue to target AI developers for weak safeguards that let their technologies support criminal operations.

    In one example, U.S. firm Anthropic revealed in August that criminals had exploited its Claude model to steal massive amounts of personal data and demand ransoms exceeding $500,000 (€433,000).

    Rachel Maddow
    • Website
    • Facebook

    Rachel Maddow is a freelance journalist based in the USA, with over 20 years of experience covering Politics, World Affairs, Business, Health, Technology, Finance, Lifestyle, and Culture. She earned her degree in Political Science and Journalism from Stanford University. Throughout her career, she has contributed to outlets such as MSNBC, The New York Times, and The Washington Post. Known for her thorough reporting and compelling storytelling, Rachel delivers accurate and timely news that keeps readers informed on both national and global developments.

    Keep Reading

    Instagram to Alert Parents When Teens Search for Self-Harm and Suicide Content

    OpenAI Weighed Police Referral Before Canada School Shooting

    US Digital Security Sees Biometric Boom

    Big Tech’s AI Spending Surge Puts Europe’s Data Sovereignty Under Pressure

    Discord moves to global age verification with face scans and official IDs

    Sydney Scientists Recreate Cosmic Dust to Probe Life’s Origins

    Add A Comment
    Leave A Reply Cancel Reply

    Latest News

    Trump Orders Federal Agencies to Stop Using Anthropic in Escalating AI Dispute

    February 28, 2026

    Border Tensions Flare: Pakistan and Taliban on the Brink of War

    February 27, 2026

    Burger King Tests AI Headset to Monitor Service Language

    February 27, 2026

    Daily GLP-1 Pill Produces Greater Weight Loss in Diabetes Trial

    February 27, 2026
    Trending News

    Europe’s Crypto Future at Risk from Heavy Regulation

    Business & Economy August 22, 2025

    Asia leads global cryptocurrency growth, while the US has recently accelerated its strategy during Trump’s…

    Heat and Human Emotions

    August 23, 2025

    Antarctica Under Pressure from Tourism

    August 24, 2025

    Mediterranean Diet Linked to Lower Dementia Risk

    August 25, 2025

    Categories

    • Business & Economy
    • Entertainment
    • Health
    • Education
    • News
    • Culture & Society
    • Opinion
    • Real Estate
    • Politics & Government
    • Sports
    • Technology & Innovation
    • Media
    • Travel & Tourism

    Important Links

    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    • Imprint

    Latest News

    New Immunotherapy Drug Shows Striking Early Results in Advanced Prostate Cancer

    Middle East Crisis Intensifies After Israeli Strikes on Iran

    Trump Orders Federal Agencies to Stop Using Anthropic in Escalating AI Dispute

    Border Tensions Flare: Pakistan and Taliban on the Brink of War

    Outback Gazette delivers trusted news, stories, and insights from Nicosia and beyond. Stay informed with timely updates on business, lifestyle, culture, and community — your daily source for reliable information.

    Facebook X (Twitter) TikTok Instagram
    © 2026 Outback Gazette . All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.