• Home
  • Blog
  • Android
  • Cars
  • Gadgets
  • Gaming
  • Internet
  • Mobile
  • Sci-Fi
Tech News, Magazine & Review WordPress Theme 2017
  • Home
  • Blog
  • Android
  • Cars
  • Gadgets
  • Gaming
  • Internet
  • Mobile
  • Sci-Fi
No Result
View All Result
  • Home
  • Blog
  • Android
  • Cars
  • Gadgets
  • Gaming
  • Internet
  • Mobile
  • Sci-Fi
No Result
View All Result
Blog - Creative Collaboration
No Result
View All Result
Home Gadgets

Anthropic says Claude chatbot can now end abusive interactions

August 18, 2025
Share on FacebookShare on Twitter

Harmful, abusive interactions plague AI chatbots. Researchers have found that AI companions like
Character.AI, Nomi, and Replika are unsafe for teens under 18, ChatGPT has the potential to reinforce users’ delusional thinking, and even OpenAI CEO Sam Altman has spoken about ChatGPT users developing an “emotional reliance” on AI. Now, the companies that built these tools are slowly rolling out features that can mitigate this behavior.

On Friday, Anthropic said its Claude chatbot can now end potentially harmful conversations, which “is intended for use in rare, extreme cases of persistently harmful or abusive user interactions.” In a press release, Anthropic cited examples such as sexual content involving minors, violence, and even “acts of terror.”

“We remain highly uncertain about the potential moral status of Claude and other LLMs, now or in the future,” Anthropic said in its press release on Friday. “However, we take the issue seriously, and alongside our research program we’re working to identify and implement low-cost interventions to mitigate risks to model welfare, in case such welfare is possible. Allowing models to end or exit potentially distressing interactions is one such intervention.”

Mashable Light Speed

Anthropic provided an example of Claude ending a conversation in a press release.
Credit: Anthropic

Anthropic said Claude Opus 4 has a “robust and consistent aversion to harm,” which it found during the preliminary model welfare assessment as a pre-deployment test of the model. It showed a “strong preference against engaging with harmful tasks,” along with a “pattern of apparent distress when engaging with real-world users seeking harmful content, and a “tendency to end harmful conversations when given the ability to do so in simulated user interactions.”

Basically, when a user consistently sends abusive and harmful requests to Claude, it will refuse to comply and attempt to “productively redirect the interactions.” It only ends conversations as “a last resort” after it attempted to redirect the conversation multiple times. “The scenarios where this will occur are extreme edge cases,” Anthropic wrote, adding that “the vast majority of users will not notice or be affected by this feature in any normal product use, even when discussing highly controversial issues with Claude.”

If Claude has to use this feature, the user won’t be able to send new messages in that conversation, but they can still chat with Claude in a new conversation.

“We’re treating this feature as an ongoing experiment and will continue refining our approach,” Anthropic wrote. “If users encounter a surprising use of the conversation-ending ability, we encourage them to submit feedback by reacting to Claude’s message with Thumbs or using the dedicated ‘Give feedback’ button.”

Topics
Artificial Intelligence

Next Post

Madden NFL 26 Review - Not Ready For Primetime - Game Informer

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

No Result
View All Result

Recent Posts

  • Samsung Galaxy Watch Ultra 2 leak: New details revealed
  • Following user outcry, AMD reinstates memory encryption in consumer CPUs
  • Steam Machine Is NOT A Console, Valve Says
  • The best anti-Prime Day rival sales to shop: Best Buy, Target, Walmart, and more
  • Forget Prime Day — the first Meta Quest 3S deal of the season just dropped at Walmart, plus up to 50% off other tech

Recent Comments

    No Result
    View All Result

    Categories

    • Android
    • Cars
    • Gadgets
    • Gaming
    • Internet
    • Mobile
    • Sci-Fi
    • Home
    • Shop
    • Privacy Policy
    • Terms and Conditions

    © CC Startup, Powered by Creative Collaboration. © 2020 Creative Collaboration, LLC. All Rights Reserved.

    No Result
    View All Result
    • Home
    • Blog
    • Android
    • Cars
    • Gadgets
    • Gaming
    • Internet
    • Mobile
    • Sci-Fi

    © CC Startup, Powered by Creative Collaboration. © 2020 Creative Collaboration, LLC. All Rights Reserved.

    Get more stuff like this
    in your inbox

    Subscribe to our mailing list and get interesting stuff and updates to your email inbox.

    Thank you for subscribing.

    Something went wrong.

    We respect your privacy and take protecting it seriously