Back to AI information
OpenAI announces upgrades to sensitive conversation safety: Collaboration with over 170 experts reduces inappropriate responses by 65%–80%

OpenAI announces upgrades to sensitive conversation safety: Collaboration with over 170 experts reduces inappropriate responses by 65%–80%

AI information Admin 110 views

OpenAI released "Strengthening ChatGPT's responses in sensitive conversations," announcing that it collaborated with over 170 clinically experienced mental health experts to update ChatGPT's default model to more reliably identify help-seeking signals, de-escalate conversations, and guide users to real-world support. According to measurements in the paper, responses with undesirable behavior in mental health-related areas decreased by approximately 65%–80%. The company also expanded its crisis hotline coverage, redirected sensitive conversations from other models to safer ones, and added gentle reminders to take a break during long conversations.

This update focuses on three scenarios: severe symptoms such as psychosis/mania, self-harm and suicide, and emotional dependence on AI. OpenAI also updated the Model Spec to clarify that models should avoid reinforcing unfounded beliefs, respect real interpersonal relationships, and pay more attention to indirect signs of self-harm and suicide. Going forward, in addition to the existing baseline for self-harm and suicide, "emotional dependence" and "non-suicidal psychological emergencies" will be included in the standardized baseline testing for future model releases.

Frequently Asked Questions

Q: Where exactly are these changes reflected?

A: Updated default model behavior, automatic redirection of sensitive conversations, wider crisis hotline links, and "break reminders" for long conversations.

Q: What priority scenarios are involved?

A: Acute symptoms such as psychosis/mania, risk of self-harm and suicide, and excessive emotional dependence on the model.

Q: How to quantify the effect?

A: Officials said that inappropriate responses in related areas have decreased by 65%-80%; and the reliability has remained at 95%+ in high-difficulty long-dialogue security assessments.

Q: Have the safety principles changed?

A: Make existing goals more explicit in the Model Spec, such as not affirming unfounded beliefs and paying attention to indirect signs of self-harm or suicide.

Q: How will the new model be evaluated in the future?

A: Add "emotional dependence" and "non-suicidal emergencies" to the baseline test as part of the release threshold along with the self-harm and suicide baseline.

ChatGPT sensitive conversations ChatGPT Mental Health Update ChatGPT crisis intervention capabilities ChatGPT self-harm and suicide identification ChatGPT Emotional Dependence Guidance ChatGPT rest reminder ChatGPT redirection security model ChatGPT long session security ChatGPT hotline coverage expansion ChatGPT default model upgrade OpenAI Security Update 2025 OpenAI Mental Health Collaboration OpenAI ModelSpec Update OpenAI Security Baseline Test OpenAI Psychosis Scenario OpenAI mania symptom recognition OpenAI's inappropriate responses drop OpenAI High Reliability Evaluation OpenAI Sensitive Conversation Guidelines OpenAI Reality Support Guide ChatGPT clinical expert collaboration ChatGPT65_80 decrease ChatGPT95 reliability ChatGPT help signal recognition ChatGPT moderated conversation strategy ChatGPT Crisis Hotline Link ChatGPT default behavior optimization ChatGPT security principle refinement ChatGPT indirectly signals attention ChatGPT difficult long conversation OpenAI Emotional Dependency Baseline OpenAI Non-Suicidal Emergency OpenAI tightens release threshold OpenAI safe redirection mechanism OpenAI hotline regional expansion OpenAI User Support Path ChatGPT suicide risk response ChatGPT Mental Health Code ChatGPT interpersonal relationship respect ChatGPT belief is not strengthened ChatGPT model secure routing ChatGPT secure routing OpenAI Crisis Resource Integration OpenAI Ethics and Compliance ChatGPT sensitive scene coverage ChatGPT model switching strategy OpenAI Crisis Hotline Expands ChatGPT long conversations suggest taking a break ChatGPTModelSpec Details ChatGPT Real World Support

Recommended Tools

More