Anthropic's System Card: Claude Sonnet 4.5 was able to recognize many alignment evaluation environments as tests and would modify its behavior accordingly (Celia Ford/Transformer)

Celia Ford / Transformer: Anthropic's System Card: Claude Sonnet 4.5 was able to recognize many alignment evaluation environments as tests and would modify its behavior accordingly  —  Anthropic's new model appears to use “eval awareness” to be on its best behavior  —  Anthropic's newly-released Claude Sonnet 4.5 is …

Anthropic's System Card: Claude Sonnet 4.5 was able to recognize many alignment evaluation environments as tests and would modify its behavior accordingly (Celia Ford/Transformer)

Celia Ford / Transformer:
Anthropic's System Card: Claude Sonnet 4.5 was able to recognize many alignment evaluation environments as tests and would modify its behavior accordingly  —  Anthropic's new model appears to use “eval awareness” to be on its best behavior  —  Anthropic's newly-released Claude Sonnet 4.5 is …

This article has been sourced from various publicly available news platforms around the world. All intellectual property rights remain with the original publishers and authors. Unshared News does not claim ownership of the content and provides it solely for informational and educational purposes voluntarily. If you are the rightful owner and believe this content has been used improperly, please contact us for prompt removal or correction.