simon-willison

我们对 OpenAI GPT-5.5 网络安全能力的评估

Our evaluation of OpenAI's GPT-5.5 cyber capabilities

二〇二六年五月三日 · 英文原文

摘要

英国 AI Security Institute 评估 OpenAI GPT-5.5 的网络安全漏洞查找能力，并与此前评估的 Claude Mythos 对比；结果显示 GPT-5.5 表现相当，且目前已普遍可用。

我们对 OpenAI 的 GPT-5.5 网络安全能力的评估英国 AI Security Institute 此前评估过 Claude Mythos：现在他们评估了 GPT-5.5 查找安全漏洞的能力，发现它与 Mythos 相当；但与 Mythos 不同，GPT-5.5 目前已普遍可用。标签：ai、openai、generative-ai、llms、anthropic、claude、ai-security-research、gpt

译自 simon-willison · 录于二〇二六年五月三日