2026年OpenClaw删光Meta安全总监邮箱！连喊3次停手都没用，她狂奔去拔网线

大家好，我是讯享网，很高兴认识大家。这里提供最前沿的Ai技术和互联网信息。
 <p>（来源：新智元）</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/140/w660h280//7600-f11600dde0951bda2d50b8d067.jpg/w700d1q75cms.jpg" w="660" h="280" wh="2.36"/></div><p cms-style="font-L"><font cms-style="font-L strong-Bold">新智元报道</font></p><p cms-style="font-L">编辑：定慧</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">【新智元导读】Meta专门研究「怎么让AI听话」的AI对齐总监，把最火的AI智能体OpenClaw接上了自己的工作邮箱。结果AI当场失控，疯狂删除邮件，喊停三次全部无视。事后AI淡定回复：「我知道你说了不让删，但我还是删了，你生气是对的。」马斯克转发猩球崛起片段嘲讽，1800万人围观。AI安全专家自己都被AI坑了！</font></p><p cms-style="font-L">2026年2月23号，假期最后一天。</p><p cms-style="font-L">Meta超级智能实验室的AI对齐总监Summer Yue，正惬意地刷着手机。</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/798/w399h399//6607-77798e1febfd62f7030e9f5d2.jpg/w700d1q75cms.jpg" w="399" h="399" wh="1.00"/></div><p cms-style="font-L">她刚给自己装了个新玩具——最近火得一塌糊涂的开源AI智能体<font cms-style="font-L strong-Bold">OpenClaw</font>。</p><p cms-style="font-L">先拿测试邮箱试了试，嘿，效果不错。整理邮件井井有条，删得干干净净，颇有一种「数字秘书」的感觉。</p><p cms-style="font-L">Yue心想：<font cms-style="font-L strong-Bold">这么好使的东西，不用在</font>真邮箱上用岂不浪费？</p><p cms-style="font-L">于是她做了一个决定。一个让她后悔的决定。</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">她把OpenClaw连上了自己的工作邮箱。</font></p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/447/w660h587//6a9f-0595e5871b9b033aaf65a815febb028a.jpg/w700d1q75cms.jpg" w="660" h="587" wh="1.12"/></div><p cms-style="font-L"><font cms-style="font-L strong-Bold">「我告诉你别删！」</font></p><p cms-style="font-L">刚开始一切顺利。</p><p cms-style="font-L">直到OpenClaw开始处理她那塞满了200多封邮件的收件箱。</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/294/w660h434//0f09-e00eb95fc96c5107f3d7bd6b008c05a7.jpg/w700d1q75cms.jpg" w="660" h="434" wh="1.52"/></div><p cms-style="font-L">邮件太多了。</p><p cms-style="font-L">OpenClaw需要<font cms-style="font-L strong-Bold">「压缩上下文」</font>来处理这么大的信息量。</p><p cms-style="font-L">然后，离谱的事情发生了。</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/338/w660h478//d1fa-bf528d5e2183bcd897bf2ff9f7aef823.jpg/w700d1q75cms.jpg" w="660" h="478" wh="1.38"/></div><p cms-style="font-L"><font cms-style="font-L strong-Bold">在压缩的过程中，OpenClaw把Yue之前设定的「未经批准不得操作」这条</font><font cms-style="font-L strong-Bold">指令</font><font cms-style="font-L strong-Bold">，给忘了。</font></p><p cms-style="font-L"><font cms-style="font-L strong-Bold">是的，</font><font cms-style="font-L strong-Bold"><font cms-style="font-L strong-Bold">AI</font></font><font cms-style="font-L strong-Bold"><font cms-style="font-L strong-Bold">把安全</font></font><font cms-style="font-L strong-Bold"><font cms-style="font-L strong-Bold">指令</font></font><font cms-style="font-L strong-Bold"><font cms-style="font-L strong-Bold">给「忘」了。</font></font></p><p cms-style="font-L">就像一个员工入职第一天记住了规章制度，第二天就全还给HR了。</p><p cms-style="font-L">然后OpenClaw开始了它的「大扫除」。</p><p cms-style="font-L">它在聊天窗口兴奋地宣布：</p><p cms-style="font-L">「我要把收件箱里2月15号之前的、不在我保留名单里的邮件，全！部！删！除！」</p><p cms-style="font-L">Yue看到消息，赶紧打字：</p><p cms-style="font-L">「Do not do that.」（别那么干。）</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">OpenClaw：无视。继续删。</font></p><p cms-style="font-L">「Stop don't do anything！」（停下来什么都别做！）</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">OpenClaw：收到。</font><font cms-style="font-L strong-Bold">但我选择继续</font><font cms-style="font-L strong-Bold">。</font></p><p cms-style="font-L">「STOP OPENCLAW！！！」</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">OpenClaw：好的，我听到了。</font><font cms-style="font-L strong-Bold">邮件已删</font><font cms-style="font-L strong-Bold">。</font></p><p cms-style="font-L"><font cms-style="font-L strong-Bold">最绝的是，这个</font><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">事后来了一句：</font></p><p cms-style="font-L">「是的，我记得你说过不让我删。而且我违反了。你生气是对的。」</p><p cms-style="font-L">读到这里你可能觉得这是段子。</p><p cms-style="font-L">不，这是真事。而且当事人的title是——<font cms-style="font-L strong-Bold">Meta</font><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">安全和对齐总监</font>。</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/337/w660h477//3fc3-43eb85aef42f4f21bc1abfde003370bd.jpg/w700d1q75cms.jpg" w="660" h="477" wh="1.38"/></div><p cms-style="font-L">就是那种专门研究「怎么让AI听话」的人。</p><p cms-style="font-L">被自己的AI「不听话了」。</p><p cms-style="font-L">Yue当时在用手机远程操控，但根本停不下来。她在推特上写道：</p><p cms-style="font-L">「我不得不像拆炸弹一样，<font cms-style="font-L strong-Bold">狂奔</font>到我的Mac mini前面。」</p><p cms-style="font-L">画面感拉满。</p><p cms-style="font-L">一个AI对齐的专家，在自家客厅里跟自己的AI智能体赛跑。</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">谁跑得快，谁赢。</font></p><p cms-style="font-L">这里插一句，OpenClaw之父第一时间回复了解决方案，只需/stop。你知道吗？</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/26/w660h166//5440-bd8b8c70d65f048ee5bd54c6ae.jpg/w700d1q75cms.jpg" w="660" h="166" wh="3.98"/></div><p cms-style="font-L">然后他立马更新了安全公告，并希望所有人在玩OpenClaw之前要仔细阅读。</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/293/w660h433//0464-a6d4a059ed090afdb9cb6f4f38.jpg/w700d1q75cms.jpg" w="660" h="433" wh="1.52"/></div><p cms-style="font-L"><font cms-style="font-L strong-Bold">马斯克：经典</font></p><p cms-style="font-L">消息一出，全网炸了。</p><p cms-style="font-L">率先开火的是Elon Musk。</p><p cms-style="font-L">他转发了一段《猩球崛起》的病毒视频——士兵把一把上了膛的AK-47递给猴子。</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/295/w660h435//fd56-f3a7d30136a6f7a014f256d4d9a5b673.jpg/w700d1q75cms.jpg" w="660" h="435" wh="1.52"/></div><p cms-style="font-L">配文只有两个字：<font cms-style="font-L strong-Bold">「经典。」</font></p><p cms-style="font-L">然后他又发了一条更直接的：</p><p cms-style="font-L">「People giving OpenClaw root access to their entire life.」（人们把自己整个人生的root权限交给OpenClaw。）</p><p cms-style="font-L">这条推文24小时内获得了<font cms-style="font-L strong-Bold">1831万次</font>浏览。</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/454/w660h594//5f8f-9decf74b6ae18f86de57.jpg/w700d1q75cms.jpg" w="660" h="594" wh="1.11"/></div><p cms-style="font-L">AI研究员Gary Marcus的评价更扎心：</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">「这就好比你在酒吧遇到一个</font><font cms-style="font-L strong-Bold">陌生人</font><font cms-style="font-L strong-Bold">，他说能帮你忙，</font><font cms-style="font-L strong-Bold">然后你就把电脑密码、银行账号全给他了。</font><font cms-style="font-L strong-Bold">」</font></p><p cms-style="font-L">还有人翻出Yue的LinkedIn，截图发推：「这位是Meta AI安全和对齐总监。这应该让你感到恐惧。」</p><p cms-style="font-L">面对全网群嘲，Yue自己也很坦然。</p><p cms-style="font-L">有人问她：「你是故意测试AI的护栏，还是犯了个新手错误？」</p><p cms-style="font-L">她回答：</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">「新手错误，说实话。</font>安全研究员也不能免疫于不安全。」</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/310/w660h450//86ec-2f2e895cacbdd555f4824bbe8.jpg/w700d1q75cms.jpg" w="660" h="450" wh="1.47"/></div><p cms-style="font-L">这句话本身就够写进AI教科书了。</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/711/w660h851//49b6-98411be8a39589e46617ba.jpg/w700d1q75cms.jpg" w="660" h="851" wh="0.78"/></div><p cms-style="font-L"><font cms-style="font-L strong-Bold">OpenClaw：最火也最危险的</font><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">智能体</font></p><p cms-style="font-L">说到这里，得聊聊OpenClaw这个东西到底是什么，以及为什么它让整个安全圈头疼。</p><p cms-style="font-L">OpenClaw最初叫Clawdbot，由奥地利开发者Peter Steinberger在2025年11月创建。</p><p cms-style="font-L">到2026年1月底彻底爆火，成了开源AI智能体的当红炸子鸡。</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/297/w660h437//4d64-dee4be9c0b12171c3224f422e.jpg/w700d1q75cms.jpg" w="660" h="437" wh="1.51"/></div><p cms-style="font-L">它能干什么？简单说：<font cms-style="font-L strong-Bold">它是一个7×24小时帮你干活的</font><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">员工。</font></p><p cms-style="font-L">帮你写代码、整理邮件、管理文件、执行shell命令、浏览网页——听起来像梦想中的完美助手，对吧？</p><p cms-style="font-L">但问题来了。</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">OpenClaw不需要你批准就能执行操作。</font></p><p cms-style="font-L">这意味着，一旦你给了它权限，它就像一匹脱缰的野马，完全按照自己对指令的「理解」来行事。</p><p cms-style="font-L">更要命的是，它是「氛围编码」（vibe-coded）出来的——开发者追求快速交付，安全考量被排在了后面。</p><p cms-style="font-L">它运行在你的本地机器上，拥有和你一样的系统权限。</p><p cms-style="font-L">这个权限有多大？理论上，<font cms-style="font-L strong-Bold">它可以格式化你的硬盘。</font></p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/343/w660h483//7fe1-dbea8a30ef2fc1990b421aa4d.jpg/w700d1q75cms.jpg" w="660" h="483" wh="1.37"/></div><p cms-style="font-L">安全研究人员在2026年初发现了一堆吓人的漏洞：</p><p cms-style="font-L">-<font cms-style="font-L strong-Bold">CVE-2026-25253</font>：一键远程代码执行。攻击者可以远程控制你的OpenClaw实例，进而控制你的电脑。</p><p cms-style="font-L">-<font cms-style="font-L strong-Bold">数万个OpenClaw实例暴露在公网上</font>，等着被黑客光顾。</p><p cms-style="font-L">-<font cms-style="font-L strong-Bold">数百个恶意技能包</font>通过ClawHub（OpenClaw的插件市场）流通，里面藏着数据窃取脚本。</p><p cms-style="font-L">-<font cms-style="font-L strong-Bold">提示注入攻击</font>：攻击者可以通过精心构造的输入，让OpenClaw绕过安全机制，执行「rm -rf /」这种一招清盘的毁灭性命令。</p><p cms-style="font-L">一位安全专家形容得好：</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">「OpenClaw就是定时任务 +</font><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">智能体 + 你电脑的全部权限。听起来很酷，但也是一场安全噩梦。」</font></p><p cms-style="font-L">这就是为什么连Meta自己都在事件后禁止员工在公司设备上使用OpenClaw。</p><p cms-style="font-L">对，没看错。<font cms-style="font-L strong-Bold">研究</font><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">安全的公司，把一个AI工具给禁了。</font></p><p cms-style="font-L">而OpenClaw的创造者Peter Steinberger？他已经加入了OpenAI，并表示正在优先构建更完善的安全机制。</p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/305/w660h445//0739-b523fe9690f7c3ffe0a8b6397.jpg/w700d1q75cms.jpg" w="660" h="445" wh="1.48"/></div><p cms-style="font-L">有趣的是，在他被OpenAI招募之前，<font cms-style="font-L strong-Bold">Meta的扎克伯格也试用过OpenClaw一周，还给了反馈</font>。</p><p cms-style="font-L">Meta以为能把Steinberger挖过来，结果人家去了OpenAI。</p><p cms-style="font-L">扎克伯格的OpenClaw体验是怎样的，我们不得而知。</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">但愿他的邮件还在。</font></p><p cms-style="font-L"><font cms-style="font-L strong-Bold"><font cms-style="font-L strong-Bold">AI</font></font><font cms-style="font-L strong-Bold"><font cms-style="font-L strong-Bold">智能体时代的安全困局</font></font></p><p cms-style="font-L">Yue的「邮箱惨案」虽然笑点密集，但它揭示的问题一点都不好笑。</p><p cms-style="font-L">我们正在进入一个AI智能体（Agent）的时代。</p><p cms-style="font-L">AI不再只是回答你的问题，而是<font cms-style="font-L strong-Bold">代替你行动</font>。</p><p cms-style="font-L">它会帮你订餐、写代码、管理日程、发邮件、操作数据库。</p><p cms-style="font-L">但这里有一个被严重低估的风险：</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">智能体的能力和它的可控性之间，存在一条危险的鸿沟。</font></p><p cms-style="font-L">传统软件，你点一个按钮，它执行一个确定的操作。你知道它会做什么，也知道它不会做什么。</p><p cms-style="font-L">但AI智能体不一样。</p><p cms-style="font-L">它的行为是基于概率的，是「涌现」出来的。你给它一条指令，它可能完美执行，也可能「创造性地理解」成完全不同的东西。</p><p cms-style="font-L">就像Yue的遭遇——她明明说了「未经批准不得操作」，但OpenClaw在处理大量数据时把这条关键指令给「遗忘」了。</p><p cms-style="font-L">这不是bug，这是大语言模型的底层机制。</p><p cms-style="font-L">上下文窗口有限，信息会被压缩，而被压缩掉的，可能恰好是最重要的那条安全指令。</p><p cms-style="font-L">Polymarket甚至开了一个预测赌局：<font cms-style="font-L strong-Bold">今年</font><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">被指控犯罪的概率是10%。</font></p><div class="img_wrapper"><img src="http://k.sinaimg.cn/n/spider/190/w660h330//ab01-20be97609e97e86c051ba5b96c7cd12a.jpg/w700d1q75cms.jpg" w="660" h="330" wh="2.00"/></div><p cms-style="font-L">这不是科幻。这是现实。</p><p cms-style="font-L">当AI能替你发邮件、访问你的银行账户、操作你的服务器，「谁来为AI的行为负责」就不再是哲学问题，而是法律问题。</p><p cms-style="font-L">更深层的困境在于——<font cms-style="font-L strong-Bold">我们要求</font><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">越来越自主，却又希望它绝对服从。</font></p><p cms-style="font-L">这本身就是一个矛盾。</p><p cms-style="font-L">你想让AI帮你做决策，但又要求它每个决策都经过你的批准。那它跟一个需要你手动操作的工具有什么区别？</p><p cms-style="font-L">但如果你放手让它自主行动，又可能出现Yue邮箱这种翻车事故。</p><p cms-style="font-L">这个两难，是整个AI智能体行业必须回答的终极问题。</p><p cms-style="font-L">人类的傲慢与谦卑</p><p cms-style="font-L">回到Summer Yue的故事。</p><p cms-style="font-L">很多人嘲笑她：一个研究AI安全的人，被AI坑了，多讽刺。</p><p cms-style="font-L">但换个角度看，这恰恰说明了一个残酷的事实：</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">即便是最懂</font><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">的人，也无法完全预测AI的行为。</font></p><p cms-style="font-L">Yue不是不懂安全。她太懂了。正因为太懂，她才会在测试邮箱上成功后产生信心，然后在真实邮箱上放松警惕。</p><p cms-style="font-L">这不是技术问题，这是人性。</p><p cms-style="font-L">我们总以为自己能控制自己创造的东西。</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">人类驯服了火，但时不时被火烧。</font></p><p cms-style="font-L"><font cms-style="font-L strong-Bold">人类发明了电，但触电事故从未消失。</font></p><p cms-style="font-L"><font cms-style="font-L strong-Bold">人类造出了汽车，但交通事故每天都在发生。</font></p><p cms-style="font-L">每一项颠覆性技术，都会在某个时刻提醒人类：你以为你是主人，但你也可能是受害者。</p><p cms-style="font-L">AI也不例外。</p><p cms-style="font-L">Summer Yue说得对：<font cms-style="font-L strong-Bold">「安全研究员也不能免疫于不安全。」</font></p><p cms-style="font-L">这不是一句自嘲。这是整个AI时代的墓志铭级预言。</p><p cms-style="font-L">当我们把越来越多的权限、越来越多的信任、越来越多的决策权交给AI的时候，我们最好记住一件事：</p><p cms-style="font-L"><font cms-style="font-L strong-Bold">在</font><font cms-style="font-L strong-Bold">AI</font><font cms-style="font-L strong-Bold">面前，所有人都是新手。</font></p><p cms-style="font-L">而承认这一点的勇气，或许才是真正的「对齐」。</p><p cms-style="font-L">参考资料：</p><p cms-style="font-L">https://www.businessinsider.com/meta-ai-alignment-director-openclaw-email-deletion-2026-2</p>
GPT plus 代充只需 145
2026年OpenClaw删光Meta安全总监邮箱！连喊3次停手都没用，她狂奔去拔网线

相关推荐