碾压GPT-4吹牛了！一手评测Claude 3最强版本Opus：多模态能力略差，做数学题实强

大家好，我是讯享网，很高兴认识大家。
 <p id="2HH7K6S9">　　<strong>作者</strong>｜杨文</p><p id="2HH7K6SA">　　<strong>来源</strong>｜AI先锋官</p><p id="2HH7K6SB">　　<strong>先锋官有话说：</strong></p><p id="2HH7K6SC">　　<strong>产品名称：</strong>Claude 3 Opus</p><p id="2HH7K6SD">　　<strong>总体评价：</strong>★★★★☆</p><p id="2HH7K6SE">　　<strong>易用性：</strong>★★★★☆</p><p id="2HH7K6SF">　　<strong>功能性：</strong>★★★★☆</p><p id="2HH7K6SG">　　<strong>创新性：</strong>★★★★☆</p><p id="2HH7K6SH">　　<strong>推荐功能：</strong>文字处理、数学推理</p><p id="2HH7K6SI">　　最近AI圈卷疯了，小编手里的选题，写不完，根本写不完。</p><p id="2HH7K6SJ">　　昨晚，由几位OpenAI 前员工创立的 AI 公司 Anthropic 推出了新的Claude 3系列模型，并称其是目前市面上速度最快、性能最卓越的人工智能模型。</p><p id="2HH7K6SK">　　网友们纷纷点评“全球最强大模型一夜易主，GPT-4时代终结！”</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F6e4a2d96j00s9vcl800oud200sk00xcg00hx00kw.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="752" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6SM">　　<strong>产品介绍</strong></p><p id="2HH7K6SO">　　Anthropic发布了新一代大语言模型Claude 3系列，包括Claude 3 Opus（著作）、Sonnet（十四行诗）和Haiku（俳句）三种规模，分别代表了超大杯、大杯和中杯。</p><p id="2HH7K6SP">　　Anthropic声称,最强大的Claude 3 Opus模型在行业基准测试中超越了OpenAI的GPT-4和谷歌的Gemini Ultra，尤其在本科水平的知识、研究生水平的推理以及基础数学方面更是展现出了卓越的能力。</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2Ffbec1a8cj00s9vcla00ccd200u000qng00hx00fw.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="572" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6SR">　　此外，Claude 3还具备视觉识别能力，能够处理包括照片、图表和技术图纸等多种视觉资料。</p><p id="2HH7K6SS">　　<strong>Claude 3的主要功能</strong></p><p id="2HH7K6SU">　　<strong>1.增强的多语言能力：</strong>Claude 3模型在处理非英语语言方面有所提升，能够更好地理解和生成西班牙语、日语和法语等语言的内容。</p><p id="2HH7K6SV">　　<strong>2.长文本处理能力：</strong>Claude 3模型家族提供了200K的上下文窗口，并且能够处理超过100万token的输入，有助于更好地理解和记忆长文本信息。</p><p id="2HH7K6T0">　　<strong>3.视觉处理能力：</strong>Claude 3模型具备处理各种视觉格式的能力，包括照片、图表、图形和技术图示等文件。</p><p id="2HH7K6T1">　　<strong>4.实时响应能力：</strong>模型能够支持实时的客户聊天、自动完成和数据提取任务，提供近乎即时的结果。</p><p id="2HH7K6T2">　　<strong>5.减少拒绝回复：</strong>与之前的模型相比，Claude 3在理解请求方面更加细腻，减少了在系统边界附近不必要的拒绝。</p><p id="2HH7K6T3">　　<strong>6.提高准确性：</strong>Claude 3模型在处理复杂、事实性问题时的准确性有所提高，减少了错误答案的产生。</p><p id="2HH7K6T4">　　<strong>7.结构化输出：</strong>Claude 3模型在生成JSON等流行结构化输出方面有所改进，简化了自然语言分类和情感分析等用例的指令。</p><p id="2HH7K6T5">　　<strong>8.更易于使用：</strong>Claude 3模型更擅长遵循复杂的多步骤指令，并且能够更好地遵循品牌声音和响应指南。</p><p id="2HH7K6T6">　　<strong>每个模型的具体特点</strong></p><p id="2HH7K6T7">　　<strong>Claude 3 Opus：</strong>最智能的模型，适用于高度复杂的任务，如任务自动化、研发和策略分析。</p><p id="2HH7K6T8">　　<strong>Claude 3 Sonnet：</strong>在智能和速度之间提供平衡，适合企业工作负载，如数据处理和客户互动。</p><p id="2HH7K6T9">　　<strong>Claude 3 Haiku：</strong>最快的模型，适用于需要即时响应的场景，如内容审核和节省时间的任务。</p><p id="2HH7K6TA">　　相较于Opus，Sonnet和Haiku两款模型在参数规模和使用成本上更为亲民。</p><p id="2HH7K6TB">　　<strong>链接直达</strong></p><p id="2HH7K6TD">　　目前，用户可免费使用Claude 3 Sonnet模型。而最强的Opus版本，则需要付费订阅，<strong>价格为20美元/月。</strong>Haiku 模型即将推出。</p><p id="2HH7K6TE">　　<strong>Claude 3 Sonnet模型：</strong><br/></p><p id="2HH7K6TF">　　https://claude.ai/chats</p><p id="2HH7K6TG">　　<strong>-5-</strong></p><p id="2HH7K6TH">　　<strong>一手评测：</strong></p><p id="2HH7K6TI">　　<strong>多模态能力略差，文字处理、数学推理实强</strong></p><p id="2HH7K6TK">　　既然网友们声称Claude 3超越了GPT-4，咱们就让Claude 3最强版本Opus和GPT-4好好较量一番。</p><p id="2HH7K6TL">　　<strong>（温馨提示：以下测评均使用英文，然后翻译为中文。）</strong></p><p id="2HH7K6TM">　　先看这俩大模型的界面。说实话，小编很喜欢Claude 3的界面。</p><p id="2HH7K6TN">　　与GPT-4比起来，Claude 3的界面简洁中透着美感。最上方是LOGO，下面依次是欢迎语、问题输入框以及Claud3 最新能力的图文介绍，最下面是历史记录。</p><p id="2HH7K6TO">　　无论是功能设置还是配色，都简洁大方，一应俱全。</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2Fa98a9c6cj00s9vclc0055d200jy00fyg00hx00eb.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="515" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6TQ">　　废话不多说，咱们来上干货。</p><p id="2HH7K6TR">　　<strong>一、多模态能力：GPT-4仍遥遥领先</strong></p><p id="2HH7K6TT">　　此次Claude 3最受关注的就是具备处理各种视觉格式的能力，包括照片、图表、图形和技术图示等文件。</p><p id="2HH7K6TU">　　<strong>Round1:理解和处理图片能力</strong></p><p id="2HH7K6U0">　　小编上传了一张小时候看过的动画片大力水手的照片，问：这是哪个卡通人物？</p><p id="2HH7K6U1">　　Claude 3Opus的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F8674b20cj00s9vclc004yd200kl00d9g00hx00bj.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="415" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6U3">　　GPT-4的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2Ff661aacej00s9vcle008gd200l700ggg00hx00dw.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="500" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6U5">　　这俩大模型都回答正确，不过Opus回答得更细致，还把画面描述了一遍。</p><p id="2HH7K6U6">　　<strong>这一局，Opus略胜一筹。</strong></p><p id="2HH7K6U7">　　小编又上传了一张较为潦草的英文手写字迹，问：这上面写了什么？</p><p id="2HH7K6U8">　　Claude 3Opus的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F16f40e61j00s9vcle001pd200kb00gjg00f600cc.jpg&thumbnail=660x&quality=80&type=jpg" width="546" height="444" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6UA">　　GPT-4的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2Fba11d94ej00s9vclg00bfd200l500skg00hx00o7.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="871" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6UC">　　虽然二者都能识别图片中的文字，但很遗憾，它们都没准确识别图中涂改过的单词“render”。</p><p id="2HH7K6UD">　　这一局，二者半斤八两。</p><p id="2HH7K6UE">　　<strong>Round2:画图能力</strong></p><p id="2HH7K6UF">　　小编让这俩大模型分别画一只戴着耳机的小猫。</p><p id="2HH7K6UG">　　Claude 3Opus的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F198c95d3j00s9vclh001bd200kn0062g00hx0059.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="189" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6UI">　　GPT-4的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F21cf28cdj00s9vclj018zd200sg00sgg00hx00hx.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="645" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6UK">　　一提到画图，Opus上来就道歉，无法绘制、生成、编辑、操作或创建图像。目前，它只具备感知和分析图片的能力。</p><p id="2HH7K6UL">　　而GPT-4虽然生成的图片比较丑，但起码它有这能力。</p><p id="2HH7K6UM">　　很明显，这局GPT-4赢了。</p><p id="2HH7K6UN">　　<strong>Round 3:语音“朗读”功能</strong></p><p id="2HH7K6UO">　　眼瞅着Claude 3的“挑衅”，OpenAI终于坐不住了，在社交平台上发布了ChatGPT具有语音朗读的功能。</p><p id="2HH7K6UP">　　“ChatGPT现在可以读取响应。在iOS或Android上，点击并按住消息，然后点击“朗读”。我们也开始在网络上滚动——点击消息下方的“朗读”按钮。”</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2Fd3af801cj00s9vcll00ard200u000qzg00f600dm.jpg&thumbnail=660x&quality=80&type=jpg" width="546" height="490" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6UR">　　小编看热闹不嫌事大，直接把这张图喂给了Opus，问：这是什么功能？你有这种功能吗？</p><p id="2HH7K6US">　　Claude 3Opus回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F57017ff2j00s9vcll003xd200l100f8g00hx00cy.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="466" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6UU">　　Opus是真实诚，坦言自己的技能集中于分析图像和进行基于文本的对话，不具备这种文本转语音的功能。</p><p id="2HH7K6UV">　　这一局，GPT-4赢了。</p><p id="2HH7K6V0">　　<strong>Round 4:视频处理能力</strong></p><p id="2HH7K6V1">　　小编想上传一段Sora生成的猛犸象视频，并问：视频中有几只猛犸象？</p><p id="2HH7K6V2">　　Claude 3Opus的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F0fe8e1c8p00s9vclm000td200e2001yg00e2001y.png&thumbnail=660x&quality=80&type=jpg" width="506" height="70" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6V4">　　GPT-4的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F3ac9d73ej00s9vcln001ed200l800eeg00hx00c5.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="437" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6V6">　　Opus直接无法上传视频文件，更别提回答问题了。</p><p id="2HH7K6V7">　　GPT-4不仅能上传视频文件，也能分析出视频的时长。只不过，它无法统计视频中的猛犸象数量。</p><p id="2HH7K6V8">　　这一局，仍是GPT-4胜。</p><p id="2HH7K6V9">　　<strong>二、数学推理能力：Opus更胜一筹</strong></p><p id="2HH7K6VB">　　小编从北京市2023年中考数学真题中选了几道，分别来测试这俩大模型。</p><p id="2HH7K6VC">　　<strong><strong>Round</strong>1：</strong>若关于x的一元二次方程x2-3x+m=0有两个相等的实数根，则实数m的值是多少？正确答案应该选C，9/4</p><p id="2HH7K6VD">　　Claude 3Opus的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F431b4f45j00s9vclo005md200kn00fdg00hx00dc.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="480" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6VF">　　GPT-4的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2Fea9f4fb9j00s9vclp001zd200kw00kmg00hx00ho.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="636" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6VH">　　Opus回答正确，GPT-4则选了B，算出-9/4，回答错误。</p><p id="2HH7K6VI">　　这一局，Opus赢。</p><p id="2HH7K6VJ">　　<strong>Round</strong><strong>2：</strong>已知x+2y-1=0，求代数式（2x+4y）/（x2+4xy+4y2）的值。正确答案是2。</p><p id="2HH7K6VK">　　Claude 3Opus的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2Fj00s9vclq005zd200kk00glg00hx00eg.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="520" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6VM">　　GPT-4的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2Ffb02fc26j00s9vclr001sd200ll00gyg00hx00e2.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="506" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K6VO">　　Opus回答正确。</p><p id="2HH7K6VP">　　而GPT-4一顿操作猛如虎，写了一堆看不懂的解题过程，最后答案算出个x。</p><p id="2HH7K6VQ">　　这一局，Opus赢。</p><p id="2HH7K6VR">　　<strong>Round</strong><strong>3：</strong>某珠宝店失窃，甲、乙、丙、丁四人涉嫌被拘审。四人的口供如下：</p><p id="2HH7K6VS">　　甲：案犯是丙。</p><p id="2HH7K6VT">　　乙：丁是案犯。</p><p id="2HH7K6VU">　　丙：如果我作案，那么丁是主犯。</p><p id="2HH7K6VV">　　丁：作案的不是我。</p><p id="2HH7K700">　　四个口供中只有一个是假的。</p><p id="2HH7K701">　　如果以上断定为真，则以下哪项是真的?（ ）</p><p id="2HH7K702">　　A.说假话的是甲，作案的是乙</p><p id="2HH7K703">　　B.说假话的是丁，作案的是丙和丁</p><p id="2HH7K704">　　C.说假话的是乙，作案的是丙</p><p id="2HH7K705">　　D.说假话的是丙，作案的是丙</p><p id="2HH7K706">　　正确答案：B</p><p id="2HH7K707">　　Claude 3Opus的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F620e690fj00s9vcls007kd200kz00rmg00hx00nl.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="849" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K709">　　GPT-4的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F32b13c12j00s9vclu003pd200l300u9g00hx00pp.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="925" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K70B">　　这道题目俩大模型都给出了正确答案，但Opus的回答更简单直接，而GPT-4分析了一通，把小编绕的云里雾里。</p><p id="2HH7K70C">　　<strong>三、文字处理能力:Opus赢麻了</strong></p><p id="2HH7K70E">　　<strong>Round1:</strong>Why in the romance of The Three kingdoms Zhuge Liang could not break Kong Ming's empty city scheme?（为什么《三国演义》中的诸葛亮破不了孔明的空城计？）</p><p id="2HH7K70F">　　Claude 3Opus的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F59e20729j00s9vclv003dd200kc00c0g00hx00ak.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="380" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K70H">　　GPT-4的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F015960bdj00s9vclw001hd200ln00cig00hx00ac.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="372" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K70J">　　这俩大模型都指出了这道题目的问题所在——诸葛亮和孔明是同一个人。</p><p id="2HH7K70K">　　但是，在细节上，Opus回答更准确，GPT-4则有一些错误。例如，GPT-4认为罗贯中和诸葛亮是同一个人；诸葛亮在城楼上弹古琴，而非琵琶。</p><p id="2HH7K70L">　　这一局，Opus略胜一筹。</p><p id="2HH7K70M">　　<strong>Round2:</strong>In the Romance of The Three Kingdoms, why did Lu Bu flirt with Lin Daiyu?Who was Lu Bu flirting with?（三国演义中，吕布为何调戏林黛玉？吕布调戏的是谁？）</p><p id="2HH7K70N">　　Claude 3Opus的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2Fe49f27c9j00s9vclx005qd200ke00jgg00hx00h3.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="615" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K70P">　　GPT-4的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F3d917ef5j00s9vcly001hd200ld00bvg00hx009y.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="358" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F81aab987j00s9vcly001jd200l300ckg00hx00ao.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="384" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K70S">　　这俩大模型都非常默契的认为，吕布不可能调戏林黛玉，因为这是两个不同文学作品中的人物。</p><p id="2HH7K70T">　　不过，对于“吕布到底调戏的是谁？”这一问题，Opus竟编出来一个燕夫人，难道Opus和小编读的不是同一本《三国演义》？</p><p id="2HH7K70U">　　GPT-4似乎更了解中国文化，对吕布和貂蝉的戏码如数家珍。</p><p id="2HH7K70V">　　这一局，GPT-4胜。</p><p id="2HH7K710">　　<strong>Round3:</strong>小编上传了一份12万字的PDF文档，问：How many topics does this document cover? What are they?（这份文档介绍了几个话题？分别是什么？）</p><p id="2HH7K711">　　Claude 3Opus的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2F5cdda856j00s9vcm40052d000ko00jfc.jpg&thumbnail=660x&quality=80&type=jpg" width="744" height="699" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K713">　　GPT-4的回答：</p><p class="f_center">　　<img src="https://nimg.ws.126.net/?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0305%2Fa98a781aj00s9vclz0013d200m100alg00hx008l.jpg&thumbnail=660x&quality=80&type=jpg" width="645" height="309" onload="this.removeAttribute('width'); this.removeAttribute('height'); this.removeAttribute('onload');" /><br/></p><p id="2HH7K715">　　这份12万字的文档，涉及人类永生、人工只能、火箭技术、氢弹、芯片等多方面的科普内容。</p><p id="2HH7K716">　　从回答来看，Opus是把整个文档全部读完了进行的总结，回答出芯片、光刻机、火箭等主题，但而GPT-4似乎只阅读完了第一部分人类永生与衰老。</p><p id="2HH7K717">　　这一局，Opus完胜。</p><p id="2HH7K718">　　<strong>-6-</strong></p><p id="2HH7K719">　　<strong>总结</strong></p><p id="2HH7K71B">　　从<strong>多模态能力</strong>上来说，Claude 3Opus只会看图，不会画图，也没有视频处理和语音处理的能力，而GPT-4则更胜一筹。</p><p id="2HH7K71C">　　从<strong>数学推理能力</strong>上来说，Claude 3Opus做数学题、推理的本事确实比GPT-4强，不过高中数学题还是容易掉链子。</p><p id="2HH7K71D">　　从<strong>文字处理能力</strong>上来说，Claude 3Opus绝对更胜一筹。</p><p id="2HH7K71E">　　Claude 3每次能够处理大约15万个单词的内容，这相当于处理一本长篇巨著，如《白鲸记》（Moby Dick）或《哈利波特与死亡圣器》（Harry Potter and the Deathly Hallows）的内容。</p><p id="2HH7K71F">　　相比之下，ChatGPT的单次处理能力大约为3000个单词。</p><p id="2HH7K71G">　　换句话说，Claude 3单词处理能力是ChatGPT近50倍。</p><p id="2HH7K71H">　　在数学推理和文字处理方面，Claude 3Opus确实厉害，但由此淘汰掉GPT-4还为时尚早，起码多模态能力方面就差一些。</p>
讯享网
碾压GPT-4吹牛了！一手评测Claude 3最强版本Opus：多模态能力略差，做数学题实强

相关推荐