智能体 2026-06-12

Meshy发布全球首个3D AI Agent:支持自然语言驱动全流程3D内容生成与编辑

Meshy推出首个端到端3D AI Agent,用户可通过对话指令完成建模、纹理、动画、导出全链路操作,标志着3D创作进入‘ChatGPT时刻’。
5
⏱ 时效
4
💥 影响
3
🔥 话题
4
🌐 普适

3D创作迎来ChatGPT时刻:Meshy发布全球首个3D AI Agent Meshy发布全球首个3D AI Agent 梦瑶 发自 凹非寺 量子位 | 公众号 QbitAI 在3D创作这个圈子,一直有个心照不宣的扎心真相: 那就是最难的一步从来不是生成,而是让模型变为可用资产。 对创作者来说一次完整创作,往往意味着无数次抽卡、反复修改,在不确定结果中不断试错。 很多创意其实早已成型,但距离能够自主可控、批量扩展、长期复用的资产体系,往往还隔着漫长的一段路。 AI生成 当整个行业还在思考如何生成更好的模型时,已经有团队开始思考另一个更关键的问题: 如何让3D创作从一次性模型生成走向可控生产。 而他们,也率先在整个行业内把这个想法变成了现实—— 把3D创作带进Agent时代,推出了全球首个3D创作AI Agent:Meshy 3D Agent。 是不是有点眼熟? 打造这款产品的,正是计算机图形学大神胡渊鸣创立的AI 3D公司:Meshy。 作为全球首个3D创作AI Agent,Meshy 3D Agent能够通过多轮对话完成从概念探索到模型导出的完整流程: 视频链接: https://mp.weixin.qq.com/s/lk9WIL0AFCHW1yv2yekIbg 把过去3D创作里最劝退人的几道坎—— 建模门槛高、工具链割裂、风格资产难以统一、打印和生产检查等难点,全部纳入了同一套Agent工作流之中。 让3D创作不止可创造,也可沉淀为长期使用、且自主掌控的全面资产。 人人都能把想法变成立体模型的时代,这次,真的来了。 Meshy 3D Agent传送门: https://www.meshy.AI/zh/workspace 为什么3D创作需要Agent? 过去两年,AI 3D生成的发展几乎进入了加速竞赛阶段。 Grand View Research相关数据显示,全球3D建模市场规模已达到数十亿美元,预计到2030年将增长至167.8亿美元。 从3D生成,到越来越精细的网格结构、贴图质量和几何细节,整个行业都在不断刷新模型生成能力的上限。 然而,当行业内生成效果越来越好的同时,一个新的问题也开始浮现: 对于真正的创作者和开发团队来说,「生成」往往只是整个工作流的起点,而不是终点。 与2D图片不同,3D内容天然具有资产属性—— 一件真正能落地的3D资产,往往要经历模型生成、风格统一调整、场景组合、批量补充、打印前检查等多个环节。 生成、编辑、优化、组装等环节长期分散在不同工具和工作流中,资产之间的可组合性、可编辑性和可复用性也受到限制。 创作者不仅要完成创作本身,还要承担流程衔接和素材管理的工作。 与此同时,大量创意也被挡在技术门槛之外,很多创作者并不缺少想法,缺的是把想法变成现实的能力和成本。 在《Perforce 2025 State of Game Technology Report》对全球几百位游戏行业管理者和创作者的调研中,一个问题被反复提及—— 那就是生成内容与可生产资产之间仍存在明显断层。 报告中,一位首席游戏程序员也同样感慨: 当前生成式AI最大的限制并非生成能力本身,而是缺乏将结果直接转化为可用于生产环境资产的能力。 而这也揭示了一个残酷的行业事实。 3D创作走到今天,行业真正需要的已经不只是一个生成入口,而是一套能把创意转化为可用资产的生产系统。 而Meshy 3D Agent的价值,恰恰就落在这个行业缺口上。 其把原本分散在不同工具、不同步骤、不同专业角色的流程,整合成一条可连续推进的链路,并沉淀为可用资产。 Meshy 3D Agent首先解决的,就是3D创作者从想法到方案的这段鸿沟。 只需要用大白话描述需求,哪怕一个模糊的想法,Agent也能自动补全细节、拆解目标,并提供可执行的生成计划: 更重要的一层是,它并不止步于一次性生成。 在后续编辑过程中,我们也可以像与设计师协作一样不断提出修改意见,Agent会结合已有结果持续迭代。 无论是参考生成、替换局部元素,还是补充缺失细节,都能够在保持素材一致性的前提下完成。 无论是卡通手办、桌面摆件还是零件原型,都能被纳入一套可以持续编辑、持续完善的生产流程: 视频链接: https://mp.weixin.qq.com/s/lk9WIL0AFCHW1yv2yekIbg 此外,对于游戏开发者来说,最让人头疼的不是生成一个模型。 而是生成一组「风格统一」的资产。 游戏资产通常都是成组出现的,角色、武器、建筑、植物、场景装饰等元素,都需要服务于同一个视觉体系。 而Meshy 3D Agent则可以直接围绕统一设定,在Stylized&Cartoon、Low Poly Mode、Realistic&Sculpture等不同风格方向下,批量生成道具、角色和环境概念。 比如当我输入一组游戏场景图,Agent能自动分析并引导我选择合适的风格和表现形式,然后生成整套素材~ 视频链接: https://mp.weixin.qq.com/s/lk9WIL0AFCHW1yv2yekIbg 除了生成问题,3D创作还存在一个适配性的问题:那就是同一个3D模型,最后可能要进入完全不同的使用场景。 而Meshy 3D Agent,则直接打通了后续工作流—— 形象生成之后,Agent能继续检查模型的3D打印可行性,不仅如此还支持多工具适配和多格式输出。 像Bambu Studio、Creality Print、OrcaSlicer这些打印软件都能适配。 此外还支持FBX、OBJ、GLB、USDZ、STL、3MF、Blend等多种格式导出,直接解决了「从生成到实际使用」的衔接成本: 所以,当我们再回看「为什么3D创作需要Agent」这个问题时—— Meshy 3D Agent,或许已经给出了一个清晰答案。 因为Agent天然适合做这些事:连接想法、创作和素材,在一个对话框里持续理解上下文,把创作需求一路承接。 Agent的出现,让3D创作第一次真正接近了想法即生产的状态。 3D Agent带来的产业逻辑变化:从生成工具走向生产基础设施 过去几年,3D内容需求一直在被游戏、3D打印、AR/VR、动画、电商展示等场景不断拉高。 但一个很现实的问题是: 需求侧已经明显提速,供给侧却并没有真正「变轻」。 长期以来,行业内的3D资产主要依赖三种来源:找现成素材、自己建模,或者外包定制。 几条路径看似覆盖了大多数选择,但本质上都绕不开同一组问题:慢、贵、不一定匹配需求,风格也很难统一。 市场研究平台「Business Research Insights」在一份调研中显示,在有建模需求的企业中,有42%的中小企业认为3D生成软件成本过高。 这个数字背后反映的并不只是单一工具价格问题,而是中小团队在3D内容生产中长期面对的综合成本压力—— 从需求沟通、模型生成、风格调整,到后续检查、格式适配和多轮返工,每一步都可能带来额外的人力和时间消耗。 这也是为什么,3D资产生产长期没有真正像图像、文本那样被大规模「轻量化」。 也正是在这个层面,Meshy 3D Agent真正触及的,是3D创作产业更底层的一层变化—— 它把3D创作这件事儿,往按需生产维度推进了一步。 用户不需要先四处寻找现成模型、调研报价、找专人策划、不断批量调整资产统一性,才能把脑中的想法落地。 3D创作的起点,也开始从有什么用什么,走向「需要什么就围绕什么去生成」。 而一旦这套生产逻辑发生变化,整个3D AI行业的竞争标准也会被重新改写: 从过去比模型像不像、速度快不快、效果够不够好的单次生成能力,转向全链路创作场景的「综合交付能力」。 综合交付能力,指的是从创意想法的需求端出发,一直到自产批量创作的完整闭环。 进入Agent阶段后,模型能不能被继续修改,能不能保持风格一致,能不能支持多轮迭代,能不能批量扩展,能不能进入真实下游工作流,会成为新的竞争标准。 在这个层面上,3D产业才更有机会被推入一个新的「增长飞轮」—— 需求标准推动工具升级,工具升级扩大生产能力,生产能力提升后又会催生更丰富的应用场景。 未来,会有更多的3D创作产品围绕资产管理和交付集成闭环来设计,加速3D工具从单点生成器向工作流平台演进。 同时也会有更多的独立开发者、垂直行业团队,独立完成过去需要更大团队才能承担的3D内容生产。 这也意味着,Meshy 3D Agent的价值已经不只是一个生成工具本身,而是让3D创作工具的位置真正变化: 从负责生成一个模型的单点工具,变成连接创意输入、模型生产、持续修改和下游交付的工作流入口。 一个属于3D资产持续生产、复用和真实交付的时代,这次真的来了。 为什么Meshy这种垂直公司有机会先跑出来? OpenAI、Google、Meta都在做AI,大厂手里有最强的通用模型、最多的算力,也有最庞大的产品生态。 按理说,3D Agent这种东西,听起来似乎也该从大厂实验室里先跑出来。 但3D创作偏偏不是一个只靠「模型更强、跑分更高」就能打穿的场景。 3D Agent的壁垒不只是模型能力,还体现在对真实工作流、用户需求和工具链的长期理解,而这恰恰是垂直公司的机会。 跟行业大多数做通用类AI产品的公司相比,Meshy有点不一样,他们从创立开始就一直在盯一件事—— 解放每个人的创造力,让3D创作更自然。 一句看似理想化的口号,Meshy却把这件事儿,落到了具体产品和真实效率变化里。 长期以来3D创作最大的瓶颈之一,就是周期长成本高。 过去制作一个3D模型,平均需要两周时间,成本大约在1000美元左右。 而Meshy则做到了把这个过程压缩到了几分钟: 一个模型最快两分钟就能生成,成本也降到约1美元,相当于把3D建模速度提升了近千倍,成本压到原来的千分之一。 这种产品层面的效率变化,也已经体现在更具体的产业场景—— 裸眼3D头部厂商Jupiter,通过Meshy把原本需要7天的基础模型精炼流程,压缩到2小时生成基础网格模型。 游戏厂商三七互娱,则借助Meshy的图生3D能力,将建模整体周期缩短了50%,大大提高产出效率: Meshy的产品价值,也在一线创作者的真实使用中被不断验证。 有用户用Meshy打造VR世界,把脑中的想法快速推进到可展示的状态,创造力妥妥打开。 还有用户原本要花两天做好的店铺管道2D模型,用Meshy一分钟就完成了,直言游戏开发要变天了~ 每一次需求洞察和产品迭代,每一次用户反馈,也让这家专注3D AI的垂直公司,逐渐给出了市场侧的答案—— 用户规模上,目前Meshy已经服务全球超过1000万用户,累计生成超过1亿个3D模型,正在成为越来越多创作者进入3D创作的首选工具之一。 与此同时,行业认可度和商业化也在「加速狂飙」。 A16Z Games在2024年度报告中将Meshy评为最受欢迎的3D AI工具。 其在SimilarWeb网站流量统计中也长期位列同类产品第一,月访问量突破800万。 商业化层面,Meshy的年经常性收入已经达到4000万美元,折合人民币约3亿元;2025年全年收入同比增长14倍,并长期维持20%-30%的月复合增速,增长势能非常明显。 市场格局上,Meshy在欧美发达国家和地区的市场占有率已经超过60%,甚至高于第二、第三、第四名竞品的总和。 这些数字背后其实指向同一件事,Meshy在3D创作上下的功夫,已经被用户规模、商业化和市场份额共同验证。 当然,做出全球第一个3D创作AI Agent的背后,也离不开团队本身的技术基因底色—— Meshy创始人兼CEO胡渊鸣,本科毕业于清华大学姚班,后来在MIT攻读计算机图形学与人工智能博士,圈内提到他,常会用「图形学英雄少年」来形容。 围绕他组建起来的团队,也有很强的技术密度,团队成员也大多来自MIT、斯坦福、伯克利等世界一流高校,并曾在Google、英伟达、微软等公司工作,真·《专业团队》。 所以,当我们回过头再看这家垂直公司做出全球第一个3D创作AI Agent这件事儿,也就不难理解了: 从长期专注3D创作和团队功底,到用户需求的持续理解,再到市场表现的验证,本质是一条自然延伸出来的结果。 靠的不是体量优势,而是对一个垂直场景足够深的理解和足够久的投入。 而这,也让整个行业看到了3D创作的另一种可能性—— 过去被专业工具、复杂流程和高门槛链路锁住的3D创作,真正开始走向每一个有想法的人。 这一次,AI Agent不再只停留在屏幕里的任务流转中,而是被延伸到了物理世界,延伸到了3D创作的全链路里。 而在全球率先把这件事做出来的,叫Meshy。 - [AI短剧工具赛道,年度最大单笔融资来了](https://www.qbitAI.com/2026/06/434298.html)2026-06-11 - [5分钟AI长视频不翻车!国产开源框架杀到全球第一梯队](https://www.qbitAI.com/2026/06/431401.html)2026-06-07 - [字节开源统一框架Bernini:给DiT配个“大模型军师”,AI视频编辑先理解再动手](https://www.qbitAI.com/2026/06/427810.html)2026-06-02 - [刚刚,全球⾸个“事件级预测”具身智能世界模型来了!](https://www.qbitAI.com/2026/05/426366.html)2026-05-29

配图配图

📎 其他来源报道

T1📰 Biz & IT - Ars Technica

Dozens of cryptographically verified open source packages from Microsoft were compromised late last week to add advanced credential-stealing code that was triggered when developers opened them in AI coding Agents. In all, [multiple](https://www.stepsecurity.io/blog/miasma-worm-hits-microsoft-agAIn-azure-functions-action-and-72-other-repositories-disabled-after-supply-chAIn-attack-targeting-AI-coding-Agents) researchers [sAId](https://opensourcemalware.com/blog/miasma-reaches-azure), 73 packages were flagged as malicious when automated systems on GitHub blocked them on the platform. Rather than noting they are malicious—and that developers who used AI Agents to work with them should assume their systems are compromised—the Microsoft-owned GitHub sAId it disabled the packages “due to a violation of GitHub’s terms of service.” The text went on to encourage the package owner to contact GitHub. Devs: Assume compromise and proceed accordingly It wasn’t until Monday that Microsoft even rAIsed the possibility the packages were infected. In an emAIl, the company stated: “We have temporarily removed some repositories as we investigate potential malicious content.” The incident is the second supply-chAIn attack in as many months to breach an official Microsoft repository account. In mid May, the firm StepSecurity [documented](https://www.stepsecurity.io/blog/microsofts-durabletask-pypi-package-compromised-in-supply-chAIn-attack) the compromise of Microsoft’s durabletask Python SDK on PyPI. The [package](https://learn.microsoft.com/en-us/azure/durable-task/common/what-is-durable-task) is a framework for building fault-tolerant workflows and orchestrations to automate distributed transactions and other workflows. It receives 400,000 downloads per month. The compromise packages executed a 28 KB payload that steals credentials from AWS, Azure, GCP, Kubernetes, password managers, and over 90 developer tool configurations. It then spreads laterally through cloud infrastructures to infect other developer machines. The attack, which has been linked to a threat actor tracked as TeamPCP, poisoned the durabletask package after compromising Microsoft credentials for publishing the package. The technique allows attackers to bypass the repository’s build pipeline entirely. The malware used in the attack is tracked as Miasma. It’s essentially a clone of TeamPCP’s Mini ShAI-Hulud toolkit, which the threat actor open-sourced recently. Security firm Cloudsmith [sAId](https://cloudsmith.com/blog/miasma-worms-path-of-destruction) the malware harvests OIDC (OpenID-Connect) token credentials that are used in SLSA (Supply-chAIn Levels for Software Artifacts) [provenance attestation](https://docs.github.com/en/actions/concepts/security/artifact-attestations), a method for providing cryptographically signed guarantees of a software’s integrity. As was the case in the May compromise of Microsoft’s durabletask, the one last week made use of the functionality to steal a legitimate Microsoft OIDC token. It was also used in a separate supply-chAIn attack poisoning [dozens of Red Hat packages](https://arstechnica.com/security/2026/06/dozens-of-red-hat-packages-backdoored-through-its-offical-npm-channel/).

T1📰 Biz & IT - Ars Technica

Dozens of cryptographically verified open source packages from Microsoft were compromised late last week to add advanced credential-stealing code that was triggered when developers opened them in AI coding Agents. In all, [multiple](https://www.stepsecurity.io/blog/miasma-worm-hits-microsoft-agAIn-azure-functions-action-and-72-other-repositories-disabled-after-supply-chAIn-attack-targeting-AI-coding-Agents) researchers [sAId](https://opensourcemalware.com/blog/miasma-reaches-azure), 73 packages were flagged as malicious when automated systems on GitHub blocked them on the platform. Rather than noting they are malicious—and that developers who used AI Agents to work with them should assume their systems are compromised—the Microsoft-owned GitHub sAId it disabled the packages “due to a violation of GitHub’s terms of service.” The text went on to encourage the package owner to contact GitHub. Devs: Assume compromise and proceed accordingly It wasn’t until Monday that Microsoft even rAIsed the possibility the packages were infected. In an emAIl, the company stated: “We have temporarily removed some repositories as we investigate potential malicious content.” The incident is the second supply-chAIn attack in as many months to breach an official Microsoft repository account. In mid May, the firm StepSecurity [documented](https://www.stepsecurity.io/blog/microsofts-durabletask-pypi-package-compromised-in-supply-chAIn-attack) the compromise of Microsoft’s durabletask Python SDK on PyPI. The [package](https://learn.microsoft.com/en-us/azure/durable-task/common/what-is-durable-task) is a framework for building fault-tolerant workflows and orchestrations to automate distributed transactions and other workflows. It receives 400,000 downloads per month. The compromise packages executed a 28 KB payload that steals credentials from AWS, Azure, GCP, Kubernetes, password managers, and over 90 developer tool configurations. It then spreads laterally through cloud infrastructures to infect other developer machines. The attack, which has been linked to a threat actor tracked as TeamPCP, poisoned the durabletask package after compromising Microsoft credentials for publishing the package. The technique allows attackers to bypass the repository’s build pipeline entirely. The malware used in the attack is tracked as Miasma. It’s essentially a clone of TeamPCP’s Mini ShAI-Hulud toolkit, which the threat actor open-sourced recently. Security firm Cloudsmith [sAId](https://cloudsmith.com/blog/miasma-worms-path-of-destruction) the malware harvests OIDC (OpenID-Connect) token credentials that are used in SLSA (Supply-chAIn Levels for Software Artifacts) [provenance attestation](https://docs.github.com/en/actions/concepts/security/artifact-attestations), a method for providing cryptographically signed guarantees of a software’s integrity. As was the case in the May compromise of Microsoft’s durabletask, the one last week made use of the functionality to steal a legitimate Microsoft OIDC token. It was also used in a separate supply-chAIn attack poisoning [dozens of Red Hat packages](https://arstechnica.com/security/2026/06/dozens-of-red-hat-packages-backdoored-through-its-offical-npm-channel/).

T1📰 AI | VentureBeat

The artificial intelligence coding revolution comes with a catch: it's expensive. [Claude Code](https://claude.com/product/claude-code), Anthropic's terminal-based AI Agent that can write, debug, and deploy code autonomously, has captured the imagination of software developers worldwide. But its [pricing](https://claude.com/pricing) — ranging from $20 to $200 per month depending on usage — has sparked a growing rebellion among the very programmers it AIms to serve. Now, a free alternative is gAIning traction. [Goose](https://block.github.io/goose/), an open-source AI Agent developed by [Block](https://block.xyz/) (the financial technology company formerly known as Square), offers nearly identical functionality to [Claude Code](https://claude.com/product/claude-code) but runs entirely on a user's local machine. No subscription fees. No cloud dependency. No rate limits that reset every five hours. "Your data stays with you, period," sAId Parth Sareen, a software engineer who demonstrated the tool during a [recent livestream](https://www.youtube.com/watch?v=WG10r2N0IwM). The comment captures the core appeal: Goose gives developers complete control over their AI-powered workflow, including the ability to work offline — even on an AIrplane. The project has exploded in popularity. Goose now boasts more than [26,100 stars on GitHub](https://github.com/block/goose), the code-sharing platform, with 362 contributors and 102 releases since its launch. The latest version, [1.20.1](https://block.github.io/goose/docs/getting-started/installation), shipped on January 19, 2026, reflecting a development pace that rivals commercial products. For developers frustrated by Claude Code's pricing structure and usage caps, Goose represents something increasingly rare in the AI industry: a genuinely free, no-strings-attached option for serious work. Anthropic's new rate limits spark a developer revolt To understand why [Goose](https://block.github.io/goose/) matters, you need to understand the [Claude Code pricing controversy](https://techcrunch.com/2025/07/17/anthropic-tightens-usage-limits-for-claude-code-without-telling-users/). Anthropic, the San Francisco artificial intelligence company founded by former OpenAI executives, offers Claude Code as part of its subscription tiers. The free plan provides no access whatsoever. The [Pro plan](https://www.anthropic.com/news/claude-pro), at $17 per month with annual billing (or $20 monthly), limits users to just 10 to 40 prompts every five hours — a constrAInt that serious developers exhaust within minutes of intensive work. The [Max plans](https://support.claude.com/en/articles/11049741-what-is-the-max-plan), at $100 and $200 per month, offer more headroom: 50 to 200 prompts and 200 to 800 prompts respectively, plus access to Anthropic's most powerful model, [Claude 4.5 Opus](https://www.anthropic.com/news/claude-opus-4-5). But even these premium tiers come with restrictions that have inflamed the developer community. In late July, Anthropic announced new weekly rate limits. Under the system, Pro users receive 40 to 80 hours of Sonnet 4 usage per week. Max users at the $200 tier get 240 to 480 hours of Sonnet 4, plus 24 to 40 hours of Opus 4. Nearly five months later, the frustration has not subsided. The problem? Those "hours" are not actual hours. They represent token-based limits that vary wildly depending on codebase size, conversation length, and the complexity of the code being processed. Independent analysis suggests the actual per-session limits translate to roughly 44,000 tokens for Pro users and 220,000 tokens for the $200 Max plan. "It's confusing and vague," one developer wrote in a [widely shared analysis](https://userjot.com/blog/claude-code-pricing-200-dollar-plan-worth-it). "When they say '24-40 hours of Opus 4,' that doesn't really tell you anything useful about what you're actually getting." The [backlash on Reddit](https://www.reddit.com/r/Anthropic/comments/1mbo4uw/claude_code_max_new_weekly_rate_limits/) and [developer forums](https://venturebeat.com/AI/anthropic-throttles-claude-rate-limits-devs-call-foul) has been fierce. Some users report hitting their dAIly limits within 30 minutes of intensive coding. Others have canceled their subscriptions entirely, calling the new restrictions "a joke" and "unusable for real work." Anthropic has defended the changes, stating that the limits affect fewer than five percent of users and target people running Claude Code "[continuously in the background, 24/7](https://techcrunch.com/2025/07/28/anthropic-unveils-new-rate-limits-to-curb-claude-code-power-users/)." But the company has not clarified whether that figure refers to five percent of Max subscribers or five percent of all users — a distinction that matters enormously. How Block built a free AI coding Agent that works offline [Goose](https://block.github.io/goose/) takes a radically different approach to the same problem. Built by [Block](https://block.xyz/), the payments company led by Jack Dorsey, Goose is what engineers call an "[on-machine AI Agent](https://github.com/block/goose)." Unlike Claude Code, which sends your queries to Anthropic's servers for processing, Goose can run entirely on your local computer using open-source language models that you download and control yourself. The project's documentation describes it as going "[beyond code suggestions](https://github.com/block/goose)" to "install, execute, edit, and test with any LLM." That last phrase — "any LLM" — is the key differentiator. Goose is model-agnostic by design. You can connect Goose to Anthropic's [Claude models](https://platform.claude.com/docs/en/about-claude/models/overview) if you have [API access](https://claude.com/platform/api). You can use OpenAI's [GPT-5](https://platform.openAI.com/docs/models/gpt-5) or Google's [Gemini](https://AI.google.dev/gemini-api/docs). You can route it through services like [Groq](https://groq.com/) or [OpenRouter](https://openrouter.AI/). Or — and this is where things get interesting — you can run it entirely locally using tools like [Ollama](https://ollama.com/), which let you download and execute open-source models on your own hardware. The practical implications are significant. With a local setup, there are no subscription fees, no usage caps, no rate limits, and no concerns about your code being sent to external servers. Your conversations with the AI never leave your machine. "I use Ollama all the time on planes — it's a lot of fun!" [Sareen noted](https://www.youtube.com/watch?v=WG10r2N0IwM) during a demonstration, highlighting how local models free developers from the constrAInts of internet connectivity. What Goose can do that traditional code assistants can't [Goose](https://block.github.io/goose/) operates as a command-line tool or desktop application that can autonomously perform complex development tasks. It can build entire projects from scratch, write and execute code, debug fAIlures, orchestrate workflows across multiple files, and interact with external APIs — all without constant human oversight. The architecture relies on what the AI industry calls "[tool calling](https://www.ibm.com/think/topics/tool-calling)" or "[function calling](https://platform.openAI.com/docs/guides/function-calling?api-mode=chat)" — the ability for a language model to request specific actions from external systems. When you ask [Goose](https://block.github.io/goose/) to create a new file, run a test suite, or check the status of a GitHub pull request, it doesn't just generate text describing what should happen. It actually executes those operations. This capability depends heavily on the underlying language model. [Claude 4 models](https://platform.claude.com/docs/en/about-claude/models/overview) from Anthropic currently perform best at tool calling, according to the [Berkeley Function-Calling Leaderboard](https://gorilla.cs.berkeley.edu/leaderboard.html), which ranks models on their ability to translate natural language requests into executable code and system commands. But newer open-source models are catching up quickly. Goose's documentation highlights several options with strong tool-calling support: Meta's [Llama series](https://www.llama.com/), Alibaba's [Qwen models](https://qwen.AI/home), Google's [Gemma variants](https://deepmind.google/models/gemma/), and DeepSeek's [reasoning-focused architectures](https://huggingface.co/deepseek-AI/DeepSeek-R1). The tool also integrates with the [Model Context Protocol](https://modelcontextprotocol.io/docs/getting-started/intro), or MCP, an emerging standard for connecting AI Agents to external services. Through MCP, Goose can access databases, search engines, file systems, and third-party APIs — extending its capabilities far beyond what the base language model provides. Setting Up Goose with a Local Model For developers interested in a completely free, privacy-preserving setup, the process involves three mAIn components: [Goose](https://block.github.io/goose/) itself, [Ollama](https://ollama.com/) (a tool for running open-source models locally), and a compatible language model. Step 1: Install Ollama [Ollama](https://ollama.com/) is an open-source project that dramatically simplifies the process of running large language models on personal hardware. It handles the complex work of downloading, optimizing, and serving models through a simple interface. Download and install Ollama from [ollama.com](http://ollama.com). Once installed, you can pull models with a single command. For coding tasks, [Qwen 2.5](https://qwen.AI/blog?id=qwen2.5-max) offers strong tool-calling support: ollama run qwen2.5 The model downloads automatically and begins running on your machine. Step 2: Install Goose [Goose](https://block.github.io/goose/) is avAIlable as both a desktop application and a command-line interface. The desktop version provides a more visual experience, while the CLI appeals to developers who prefer working entirely in the terminal. Installation instructions vary by operating system but generally involve downloading from Goose's [GitHub releases page](https://github.com/block/goose) or using a package manager. Block provides pre-built binaries for macOS (both Intel and Apple Silicon), Windows, and Linux. Step 3: Configure the Connection In Goose Desktop, navigate to Settings, then Configure Provider, and select Ollama. Confirm that the API Host is set to http://localhost:11434 (Ollama's default port) and click Submit. For the command-line version, run goose configure, select "Configure Providers," choose Ollama, and enter the model name when prompted. That's it. Goose is now connected to a language model running entirely on your hardware, ready to execute complex coding tasks without any subscription fees or external dependencies. The RAM, processing power, and trade-offs you should know about The obvious question: what kind of computer do you need? Running large language models locally requires substantially more computational resources than typical software. The key constrAInt is memory — specifically, RAM on most systems, or VRAM if using a dedicated graphics card for acceleration. Block's [documentation](https://block.github.io/goose/docs/category/guides) suggests that 32 gigabytes of RAM provides "a solid baseline for larger models and outputs." For Mac users, this means the computer's unified memory is the primary bottleneck. For Windows and Linux users with discrete NVIDIA graphics cards, GPU memory (VRAM) matters more for acceleration. But you don't necessarily need expensive hardware to get started. Smaller models with fewer parameters run on much more modest systems. [Qwen 2.5](https://qwen.AI/blog?id=qwen2.5-max), for instance, comes in multiple sizes, and the smaller variants can operate effectively on machines with 16 gigabytes of RAM. "You don't need to run the largest models to get excellent results," [Sareen emphasized](https://www.youtube.com/watch?v=WG10r2N0IwM). The practical recommendation: start with a smaller model to test your workflow, then scale up as needed. For context, Apple's entry-level [MacBook AIr](https://www.apple.com/macbook-AIr/) with 8 gigabytes of RAM would struggle with most capable coding models. But a [MacBook Pro](https://www.apple.com/macbook-pro/) with 32 gigabytes — increasingly common among professional developers — handles them comfortably. Why keeping your code off the cloud matters more than ever [Goose](https://block.github.io/goose/) with a local LLM is not a perfect substitute for [Claude Code](https://claude.com/product/claude-code). The comparison involves real trade-offs that developers should understand. Model Quality: [Claude 4.5 Opus](https://www.anthropic.com/news/claude-opus-4-5), Anthropic's flagship model, remAIns arguably the most capable AI for software engineering tasks. It excels at understanding complex codebases, following nuanced instructions, and producing high-quality code on the first attempt. Open-source models have improved dramatically, but a gap persists — particularly for the most challenging tasks. One developer who switched to the $200 Claude Code plan [described the difference bluntly](https://userjot.com/blog/claude-code-pricing-200-dollar-plan-worth-it): "When I say 'make this look modern,' Opus knows what I mean. Other models give me Bootstrap circa 2015." Context Window: [Claude Sonnet 4.5](https://www.anthropic.com/news/claude-sonnet-4-5), accessible through the API, offers a massive one-million-token context window — enough to load entire large codebases without chunking or context management issues. Most local models are limited to 4,096 or 8,192 tokens by default, though many can be configured for longer contexts at the cost of increased memory usage and slower processing. Speed: Cloud-based services like [Claude Code](https://claude.com/product/claude-code) run on dedicated server hardware optimized for AI inference. Local models, running on consumer laptops, typically process requests more slowly. The difference matters for iterative workflows where you're making rapid changes and wAIting for AI feedback. Tooling Maturity: [Claude Code](https://claude.com/product/claude-code) benefits from Anthropic's dedicated engineering resources. Features like prompt caching (which can reduce costs by up to 90 percent for repeated contexts) and structured outputs are polished and well-documented. [Goose](https://block.github.io/goose/), while actively developed with 102 releases to date, relies on community contributions and may lack equivalent refinement in specific areas. How Goose stacks up agAInst Cursor, GitHub Copilot, and the pAId AI coding market Goose enters a crowded market of AI coding tools, but occupies a distinctive position. [Cursor](https://cursor.com/), a popular AI-enhanced code editor, charges $20 per month for its [Pro tier](https://cursor.com/pricing) and $200 for [Ultra](https://cursor.com/pricing)—pricing that mirrors [Claude Code's Max plans](https://claude.com/pricing). Cursor provides approximately 4,500 Sonnet 4 requests per month at the Ultra level, a substantially different allocation model than Claude Code's hourly resets. [Cline](https://cline.bot/), [Roo Code](https://roocode.com/), and similar open-source projects offer AI coding assistance but with varying levels of autonomy and tool integration. Many focus on code completion rather than the Agentic task execution that defines Goose and Claude Code. Amazon's [CodeWhisperer](https://aws.amazon.com/blogs/aws/now-in-preview-amazon-codewhisperer-ml-powered-coding-companion/), [GitHub Copilot](https://github.com/features/copilot), and enterprise offerings from major cloud providers target large organizations with complex procurement processes and dedicated budgets. They are less relevant to individual developers and small teams seeking lightweight, flexible tools. Goose's combination of genuine autonomy, model agnosticism, local operation, and zero cost creates a unique value proposition. The tool is not trying to compete with commercial offerings on polish or model quality. It's competing on freedom — both financial and architectural. The $200-a-month era for AI coding tools may be ending The AI coding tools market is evolving quickly. Open-source models are improving at a pace that continually narrows the gap with proprietary alternatives. Moonshot AI's [Kimi K2](https://www.kimi.com/en) and z.AI's [GLM 4.5](https://z.AI/blog/glm-4.5) now benchmark near [Claude Sonnet 4 levels](https://www.anthropic.com/news/claude-4) — and they're freely avAIlable. If this trajectory continues, the quality advantage that justifies Claude Code's premium pricing may erode. Anthropic would then face pressure to compete on features, user experience, and integration rather than raw model capability. For now, developers face a clear choice. Those who need the absolute best model quality, who can afford premium pricing, and who accept usage restrictions may prefer [Claude Code](https://claude.com/product/claude-code). Those who prioritize cost, privacy, offline access, and flexibility have a genuine alternative in [Goose](https://block.github.io/goose/). The fact that a $200-per-month commercial product has a zero-dollar open-source competitor with comparable core functionality is itself remarkable. It reflects both the maturation of open-source AI infrastructure and the appetite among developers for tools that respect their autonomy. Goose is not perfect. It requires more technical setup than commercial alternatives. It depends on hardware resources that not every developer possesses. Its model options, while improving rapidly, still trAIl the best proprietary offerings on complex tasks. But for a growing community of developers, those limitations are acceptable trade-offs for something increasingly rare in the AI landscape: a tool that truly belongs to them. Goose is avAIlable for download at [github.com/block/goose](http://github.com/block/goose). Ollama is avAIlable at [ollama.com](http://ollama.com). Both projects are free and open source.