返回精选
AI 精选动态 智能评分 75

Google Interactions API GA 版发布并支持代理

来源: twitter关注列表
作者: Google AI Developers (@googleaidevs)
发布于: 2026-06-22
收录于: 2026-06-22
AI 推荐理由
首次将托管代码沙盒、异步执行和多模态生成能力集成到单一 API 中,超越了之前的纯推理接口。
核心解读
Google 正式发布 Interactions API GA 版,该 API 提供稳定的脚本架构和新的功能,通过统一的 /interactions 端点支持模型推理和自主代理。新功能包括托管式代理,可通过单次 API 调用启动安全的远程 Linux 沙盒(支持默认 antigravity-preview-05-2026 代理或自定义指令);线性化简化流程,将每个操作表示为独立步骤;异步后台执行支持;扩展的工具生态,支持计算机使用、文件搜索、Google 地图,并包含 Gemini API Docs MCP 服务器和 gemini-interactions-api 技能;同时支持通过 Google 搜索定位的 Nano Banana 2 进行图像生成、Lyria 3 生成音乐以及表情多声道语音合成。
全文
The Interactions API is now GA. Featuring a stable schema and new capabilities via a single /interactions endpoint, it is now our standard interface for both model inference and autonomous agents. Check out the key updates: — Managed Agents: Spin up a secure, remote Linux sandbox for code execution and browsing via a single API call, using the default antigravity-preview-05-2026 agent or your own custom instructions — Simplified Schema: Every action is now represented as its own step (from user_input to model_output, etc.) in a simplified, linear flow — Background Execution: Set background=True to run models and agents asynchronously, letting you poll for results when ready — Expanded tool ecosystem: Native support for Computer Use, File Search, and Google Maps, alongside new agent-building tools like the Gemini API Docs MCP server and gemini-interactions-api Skill — Media generation: Image generation via Google Search-grounded Nano Banana 2, music with Lyria 3, and expressive multi-speaker TTS Read the blog to learn more: https://t.co/ZzfpUGzAvm
#技术更新#产品发布#多模态