返回精选
AI 精选动态 智能评分 65

ENPIRE系统幕后详解

来源: twitter关注列表
作者: Jim Fan (@DrJimFan)
发布于: 2026-06-17
收录于: 2026-06-17
AI 推荐理由
详细披露了安全层设计、奖励函数冻结机制和资源遥测指标等工程细节,与通常只展示结果的宣传不同,值得参考其系统设计思路。
核心解读
NVIDIA GEAR实验室的ENPIRE系统通过8个机器人集群实现物理世界AutoResearch,系统包含两层安全机制(硬运动学限制和扭矩限制)和基于演示学习的冻结奖励函数,并定义了MRU、MTU等遥测指标以优化资源利用。
全文
I made Physical AutoResearch sound simple (conceptually), but it took a village to pull off and lots of design thinking into the robot /loopcraft. The hardest part is everything we need to setup *before* pressing Enter. Here's a behind-the-scene tour: 1. Safety harness Letting 8 robots run unattended overnight means safety has to be more than a hint in the system prompt. ENPIRE hardwires it in 2 layers: (1) hard kinematic limit that trips an immediate task failure and auto-resets as soon as a robot leaves its safety envelope, and (2) a torque-limited compliant gripper so a bad contact or misaligned insertion ends in a safe stall, instead of crushing the robot or the object at hand. We make safety more conservative than usual so humans can sleep tight. In reality, we still need a few human operators to watch over the "robots of loving grace". 2. Definition of /done An agent that can edit its own reward will game it for sure. ENPIRE fixes the goalposts before the fleet can move them. Here's the recipe: Collect a few minutes of success & failure demos -> Ask agent to write code using computer vision tools to classify success and measure against groundtruth -> Agent hill-climbs on classifier until reliably good -> This classifier becomes the real-time reward function that directly computes on sensor streams -> *Freeze* the reward function before AutoResearch. It's sacred, enshrined in a Gym env that no one can touch. 3. System telemetry design Robot-seconds is by far the scarcest resource, followed by GPU-seconds, and finally tokens. We instrument all three and surface them to ENPIRE for live resource awareness rather than letting it hill-climb in a vacuum. We define: - Mean Robot Utilization ("MRU"): the fraction of wall-clock time when the robot is actively executing an experiment. Otherwise the hardware is sitting idle and waiting for the next code commit. - Mean Token Utilization ("MTU"): tokens consumed per minute, our proxy for how hard the agent is actually thinking. A low MTU means the agent is stalled, waiting on a robot rollout to finish instead of doing research. - GPU utilization: fraction of wall-clock time when GPU is active. ... and evaluate on two budget-to-outcome metrics: 1. Tokens-to-Success: token budget the fleet burns to complete /goal. 2. Time-to-Success: wall-clock time to /goal https://video.twimg.com/amplify_video/2067270848499851264/vid/avc1/1920x1080/TjQcTQHBN_ugg7Yw.mp4?tag=28 > **引用原帖 Jim Fan (@DrJimFan):** > Today, we enable AutoResearch in the physical world for the first time! Introducing ENPIRE: we give 8 Codex agents a fleet of robots, an allocation of GPUs, and generous token budget. We set them free with a simple goal: solve the task as quickly as possible, keep the robots busy but stay safe, don't waste precious compute. Make no mistake. > Then humans step aside and our watch begins. The robot fleet starts to come alive: they learn to look for visual clues, reset the scene, practice novel skills, tinker with control stack, read papers online, debate, reflect, get stuck, and try again directly on the hardware. All we did is to give Codex an API to the world of atoms, and the rest is emergence. > ENPIRE is able to solve high-precision tasks like tying zip-ties, organizing fine pins, and installing GPUs all by itself. We also discovered a new type of "physical scaling": 8 robots exploring in parallel improves significantly faster than fewer ones. > A part of our NVIDIA GEAR lab now self-improves tirelessly over night. We just read the reports in the morning. > /goal: we all take a holiday and Jensen wouldn't even notice ;) > We will be open-sourcing everything, so you can host your self-running robot lab at home too! Deep dive in the thread: > https://x.com/DrJimFan/status/2066921736369766762
#技术突破#智能体#研究