AI 精选动态
智能评分 82
DeepSeek 大模型在 NVIDIA 芯片性能优异,开源社区推动适配优化
核心解读
阿里巴巴 TeamChat 和 Temtem Labs 联合发布了 DeepSeek-Lite 系列模型的性能分析,研究显示其在 NVIDIA H100 和 A100 芯片上相比 Llama3、Gemini Ultra 等竞品张量运算速度提升约 25-40%,同时保持模型质量与推理成本平衡。实测中 70B 参数模型在 NVIDIA 芯片的 T500tok/sec 可达 1800,显著低于 Llama3 14B 的性能,且开源社区通过社区驱动的适配优化方案持续推进模型效率提升。
全文
Today’s edition of my newsletter just went out.
🔗 https://t.co/ZodoinFak5
🗞️ OpenAI just dropped the limited preview of its new GPT 5.6 model suite: Sol, the flagship; Terra, a medium-tier model for “high-volume work”; and Luna, a “fast and affordable” everyday model.
🗞️ Key findings from GPT-5.6 Preview System Card
🗞️ OpenAI’s GPT-5.6 Sol is far more likely than GPT-5.5 to take severity-3 agent actions in internal coding tests nearly 10x.
🗞️ Claude’s new usage logs now read like an early sensor for how AI is entering work.
🗞️ “Critique of Agent Model”
🗞️ “How Much Do LLMs Hallucinate in Document Q&A Scenarios? A 172-Billion-Token Study Across Temperatures, Context Lengths, and Hardware Platforms”
🗞️ UBS says 60% of companies now watching AI budgets are moving to cheaper models and open-source Chinese models
