返回精选
AI 精选动态 智能评分 60

百度推出Unlimited-OCR新模型

来源: twitter关注列表
作者: Baidu Inc. (@Baidu_Inc)
发布于: 2026-06-23
收录于: 2026-06-23
AI 推荐理由
Unlimited-OCR通过R-SWA实现高效长文档解析,对比DeepSeek-OCR有明确量化优势,值得关注OCR技术进展。
核心解读
百度PP-OCR团队发布Unlimited-OCR模型,采用R-SWA注意力机制保持KV缓存恒定,支持单次处理数百页PDF,在OmniDocBench基准上达到93%准确率,相较DeepSeek-OCR提升6%。
全文
Baidu Inc. (@Baidu_Inc) 转发了 jzhang533 (@jzhang533) 的帖子: Baidu’s open OCR stack: PP-OCR: fast text detection + recognition PaddleOCR-VL: turns pages into structured docs Qianfan-OCR: promptable document intelligence Unlimited-OCR: long-horizon PDF/multi-page parsing OCR is documents becoming computable. > **引用原帖 Adina Yakup (@AdinaYakup):** > Unlimited-OCR 🔥New OCR from @PaddlePaddle > It can parse hundreds of pages in a single pass while maintaining stable speed. > The key idea is R-SWA (Reference Sliding Window Attention), which keeps KV cache constant during decoding. > 🏆 93% on OmniDocBench > 📈 +6% over DeepSeek-OCR > https://x.com/AdinaYakup/status/2069081754837614998
#模型发布#技术突破#基准测试