AI 精选动态
智能评分 70
Twinkle v0.4.0 Release
核心解读
Training-as-a-Service扩展,支持多实例与深度学习模型优化。
全文
Twinkle is now at v0.4.0! 🔥
The fully open-sourced solution for multi-tenant Training-as-a-Service, with Tinker API compatibility. Now packed with broader model coverage, more training algorithm support, and an improved backend built to scale.
Here’s what’s cooking:
🐳 DeepSeek V4 Support: Flash FSDP2 + Expert Parallelism (EP) training, plus native tool-call parsing and cleanup.
🤖 Qwen3.5 Evolution: Maximize efficiency with padding-free / packed-sequence support and MoE GatedDeltaNet sequence parallelism.
🔮 Gemma 4: Full multimodal training support is officially here, complete with a fresh 12B cookbook!
🧬 LoRA Level-up: Added rsLoRA for Multi-LoRA, FSDP2 for Multi-LoRA SFT, and EP LoRA SFT examples for DeepSeek V4 and Qwen3.5 MoE.
⚡ NPU Acceleration: Huge stability and speed gains with fused operators (RMSNorm, RoPE, SwiGLU, SDPA) and FLA patches.
Time to supercharge your cluster and squeeze out every ounce of compute. 🏎️💨
👉 Check out the full release notes at https://t.co/loDlgJbttv, and drop us a ⭐ on GitHub: https://t.co/yVxwZivrvO ❤️