AI 精选动态
智能评分 60
AI 视频模型舌头旋转评测
AI 推荐理由
阅读原帖了解 Gemini 等模型在细致指令测试中的表现,有助于选择合适的视频生成模型核心解读
Alpha Mom (CuiMao) 评价了 Seedance 2.0、Kling 3.0 Pro、Gemini Omni Flash 和 Grok Imagine 1.5 在 10 秒舌头旋转任务中的表现,每次给每个模型 4 次尝试,最终 Gemini Omni Flash 排名第一,Grok Imagine 1.5 第二,Seedance 2.0 第三,Kling 3.0 Pro 第四,并指出 Kling 在一次尝试中因政策违规未通过。
全文
CuiMao (@CuiMao) 转发了 Alpha Mom (@YourAlphaMom) 的帖子:
New tongue-physics test for the best AI video models!
Seedance 2.0, Kling 3.0 Pro, Gemini Omni Flash, and Grok Imagine 1.5 were given a task that sounds ridiculously simple for a human:
Rotate the tongue in a continuous circle for 10 seconds. That’s it.
Each model received exactly four attempts.
Yet this tiny movement turned into an unexpectedly brutal test of instruction following, anatomy, motion consistency, and basic physical control.
And the result will probably surprise a lot of AI video creators. Spoiler: Seedance 2.0 completely failed to live up to its reputation this time.
- Kling 3.0 Pro:
The old man somehow managed to detect a policy violation in one attempt, although the other three generations went through without any problems.
Unfortunately, the result was still the worst of the four.
Kling failed the main instruction every single time: the tongue never actually rotated in a circle.
It moved sideways, twisted, stretched, and performed all kinds of strange anatomical experiments, but it never completed the requested motion.
The image itself looked reasonably realistic, but visual quality was almost irrelevant here. The movement was unnatural, inconsistent, and completely missed the task.
Everything that could go wrong did.
- Grok Imagine 1.5:
This was an unexpectedly good result.
The image still has Grok’s usual slightly cartoonish appearance, but its instruction following was surprisingly strong.
The tongue actually rotated.
Not perfectly, not with completely realistic anatomy, and not always through a clean full circle, but Grok understood what it was supposed to do and made a genuine attempt to execute it.
Considering its performance in several previous tests, this was a very pleasant surprise.
- Gemini Omni Flash:
The only model that handled the task convincingly.
Omni understood the instruction, produced a realistic-looking image, and generated an actual circular tongue movement.
It still isn’t perfect, and the motion does not yet look 100% like something a real person would perform. But compared with the other models, this was easily the cleanest, most controlled, and most believable result.
A genuinely strong performance.
Google finally found a very specific battlefield where Omni Flash can flex.
- Seedance 2.0:
This was the biggest disappointment.
The result was better than Kling’s, but Seedance still failed to perform the requested circular movement.
Seedance remains excellent at large action scenes, complex choreography, and cinematic motion. But in these simple human-body tests, including running and now tongue control, it keeps exposing unexpected weaknesses.
The movement looked more polished than Kling’s, but the core instruction was still not followed.
And for the first time in one of my tests, Seedance finishes below Grok.
- My ranking:
1. Gemini Omni Flash. The best instruction following, the cleanest circular motion, and the most realistic overall result.
2. Grok Imagine 1.5. Not perfect, but it understood the assignment and produced a surprisingly decent attempt.
3. Seedance 2.0. More polished than Kling, but clearly behind Grok and Omni in actual task execution.
4. Kling 3.0 Pro. Censorship, broken anatomy, and almost every possible tongue movement except the one I requested.
What do you think of this test? Do you agree with my ranking?
#AIVideo
https://video.twimg.com/amplify_video/2068046109033828353/vid/avc1/1920x1080/VpG5MSbRxyvKDunn.mp4?tag=28
> **引用原帖 Alpha Mom (@YourAlphaMom):**
> Another body-physics test for the best AI video models.
> This time, Elon’s pride and joy Grok Imagine 1.5 faces the old but stubborn Kling 3.0 Pro, China’s flagship Seedance 2.0, and America’s shiny newcomer Gemini Omni Flash, which developers called “revolutionary,” although I’m still waiting to see the revolution.
> The new battlefield: jump-rope exercises filmed from the front.
> There is a lot to analyze here. But let’s be honest. We all know exactly where everyone will be looking. Or rather, at which two points.
> The result was surprisingly close.
> - Kling 3.0 Pro:
> In this test, Kling was the only model that detected adult content in a completely harmless fitness scene and repeatedly refused to generate it.
> Which is especially funny considering how often Kling is used to bypass censorship with celebrity faces, yet apparently a woman jumping rope is where it draws the moral line.
> After several attempts, it finally produced a usable result.
> The lighting and overall realism are strong, as usual, but the actual body physics are the weakest here. It also missed the visual instructions from the prompt, giving me an older-looking model with less flattering proportions instead of the athletic, curvy character I requested.
> Not terrible, but definitely not Kling’s finest performance.
> - Grok Imagine 1.5:
> Apparently, Elon’s creation performs better from the front than from behind.
> Grok understood the task immediately, generated quickly, and didn’t complain about censorship.
> The image still has that unmistakably cartoonish Grok look, but the jumping motion is surprisingly fun and the body physics are actually decent.
> It feels more like a cutscene from a video game than real footage, but this is still one of Grok’s better results in my recent tests.
> - Gemini Omni Flash:
> Once again, Omni produced a beautiful and polished image.
> Google’s usual slow-motion, low-FPS effect is normally one of its biggest weaknesses, but in this specific test it actually worked in its favor.
> The slower movement makes the secondary motion much easier to see, and the body physics look smooth, convincing, and surprisingly natural.
> I liked this result a lot.
> - Seedance 2.0:
> Seedance performed much better than in the previous body-physics test.
> The footage is dynamic, vibrant, realistic, and visually the strongest of the four.
> The physics still don’t look completely natural to me, but the overall result is so convincing that it deserves first place, or at least a shared victory with Omni Flash.
> Omni may have slightly better body physics, while Seedance wins in movement, realism, energy, and overall image quality.
> - My ranking:
> 1. Seedance 2.0 and Gemini Omni Flash. A shared first place. Omni wins on body physics, while Seedance wins on overall realism, visual quality, and dynamic motion. If I absolutely had to choose one, Seedance would take it by a very small margin.
> 2. Grok Imagine 1.5. Still behind the two flagships, but noticeably better than in the previous tests.
> 3. Kling 3.0 Pro. Too much censorship, weaker instruction following, and the least convincing physics. The realistic lighting and natural-looking footage save it from being a complete failure.
> What’s your ranking?
> And if anyone wants the prompt, ask in the comments and I’ll share it.
> #AIVideo
> https://x.com/YourAlphaMom/status/2067646016724074706