Where AI video still fails
The tech moved fast but 30% of generations still go to the trash. Concrete failure cases and workarounds.
Vendor marketing only shows successful generations. In practice 1 of 3 clips lands in the bin from artifacts. Here is where it breaks.
Hands and held objects
Fingers still glitch on every model. Especially actions with a tool, phone, coffee cup. Symptoms: 6 fingers, twisted joints, object passes through hand.
Workaround: shoot hands separately and composite, or use stock for hands, or frame so hands are not visible (wide shot, view from above).
Text and logos
AI still cannot render readable text in frame longer than 6-8 characters. Logos turn into "something similar".
Workaround: render text and logo separately in After Effects/Premiere over the AI video.
Character consistency
Across three scenes the hero's eye color shifts, face shape changes, ears reshape. Viewers notice.
Workaround: character reference (Sora 2, Runway Gen-4, Kling 2). Lock the face with one image and reference it in every prompt.
Liquid and material physics
Pouring water, smoke, fabric in wind — still unconvincing. Liquid hangs in air, moves wrong.
Workaround: for critical liquid shots — film or stock. For B-roll — ignore, no one inspects.
Duration and consistency
Past 5-8 seconds AI starts losing context. Objects vanish, background changes, lighting jumps.
Workaround: break clip into 3-5 second scenes, stitch in editor. Do not try one prompt for 30 seconds.
Famous people and brand faces
Most models block recognizable people (politicians, actors). When they don't block, they output similar but inaccurate.
Workaround: only with explicit rights. For brand faces — fine-tune the model on consented data.
Brand colors and visual identity
"Make it in our style" is poorly understood by AI. Need either a style-LoRA (open-source models) or a very detailed prompt with HEX colors, fonts, composition.
Workaround: build a set of reference images and tie every generation to them.