Kling’s 3DiMo allows smooth motion and camera control in one AI pass

Duane Villanueva • Mar 11, 2026 • 1 min read

Kling’s 3DiMo is a system for “3D-aware implicit motion control” that transfers motion from a reference video onto a new character. It achieves this while giving users camera control over the generated scene.

In demos, a source dance clip is mapped onto different humans, anime characters, or 3D avatars. Meanwhile, there’s also a virtual camera that orbits, zooms, or tracks from above.

Unlike existing tools like One-Animate or SCALE that focus only on motion transfer, 3DiMo simultaneously handles motion and camera view. The generated videos maintain consistent character shape and environment even as the viewpoint changes.

This suggests the model builds an implicit 3D understanding of the scene. Examples show detailed hand and finger movements preserved, along with facial expressions, and even creative touches.

This includes the likes of adding sakura branches to an upward pan in an anime scene. For now, only a technical paper and demos are available, with no indication that code or weights will be released.

If it eventually ships as a tool or API, 3DiMo could become a powerful option for music videos, VTubers, and virtual production pipelines.

Duane Villanueva

Communication graduate, closet cynic, and kid at heart. Duane is a rare person to find, quite literally. He often takes to himself but has proven his mettle in tech media with his quick wits. Well, the portfolio of scriptwriting, web content, and public relations help too, we suppose. As a homebody, he often spends his time on the streaming platform Twitch or ‘farming’ gaming clips with friends. He is also an avid fan of round glasses and anything relative to blueberries.

203 posts

Kling’s 3DiMo allows smooth motion and camera control in one AI pass

Comments

Cancel reply