MOVA: Open-Source Model Generates Synced Video+Audio

Someone found MOVA, an open-source model that generates video and audio together instead of separately.

Available models:

MOVA-360p - faster generation: https://huggingface.co/OpenMOSS-Team/MOVA-360p
MOVA-720p - higher quality: https://huggingface.co/OpenMOSS-Team/MOVA-720p

The interesting part is how it keeps audio and video in sync. Most models generate them separately and then try to match them up, which gets messy. MOVA does both at once.

Setup from their repo:

Runs on consumer GPUs apparently, though 720p needs beefier hardware. The 360p version works fine for testing things out without burning through cloud credits.

MOVA: Open-Source Model Generates Synced Video+Audio

Related Tips

Verity: Local AI Search Engine Like Perplexity

ACE-Step 1.5: Free Local Music AI Rivals Suno v4/v5

MOVA: Open-Source Synchronized Video & Audio Gen