Generate 1080p video with synchronized native audio from a text prompt and references. Aspect ratios: 16:9, 9:16, 1:1, 4:3, 3:4. Duration: 3–15s.
"character1 walks confidently down a neon-lit Tokyo street at night, cinematic lighting, smooth tracking shot"
"character1 dances joyfully in a sunlit meadow with flowers and butterflies, warm summer afternoon"
"character1 stands on a mountaintop overlooking a deep valley at golden-hour sunrise, wind in his hair"
"character1 sips coffee at a cozy cafe by a rain-streaked window, warm afternoon light, gentle camera push-in"
Related models
Happy Horse Text-to-Video
alibaba/happy-horse/text-to-video
Generate 1080p video with synchronized native audio from a text prompt. Aspect r...
Happy Horse Image-to-Video
alibaba/happy-horse/image-to-video
Alibaba's #1-ranked Happy Horse 1.0 — generate 1080p video with synchronized nat...
Happy Horse Video Edit
alibaba/happy-horse/video-edit
HappyHorse video editing supports advanced video editing through natural languag...
Wan 2.7 — Image to Video
alibaba/wan/v2.7/image-to-video
Wan 2.7 delivers enhanced motion smoothness, superior scene fidelity, and greate...
Start generating with Happy Horse Reference-to-Video
Get API access in minutes. No GPU setup, no infrastructure to manage.