Generate 1080p video with synchronized native audio from a text prompt. Aspect ratios: 16:9, 9:16, 1:1, 4:3, 3:4. Duration: 3–15s.
Input
Per second of generated video (720p baseline)
Output
ExampleExample output from Happy Horse Text-to-Video
Pricing
Criteria
Per second of generated video (720p baseline)
per second of video
7s of video for $1
Criteria
1080p
per second of video
3s of video for $1
Overview
Happy Horse Text-to-Video is Alibaba's flagship 1080p video generator with synchronized native audio built in — no separate audio model, no lip-sync rig, no post-production. Send a single prompt; get a fully-scored clip back.
Key capabilities
- ●Native audio: ambient sound, music, voice, foley — generated in lock-step with the visuals so they actually match (no overlay tricks)
- ●Multilingual: prompts and any embedded dialogue work across major languages
- ●Five aspect ratios: 16:9, 9:16, 1:1, 4:3, 3:4 — covers landscape ads, vertical short-form, square social, and portrait formats from one endpoint
- ●3-15 second clips at 720p or 1080p
- ●Cinematic motion: handles complex camera moves (dolly, push-in, aerial), shallow DOF, golden-hour lighting prompts well
Family
Part of the Happy Horse family — pair with the variants when you need a different starting modality:
| Variant | Input | Use it for |
|---|---|---|
| Text-to-Video | text prompt | one-shot clips from a brief |
| Image-to-Video | image + optional prompt | animating a still or hero shot |
| Video Edit | source video + edit prompt | transforming an existing clip (style, scene swap) |
| Reference-to-Video | text + 1-9 references | multi-character scenes, brand-consistent subjects |
Tech specs
- ●Resolutions: 720p, 1080p
- ●Duration: 3-15s
- ●Audio: native, in-sync, prompt-controlled
- ●Latency: 60-180s typical for a 5s clip; queue depth varies during peak hours
- ●Pricing: $0.14/s at 720p, $0.28/s at 1080p — simple per-second billing, no minimums
Frequently asked questions
Related models
Happy Horse Image-to-Video
alibaba/happy-horse/image-to-video
Alibaba's #1-ranked Happy Horse 1.0 — generate 1080p video with synchronized nat...
Happy Horse Video Edit
alibaba/happy-horse/video-edit
HappyHorse video editing supports advanced video editing through natural languag...
Happy Horse Reference-to-Video
alibaba/happy-horse/reference-to-video
Generate 1080p video with synchronized native audio from a text prompt and refer...
HeyGen Video Agent V3
heygen/v3/video-agent
Generate videos with a single prompt. Describe what you want in plain text, and ...
Start generating with Happy Horse Text-to-Video
Get API access in minutes. No GPU setup, no infrastructure to manage.