Model Market
Search, filter, and launch Playground in one continuous flow.
Capability
Input type
Price range ($)
Free trial
Only show starter-friendly models
Recently Added

grok
Grok Imagine Video
Grok Imagine Video offers hosted Grok video generation for text-to-video and image-to-video workflows.

vidu
Vidu Q3 Pro
Vidu Q3 Pro is Vidu’s advanced AI video model for higher-end text-to-video and image-to-video creation, designed to generate up to 16-second 1080p clips with native, synced audio, precise camera control, and stronger storytelling quality for ads, animation, and cinematic short-form content.
vidu
Vidu Q3 Turbo
Vidu Q3 Turbo is a speed-focused version of the Vidu Q3 video model, built for fast text-to-video and image-to-video generation with synchronized audio, short-form clip creation, and responsive iteration for creators who want quicker turnaround

seedance
Seedance 2.0
Seedance 2.0 is a multimodal AI video model developed by ByteDance. Unlike models limited to "text-to-video," Seedance 2.0 allows users to combine text, image, video, and audio as reference inputs, generating content with greater control and richer detail. Its announcement has sparked a massive wave of interest. Official sources state that the Seedance 2.0 API launch was postponed over copyright risk concerns; however, insider sources reveal a public rollout is slated for mid-to-late March 2026. ModelHunter AI will be among the first platforms to integrate Seedance2.0 API. Email [email protected] to join the waitlist.

seedance
Seedance 1.5 Pro
Seedance 1.5 Pro is ByteDance’s joint audio-video generation model built to follow complex prompts with higher precision, combining native synchronized audio, strong multilingual lip-sync, and film-grade cinematic motion for more immersive text-to-video and image-to-video creation.
Popular Models
All Models

grok
Grok Imagine Video
Grok Imagine Video offers hosted Grok video generation for text-to-video and image-to-video workflows.

vidu
Vidu Q3 Pro
Vidu Q3 Pro is Vidu’s advanced AI video model for higher-end text-to-video and image-to-video creation, designed to generate up to 16-second 1080p clips with native, synced audio, precise camera control, and stronger storytelling quality for ads, animation, and cinematic short-form content.
vidu
Vidu Q3 Turbo
Vidu Q3 Turbo is a speed-focused version of the Vidu Q3 video model, built for fast text-to-video and image-to-video generation with synchronized audio, short-form clip creation, and responsive iteration for creators who want quicker turnaround

seedance
Seedance 2.0
Seedance 2.0 is a multimodal AI video model developed by ByteDance. Unlike models limited to "text-to-video," Seedance 2.0 allows users to combine text, image, video, and audio as reference inputs, generating content with greater control and richer detail. Its announcement has sparked a massive wave of interest. Official sources state that the Seedance 2.0 API launch was postponed over copyright risk concerns; however, insider sources reveal a public rollout is slated for mid-to-late March 2026. ModelHunter AI will be among the first platforms to integrate Seedance2.0 API. Email [email protected] to join the waitlist.

seedance
Seedance 1.5 Pro
Seedance 1.5 Pro is ByteDance’s joint audio-video generation model built to follow complex prompts with higher precision, combining native synchronized audio, strong multilingual lip-sync, and film-grade cinematic motion for more immersive text-to-video and image-to-video creation.

seedance
Seedance 1.0 Pro
Seedance 1.0 Pro is ByteDance’s advanced video generation model for text-to-video and image-to-video creation, designed to produce smooth 1080p multi-shot videos with strong prompt understanding, cinematic motion, and rich visual detail.

seedance
Seedance 1.0 Pro Fast
Seedance 1.0 Pro Fast is a speed-optimized version of the Seedance 1.0 Pro family, commonly considered as a faster, lower-cost video model that preserves the core multi-shot text-to-video and image-to-video strengths of ByteDance’s Seedance line while prioritizing quicker generation and better efficiency.

seedance
Seedance 1.0 Lite
Seedance 1.0 Lite is ByteDance’s lightweight video generation model for fast, cost-efficient text-to-video and image-to-video creation, positioned as a more accessible version of the Seedance 1.0 family while still supporting multi-shot video generation, smooth motion, and short-form outputs.

kling
Kling V3.0
Kling V3.0 is Kuaishou’s latest flagship AI video model, positioned as an all-in-one creative engine for native multimodal creation with stronger consistency, more photorealistic output, up to 15-second video generation, and native audio for higher-end cinematic text-to-video and image-to-video workflows.
kling
Kling V2.6
Kling V2.6 is Kuaishou’s video generation model built around simultaneous audio-visual generation, letting creators produce video, voice, dialogue, and sound effects together in one workflow while improving coherence between what appears on screen and what is heard.

seedream
Seedream 5.0 Lite
Seedream 5.0 lite is ByteDance’s unified multimodal image generation model, built with deeper reasoning and online search capabilities to deliver stronger prompt understanding, better visual reasoning, and more accurate, context-aware image creation.

seedream
Seedream 4.5
Seedream 4.5 is ByteDance’s upgraded image creation and editing model, designed to deliver higher consistency and fidelity through stronger reference-image preservation, more accurate multi-image editing, and improved typography and dense text rendering for professional visual creative work.

seedream
Seedream 4.0
Seedream 4.0 is ByteDance’s new-generation image creation model that unifies image generation and image editing in a single architecture, built to handle complex multimodal tasks such as knowledge-based generation, visual reasoning, and reference-consistent creation while producing high-definition images at up to 4K resolution.

seedream
Seedream 3.0
Seedream 3.0 is ByteDance's high-resolution bilingual image generation model, built for Chinese and English prompt understanding with stronger text rendering, better structural accuracy, improved visual aesthetics, and native high-definition image creation.

gemini
Nano Banana 2
Nano Banana 2 is Google’s latest image generation and editing model, also called Gemini 3.1 Flash Image, built to combine fast “Flash” speed with stronger visual quality, sharper instruction-following, and more precise edits.

gemini
Veo 3.1
Veo 3.1 is the quality-focused Veo video model for premium text-to-video and image-to-video generation with default background audio.

gemini
Veo 3.1 Fast
Veo 3.1 Fast is the speed-focused Veo variant for quicker text-to-video and image-to-video iteration with default background audio.

elevenlabs
Eleven v3
Eleven v3 is an expressive dialogue-to-speech model for multi-speaker voice generation with audio tags and multilingual support.

wan
Wan 2.6
Wan 2.6 is an official Wan video generation model family supporting text-to-video, image-to-video, and video-to-video workflows in one unified async API.