Grok Imagine Video 1.5 AI Video Generator

Upload a still and let Grok Imagine Video 1.5 render a cinematic clip with native sound — 5s, 10s, or 15s at 480p/720p.

Upload & Describe

JPG, PNG, or WEBP up to 20MB. NSFW is checked on upload.

0/4000

Result

Your generated Grok Imagine Video 1.5 clip will appear here.
Examples

See what Grok Imagine Video 1.5 can create

Turn on audio to hear the native sound generation. Each clip below started from a single image and was generated in one pass with no post-production.

UGC product spot with timed beats and synced dialogue

"(0-5s) Medium shot, she speaks warmly while gesturing to a product. (5-10s) Slow push-in to a close-up, glowing skin and expressive eyes. (10-15s) Cut to an over-shoulder framing of the vanity as she smiles. Glossy, warm, cinematic."

Product hero shot with a slow spin

"Brightly colored athletic running shoe resting on mossy ground, red to yellow gradient upper with grid pattern, thick sculpted neon green foam sole, red laces, wavy yellow eyestay overlay, surrounded by green moss and small ferns, blurred tree branches and bright blue sky background, extreme low angle close-up, vibrant product photography, sharp natural sunlight and doing a slow spin"

Landscape still animated with an orchestral score

"Camera tracks forward over the fjord as mist begins to drift between the mountains and the water ripples. A small red boat starts moving across the frame. Orchestral strings swell as the shot rises"

Cinematic scene with motion and ambient audio

"The character turns toward the camera and looks up, and behind him rain starts to fall. Crisp rainfall and fishing town ambience"

New in the video toolkit

One Image, One Cinematic Clip — With the Sound Baked In

LetsMkVideo already covers text-to-video, image-to-video, and effects. Grok Imagine Video 1.5 adds the piece the other tools do not: native sound generation in the same pass as the video. Upload one still, describe the motion and mood, and you get back an MP4 with synchronized dialogue, ambience, and score — no separate foley or voiceover step.

What Grok Imagine Video 1.5 Adds to Your Toolkit

Native audio, identity-locked animation, and three durations that ship — what you gain when Grok Imagine Video 1.5 joins the LetsMkVideo lineup.

Native Sound, Baked In

Grok Imagine Video 1.5 generates dialogue, ambient sound, and score in the same pass as the video. The MP4 you download already has synced audio — toggle it off only when you want a silent cut for further work.

Single-Image, Identity-Locked

Upload one still and Grok Imagine Video 1.5 uses it as the identity and composition lock for the entire clip. The face, pose, and framing stay consistent frame-by-frame — no drift, no re-interpretation.

5s, 10s, or 15s — Pick a Duration That Ships

Three durations, no 1-second granularity you will never use. Short 5s hooks for social, 10s product beats, 15s establishing shots — match the cut to the channel.

480p Draft, 720p Deliver

Iterate cheaply at 480p, switch to 720p for the version you ship. Eight aspect ratios (auto, 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3) cover landscape, portrait, square, and vertical-social crops.

Prompt-Guided Direction

Describe the camera move, action, and mood in plain language. Timed beats, dialogue cues, and genre cues let you steer motion and sound together — from slow push-ins to dialogue-driven cuts.

A Photographed, Not Generated, Look

Grok Imagine Video 1.5 favors real skin texture, practical imperfections, and photographed motion over the plastic 'AI aesthetic'. The look holds up next to live-action footage — which is exactly what a video toolkit needs.

One Still → One Sounding MP4

Three steps from an uploaded image to a downloadable, sounding clip.

01

Upload a Starting Still

Drop in one image — JPG, PNG, or WEBP up to 20MB. The still anchors identity and composition for the clip.

02

Set Duration, Resolution, Aspect, and Audio

Pick 5s, 10s, or 15s, choose 480p or 720p, set the aspect ratio, decide whether native audio is on. Then write the motion, camera move, and mood.

03

Generate and Download the MP4

Grok Imagine Video 1.5 renders video and native audio in one pass. Preview, iterate on the prompt, and download the MP4 that ships.

Grok Imagine Video 1.5 + LetsMkVideo FAQ

LetsMkVideo covers text-to-video, image-to-video, and effects. Grok Imagine Video 1.5 adds native sound: every clip ships with synchronized dialogue, ambient sound, and score by default. Use it whenever the shot needs to sound like a scene, not just look like one.
Grok Imagine Video 1.5 is xAI's image-to-video model. Upload a single still, describe the motion and mood in a prompt, and Grok Imagine Video 1.5 renders a 5s, 10s, or 15s MP4 at 480p or 720p, with native sound generation (dialogue, ambience, score) produced in the same pass as the video.
Yes. Native sound is generated alongside the video in a single pass, so the output MP4 ships with synchronized dialogue, ambient sound, and score by default. Toggle audio off when you want a silent cut for layered editing or a music-only cut.
Duration options are 5, 10, or 15 seconds — three lengths that map cleanly to social hooks, product beats, and short establishing shots. Resolution options are 480p and 720p. Aspect ratios include auto (follows the input still), 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, and 2:3.
One starting image (JPG, PNG, or WEBP, up to 20MB) plus an optional text prompt. The image anchors identity and composition; the prompt steers motion, camera, and mood. There is no text-to-video-only mode — every clip starts from a still.
Credit cost scales with duration and resolution. A 5-second 480p clip costs 2 credits, a 15-second 480p clip costs 6 credits, and 720p clips cost 2x the 480p rate (so a 5s 720p clip costs 4 credits). The exact cost is shown in the playground before you click Generate.
Most clips render in a few minutes, depending on duration, resolution, and current load. Shorter 480p clips finish fastest; longer 720p clips with native audio take more time. The playground shows live progress and notifies you the moment the MP4 is ready to download.
Yes. Generated videos can be used for commercial work — ads, product pages, social campaigns, client deliverables — when you have the rights to the source still and prompt input and comply with the service terms. Always confirm you have the rights to the source image before generating.

Add Native Sound to Your Next Clip

Drop a still into Grok Imagine Video 1.5 and download a cinematic MP4 with native audio — only on LetsMkVideo. Start generating with free credits today.