Spending 40 minutes tweaking a prompt that still looks wrong is the most common Omni frustration I hear. This guide has 200 pre-engineered prompts covering cinematic 8K visuals, character consistency across generations, and advanced creator workflows. Takes you 30 seconds to use: grab it here.
Google published their Gemini Omni prompt guide on May 20th. It covered the framework. It skipped the prompts.
No copy-paste examples. No quota reality. No comparison with Veo. No explanation of why your carefully written paragraph gets half-ignored by the model. Just the framework, clean and untested.
This is what they left out.
What Gemini Omni actually is (not the marketing version)
Gemini Omni Flash is not a video plugin bolted onto Gemini. Google DeepMind built it from the ground up as what they call an "any-to-any world model" — meaning the same architecture processes text, images, audio, and video simultaneously, not sequentially through separate pipelines. You hand it a text prompt and a reference image and a piece of music, and it handles all three as one unified input rather than routing them to different models behind the scenes.
Output is a 10-second video clip with synchronized native audio. That 10-second cap is a policy decision, not a model constraint, based on everything in the architecture papers.
Three things matter most for creators before you touch the prompt box:
First: conversational editing is the real feature. Before Omni, every change meant re-prompting from scratch, which meant a completely different output. Now you can type "change the lighting to golden hour, keep everything else" and it adjusts that specific element while preserving what you already built.
Second: quota burns faster than you expect. One tester from PromptsLove hit 86% of their daily AI Pro allowance in two clips during a complex multimodal session. Plan sessions around this, not against it.
Third: SynthID watermarking is mandatory on every output. It is invisible to human eyes but detectable by Google's verification tools. Every video you publish from Gemini Omni carries this marker. Factor that into your publishing workflow.
Pricing and access (the table nobody publishes clearly)
Plan | Monthly Cost | Omni Access |
|---|---|---|
YouTube free tier | $0 | Limited, Google One integration required |
AI Plus | $20/month | Included in standard quota |
AI Pro | $30/month | Expanded quota, better priority |
AI Ultra | $100/month | Priority access, highest quota |
Developer API | Not yet live | Planned, no date confirmed |
AI Pro is the tier most serious creators will land on. At $30/month it gives you meaningful quota for testing and light production. If you are doing daily video content, AI Ultra is worth the math. And be aware: the quota depletes per generation complexity, not per clip count. A heavily multimodal prompt (text plus reference image plus audio input) burns significantly more quota than a clean text-to-video prompt.
Gemini Omni vs Veo: they are not the same thing
This is the question that gets the most confused answers online. PixVerse covers it briefly but pushes their own tool in the comparison. So here is the neutral version.
Dimension | Gemini Omni Flash | Veo |
|---|---|---|
Architecture | Unified multimodal (any-to-any) | Specialized video generation model |
Primary strength | Mixed-input creation + conversational editing | Cinematic quality, longer video |
Input types | Text, image, audio, video simultaneously | Primarily text and image |
Output length | 10 seconds (policy cap) | Up to 60+ seconds |
Audio | Native synchronized audio generation | Separate |
Editing | Conversational (chat-based iteration) | Re-prompt from scratch |
Access | Gemini, Flow, YouTube Shorts, YouTube Create | Google One, Vertex AI |
API | Coming soon | Available via Vertex |
Best for | Content creators, short-form, multimodal projects | Filmmakers, long-form, professional production |
If you want a 10-second product video clip with voice synced to a brand track, Gemini Omni is the faster workflow. If you want a two-minute cinematic sequence, Veo is still the right call. They are not competing for the same use case.
The 6-dimension prompt framework (what Google published, expanded with what works)
Google's official guide describes six dimensions. They are all real. But the official version does not tell you which dimensions have the most leverage, or what happens when you skip one.
Dimension 1: Shot framing and motion. This is the most influential dimension for creative control. State the camera position and movement before anything else.
Skip it and the model defaults to a medium shot with slow drift. Sometimes that is exactly right. More often it is not.
Dimension 2: Style. The visual language of the clip. Cinematic, Studio Ghibli, film noir, claymation, watercolor animation, architectural visualization. Name the reference clearly. The model has strong associations with well-known visual styles and uses them precisely.
Dimension 3: Lighting. Emotional control. A warm single-source lamp and flat midday daylight can make the same scene feel completely different. Name the source, the quality (crisp vs diffuse vs ethereal), and the direction.
Dimension 4: Location. One phrase. The model expands from intention, so you rarely need more than "rain-soaked Tokyo alleyway" or "minimalist white studio." Over-describing location can pull the model's attention away from your subject and character.
Dimension 5: Action. What happens, who does it, how things move. The event at the center of the shot. This dimension and shot framing together do the most work in the output.
Dimension 6: Text rendering. Gemini Omni can place readable text inside generated video, which is something most AI video models get wrong or skip entirely. If you want captions, titles, or labels that are actually legible in the output, you have to explicitly specify them here. Otherwise the model will not include them.
Covering all six in one prompt produces consistently better output than covering four or five. Not because of some magic checklist, but because each dimension reduces the model's ambiguity about what you actually want.
Camera vocabulary: the full reference
This is the section most prompt guides skip or abbreviate. Camera terms function as technical commands for Gemini Omni. The model has real associations with filmmaking language and responds to it with precision.
Shot types:
Close-up. Tight on subject, fills the frame. Medium shot. Subject from waist up. Wide shot. Subject in full environmental context. Extreme wide. Environment dominant, subject small or absent. Over-the-shoulder. Camera behind one person looking at another. POV. Camera as the viewer's eyes. Dutch angle. Frame tilted to signal tension or disorientation.
Camera motion:
Oner / one continuous shot. No cuts, everything happens in a single unbroken take. Locked off / static. Camera does not move at all. Push in / punch in. Camera moves slowly toward the subject. Pull out / pull back. Camera retreats from the subject. Dolly zoom. Subject holds the same frame size while the background perspective warps. Named after Hitchcock who used it first. Orbit / arc. Camera circles around the subject. Tilt up / tilt down. Camera pivots vertically on a fixed axis. Pan left / pan right. Camera pivots horizontally. Handheld. Subtle natural movement, documentary feel. Natural smartphone zoom. Emulates a phone zooming in, slightly imperfect. Film camera. Cinematic grain and specific motion characteristics. Webcam style. Flat, slightly distorted, intentionally low-fidelity.
Practical tip: Combine one shot type with one motion command. "Close-up, push in slowly" gives Gemini Omni clear simultaneous instructions about where the camera starts and where it goes. Stacking more than two camera instructions in one prompt tends to produce outputs that split the difference rather than execute any of them cleanly.
Multi-turn conversational editing: the workflow nobody uses correctly
This is genuinely the most underused capability in Gemini Omni. Most people treat it like a text-to-video tool where you write a prompt, get a clip, and accept the result. That is leaving most of the model's capability on the table.
Here is how the actual workflow goes:
Generate a base clip first. Keep this first prompt clean and specific about what matters most to you. Do not try to specify everything at once.
Then make one surgical change at a time. The key syntax is: "Change [specific element]. Keep everything else identical." That phrase matters. "Keep everything else identical" is not just nice phrasing; it signals to the model that you want preservation, not regeneration.
Some examples of how this works in practice:
You generate a clip of a woman walking through a neon-lit city street. You like the look but the lighting feels too cold. You type: "Change the lighting to warm amber streetlamps. Keep everything else identical." The model adjusts the light quality while maintaining the character, the motion, the camera angle, and the environment.
You generated a product video for a watch on a marble surface. You want the camera angle lower. "Shift the camera to a low angle, looking up at the product. Keep everything else identical."
You generated an explainer animation but the pacing feels rushed. "Slow the motion of all elements by 40 percent. Keep everything else identical."
Do one change per turn when precision matters. Pile in five changes at once and you lose the ability to identify which one caused a problem, and the model is more likely to regenerate broadly rather than preserve what you built.
30 copy-paste prompts by use case
None of the 7 competitors in this space has a clean, organized copy-paste prompt library. C7 (AI Blew My Mind on Substack) comes closest but is paywalled. This is the free version.
Cinematic short films
Wide shot, slow dolly forward. A lone figure walks toward a lighthouse at dusk, fog rolling across the water, warm amber glow from the lighthouse windows, cinematic 4K, film grain.
Extreme close-up of hands turning the pages of an old handwritten journal, natural window light from the left, shallow depth of field, vintage 16mm film style, gentle page-turning sounds.
Overhead drone shot of a circular clearing in a dense forest, early morning mist, golden light breaking through the canopy, slow clockwise arc, cinematic, no characters.
Product visualization
Locked off, centered frame. A luxury watch rests on a sheet of dark brushed concrete. Camera slowly pushes in. Studio lighting, single overhead softbox, deep shadows on either side. No background, pure product focus.
Close-up macro shot. A glass bottle of serum is picked up by a hand from a marble surface. Morning light from the right. Camera tilts up to follow the bottle. Clean, editorial photography style.
Medium shot. A pair of sneakers rotates slowly on a floating platform, cinematic lighting, black background, subtle rim light from behind, architectural visualization quality.
Talking characters and voiceover
Medium close-up. A friendly avocado with a face speaks directly to camera: "I'm not just a snack. I'm a lifestyle." Bright studio lighting, white background, slight camera handheld wobble, Pixar-style 3D animation.
Close-up on the face of a scientist looking directly into camera in a lab setting. She says: "We were not expecting these results." Fluorescent lighting, slight depth of field blur on background, documentary style.
Medium shot, locked off. An animated illustrated character in a 1920s speakeasy stands at a bar and recites the first verse of a short poem. Vintage watercolor style, sepia tones.
Explainer animations
Top-down flat animation showing coins multiplying over time with a progress bar and year counter. Kurzgesagt style. Clean white background, bright primary colors, no characters, data visualization aesthetic.
Side-view cross-section of a coffee machine showing water path from reservoir through heating element to portafilter. Schematic line art style, labeled components, slow step-by-step reveal.
Animated bar chart racing showing global population growth from 1900 to 2024. Country flags, speed builds gradually, data visualization style, clean background.
UGC-style ads
Handheld POV shot walking through a farmers market in bright morning sunlight, person reaching for a product from a wooden stall, natural ambient sound, real-life documentary feel, no professional lighting.
Natural smartphone zoom. A person sits cross-legged on a couch, holding a product up to camera and saying: "I was skeptical but I have been using this for three weeks." Warm indoor lamp light, casual home setting.
Wide shot, handheld. A solo traveler walks toward the entrance of a train station pulling a small suitcase. Headphones visible. Golden hour light. City ambient sounds. UGC travel content aesthetic.
YouTube Shorts and social video
Vertical frame, medium shot. A chef in a home kitchen holds up a single ingredient, turns it toward camera, and sets it down on a wooden cutting board. Quick cut implied by motion. Bright kitchen lighting, casual style.
Vertical frame, close-up. An open book with highlighted text. A hand with a pen underlines a passage. Camera tilts up slowly to reveal the reader's face, thoughtful expression. Study aesthetic, warm desk lamp.
Vertical frame, close-up on a phone screen. Finger taps through an app interface. Screen content visible and readable. Shallow depth of field on the finger.
Storyboard-driven scenes
[0:00-0:03] Medium shot. Character enters through a heavy wooden door into a candlelit medieval library. Camera locked off, slight natural flicker from candles. [0:03-0:07] Push in slowly to a particular book on a shelf, glowing faintly. [0:07-0:10] Character reaches out toward the book.
[0:00-0:04] Aerial establishing shot of a coastal cliff at sunrise, ocean far below. Slow drift downward. [0:04-0:10] Medium shot. Character stands at the cliff edge, wind in hair, looking out. Camera orbits gently from left to right.
World-knowledge grounding
Historically accurate scene of a 15th century Venetian canal. Gondoliers in period costume, merchant boats, Renaissance palazzo architecture in background. Overcast morning light, no anachronisms. Oil painting style.
Scientifically accurate visualization of neurons firing in the human brain. Bright electrical pulses traveling along dendrites, blue and amber bioluminescence, macro scale, slow motion, 3D scientific illustration style.
Culturally accurate depiction of a traditional Japanese tea ceremony. Indoor tatami setting, morning light from shoji screens, deliberate slow movements, no spoken dialogue. Documentary cinematic style.
Lofi and music video
Slow pan across a lo-fi illustrated bedroom at night. Rain on a window. A cartoon character sits at a desk studying. Warm lamp glow, retro anime aesthetic, soft focus, ambient rain sounds.
Overhead shot of a vinyl record spinning on a turntable. Needle in groove. Warm amber light from one direction. Slow push in toward the center label. Film grain, 16mm.
Medium shot. A solo musician plays upright bass in an empty jazz club at 2am. Single overhead spotlight. Rest of the room dark. Camera slowly orbits left. Cinematic, intimate.
Fashion and ecommerce
Medium shot. Model walks slowly toward camera through a minimalist concrete space wearing a structured jacket. Natural daylight from skylights above. Clean, editorial photography aesthetic.
Extreme close-up. Fabric texture of a handmade textile. Hands slowly spread the fabric apart showing the weave pattern. Warm natural light from the side. No background music. Pure texture focus.
Medium to close-up pull. A mannequin displays a winter coat in a clean studio. Camera slowly pulls in to show stitching detail on the collar. Architectural visualization style, pure white background.
Newsletter and content creator
Screen recording aesthetic. A Beehiiv newsletter draft appears on screen, text populates line by line. Minimal desktop setup visible in background, morning light. Lo-fi music implied by ambient warmth in the frame.
Gemini Omni for content creators: specific use cases
Every competitor in this space writes for a generic "creator" or a developer. Nobody writes for the people who actually make up the majority of AI tool users right now: newsletter writers, short-form video creators, lofi channel builders, and small brand owners doing their own content.
For newsletter and Beehiiv creators. Gemini Omni can generate short 10-second clips that you embed as background visuals or header animations in your posts. A newsletter about AI trends could open each issue with a generated clip matching that week's theme, custom-produced in minutes. No stock footage subscription. No licensing fees. The SynthID watermark is invisible in video thumbnails and does not appear in text.
For lofi YouTube channels. This is the use case I find most interesting personally, because lofi content is architecturally perfect for Gemini Omni's 10-second output. Lofi YouTube is built on looping visuals. A 10-second clip of a rainy window, a spinning vinyl record, or a cartoon character at a desk loops perfectly for a 2-hour stream. You generate the clip once, loop it in your video editor, sync it to your track, done. No animator needed. Try prompt 24, 25, or 26 above.
For ecommerce and small fashion brands. Product videos are expensive to produce when you need them for every SKU. Gemini Omni lets you generate a short product visualization clip per product in roughly the time it takes to write the prompt and wait for the generation. Use prompts 4 through 6 as starting templates. The conversational editing workflow means you can adjust lighting and framing per product without starting from scratch each time.
For short-form content creators. The UGC-style prompt category (prompts 13-15) generates clips that look shot on a phone, not produced in a studio. For social content where authenticity matters more than polish, this is the format that fits. You can generate B-roll for your Reels or Shorts without needing a camera crew.
World-knowledge grounding: what it actually does
Most AI video tools treat generation as a pure visual task. You describe a scene and the model renders it. Gemini Omni does something different when world-knowledge grounding is active: it applies Gemini's training on history, physics, biology, and culture as a constraint on the generation process.
What that means practically is this. Ask it to generate a historically accurate 15th century Venetian scene, and it does not just produce something that looks vaguely old. It applies what it knows about 15th century Venice, specifically, to the architecture, the boats, the clothing, the light quality, the street layouts. Ask it to depict neurons firing accurately and it does not produce a decorative glowing-brain effect. It models the actual mechanism.
For educational content creators and explainer channels, this is the capability with the most long-term potential. You can generate scientifically accurate animations of natural processes without needing a motion graphics artist who spent three months learning what mitochondria actually look like.
One honest caveat: the output quality for world-knowledge grounding varies by subject. Physics and famous historical contexts are strong. Niche scientific subfields and lesser-documented historical periods are less reliable. Cross-check anything you plan to publish as educational content.
The quota problem: how to not burn your allowance in two clips
Real quota management, not the version from the FAQ page.
Each generation costs quota based on the complexity of the inputs you provide, not just the fact that you pressed the generate button. A text-only prompt is cheaper than a prompt with a reference image. A prompt with a reference image and an audio file costs more than either of those alone.
Two ways this plays out for different users:
If you are on AI Pro ($30/month) and doing exploratory testing with heavy multimodal inputs, your daily quota can run out in 4-6 clips. This surprises people who expected to generate 20-30 clips per day based on how other tools work.
If you are using Gemini Omni for production rather than exploration, the multi-turn editing workflow is actually a quota-saving strategy. Instead of regenerating a clip five times until it looks right, you generate once and edit twice. Three total generation events instead of five, for essentially the same creative result.
Practical session planning: on AI Pro, budget 6-8 total generation events per session (including edits). Use text-only prompts for initial ideation runs, then add reference images when you have a direction you want to refine. Save your heaviest multimodal inputs for the clips you already know you want to produce.
On AI Ultra, the budget is more comfortable for daily production workflows. At $100/month it is worth calculating whether the output replaces something you currently pay for, like stock footage, B-roll production, or a motion graphics freelancer.
Common mistakes and how to fix them
Here is the section I wish existed three weeks ago.
Mistake 1: Over-specifying the background. Gemini Omni is good at filling in environment details when given an intention. When you over-specify every element of the background, you can accidentally deprioritize your subject and the motion quality takes a hit. Give the overall atmosphere in one or two phrases. Let the model handle the details.
Mistake 2: Stacking five edits into one multi-turn prompt. The output will be different from what you built, not an iteration of it. One change per turn when you want precision. Five changes at once when you are willing to accept a different direction.
Mistake 3: Not specifying a camera type for conversational video. If you want the output to look like it was shot on a phone rather than a cinema camera, say "handheld" or "natural smartphone zoom" explicitly. The model defaults toward cinematic framing, which is sometimes the opposite of what UGC-style content needs.
Mistake 4: Expecting perfect text rendering without specifying it. Text inside AI-generated video is genuinely tricky for most models. Gemini Omni handles it better than alternatives, but only when you explicitly prompt for it in the text rendering dimension. If you want a caption or title card inside the video, specify it in dimension 6 of the prompt framework. Do not assume the model will add it.
Mistake 5: Ignoring the reference image input. Most users default to text-only prompts because that is the workflow they know from other AI tools. Gemini Omni's reference image input changes the quality ceiling on character consistency and style adherence significantly. Even a rough sketch or a screenshot from another video gives the model a visual anchor that a text description cannot fully replicate.
Build your own Gemini Omni prompts: the 6D template
Copy this and fill it in for any video you want to generate.
[Shot framing]: [close-up / medium / wide / extreme wide / POV / over-the-shoulder] [Camera motion]: [locked off / push in / pull out / orbit / dolly zoom / handheld / pan left / pan right] [Style]: [cinematic 4K / Studio Ghibli / film noir / vintage 16mm / claymation / watercolor / cyberpunk / architectural / editorial] [Lighting]: [natural daylight / warm amber lamp / single overhead softbox / backlit silhouette / fluorescent / golden hour / moonlight] [Location]: [one-phrase description of the environment] [Action]: [what happens, who does it, what moves] [Text rendering (if needed)]: [specify any text that should appear in the frame]
Example filled in:
Close-up, slow push in. Cinematic 4K. Single overhead softbox, warm side fill from the right. Marble surface in a clean studio. A hand places a glass bottle of serum down slowly. No text.
That produces a usable product video prompt in under a minute. Adjust one element at a time using the multi-turn editing workflow from there.
Gemini Omni vs Sora vs Kling vs Runway: where it fits in the wider field
Gemini Omni Flash is the only major AI video model built natively around conversational editing and mixed-input creation. That is a real architectural difference, not a marketing claim.
Sora (OpenAI) produces longer outputs and has stronger cinematic quality on complex scenes. Its main limitation for working creators is the lack of conversational editing and the input restrictions in the consumer version.
Kling AI has competitive quality in the 5-10 second range and is faster to access via API. It does not have Gemini's world-knowledge grounding or the native audio generation.
Runway is the most developed end-to-end production tool and has the strongest ecosystem for video editing workflows. It targets professional video editors more than content creators, and pricing reflects that.
For a newsletter writer, a lofi channel, or a small brand doing product content: Gemini Omni at $20-30/month is the most accessible entry point with the fastest edit iteration cycle. For a filmmaker who needs 60-second sequences with professional color and motion: Veo or Runway are still the better fit.
None of these are going to stay static. This comparison is accurate as of early June 2026. The gap between them has been narrowing at roughly 4-6 week intervals.
What Google's official guide did not tell you
Three things, specifically.
One: the 10-second cap is going to move. Every signal from the architecture papers and Google's own commentary suggests the model can sustain longer sequences. The 10-second limit is a controlled rollout decision. Plan your creative workflows around the constraint that exists now, but do not build assumptions into long-term projects that require it to stay there.
Two: the API is still not live for developers. Google announced it as "planned" at I/O 2026, no date confirmed. If you are building a product that depends on Gemini Omni API access, you are working with incomplete information about timeline and pricing. Worth tracking the developer blog directly.
Three: character consistency across clips is still the hardest problem. Single-clip quality has improved dramatically. Keeping the same character looking the same across five separate 10-second clips that tell a story is genuinely difficult with any current AI video model, including this one. The reference image workflow from MindStudio's storyboard guide is the best current approach: generate one clean character reference, attach it to every scene prompt, and use the preserve syntax in conversational editing. It helps. It does not fully solve it yet.
The bottom line
Gemini Omni Flash is the first AI video tool that actually behaves like a collaborator instead of a slot machine. You generate, you react, you adjust. That workflow change matters more than any single quality benchmark.
My honest take: the 10-second limit is frustrating for anyone trying to build longer narrative content, and quota management on the mid-range tiers requires more planning than it should. But for the use cases where it fits (product video, short-form social, lofi visuals, explainer animation, newsletter assets), nothing else has a faster iteration cycle right now.
What I am still watching: whether the API lands with pricing that makes it practical to integrate into publishing tools, or whether it stays consumer-only for the near term. That distinction determines whether this becomes a tool creators use directly, or infrastructure that tools are built on top of. Both outcomes are interesting for different reasons.
What are you actually making with it? Drop a comment. Specifically curious about anyone using it for educational content or ecommerce.
Suggested tags: Artificial Intelligence, Gemini, Video Generation, Content Creators, AI Tools
Internal link targets:
Your Google I/O 2026 coverage article (for model context)
Your AI Image Generation Guide on Gumroad (from the style section)
Any existing Lofi channel content you have published
Your Daperdash store or related ecommerce articles
Your Beehiiv newsletter signup (from the newsletter use case section)
FAQ schema block (add to page source as JSON-LD):
Q: Is Gemini Omni the same as Veo?
A: No. Gemini Omni Flash is a unified multimodal model built for mixed-input video creation and conversational editing. Veo is Google's specialized video generation model designed for longer, higher-fidelity cinematic output. They serve different use cases and different creator types.
Q: What is the maximum video length in Gemini Omni?
A: Currently 10 seconds. This is a policy cap during the initial rollout, not a model constraint. Google has indicated longer sequences may come in future updates.
Q: How many videos can I generate per day on Gemini Omni?
A: It depends on your plan and prompt complexity. AI Pro users running heavy multimodal prompts can hit quota in 4-6 generations. Text-only prompts cost less quota. Using the multi-turn editing workflow (one edit at a time) conserves quota compared to regenerating from scratch.
Q: Can Gemini Omni edit existing videos through conversation?
A: Yes. You generate a base clip, then type change instructions in natural language. The key syntax is "Change [element]. Keep everything else identical." The model preserves what you already built and adjusts only what you specified.
Q: Does Gemini Omni generate audio natively?
A: Yes. Output includes synchronized native audio (ambient sound, music, sound effects, and dialogue) generated from the same prompt. You can also provide your own audio as an input reference.
Q: Is there a Gemini Omni developer API?
A: Not yet as of June 2026. Google announced API access as planned at I/O 2026 but has not confirmed a launch date or pricing structure.
Q: Can I remove the SynthID watermark from Gemini Omni videos?
A: No. SynthID watermarking is mandatory on all outputs and non-removable. The watermark is imperceptible to human viewers but detectable by Google's verification tools.
Q: What is world-knowledge grounding in Gemini Omni?
A: It means the model applies Gemini's training on history, physics, biology, and culture as a real constraint during generation, not just a lookup. A historically accurate scene prompt produces output that reflects actual historical knowledge rather than a generic vintage aesthetic.
Q: Is Gemini Omni available outside the US?
A: Access is rolling out progressively. As of June 2026, availability through Gemini, Flow, YouTube Shorts, and YouTube Create varies by region. Check Google One eligibility in your country directly.
Q: How do I maintain character consistency across multiple Gemini Omni clips?
A: Generate a clean, neutral reference image of your character first. Attach that image as a reference input to every scene prompt. Use the preserve syntax during conversational editing ("keep the character identical"). This minimizes drift across clips though does not eliminate it entirely.

