FREE-Edit: Using Editing-aware Injection in Rectified Flow Models for Zero-shot Image-Driven Video Editing

FREE-Edit is a zero-shot image-driven video editing framework based on rectified flow models that utilizes an Editing-aware (REE) injection method to dynamically modulate attention injection intensity according to an optical flow-warped editing mask, thereby avoiding semantic conflicts and producing higher-quality results without fine-tuning.

Maomao Li, Yunfei Liu, Yu Li

Published 2026-03-03
📖 4 min read☕ Coffee break read

Imagine you have a home video of your family walking down a street. You want to edit the very first frame: maybe you want to swap your dad's boring gray jacket for a flashy superhero cape, or replace a boring gray car with a bright red sports car.

The goal of FREE-Edit is to take that single edited first frame and magically apply that change to the rest of the video, while keeping everything else (like the people walking, the cars moving, and the background scenery) exactly the same.

Here is the problem the paper solves, explained simply:

The Problem: The "Copy-Paste" Glitch

Existing video editing tools try to do this by looking at the original video and the new edited frame, then trying to blend them together. But they often use a "one-size-fits-all" approach.

Think of it like a bad photocopier.

  • If you tell the photocopier to "copy the whole page," it copies the superhero cape and the background, which ruins the video because the background shouldn't change.
  • If you tell it to "don't copy anything," the video loses its natural movement, and the people look like they are floating on a static image.

The old methods (called "vanilla injection") were like a photocopier that couldn't tell the difference between the part you wanted to change (the cape) and the part you wanted to keep (the street). This resulted in weird glitches where the superhero cape might suddenly disappear, or the street might start morphing into something else.

The Solution: The "Smart Mask" (REE Injection)

The authors of this paper created a new method called FREE-Edit (which stands for Free Rectified Edit, but you can think of it as Free Real Editing).

They introduced a clever trick called REE (Editing-awaRE) Injection. Here is how it works using a creative analogy:

1. The "Edit Map" (The Mask)

First, the computer looks at the original first frame and your new edited first frame. It draws a digital "mask" (like a stencil) over the parts that changed.

  • Example: The mask covers the jacket but leaves the face and the background alone.

2. The "Flow Tracker" (Optical Flow)

The video is moving! The jacket moves as the person walks. The computer uses a technology called Optical Flow (which is like a super-smart tracker) to follow that mask through the entire video.

  • Analogy: Imagine painting a red dot on a moving car. As the car drives, the red dot moves with it. The computer tracks that "red dot" (the edited area) frame by frame so it knows exactly where the edit is at any given second.

3. The "Smart Switch" (Modulation Weight)

This is the magic part. The computer creates a switch for every single pixel in the video.

  • If the pixel is in the "Edit Zone" (the jacket): The switch is turned OFF. The computer says, "Do not touch this! Keep the new superhero cape exactly as you drew it."
  • If the pixel is in the "Safe Zone" (the street): The switch is turned ON. The computer says, "Copy the movement and texture from the original video so the street looks natural."

Why is this better?

  • Old Way: Like trying to paint over a moving wall with a roller that covers everything. You either mess up the wall or the paint doesn't stick.
  • FREE-Edit: Like having a team of artists where one person holds a stencil perfectly over the moving wall, and another person paints only the exposed parts. The result is perfect: the wall moves naturally, but the new paint stays exactly where you put it.

The Result

The paper shows that this method works without needing to retrain the AI (it's "zero-shot," meaning it's ready to use immediately).

  • Motion: The people and cars move naturally because the computer didn't mess with the background.
  • Consistency: The superhero cape stays on the person the whole time and doesn't glitch out.
  • Speed: Because it uses a fast model, it can edit videos in real-time.

In a nutshell: FREE-Edit is a smart video editor that knows exactly what to change and what to leave alone, using a digital tracking system to ensure your edits look natural and stay put, even when the video is moving.