Building Realtime Editing on FLUX2: Inference Acceleration and Distillation with Reinforcement Learning (Preview)
A systems-and-training breakdown of how a FLUX2-based editor was pushed toward realtime interaction using cache-aware two-step inference, causal attention distillation, and reward-guided DMDR.