super hub Mixed citations

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

Chengyue Gong, Qiang Liu, Xingchao Liu · 2022 · cs.LG · arXiv 2209.03003

Mixed citation behavior. Most common role is background (53%).

414 Pith papers citing it

Background 53% of classified citations

open full Pith review browse 414 citing papers more from Chengyue Gong arXiv PDF

abstract

We present rectified flow, a surprisingly simple approach to learning (neural) ordinary differential equation (ODE) models to transport between two empirically observed distributions \pi_0 and \pi_1, hence providing a unified solution to generative modeling and domain transfer, among various other tasks involving distribution transport. The idea of rectified flow is to learn the ODE to follow the straight paths connecting the points drawn from \pi_0 and \pi_1 as much as possible. This is achieved by solving a straightforward nonlinear least squares optimization problem, which can be easily scaled to large models without introducing extra parameters beyond standard supervised learning. The straight paths are special and preferred because they are the shortest paths between two points, and can be simulated exactly without time discretization and hence yield computationally efficient models. We show that the procedure of learning a rectified flow from data, called rectification, turns an arbitrary coupling of \pi_0 and \pi_1 to a new deterministic coupling with provably non-increasing convex transport costs. In addition, recursively applying rectification allows us to obtain a sequence of flows with increasingly straight paths, which can be simulated accurately with coarse time discretization in the inference phase. In empirical studies, we show that rectified flow performs superbly on image generation, image-to-image translation, and domain adaptation. In particular, on image generation and translation, our method yields nearly straight flows that give high quality results even with a single Euler discretization step.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 41 method 32 baseline 1 dataset 1 other 1

citation-polarity summary

background 40 use method 32 unclear 2 baseline 1 use dataset 1

claims ledger

abstract We present rectified flow, a surprisingly simple approach to learning (neural) ordinary differential equation (ODE) models to transport between two empirically observed distributions \pi_0 and \pi_1, hence providing a unified solution to generative modeling and domain transfer, among various other tasks involving distribution transport. The idea of rectified flow is to learn the ODE to follow the straight paths connecting the points drawn from \pi_0 and \pi_1 as much as possible. This is achieved by solving a straightforward nonlinear least squares optimization problem, which can be easily sca

authors

Chengyue Gong Qiang Liu Xingchao Liu

co-cited works

representative citing papers

Lip Forcing: Few-Step Autoregressive Diffusion for Real-time Lip Synchronization

cs.CV · 2026-06-09 · conditional · novelty 8.0

Lip Forcing distills a 14B bidirectional video diffusion teacher into autoregressive students that achieve real-time lip synchronization at 31 FPS using two denoising steps without CFG.

WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling

eess.AS · 2026-06-02 · unverdicted · novelty 8.0

WavTTS is the first raw-waveform diffusion TTS model using DiT flow matching and multi-scale mel supervision that approaches SOTA latent zero-shot performance while beating prior end-to-end models.

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

cs.CV · 2026-05-13 · unverdicted · novelty 8.0

AnyFlow enables any-step video diffusion by distilling flow-map transitions over arbitrary time intervals with on-policy backward simulation.

What Time Is It? How Data Geometry Makes Time Conditioning Optional for Flow Matching

cs.LG · 2026-05-08 · unverdicted · novelty 8.0

Data geometry makes time identifiable from noisy interpolants at rate O(1/sqrt(d-k)), rendering the time-blindness gap asymptotically negligible relative to coupling variance.

Generative Modeling with Flux Matching

cs.LG · 2026-05-08 · unverdicted · novelty 8.0

Flux Matching generalizes score-based generative modeling by using a weaker objective that admits infinitely many non-conservative vector fields with the data as stationary distribution, enabling new design choices beyond traditional score matching.

Divergence is Uncertainty: A Closed-Form Posterior Covariance for Flow Matching

cs.LG · 2026-05-01 · unverdicted · novelty 8.0 · 3 refs

Derives closed-form posterior covariance for flow matching from divergence of velocity field, enabling post-hoc uncertainty on pre-trained models including one-step generators.

How to Guide Your Flow: Few-Step Alignment via Flow Map Reward Guidance

cs.LG · 2026-04-29 · unverdicted · novelty 8.0 · 3 refs

FMRG reformulates guidance as deterministic optimal control, deriving a single-trajectory method using the flow map that matches or exceeds baselines on reward-guided generation and inverse problems with 3 NFEs at text-to-image scale.

ReConText3D: Replay-based Continual Text-to-3D Generation

cs.CV · 2026-04-15 · conditional · novelty 8.0

ReConText3D is the first replay-memory framework for continual text-to-3D generation that prevents catastrophic forgetting on new textual categories while preserving quality on previously seen classes.

OP-GRPO: Efficient Off-Policy GRPO for Flow-Matching Models

cs.CV · 2026-04-05 · unverdicted · novelty 8.0

OP-GRPO is the first off-policy GRPO method for flow-matching models that reuses trajectories via replay buffer and importance sampling corrections, matching on-policy performance with 34.2% of the training steps.

Flow-GRPO: Training Flow Matching Models via Online RL

cs.CV · 2025-05-08 · unverdicted · novelty 8.0

Flow-GRPO is the first online RL method for flow matching models, raising GenEval accuracy from 63% to 95% and text-rendering accuracy from 59% to 92% with little reward hacking.

Consistency Models

cs.LG · 2023-03-02 · conditional · novelty 8.0

Consistency models achieve fast one-step generation with SOTA FID of 3.55 on CIFAR-10 and 6.20 on ImageNet 64x64 by directly mapping noise to data, outperforming prior distillation techniques.

Building Normalizing Flows with Stochastic Interpolants

cs.LG · 2022-09-30 · conditional · novelty 8.0 · 2 refs

Normalizing flows are constructed by learning the velocity of a stochastic interpolant via a quadratic loss derived from its probability current, yielding an efficient ODE-based alternative to diffusion models.

Cross-Space Distillation: Teaching One-Step Students with Modern Diffusion Teachers

cs.CV · 2026-06-30 · unverdicted · novelty 7.0

Introduces a Bridge latent interface that maps mismatched student latents into teacher space, enabling distillation from modern diffusion teachers to compact one-step students and raising SD 1.5 HPSv3 from 5.4 to 9.4 while keeping one-step speed.

FlexiSLM: A Dynamic and Controllable Frame Rate Spoken Language Model

cs.SD · 2026-06-30 · unverdicted · novelty 7.0

FlexiSLM is the first spoken language model supporting dynamic and controllable frame rates on speech input and output, outperforming fixed-rate 7B models at high quality and enabling faster inference at lower rates like 6.25 Hz.

Panel Flow Matching: A Generative Approach to Learning Distributions of Longitudinal Data

stat.ME · 2026-06-27 · unverdicted · novelty 7.0

Panel Flow Matching is a generative method to estimate panel densities from longitudinal data with statistical guarantees under irregular sampling, supporting completion, synthetic data, and classification.

MammoFlow: Multiview Mammogram Synthesis with Anatomically Consistent Flow Matching

cs.CV · 2026-06-26 · unverdicted · novelty 7.0

MammoFlow adds geometric alignment and EMD tissue-distribution consistency to a pretrained flow-matching model to generate anatomically paired mammograms, reporting superior quality and a 5% downstream AUC gain.

TempAct: Advancing Temporal Plausibility in Autoregressive Video Generation via Planner-Executor RL

cs.CV · 2026-06-26 · unverdicted · novelty 7.0 · 2 refs

TempAct introduces a planner-executor RL framework with hierarchical group exploration and rewards to improve temporal consistency in autoregressive video diffusion models.

Parallel Rollout Approximation for Pixel-Space Autoregressive Image Generation

cs.CV · 2026-06-26 · unverdicted · novelty 7.0

PRA approximates sequential rollout training in parallel for pixel-space AR models via intermediate states and a pixel decoder, achieving FID 2.58 (135M params) and 1.94 (511M params) on ImageNet-1K 256x256, new SOTA among pixel-space AR models.

PolyFlow: Continuous Topology Embedding Flow Matching for Artist-style Mesh Generation

cs.GR · 2026-06-25 · unverdicted · novelty 7.0

PolyFlow converts discrete meshes to continuous per-vertex representations using a topology embedder and applies flow matching for parallel artist-style mesh generation that outperforms autoregressive baselines on Toys4K in Chamfer and Hausdorff distances.

Focusing on What Matters: Saliency-Harnessing Accurate Routing for Diffusion MoE

cs.CV · 2026-06-25 · unverdicted · novelty 7.0

SharpMoE is a plug-and-play post-training method that uses clean latent features and a trajectory routing loss to enable accurate saliency-based routing in diffusion MoE models for improved visual generation.

Bridging Vision and Language Concepts through Optimal Transport Semantic Flow

cs.CV · 2026-06-25 · unverdicted · novelty 7.0

OTF-CBM replaces static cosine similarity in vision-language CBMs with data-driven optimal transport flow to improve concept alignment, accuracy, and faithfulness.

Flow Annealing Posterior Sampling for Function-Space Regression and Inverse Problems

stat.ML · 2026-06-21 · unverdicted · novelty 7.0

FAPS is a new function-space posterior sampling method built on flow-matching priors that unifies stochastic-process regression and PDE inverse problems while avoiding explicit prior density evaluation.

CoDMD: Copula-aware Distribution Matching Distillation for Fast Video Generation

cs.CV · 2026-06-20 · unverdicted · novelty 7.0

CoDMD adds a copula-matching regularizer to DMD for distilling 50-step video diffusion models to 4 steps, reporting VBench scores of 84.46/84.87 on 1.3B/14B Wan-2.1-T2V models.

Intrinsic Flow Matching on Quantum Pure-State Manifolds with Phase-Aligned Transport

cs.LG · 2026-06-19 · unverdicted · novelty 7.0

IFM learns deterministic tangent velocity fields on CP^{d-1} via Pancharatnam phase-aligned paths, recovering marginal transport with endpoint and stability guarantees while showing empirical gains over Euclidean flow matching on quantum benchmarks.

citing papers explorer

Showing 9 of 9 citing papers after filters.

Flow Annealing Posterior Sampling for Function-Space Regression and Inverse Problems stat.ML · 2026-06-21 · unverdicted · none · ref 11 · internal anchor
FAPS is a new function-space posterior sampling method built on flow-matching priors that unifies stochastic-process regression and PDE inverse problems while avoiding explicit prior density evaluation.
Training-Free Generative Sampling via Moment-Matched Score Smoothing stat.ML · 2026-05-14 · unverdicted · none · ref 34 · internal anchor
MM-SOLD is a training-free particle sampler whose large-particle limit converges to a moment-matched Gibbs distribution obtained by exponentially tilting a score-smoothed target.
Is Flow Matching Just Trajectory Replay for Sequential Data? stat.ML · 2026-02-09 · unverdicted · none · ref 74 · internal anchor
Flow matching on time series targets a closed-form nonparametric velocity field that is a similarity-weighted mixture of observed transition velocities, making neural models approximations to an ideal memory-augmented dynamical system sampler.
Flow-Based Conformal Predictive Distributions stat.ML · 2026-02-07 · unverdicted · none · ref 28 · internal anchor
Differentiable nonconformity scores induce flows that sample conformal prediction set boundaries, and mixing flows across levels produces conformal predictive distributions whose quantiles match the sets.
On The Hidden Biases of Flow Matching Samplers stat.ML · 2025-12-18 · unverdicted · none · ref 30 · internal anchor
Empirical flow matching introduces coupled biases from plug-in estimation, including altered statistical targets, non-gradient minimizers, and non-unique dynamics via flux-null fields, with base distribution controlling kinetic energy tails.
SURGE: Approximation and Training Free Particle Filter for Diffusion Surrogate stat.ML · 2026-05-18 · unverdicted · none · ref 15 · 2 links · internal anchor
SURGE is an unbiased particle filter that fuses diffusion-model simulations with noisy observations via sequential Monte Carlo reweighting over diffusion trajectories.
Simple Approximation and Derivative Free Inference-Time Scaling for Diffusion Models via Sequential Monte Carlo on Path Measures stat.ML · 2026-05-18 · unverdicted · none · ref 6 · internal anchor
URGE performs unbiased inference-time scaling for diffusion models by attaching multiplicative path weights from Girsanov estimation and resampling trajectories, with a proven equivalence to prior particle-wise SMC schemes.
Conditional flow matching for physics-constrained inverse problems with finite training data stat.ML · 2026-03-14 · unverdicted · none · ref 36 · internal anchor
Conditional flow matching learns a velocity field to sample from measurement-conditioned posteriors in physics inverse problems, with early stopping to prevent variance collapse and selective memorization under finite training data.
Notes on generative modeling: flow matching, diffusion, optimal transport and Schr{\"o}dinger bridge stat.ML · 2026-06-29 · unverdicted · none · ref 19 · internal anchor
Notes recapitulating high-level principles of generative modeling and showing connections between optimal transport, Schrödinger bridge, and flow matching.

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

hub tools

citation-role summary

citation-polarity summary

claims ledger

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer