UniFlow: Zero-Shot LiDAR Scene Flow for Autonomous Driving

Siyi Li¹, Qingwen Zhang², Ishan Khatri³, Kyle Vedder¹, Deva Ramanan³, Neehar Peri³

¹University of Pennsylvania ²KTH Royal Institute of Technology ³Carnegie Mellon University

ECCV 2026 Accepted to the European Conference on Computer Vision

Paper Code

Front-center RGB images (top), LiDAR sensor positions (middle), and BEV LiDAR point clouds (bottom) for Argoverse 2, Waymo, nuScenes, and TruckScenes. All four datasets use different sensors, and collect data in different environments.

Abstract

LiDAR scene flow is the task of estimating per-point 3D motion between consecutive point clouds. Recent methods achieve centimeter-level accuracy on popular autonomous vehicle (AV) datasets, but are typically only trained and evaluated on a single sensor. In this paper, we aim to learn general motion priors that transfer to diverse and unseen LiDAR sensors.

However, prior work in LiDAR semantic segmentation and 3D object detection demonstrate that naively training on multiple datasets yields worse performance than single dataset models. Interestingly, we find that this conventional wisdom does not hold for motion estimation, and that state-of-the-art scene flow methods greatly benefit from cross-dataset training without architectural modification. We posit that low-level tasks such as motion estimation may be less sensitive to sensor configuration; indeed, our analysis shows that models trained on fast-moving objects (e.g., from highway datasets) perform well on fast-moving objects, even across different datasets.

Informed by our analysis, we propose UniFlow, a feedforward model that unifies and trains on multiple large-scale LiDAR scene flow datasets with diverse sensor placements and point cloud densities. Our frustratingly simple solution establishes a new state-of-the-art on Waymo and nuScenes, improving over prior work by 5.1% and 35.2% respectively. Moreover, UniFlow achieves state-of-the-art accuracy on unseen datasets like TruckScenes and AEVAScenes, outperforming prior dataset-specific models by 30.1% and 22.5% respectively.

Cross-Dataset Generalization Correlates with Velocity Distribution

Cross-Dataset Generalization Correlates with Velocity Distribution. The velocity distributions for the AV2, Waymo, nuScenes, and TruckScenes train sets (top). The Dynamic Mean EPE per velocity bin of Flow4D trained on AV2, Waymo, nuScenes, TruckScenes, and UniFlow (bottom). Notably, Flow4D trained on TruckScenes outperforms Flow4D trained on any other dataset for fast-moving objects (2.0, ∞) across all datasets, as TruckScenes contains the largest number of fast-moving objects.

Quantitative Results

Method	AV2	Waymo	nuScenes
NSFP	0.422	0.574	0.602
FastNSF	0.383	–	0.560
SeFlow	0.309	0.328	0.554
ICP Flow	0.331	–	–
DeFlow	0.276	–	0.314
SSF	0.181	0.264	0.220
Flow4D	0.145	0.215	0.230
ΔFlow	0.113	0.198	0.216
UniFlow
UniFlow-SSF	0.156	0.234	0.144
UniFlow-Flow4D	0.132	0.191	0.196
UniFlow-ΔFlow	0.118	0.188	0.140

In-domain performance. We compare UniFlow against recent scene flow methods on AV2, Waymo, and nuScenes using Dynamic Bucket-Normalized Mean EPE. UniFlow establishes a new state-of-the-art on Waymo and nuScenes, improving over prior work by 5.1% and 35.2% respectively.

Method	TruckScenes	AEVAScenes
NSFP	0.658	–
FastNSF	0.588	–
SeFlow	0.681	–
ICP Flow	0.472	–
DeFlow	0.570	–
SSF	0.453	0.759^ZS
Flow4D	0.456	0.433^ZS
ΔFlow	0.402	0.402^ZS
UniFlow
UniFlow-SSF	0.435^ZS	0.639^ZS
UniFlow-Flow4D	0.281^ZS	0.448^ZS
UniFlow-ΔFlow	0.101^ZS	0.344^ZS

Generalization to unseen datasets. We compare UniFlow against recent scene flow methods on TruckScenes and AEVAScenes using Dynamic Bucket-Normalized Mean EPE. UniFlow outperforms prior dataset-specific models by 30.1% on TruckScenes and 22.5% on AEVAScenes. We mark zero-shot results with ^ZS.

Qualitative Results

Zero-Shot Generalization on TruckScenes. Compared with the dataset-specific ΔFlow model, ΔFlow (UniFlow) produces more accurate motion estimates, with better robustness to rain artifacts (on the top left) and stronger generalization to rare vehicles (middle row) and long-range vehicles (bottom row).

Video

Challenging rainy sequence from TruckScenes. As shown above and in the video, ΔFlow (left) frequently produces artifacts on rain streaks and background points, which become especially pronounced during occlusions, and predicts inconsistent flow vectors on dynamic objects. In contrast, ΔFlow (UniFlow) (right) yields significantly more stable and coherent motion fields.

CVPR 2026 Challenge

We are hosting the 2026 AV2 Scene Flow Challenge to encourage broad community involvement in LiDAR scene flow across diverse autonomous-driving datasets. Participants are allowed to train their models on any publicly available datasets and will be evaluated on Argoverse 2, Waymo, TruckScenes, nuScenes, and AEVAScenes.

2026 AV2 Sceneflow Challenge Leaderboard

#	Method	Team	Mean Dynamic	AV2	Waymo	TruckScenes	nuScenes	AEVAScenes	Date
2	UniFlow-DeltaFlow	AV2 Host Team	0.2421	0.1088	0.1109	0.2408	0.1678	0.2415	2026-04-06
4	UniFlow-Flow4D	AV2 Host Team	0.2644	0.1184	0.1182	0.2485	0.2085	0.2837	2026-04-06
5	DeltaFlow-AV2	AV2 Host Team	0.2688	0.1033	0.1085	0.2769	0.1895	0.2645	2026-04-06
6	UniFlow-SSF	AV2 Host Team	0.2968	0.1367	0.1366	0.2597	0.1868	0.3686	2026-04-06
7	SSF-long-AV2	AV2 Host Team	0.4691	0.1597	0.2020	0.6409	0.4338	0.6300	2026-04-06
8	TeFlow-AV2 ^unsup.	AV2 Host Team	0.6125	0.2019	0.1979	0.3884	0.3822	0.3875	2026-04-10
9	SeFlow++-AV2 ^unsup.	AV2 Host Team	0.7025	0.2827	0.3192	0.8555	0.6965	0.8057	2026-04-10
10	VoteFlow-AV2 ^unsup.	AV2 Host Team	0.7038	0.2904	0.3067	0.8310	0.7035	0.7961	2026-04-10
11	SeFlow-AV2 ^unsup.	AV2 Host Team	0.7367	0.3113	0.3494	0.8885	0.7543	0.8412	2026-04-10

Dynamic Bucket-Normalized Mean EPE on the AV2 2026 Scene Flow Challenge leaderboard. Lower is better.

Citation

@inproceedings{li2026uniflow, title={UniFlow: Zero-Shot LiDAR Scene Flow for Autonomous Vehicles}, author={Siyi Li and Qingwen Zhang and Ishan Khatri and Kyle Vedder and Eric Eaton and Deva Ramanan and Neehar Peri}, booktitle={European Conference on Computer Vision (ECCV)}, year={2026} }