A computer vision technique that computes the apparent motion of pixels between consecutive video frames, producing a dense motion field. Optical flow enables temporal understanding in video analysis pipelines for action recognition and scene dynamics.
Optical flow algorithms estimate the displacement of each pixel from one frame to the next, producing a 2D vector field where each vector indicates the direction and magnitude of motion. Classical methods use brightness constancy and smoothness constraints, while modern deep learning approaches directly predict flow fields from frame pairs using neural networks.
State-of-the-art models include RAFT (Recurrent All-pairs Field Transforms) and FlowFormer, which iteratively refine flow estimates using correlation volumes between feature maps. Output is typically a 2-channel image (horizontal and vertical displacement) at the input resolution. Optical flow is computationally intensive, with GPU processing required for real-time applications on high-resolution video.