This page is work in progress.

Towards Learning Monocular 3D Object Localization From 2D Labels using the Physical Laws of Motion

Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation

Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real Transfer