ViTPose - Human Pose Estimation

Upload an image to detect human poses and visualize the skeleton overlay.

Model: ViTPose-base-simple (Xu et al., 2022) with COCO 17 keypoints.


Note: This demo uses the full image as a bounding box for single-person pose estimation. For multi-person scenarios, an object detector (e.g., RT-DETR) would be used upstream.