by Michele Laurelli
A computer vision task that identifies and localizes objects within an image using bounding boxes.
Object detection combines classification and localization. Popular architectures include YOLO (real-time), Faster R-CNN (accurate), and DETR (transformer-based).
YOLO for real-time detection
Faster R-CNN for high accuracy
Pedestrian detection in autonomous driving