by Michele Laurelli
Large-scale dataset for object detection, segmentation, and captioning with 330k images.
Contains 80 object categories with instance segmentation masks. Includes captions, keypoints. Standard benchmark for detection and segmentation.
Object detection benchmark
Instance segmentation
Image captioning