Caltech-UCSD Birds-200-2011

Browse

Click here to browse the dataset.

Details

Caltech-UCSD Birds-200-2011 (CUB-200-2011) is an extended version of the CUB-200 dataset, with roughly double the number of images per class and new part location annotations. For detailed information about the dataset, please see the technical report linked below.

  • Number of categories: 200

  • Number of images: 11,788

  • Annotations per image: 15 Part Locations, 312 Binary Attributes, 1 Bounding Box

Some related datasets are Caltech-256, the Oxford Flower Dataset, and Animals with Attributes. More datasets are available at the Caltech Vision Dataset Archive.

Citation

If you use CUB-200-2011 in your work, please cite the technical report:

  • Wah C., Branson S., Welinder P., Perona P., Belongie S. “The Caltech-UCSD Birds-200-2011 Dataset.” Computation & Neural Systems Technical Report, CNS-TR-2011-001. download pdf

BibTeX
@techreport{WahCUB_200_2011,
	Title = {{The Caltech-UCSD Birds-200-2011 Dataset}},
	Author = {Wah, C. and Branson, S. and Welinder, P. and Perona, P. and Belongie, S.},
	Year = {2011}
	Institution = {California Institute of Technology},
	Number = {CNS-TR-2011-001}
}

Download

You can download the dataset using the links below:

Contact

Contact Catherine Wah for questions about the dataset.

Publications

The following publications use the dataset. Please contact us if you are using the dataset, and we will add your paper to the list.

  • Wah C., Branson S., Perona P., Belongie S., “Interactive Localization and Recognition of Fine-Grained Visual Categories”, IEEE International Conference on Computer Vision (ICCV), Barcelona, 2011. pdf

  • Branson S., Perona P., Belongie S. “Strong Supervision From Weak Annotation: Interactive Training of Deformable Part Models”, IEEE International Conference on Computer Vision (ICCV), Barcelona, 2011. pdf, supplementary material

  • Branson S., Wah C., Babenko B., Schroff F., Welinder P., Perona P., Belongie S., “Visual Recognition with Humans in the Loop”, European Conference on Computer Vision (ECCV), Heraklion, Crete, Sept., 2010. pdf

  • Welinder, P., Perona, P. Online crowdsourcing: rating annotators and obtaining cost-effective labels. Workshop on Advancing Computer Vision with Humans in the Loop at CVPR. 2010. pdf