Caltech Camera Traps

This data set contains 244,497 images from 140 camera locations in the Southwestern United States, with labels for 22 animal categories, primarily at the species level (for example, the most common labels are opossum, raccoon, and coyote), and approximately 66,000 bounding box annotations. Approximately 70% of images are labeled as empty.

More information about this data set is available here.

If you use this data set, please cite the associated manuscript:

Sara Beery, Grant Van Horn, Pietro Perona. Recognition in Terra Incognita. Proceedings of the 15th European Conference on Computer Vision (ECCV 2018). (bibtex)

Annotations are provided in the .json format used by the COCO data set.

This data set is released under the Community Data License Agreement (permissive variant).

For questions about this data set, contact

Download links:

Images (104GB)
Metadata (100MB)
Bounding boxes (30MB)

Having trouble downloading? Check out our FAQ.