This data set contains 243,100 images from 140 camera locations in the Southwestern United States, with labels for 21 animal categories (plus empty), primarily at the species level (for example, the most common labels are opossum, raccoon, and coyote), and approximately 66,000 bounding box annotations. Approximately 70% of images are labeled as empty.
More information about this data set is available here.
Citation, license, and contact information
If you use this data set, please cite the associated manuscript:
Sara Beery, Grant Van Horn, Pietro Perona. Recognition in Terra Incognita. Proceedings of the 15th European Conference on Computer Vision (ECCV 2018). (bibtex)
This data set is released under the Community Data License Agreement (permissive variant).
For questions about this data set, contact firstname.lastname@example.org.
Annotations are provided in COCO Camera Traps .json format.
For information about mapping this dataset’s categories to a common taxonomy, see this page.
Downloading the data
Images are available in the following Azure Blob Storage folder:
…and the following Google Cloud Storage folder:
A link to a zipfile is also provided below, but – whether you want the whole data set, a specific folder, or a subset of the data (e.g. images for one species) – we recommend checking out our guidelines for accessing images without using giant zipfiles.
We have also divided locations (i.e., cameras) into training and validation splits to allow for consistent benchmarking on this data set. The file describing this split specifies a train/val split for all locations in the data set, and also provides the train/val split used in the ECCV paper listed above. The “eccv_train” split here corresponds to the “train” locations and all “cis” locations in the ECCV paper; the “eccv_val” split here corresponds to all “trans” locations in the ECCV paper.
Having trouble downloading? Check out our FAQ.
CCT20 Benchmark subset
The initial publication of this data set in Beery et al. 2018 proposed a specific subset of Caltech Camera Traps data for benchmarking. Images in this benchmark dataset (CCT20) have been downsized to a maximum of 1024 pixels on a side.
The CCT20 benchmark set is available here:
Other useful links
Information about mapping camera trap datasets to a common taxonomy is available here.
Posted by Dan Morris.