This data set contains approximately 25,000 camera trap images representing 20 species (for example, the most common labels are red deer, mouflon, and white-tailed deer). Images within each sequence share the same species label (even though the animal may not have been recorded in all the images in the sequence). Around 900 bounding boxes are included. These are very challenging sequences with highly cluttered and dynamic scenes. Spatial resolutions of the images vary from 1920 × 1080 to 2048 × 1536. Sequence lengths vary from 3 to more than 300 frames.
If you use this data set, please cite the associated manuscript:
Zhang, Z., He, Z., Cao, G., & Cao, W. (2016). Animal detection from highly cluttered natural scenes using spatiotemporal object region proposals and patch verification. IEEE Transactions on Multimedia, 18(10), 2079-2092. (bibtex)
Annotations are provided in the whitespace-delimited format used by the authors (inside the zipfile, along with a README documenting its format), as well as in the COCO Camera Traps .json format used for most data sets on lila.science.
This data set is released under the Community Data License Agreement (permissive variant).
Having trouble downloading? Check out our FAQ.