Snapshot Serengeti (v1)

This data set contains 1.2M sequences of camera trap images, totaling 3.2M images, from seasons one through six of the Snapshot Serengeti project. Labels are provided for 48 animal categories, primarily at the species level (for example, the most common labels are wildebeest, zebra, and Thomson’s gazelle). Approximately 70% of images are labeled as empty. We have also added approximately 150,000 bounding box annotations to approximately 78,000 of those images.

The images and species-level labels are described in more detail in the associated manuscript:

Swanson AB, Kosmala M, Lintott CJ, Simpson RJ, Smith A, Packer C (2015) Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna. Scientific Data 2: 150026. (DOI) (bibtex)

Please cite this manuscript if you use this data set.

Annotations are provided in COCO Camera Traps .json format. Note that annotations are tied to images, but are only reliable at the sequence level. For example, there are rare sequences in which two of three images contain a lion, but the third is empty (lions, it turns out, walk away sometimes), but all three images would be annotated as “lion”.

We have also divided locations (i.e., cameras) into training and validation splits to allow for consistent benchmarking on this data set.

For questions about this data set, contact Sarah Huebner at the University of Minnesota.

This data set is released under the Community Data License Agreement (permissive variant).

The original Snapshot Serengeti data set included a “human” class label; for privacy reasons, we have removed those images from this version of the data set. Those labels are still present in the metadata. If those images are important to your work, contact us; in some cases it will be possible to release those images under an alternative license.

Additional metadata related to the aggregation of human labels into consensus labels is available in an addendum.

Data download links:

Season 1 (242GB)
Season 2 (361GB)
Season 3 (244GB)
Season 4 (368GB)
Season 5 (579GB)
Season 6 (362GB)
Metadata (1GB)
Bounding boxes (50MB)
Recommended train/val splits

Having trouble downloading? Check out our FAQ.


Posted by Dan Morris.