Overview
I’m breaking the fourth wall right from the start here: this dataset comes from my house, and is intended to fill some specific gaps in public camera trap data that are difficult to fill from “real” camera trap data. In particular, the following things are important to developing and testing AI models for camera traps, but are largely missing from public camera trap data (at least as of December 2024):
- Images of humans (which are excluded from all public sources I’m aware of, including LILA, because consent is complicated when it comes to camera traps)
- Images from consumer-grade camera traps (because the intersection between “people who curate large datasets for public release” and “people who are content to entrust their data collection to a $50 camera from Amazon that stops working in a light fog” is… small). But these cameras are increasingly common, and they’re different (they generally provide lower image quality than research-grade cameras, and are consequently more difficult for AI).
- Video, especially intact, full-size videos (as opposed to sequences of frames sampled from videos, or videos reduced to thumbnail size) (because camera trap video is still a growing phenomenon)
So, as something between “a test data set”, “a random hobby project”, and “an attempt to make stone soup”, I’m releasing this dataset containing ~20k images (in ~6.7k sequences) and ~4.5k videos from my yard in the Seattle area. Images and videos containing any people other than me have been removed, which is why I don’t use the label “human”, I use the label “dan” (although AFAIK I am human). Audio has been removed from all the videos. The most common labels are “empty” (16,779 instances), “dan” (2,484 instances), “coyote” (1,566 instances), “squirrel” (1,340 instances), and “dog” (638 instances).
Speaking of which, is this all just a really complicated excuse to post pictures of my dog on the Internet? Maybe. Maybe it is.
Given that there is just one human in the data, this dataset is unlikely to be useful for training models for recognizing humans, but I hope it’s useful for basic infrastructure testing… i.e., does my code do something reasonable when humans are present in images/video? That kind of basic testing has historically been surprisingly difficult.
And speaking of stone soup… if we do want to get to a more diverse collection of human images, it would be great if other camera trap folks were interested in contributing! If we are the subjects, we can consent to being included, and we can help unlock this important aspect of training data that has historically been very difficult to get into the public domain (for good reason). In other words, I’d like to see this dataset grow from “Seattle-ish Camera Traps” to “Camera Trappers in Camera Traps”. If you have data that you’re interested in contributing, email me!
About dates and times in this data set
The absolute dates and times are totally meaningless in this dataset… I suspect this is also a common feature in consumer-grade camera traps, where it’s much less likely that anyone bothers to set the time/date correctly. However, the top-level “check date” folders are meaningful (so you could, for example, ask whether more animals visited my neighborhood during COVID), and the times/dates within a “check date” folder are meaningful in a relative sense, so the division of images into sequences should be reliable.
Citation, license, and contact information
For questions about this data set, contact Dan Morris.
This data set is released under the Community Data License Agreement (permissive variant).
Data format
Annotations (including species tags and unique location identifiers) are provided in COCO Camera Traps format.
For information about mapping this dataset’s categories to a common taxonomy, see this page.
Downloading the data
Metadata is available here.