Overview
Masai giraffes are endangered from illegal killing and habitat loss. As part of the world’s largest giraffe conservation science project, the Wild Nature Institute has identified thousands of individual Masai giraffes in Tanzania by their unique and unchanging coat patterns. Photogrammetry allows researchers to also track these giraffes’ heights and sizes over time from images, by combining keypoint annotations with laser rangefinder data that indicates the distance between the camera and the giraffe.
This dataset contains 29,806 images of giraffes, annotated with keypoints representing four canonical measurement points: the top of the ossicones (horns), the top of the head, the indentation where the neck meets the chest, and the bottom front of the front hoof. Annotations were created by citizen-science volunteers on Zooniverse. Around 40,000 images were annotated in total; this dataset excludes images that were annotated by fewer than five volunteers, and also excludes a test set that can be used in the future to validate algorithms developed based on this dataset to automate keypoint identification.
Data format
Annotations are provided in a .json file, structured as:
{ "info": [metadata about the data set (version, etc.)], "annotations": [list of annotations] } annotation { "filename": str, "id": int, "collection": str (always "train"), "keypoints": { 'too': [list of keypoints], 'toh': [list of keypoints], 'ni': [list of keypoints], 'fbh': [list of keypoints] } } keypoint { "median_x": float, "median_y": float, "x": [], "y": [] }Keypoint identifiers have the following interpretations, which are also documented in info[‘tool_names’]:
- too: top of ossicones
- toh: top of head
- ni: neck indent
- fbh: front bottom hoof
x and y contain the raw annotations provided by individual annotators; median_x and median_y are the medians of those values when at least three annotations are present for a specific keypoint. When fewer than three annotations are present for a specific keypoint, median_x and median_y are empty.
Not all keypoints are visible in all images; some images may contain as few as one annotated keypoint (for example, if only the chest of the giraffe is visible).
Sample Python code to read this dataset and plot keypoints (e.g. to produce the thumbnail image at the bottom of this page) is available here.
Citation, license, and contact information
For questions about this dataset, contact Derek Lee (derek@wildnatureinstitute.org) at the Wild Nature Institute.
This dataset is released under the Community Data License Agreement (permissive variant).
Downloading the data
Download images from GCP (196GB)
Download annotations from GCP (8MB)
Download images from AWS (196GB)
Download annotations from AWS (8MB)
Download images from Azure (196GB)
Download annotations from Azure (8MB)
Having trouble downloading? Check out our FAQ.
Posted by Dan Morris.