NOAA Arctic Seals 2019

Overview

This dataset consists of around 80,000 color and IR (thermal) images, associated with flights conducted in Alaska by the NOAA Alaska Fisheries Science Center in 2019. Images have been annotated with around 28,000 bounding boxes (14,000 on color images, 14,000 on thermal images) on ice seals.

Data format

Metadata are provided as a .csv file, in which each row represents a detection (a bounding box on an RGB image and the corresponding thermal image); important columns include:

  • detection_type: class associated with this bounding box, e.g. “ringed_seal”, “ringed_pup”
  • rgb_left,rgb_right,rgb_top,rgb_bottom: bounding box location in absolute (pixel) coordinates on the RGB image; the origin of the bounding box is in the upper-left of the image, so “bottom” is the smaller of the two y coordinates, but represents the logical “top” of the bounding box
  • ir_left,ir_right,ir_top,ir_bottom: bounding box location in absolute (pixel) coordinates on the IR image; the origin of the bounding box is in the upper-left of the image, so “bottom” is the smaller of the two y coordinates, but represents the logical “top” of the bounding box
  • rgb_image_path: path to the RGB image associated with this detection within the blob container linked below
  • ir_image_path: path to the IR image associated with this detection within the blob container linked below

Citation

If you use these data in a publication or report, please use the following citation:

Alaska Fisheries Science Center, 2021: A Dataset for Machine Learning Algorithm Development.

Contact information

For questions about this data set, contact Erin Moreland and Stacie Hardy at NOAA Fisheries.

License

This data set is released under the Community Data License Agreement (permissive variant).

Accessing the data

Annotations are available here:

A list of all files in the data set – including empty images with no annotations – is available here:

Images are available in the following storage containers:

gs://public-datasets-lila/noaa-kotz
https://lilablobssc.blob.core.windows.net/noaa-kotz

So, for example, the image referred to in the metadata file as:

Images/fl04/CENT/test_kotz_2019_fl04_C_20190510_000310.667291_rgb.jpg

…is available at either of the following URLs:

The full data set is around 88,000 images and 1TB, so downloading all the data is not recommended. See LILA’s direct image access guide for information about downloading individual files or mounting storage containers.

Posted by Dan Morris.