Skip to content

google-research-datasets/sanpo_dataset

Repository files navigation

SANPO: A (S)cene Understanding, (A)ccessibility, (N)avigation, (P)athfinding, (O)bstacle Avoidance Dataset

Dataset

Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance is a multi-attribute dataset of common outdoor scenes from urban, park, and suburban settings. At its core, SANPO is a video-first dataset. It has both real (SANPO-Real) and synthetic (SANPO-Synthetic) counterparts. The real data is collected via an extensive data collection effort. The synthetic data is curated in collaboration with our external partner, Parallel Domain.

Dataset Contents

SANPO has...

  • Human Egocentric viewpoint: All data is captured from an eye-level and chest-level perspective with real-world volunteer runners.
  • Stereo video: Each camera optionally includes both left and right lenses, which may be downloaded separately
  • Real as well as synthetic data: Parallel Domain provides 113,000 frames of synthetic data very similar to the real-world capture conditions.
  • Sparse and dense depth maps. Dense depth from an ML disparity estimation method (CREstereo) and sparse depth from the ZED API.
  • Camera poses
  • Temporally consistent segmentation annotations from crowd annotators for a subset of frames
  • High level attributes like environment type, visibility, motion etc.

Each session is a separate recording of data. A SANPO-Real session contains:

  • High level session attributes like environment type, visibility etc.
  • Two stereo videos
  • Cameras' hardware information
  • IMU data
  • Two depth maps (meters). One from Zed cameras and another using one using the CREStereo algorithm (wrt to left side)
  • Optional temporally consistent panoptic segmentation annotation (wrt left side)

A SANPO-Synthetic session contains:

  • One video
  • Camera's hardware information used in the simulation
  • IMU data
  • Depth map (in meters)
  • Temporally consistent panoptic segmentation annotation

All the video data is in PNG format. Segmentation masks are saved as PNG files as well. Depth maps are in numpy arrays (saved as npz files). All other relevant data (including segmentation taxonomy, IMU, session attributes) is either in csv or json files.

Train/Test Splits

We provide lists of mutually exclusive session IDs for training and testing for both the real and synthetic counterparts of our dataset. This is provided in a folder named splits within the sanpo-real and sanpo-synthetic folders.

Privacy

All data collection is done in compliance with local, state, and city laws. Every volunteer was able to review each video in the data collection app before uploading it. All videos are processed to blur personally identifiable information (PII) such as faces and license plates. If any sample is found to be inadequately processed, please contact us immediately at sanpo_dataset@google.com.

Paper

SANPO is described in [http://arxiv.org/TODO](our publication (PDF)).

Download Data

All SANPO data can be downloaded directly from our Google Cloud Storage bucket. You can also browse through the dataset and download specific files using the gsutil cp command:

gsutil -m cp -r "gs://gresearch/sanpo_dataset/v0/{FILE_OR_DIR_PATH}" .

See here for instructions on installing the gsutil tool.

Selective download

All of the data is fairly large (~6TB). It may be desirable to download only the portions you need. Comment out the relevant excludes from the script below:

#!/usr/bin/env bash
SRC=gs://gresearch/sanpo_dataset/v0

EXCLUDES=""

# Exclude frames from the right lens (stereo)
EXCLUDES=$EXCLUDES'|.*/right'
# Exclude segmentation maps
EXCLUDES=$EXCLUDES'|.*/segmentation_masks'
# Exclude SANPO-synthetic depth maps and CREStereo depth maps
EXCLUDES=$EXCLUDES'|.*/depth_maps'
# Exclude depth maps from the ZED API (sparse depth)
EXCLUDES=$EXCLUDES'|.*/zed_depth_maps'

# Exclude SANPO-Real
EXCLUDES=$EXCLUDES'|sanpo-real'
# Exclude SANPO-Synthetic
EXCLUDES=$EXCLUDES'|sanpo-synthetic'

echo Running: gsutil -m rsync -r -x \'${EXCLUDES#|}\' $SRC .
gsutil -m rsync -r -x ${EXCLUDES#|} $SRC .

License & Contact

We release SANPO dataset under the Creative Commons V4.0 license. You are free to share and adapt this data for any purpose.

If you found this dataset useful, please consider citing our paper:

@article{sanpo_dataset,
  title={{SANPO}: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset},
  author={Sagar M. Waghmare and Kimberly Wilber and Dave Hawkey and Mikhail Sirotenko and Xuan Yang and Matthew Wilson and Stephanie Debats  and Cattalyya Nuengsigkapian and Astuti Sharma and Lars Pandikow and Huisheng Wang and Hartwig Adam},
  journal={arXiv preprint arXiv:2309.12172},
  year={2024}
}

If you have any questions about the dataset or paper, please send us an email at sanpo_dataset@google.com.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published