rlds_dataset_builder/stanford_mask_vit at main · s-tian/rlds_dataset_builder

History

Name		Name	Last commit message	Last commit date
parent directory ..
images		images
CITATIONS.bib		CITATIONS.bib
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
stanford_mask_vit_dataset_builder.py		stanford_mask_vit_dataset_builder.py

README.md

This dataset accompanies the paper MaskViT: Masked Visual Pre-Training for Video Prediction.

The raw data before preprocessing can be found here: https://drive.google.com/file/d/1olPNo1p-XIcLcwbifC0Z71pQijP7eY6h/view?usp=drive_link.

At a high level, this is "RoboNet-style" data in that it is random interaction data collected by a robot arm and a bin of objects, which are swapped out periodically. The objects mostly consist of soft stuffed toys and plastic objects. The robot arm is controlled by a random policy with the autograsp primitive enabled. It is collected using the Visual Foresight codebase.

The dataset is collected using a single robot, and was collected in two parts of roughly equal size. These two parts have different camera configurations and slightly different object distributions.

The dataset contains 9109 train episodes and 91 validation episodes. Each episode contains 30 steps.

Example trajectories:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stanford_mask_vit

stanford_mask_vit

README.md

Files

stanford_mask_vit

Directory actions

More options

Directory actions

More options

Latest commit

History

stanford_mask_vit

Folders and files

parent directory

README.md