Masked Visual Pre-training for Motor Control

Masked Visual Pre-training for Motor Control
This is a PyTorch implementation of the paper Masked Visual Pre-training for Motor Control. It contains the benchmark suite, pre-trained models, and the training code to reproduce the results from the paper.
Installation
Please see INSTALL.md
for installation instructions.
Pre-trained visual enocoders
We provide pre-trained visual encoders used in the paper. The models are in the same format as mae and timm:
backbone | objective | data | md5 | download |
---|---|---|---|---|
ViT-S | MAE | in-the-wild | model | |
ViT-S | MAE | ImageNet | model | |
ViT-S | Supervised | ImageNet | model |
By default,