Prompting Decisicion Transformer for Few-Shot Policy Generalization

Official code repository for Prompt-DT. [website][paper]

Prompt-DT Architecture:

Installation

We tested the code in Ubuntu 20.04.

We recommend using Anaconda to create a virtual environment.

conda create --name prompt-dt python=3.8.5 conda activate prompt-dt

Our experiments require MuJoCo as well as mujoco-py. Install them by following the instructions in the mujoco-py repo.
Install environments and dependencies with the following commands:

# install dependencies pip install -r requirements.txt # install environments ./install_envs.sh

We log experiments with wandb. Check out the wandb quickstart doc to create an account.

Download Datasets

We share example datasets via this Google Drive link.
Download the "data" folder.

wget -O data.zip 'https://drive.google.com/uc?export=download&id=1rZufm-XRq1Ig-56DejkQUX1si_WzCGBe&confirm=True' unzip data.zip rm data.zip

Organize folders as follows.

. ├── config ├── data │ ├── ant_dir │ ├── cheetah_dir │ ├── cheetah_vel │ └── ML1-pick-place-v2 ├── envs ├── prompt_dt └── ...

Run Experiments

# Prompt-DT python pdt_main.py --env cheetah_dir # choices:['cheetah_dir', 'cheetah_vel', 'ant_dir', 'ML1-pick-place-v2'] # Prompt-MT-BC python pdt_main.py --no-rtg --no-r # MT-ORL python pdt_main.py --no-prompt # MT-BC-Finetune python pdt_main.py --no-prompt --no-rtg --no-r --finetune

Acknowledgements

The code for prompt-dt is based on decision-transformer. We build environments based on repos including macaw, rand_param_envs, and metaworld.

References

If you find our code helpful for your research, please consider citing the paper!

@inproceedings{xu2022prompting, title={Prompting Decision Transformer for Few-Shot Policy Generalization}, author={Xu, Mengdi and Shen, Yikang and Zhang, Shun and Lu, Yuchen and Zhao, Ding and Tenenbaum, Joshua and Gan, Chuang}, booktitle={International Conference on Machine Learning}, pages={24631--24645}, year={2022}, organization={PMLR} }

Contributions

Suggestions for enhancing and improving the code are welcome. Please email mengdixu@andrew.cmu.edu with comments and suggestions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prompting Decisicion Transformer for Few-Shot Policy Generalization

Installation

Download Datasets

Run Experiments

Acknowledgements

References

Contributions

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
envs		envs
prompt_dt		prompt_dt
.gitignore		.gitignore
README.md		README.md
install_envs.sh		install_envs.sh
pdt_main.py		pdt_main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Prompting Decisicion Transformer for Few-Shot Policy Generalization

Installation

Download Datasets

Run Experiments

Acknowledgements

References

Contributions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages