Skip to content

wadx2019/qvpo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

[NeurIPS 2024] Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization

Code release for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization (NeurIPS 2024).

[paper] [project page]

Requirements

Installations of PyTorch and MuJoCo are needed. A suitable conda environment named qvpo can be created and activated with:

conda create qvpo
conda activate qvpo

To get started, install the additionally required python packages into you environment.

pip install -r requirements.txt

Running

Running experiments based our code could be quite easy, so below we use HalfCheetah-v3 task as an example.

python main.py --env_name HalfCheetah-v3--weighted --aug

Citation

If you find this repository useful in your research, please consider citing:

@inproceedings{
ding2024diffusionbased,
title={Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization},
author={Shutong Ding and Ke Hu and Zhenhao Zhang and Kan Ren and Weinan Zhang and Jingyi Yu, Jingya Wang and Ye Shi},
booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
year={2024},
url={https://arxiv.org/abs/2405.16173}
}

Acknowledgement

The code of QVPO is based on the implementation of DIPO.

About

official implementation of QVPO

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages