😀
Have a nice day!
major in reinforcement learning
-
Institute of Automation, Chinese Academy of Sciences
- Beijing
-
RLHF-Reward-Modeling Public
Forked from RLHFlow/RLHF-Reward-ModelingRecipes to train reward model for RLHF.
Python Apache License 2.0 UpdatedJan 25, 2025 -
-
PikPakAutoOfflineDownloadBot Public
自动PikPak离线下载+aria2下载+释放网盘空间的TG机器人
-
JOWA Public
Official code for the paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"
-
-
RIME_ICML2024 Public
Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)
-
SC-Tune Public
Forked from ivattyue/SC-TuneOfficial code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"
-
LiteLoaderQQNT-Markdown Public
Forked from Ikaleio/LiteLoaderQQNT-Markdown为QQ添加Markdown渲染支持
-
-