Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

CJReinforce Follow

Overview Repositories 10 Projects 0 Packages 0 Stars 86

Overview
Repositories
Projects
Packages
Stars

CJReinforce

😀

Have a nice day!

Jie Cheng CJReinforce

😀

Have a nice day!

major in reinforcement learning

15 followers · 13 following

Institute of Automation, Chinese Academy of Sciences
Beijing

Achievements

Block or report CJReinforce

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 10 Projects 0 Packages 0 Stars 86

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Python JavaScript ASP.NET

Sort Last updated

Select order

Last updated Name Stars

RLHF-Reward-Modeling Public
Forked from RLHFlow/RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python Apache License 2.0 Updated Jan 25, 2025
ProcessBench Public
Forked from QwenLM/ProcessBench

Python Updated Jan 24, 2025
PikPakAutoOfflineDownloadBot Public

自动PikPak离线下载+aria2下载+释放网盘空间的TG机器人

Python 454 83 Updated Dec 5, 2024
JOWA Public

Official code for the paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"

machine-learning reinforcement-learning deep-learning artificial-intelligence transformer atari few-shot

Python 16 GNU General Public License v3.0 Updated Dec 1, 2024
JOWA_agents Public

JavaScript 1 Updated Oct 21, 2024
RIME_ICML2024 Public

Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)

reinforcement-learning deep-learning robotics artificial-intelligence manipulation locomotion preference-learning

Python 27 2 MIT License Updated Oct 15, 2024
SC-Tune Public
Forked from ivattyue/SC-Tune

Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"

Python 1 MIT License Updated Apr 25, 2024
LiteLoaderQQNT-Markdown Public
Forked from Ikaleio/LiteLoaderQQNT-Markdown

为QQ添加Markdown渲染支持

JavaScript 2 Do What The F*ck You Want To Public License Updated Feb 22, 2024
CJReinforce Public

1 Updated Jan 7, 2024
RL-for-vision-navigation-in-EpMineEnv Public

RL大作业：视觉导航

ASP.NET 2 Updated May 19, 2023

Footer

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.

Jie Cheng CJReinforce

Achievements

Achievements

RLHF-Reward-Modeling Public

ProcessBench Public

PikPakAutoOfflineDownloadBot Public

JOWA Public

JOWA_agents Public

RIME_ICML2024 Public

SC-Tune Public

LiteLoaderQQNT-Markdown Public

CJReinforce Public

RL-for-vision-navigation-in-EpMineEnv Public