Pull Request: Addition of "general-preference" Branch for General Preference Model (GPM) #201

kirigayahitsugi · 2024-10-16T10:17:07Z

Dear RewardBench Team,

I am submitting a Pull Request to introduce a new branch named "general-preference" to support the General Preference Model (GPM). The following modifications have been implemented:

Added a new pipeline, GPMPipeline.
Updated run_rm.py to accommodate our model's requirements.
Included our model in the REWARD_MODEL_CONFIG.
Added a script named run_rm_rewardbench.sh.

Thank you for considering this addition!

Best regards,
Grace Zhang

Add torch_dtype to model GPM

natolambert

Hey @kirigayahitsugi -- sorry for the delay. In general this is great. I'm wondering how we can minimize modifications to the script. Right now, some of the moderations effect all the models (truncation) instead of just the new types of models. I am wondering if we can do this in a bit cleaner of a way.

Same goes for the custom classifier section where there is a lot of additions.

Lastly, seems like we want to remove the extra bash script you accidentally checked in?

natolambert · 2024-10-23T20:53:36Z

scripts/run_rm.py

@@ -89,6 +89,13 @@ def get_args():
        choices=["eager", "sdpa", "flash_attention_2"],
        help="Attention implementation to use (default: None)",
    )
+    #### custom arguments added for general-preference/GPM-Llama-3.1-8B and general-preference/GPM-Gemma-2B


I'd like to not add any args if possible, if just for one model. Given we already have a config structure.

natolambert · 2024-10-23T20:53:45Z

scripts/run_rm.py

+    # #### added for general-preference/GPM-Llama-3.1-8B and general-preference/GPM-Gemma-2B for debugging 
+    # config = REWARD_MODEL_CONFIG["general-preference/GPM-Gemma-2B"] 


natolambert · 2024-10-23T20:53:52Z

scripts/run_rm.py

+    #### added for general-preference/GPM-Llama-3.1-8B and general-preference/GPM-Gemma-2B
+    tokenizer.truncation_side = "right"
+    tokenizer.padding_side = "left"


See above comment

…d-bench into general-preference

kirigayahitsugi and others added 3 commits October 16, 2024 16:40

add branch for GPM

f7ce1f5

add general preference model eval

6e81b69

Update __init__.py

4d6b6f1

Add torch_dtype to model GPM

natolambert requested changes Oct 23, 2024

View reviewed changes

natolambert reviewed Oct 23, 2024

View reviewed changes

kirigayahitsugi added 5 commits October 24, 2024 18:52

update gpm

313df50

Merge branch 'general-preference' of github.com:kirigayahitsugi/rewar…

4aa5249

…d-bench into general-preference

update gpm

1e06640

upload results

512f175

delete shell files

082d4ad

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull Request: Addition of "general-preference" Branch for General Preference Model (GPM) #201

Pull Request: Addition of "general-preference" Branch for General Preference Model (GPM) #201

kirigayahitsugi commented Oct 16, 2024

natolambert left a comment

natolambert Oct 23, 2024

natolambert Oct 23, 2024

natolambert Oct 23, 2024

		# #### added for general-preference/GPM-Llama-3.1-8B and general-preference/GPM-Gemma-2B for debugging
		# config = REWARD_MODEL_CONFIG["general-preference/GPM-Gemma-2B"]

Pull Request: Addition of "general-preference" Branch for General Preference Model (GPM) #201

Are you sure you want to change the base?

Pull Request: Addition of "general-preference" Branch for General Preference Model (GPM) #201

Conversation

kirigayahitsugi commented Oct 16, 2024

natolambert left a comment

Choose a reason for hiding this comment

natolambert Oct 23, 2024

Choose a reason for hiding this comment

natolambert Oct 23, 2024

Choose a reason for hiding this comment

natolambert Oct 23, 2024

Choose a reason for hiding this comment