From e090a5267ff9520d68ec028e6c263038e396b915 Mon Sep 17 00:00:00 2001 From: zitiangao Date: Sun, 3 Nov 2024 19:54:56 +1100 Subject: [PATCH] add SC-MCTS* --- README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.md b/README.md index 979ce7e..4163e9a 100644 --- a/README.md +++ b/README.md @@ -184,6 +184,9 @@ Also check out 🔤 Reasoning in Large Language Models - An Emergent Ability ### 2024 +1. **[Interpretable Contrastive Monte Carlo Tree Search Reasoning.](https://arxiv.org/abs/2410.01707)** + + *Zitian Gao, Boye Niu, Xuzheng He, Haotian Xu, Hongzhang Liu, Aiwei Liu, Xuming Hu, Lijie Wen.* Preprint'24 1. **[Training Language Models to Self-Correct via Reinforcement Learning.](https://arxiv.org/abs/2409.12917)**