·
AI & ML interests
None yet
Organizations
None yet
Preview
• Updated
• 25
• 2
Viewer
• Updated
• 1.41M • 7
Viewer
• Updated
• 1.41M • 14
Viewer
• Updated
• 1.41M • 7
zd21/ReST-MCTS_SciGLM-6B_Self-Rewarding-DPO_2nd
Viewer
• Updated
• 1 • 14
zd21/ReST-MCTS_SciGLM-6B_ReST-MCTS_Policy_2nd
Viewer
• Updated
• 40.9k • 7
zd21/ReST-MCTS_SciGLM-6B_ReST-EM-CoT_2nd
Viewer
• Updated
• 28.9k • 8
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_Self-Rewarding-DPO_2nd
Viewer
• Updated
• 1 • 7
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-MCTS_2nd
Viewer
• Updated
• 26k • 8
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-EM-CoT_2nd
Viewer
• Updated
• 36.6k • 7
zd21/ReST-MCTS_Llama3-8b-Instruct_Self-Rewarding-DPO_2nd
Viewer
• Updated
• 1 • 7
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-MCTS_Policy_2nd
Viewer
• Updated
• 32.3k • 8
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-EM-CoT_2nd
Viewer
• Updated
• 33.2k • 7
zd21/ReST-MCTS_SciGLM-6B_Self-Rewarding-DPO_1st
Viewer
• Updated
• 33.5k • 9
zd21/ReST-MCTS_SciGLM-6B_ReST-MCTS_Policy_1st
Viewer
• Updated
• 30.1k • 5
zd21/ReST-MCTS_SciGLM-6B_ReST-EM-CoT_1st
Viewer
• Updated
• 55.8k • 5
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_Self-Rewarding-DPO_1st
Viewer
• Updated
• 1 • 8
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-MCTS_1st
Viewer
• Updated
• 38.7k • 6
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-EM-CoT_1st
Viewer
• Updated
• 74k • 6
zd21/ReST-MCTS_Llama3-8b-Instruct_Self-Rewarding-DPO_1st
Viewer
• Updated
• 1 • 7
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-MCTS_Policy_1st
Viewer
• Updated
• 33.7k • 6
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-EM-CoT_1st
Viewer
• Updated
• 73.1k • 5
Viewer
• Updated
• 474k • 12
• 2
Viewer
• Updated
• 91.8k • 186
• 7
Viewer
• Updated
• 91.8k • 26
• 3
zd21/ReST-MCTS-Llama3-8b-Instruct-Policy-1st
Viewer
• Updated
• 33.7k • 12
• 7
zd21/ReST-MCTS-Llama3-8b-Instruct-PRM-1st
Viewer
• Updated
• 673k • 36
• 9