Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models
Paper
• 2407.03181 • Published
• 1
Models from the ACL 2025 paper "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs" "