DCoT - a haritzpuerto Collection

haritzpuerto 's Collections

⚙️🧠🔒 Controllable Reasoning Models - Datasets

⚙️🧠🔒 Controllable Reasoning Models - Checkpoints

DCoT

updated Jun 10, 2025

Models from the ACL 2025 paper "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs" "

Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models

Paper • 2407.03181 • Published Jul 3, 2024 • 1
haritzpuerto/LLaMA2-7B-dcot

Text Generation • Updated Jul 16, 2024 • 7 • 2
haritzpuerto/LLaMA2-13B-dcot

Text Generation • Updated Jul 16, 2024
haritzpuerto/LLaMA2-70B-dcot

Text Generation • Updated Jul 16, 2024 • 1
haritzpuerto/phi-2-dcot

Text Generation • Updated Jul 16, 2024 • 4 • 1
haritzpuerto/phi-1.5-dcot

Text Generation • Updated Jul 16, 2024