-
-
-
-
-
-
Inference Providers
Active filters: cpo
NBA55/Experiment_with_trained_model_Final_CPO_for_all_3_issues-epoch-2
Updated
smohammadi/llama2-lora-aligned-cpo
Updated
NBA55/Final_Experiment_with_trained_model_Final_CPO_for_all_3_issues-epoch-2
Updated
Text Generation
• 0.1B • Updated
• 2
ravithejads/test_model_sft
Text Generation
• 0.1B • Updated
maxmyn/c4ai-takehome-model-simpo
Text Generation
• 0.1B • Updated
• 3
Text Generation
• 0.1B • Updated
• 1
Text Generation
• 0.1B • Updated
• 2
CharlesLi/OpenELM-1_1B-SimPO
Text Generation
• 1B • Updated
• 1
CharlesLi/OpenELM-1_1B-CPO
Text Generation
• 1B • Updated
• 1
NBA55/CPO_with_baseline_modalh
Text Generation
• 7B • Updated
• 1
NBA55/CPO_with_trained_model_for_all_3_issues-epoch-2
Updated
rawsh/mirrorqwen2.5-0.5b-SimPO
Text Generation
• 0.5B • Updated
Text Generation
• 0.5B • Updated
• 4
rawsh/mirrorqwen2.5-0.5b-SimPO-0
Text Generation
• 0.5B • Updated
• 3
mradermacher/mirrorqwen2.5-0.5b-SimPO-GGUF
0.5B • Updated
• 119
mradermacher/mirrorqwen2.5-0.5b-SimPO-0-GGUF
0.5B • Updated
• 76
rawsh/mirrorqwen2.5-0.5b-SimPO-1
Text Generation
• 0.5B • Updated
• 7
rawsh/mirrorqwen2.5-0.5b-SimPO-2
Text Generation
• 0.5B • Updated
• 2
rawsh/mirrorqwen2.5-0.5b-SimPO-3
Text Generation
• 0.5B • Updated
• 5
mradermacher/mirrorqwen2.5-0.5b-SimPO-1-GGUF
0.5B • Updated
• 180
mradermacher/mirrorqwen2.5-0.5b-SimPO-2-GGUF
0.5B • Updated
• 228
mradermacher/mirrorqwen2.5-0.5b-SimPO-3-GGUF
0.5B • Updated
• 42
Aratako/Llama-Gemma-2-27b-CPO_SimPO-iter1
Text Generation
• 27B • Updated
• 10
• 1
Aratako/Llama-Gemma-2-27b-CPO_SimPO-iter2
Text Generation
• 27B • Updated
• 8
• 1
Aratako/gemma-2-2b-axolotl-simpo-v1.0
Text Generation
• Updated
• 3
Aratako/gemma-2-2b-axolotl-simpo-v1.0-merged
Text Generation
• Updated
• 6
• 1
mradermacher/gemma-2-2b-axolotl-simpo-v1.0-merged-GGUF
3B • Updated
• 54
mjhamar/Meta-Llama-3.1-8B-Instruct-cpo-beir
Text Generation
• 8B • Updated
• 1