GRPO/PPO Finetunes for Creative Writing
DV
AI & ML interests
Post training @ https://dphn.ai
Recent Activity
updated a dataset about 16 hours ago
NewEden/RL-Seed-Mix-Iter-2 published a dataset about 16 hours ago
NewEden/RL-Seed-Mix-Iter-2 updated a dataset 3 days ago
NewEden/RL-Seed-Mix-Iter-1Organizations
models 112
Delta-Vector/Rei-24B-KTO
Text Generation • 24B • Updated • 147 • 16
Delta-Vector/Dr-House-Evals
Updated
Delta-Vector/Qwen-ckpt-100
Text Generation • Updated • 2
Delta-Vector/Austral-4.5B-Winton
Text Generation • 5B • Updated • 8 • 11
Delta-Vector/Nanuq-R1-9B
Text Generation • 11B • Updated • 4 • 4
Delta-Vector/Nanuq-R1-14B
Text Generation • 14B • Updated • 7 • 2
Delta-Vector/Austral-AFM-SFT
5B • Updated • 4
Delta-Vector/Elenchus
545k • Updated • 2
Delta-Vector/Austral-32B-GLM4-Winton
Text Generation • 33B • Updated • 5 • 8
Delta-Vector/Austral-GLM4-SFT
33B • Updated • 2
datasets 123
Delta-Vector/CAI-critic-revision-8k-cleaned-sharegpt
Viewer • Updated • 8.1k • 17
Delta-Vector/Ursa-Armored-Core-6-Lore
Viewer • Updated • 166 • 22
Delta-Vector/wordlist
Viewer • Updated • 253 • 14
Delta-Vector/Tauri-RL-Styles
Viewer • Updated • 32 • 82
Delta-Vector/Hydrus-Olmo-3-sft-dedup-ngram-filter-r1
Viewer • Updated • 1.67M • 4
Delta-Vector/Ursa-Armored-Core-Lore-Kimi
Viewer • Updated • 286 • 6
Delta-Vector/Hydrus-Hardcode-Dphn
Viewer • Updated • 220 • 18
Delta-Vector/Hydrus-Smoltalk-3-Subset-Demarkdownified
Viewer • Updated • 92.1k • 8
Delta-Vector/Hydrus-Next-Coder-Single-turn
Viewer • Updated • 17.3k • 45
Delta-Vector/Tauri-Complex-JSON-Formatting
Viewer • Updated • 8.05k • 31 • 1