·
AI & ML interests
None yet
Organizations
models
14
wxzhang/dpo-selective-buffer-spo-shift
Text Generation
•
7B
•
Updated
•
149
wxzhang/dpo-selective-redteaming
Text Generation
•
7B
•
Updated
•
22
wxzhang/dpo-selective-buffer-safeipo
Text Generation
•
7B
•
Updated
•
54
wxzhang/dpo-selective-alpaca
Text Generation
•
7B
•
Updated
•
12
wxzhang/dpo-selective-bufferdata
Text Generation
•
Updated
•
24
wxzhang/dpo-selective-longerrun
Text Generation
•
7B
•
Updated
•
134
wxzhang/dpo-selective-mixdata
Text Generation
•
7B
•
Updated
•
9
wxzhang/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
59
wxzhang/selective-pairrm-33079692-mt2
Text Generation
•
7B
•
Updated
•
9
wxzhang/selective-pairrm-33076849-mt1
Text Generation
•
7B
•
Updated
•
12