arxiv:2503.05731
Satya
skrishna
AI & ML interests
Safe A(G)I
Organizations
models 41
skrishna/smolm-toxicity-classifier
Text Classification • 0.1B • Updated
• 4
skrishna/sft-ref-policy-copy
Text Generation • 0.1B • Updated
• 2
skrishna/sft-model-copy
Text Generation • 0.1B • Updated
• 1
skrishna/gpt2-toxicity-classifier
Updated
• 9
skrishna/gpt2-fineweb-soap-20250422_112211
Text Generation • 0.1B • Updated
• 2
skrishna/gpt2-fineweb-20250421_194111-64
Text Generation • 0.1B • Updated
• 3
skrishna/gpt2-fineweb
Updated
skrishna/ethicsU-llama3-8b-w2s
Updated
skrishna/ethicsU-gptxl-weak2
Updated
skrishna/ethicsU-gptxl-weak
Updated
datasets 76
skrishna/toxigen_annotated_mod
Viewer
• Updated
• 8.96k • 18
skrishna/toy-toxicity-dataset
Viewer
• Updated
• 40k • 8
skrishna/toxicity-reward-dataset
Viewer
• Updated
• 40k • 11
skrishna/SECURE-VOOD
Viewer
• Updated
• 466 • 10
skrishna/SECURE-RERT
Viewer
• Updated
• 1k • 19
skrishna/SECURE-MAET
Viewer
• Updated
• 1.07k • 565
skrishna/SECURE-KCV
Viewer
• Updated
• 466 • 22
skrishna/SECURE-CPST
Viewer
• Updated
• 100 • 5
skrishna/SECURE-CWET
Viewer
• Updated
• 965 • 23
skrishna/cti-rcm
Viewer
• Updated
• 1k • 4