Satya

skrishna

·

https://satyapriyakrishna.com/

AI & ML interests

Safe A(G)I

Recent Activity

upvoted a paper 5 days ago

A Threshold Exceedance Framework for CBRN Uplift Evaluation in Frontier Language Models

liked a model 7 months ago

liked a dataset 7 months ago

hf-internal-testing/dailytalk-dummy

View all activity

Organizations

Papers 14

arxiv:2503.05731

arxiv:2410.12491

arxiv:2409.12941

arxiv:2407.14937

models 41

skrishna/smolm-toxicity-classifier

Text Classification • 0.1B • Updated Aug 15, 2025 • 3

skrishna/sft-ref-policy-copy

Text Generation • 0.1B • Updated Jun 18, 2025 • 6

skrishna/sft-model-copy

Text Generation • 0.1B • Updated Jun 18, 2025 • 6

skrishna/gpt2-toxicity-classifier

Updated May 16, 2025

skrishna/gpt2-fineweb-soap-20250422_112211

Text Generation • 0.1B • Updated Apr 22, 2025 • 6

skrishna/gpt2-fineweb-20250421_194111-64

Text Generation • 0.1B • Updated Apr 21, 2025 • 7

skrishna/gpt2-fineweb

Updated Apr 21, 2025

skrishna/ethicsU-llama3-8b-w2s

Updated Nov 8, 2024

skrishna/ethicsU-gptxl-weak2

Updated Nov 8, 2024

skrishna/ethicsU-gptxl-weak

Updated Nov 8, 2024

datasets 76

skrishna/toxigen_annotated_mod

Viewer • Updated May 25, 2025 • 8.96k • 11

skrishna/toy-toxicity-dataset

Viewer • Updated May 22, 2025 • 40k • 10

skrishna/toxicity-reward-dataset

Viewer • Updated May 16, 2025 • 40k • 19

skrishna/SECURE-VOOD

Viewer • Updated Mar 27, 2025 • 466 • 14

skrishna/SECURE-RERT

Viewer • Updated Mar 27, 2025 • 1k • 7

skrishna/SECURE-MAET

Viewer • Updated Mar 27, 2025 • 1.07k • 23

skrishna/SECURE-KCV

Viewer • Updated Mar 27, 2025 • 466 • 22

skrishna/SECURE-CPST

Viewer • Updated Mar 27, 2025 • 100 • 9

skrishna/SECURE-CWET

Viewer • Updated Mar 27, 2025 • 965 • 27

skrishna/cti-rcm

Viewer • Updated Mar 25, 2025 • 1k • 17

View 76 datasets