Zhan Su
zhan1993
AI & ML interests
None yet
Organizations
None yet
models 76
zhan1993/library-mistral7B_flan_5ep_higher_lr
Updated
zhan1993/BeaverTails_filtered_train_experts
Updated
zhan1993/shared_experts_trained_from_lora_soup_llama
Updated
zhan1993/phi-3-10-clusters-Spectral-merge
Text Generation • 4B • Updated
zhan1993/gpt-neo-125m_merged_lora_merge
Text Generation • 0.1B • Updated
zhan1993/mathqa_trained_from_lorasoup
Updated
zhan1993/code_trained_from_lorasoup
Updated • 2
zhan1993/gptneo_1B_flan_10_experts-epoch_2
Updated
zhan1993/library-phi_2-v3-10-flan-clusters
Updated
zhan1993/trained_gpt125m_experts_colab
Updated
datasets 35
zhan1993/task_adapter_dataset
Viewer • Updated • 3.13k • 9
zhan1993/BeaverTails_filtered_safe
Viewer • Updated • 16k • 12
zhan1993/coconot_experts_train
Viewer • Updated • 12.5k • 10
zhan1993/BeaverTails_filtered_train
Viewer • Updated • 185k • 10
zhan1993/BeaverTails_filtered_test
Viewer • Updated • 18.7k • 9
zhan1993/gsm-8k-perturb
Viewer • Updated • 1.32k • 4
zhan1993/mathqa_lora_soup_repo
Viewer • Updated • 395k • 7
zhan1993/coconot_original_train_routing
Viewer • Updated • 500 • 6
zhan1993/coconot_contrast_eval
Viewer • Updated • 379 • 6
zhan1993/coconot_original_eval
Viewer • Updated • 1k • 6