Mtlbiohacker (Maxime cote)

liked a model 9 months ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29, 2025 • 646k • • 2.4k

liked a dataset 9 months ago

cais/hle

Benchmark • Updated 28 days ago • 2.5k • 29.4k • 706

liked a Space 9 months ago

Hallucinations Leaderboard

🔥

145

View and submit LLM evaluations

liked a model 9 months ago

cais/zephyr_7b_r2d2

Text Generation • 7B • Updated Feb 26, 2024 • 152 • 4

liked a Space 9 months ago

Emma R1

🚀

1

MAI-DS-R1 finetuned by @mtlbiohacker

liked a model 9 months ago

Qwen/Qwen3-235B-A22B

Text Generation • Updated Jul 26, 2025 • 480k • • 1.08k

liked a dataset 9 months ago

agentica-org/DeepCoder-Preview-Dataset

Viewer • Updated Apr 9, 2025 • 25k • 3.23k • 97

liked a Space 9 months ago

Rabbits Leaderboard

💊

20

Visualize and analyze language model robustness to drug name synonyms

liked 2 datasets 10 months ago

google/bigbench

Updated Jan 18, 2024 • 536 • 66

google/IFEval

Viewer • Updated Aug 14, 2024 • 541 • 60.4k • 131

liked 2 models 10 months ago

Mtlbiohacker/Emma_R1

Text Generation • Updated May 14, 2025 • 1

microsoft/Phi-4-reasoning-plus

Text Generation • Updated Nov 24, 2025 • 225k • 334

liked a dataset 10 months ago

spawn99/GPQA-diamond-ClaudeR1

Viewer • Updated Jan 25, 2025 • 198 • 241 • 7

liked a model 10 months ago

microsoft/MAI-DS-R1

Text Generation • Updated Dec 15, 2025 • 106 • 294

liked a dataset 10 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8, 2025 • 3.91M • 2.18k • 643

liked a Space 11 months ago

Leaderboard / SeaEval

🥇

9

Explore NLP leaderboard metrics

liked a model 11 months ago

aaditya/Llama3-OpenBioLLM-70B

Text Generation • Updated Jan 18, 2025 • 2.83k • 497

liked 2 Spaces 11 months ago

Open LLM Leaderboard

🏆

13.9k

Track, rank and evaluate open LLMs and chatbots

MMLU-Pro Leaderboard

🥇

241

More advanced and challenging multi-task evaluation

liked a model 11 months ago

tencent/Tencent-Hunyuan-Large

Text Generation • Updated Jan 19, 2025 • 56 • 618

Maxime cote

AI & ML interests

Organizations

deepseek-ai/DeepSeek-R1-0528

cais/hle

Hallucinations Leaderboard

cais/zephyr_7b_r2d2

Emma R1

Qwen/Qwen3-235B-A22B

agentica-org/DeepCoder-Preview-Dataset

Rabbits Leaderboard

google/bigbench

google/IFEval

Mtlbiohacker/Emma_R1

microsoft/Phi-4-reasoning-plus

spawn99/GPQA-diamond-ClaudeR1

microsoft/MAI-DS-R1

nvidia/Llama-Nemotron-Post-Training-Dataset

Leaderboard / SeaEval

aaditya/Llama3-OpenBioLLM-70B

Open LLM Leaderboard

MMLU-Pro Leaderboard

tencent/Tencent-Hunyuan-Large

Maxime cote

AI & ML interests

Organizations

Mtlbiohacker's activity

Hallucinations Leaderboard

Emma R1

Rabbits Leaderboard

Leaderboard / SeaEval

Open LLM Leaderboard

MMLU-Pro Leaderboard