Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tinyBenchmarks

community
https://github.com/felipemaiapolo/tinyBenchmarks
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

borgr  submitted a paper 19 days ago
General Agent Evaluation
moonfolk  authored a paper 9 months ago
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
borgr  authored a paper 11 months ago
Pretraining Language Models for Diachronic Linguistic Change Discovery
View all activity

Lucas Weber's profile pictureMikhail Yurochkin's profile pictureFelipe Maia Polo's profile pictureLeshem Choshen's profile picture

models 0

None public yet

datasets 7

tinyBenchmarks/tinyMMLU

Viewer • Updated Jul 8, 2024 • 385 • 13.8k • 24

tinyBenchmarks/tinyHellaswag

Viewer • Updated May 25, 2024 • 50k • 2.49k • 5

tinyBenchmarks/tinyTruthfulQA

Preview • Updated May 25, 2024 • 1.79k • 4

tinyBenchmarks/tinyWinogrande

Preview • Updated May 25, 2024 • 2.04k • 5

tinyBenchmarks/tinyGSM8k

Preview • Updated May 25, 2024 • 6.93k • 9

tinyBenchmarks/tinyAI2_arc

Preview • Updated May 25, 2024 • 2.46k • 4

tinyBenchmarks/tinyAlpacaEval

Viewer • Updated Apr 19, 2024 • 100 • 148 • 7
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs