Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
araag2 's Collections
OpenCTEval Benchmark Datasets
Medical-LLMs
TAI-P2

OpenCTEval Benchmark Datasets

updated Jan 14

A collection that supports the development of the OpenCTEval Benchmark, a medical dataset catered towards LLM reasoning over Clinical Trial (CT) data

Upvote
1

  • araag2/MedNLI

    Viewer • Updated Jul 28, 2025 • 42.1k • 106

  • araag2/MedQA

    Viewer • Updated Jul 28, 2025 • 38.2k • 58

  • araag2/MedMCQA

    Viewer • Updated Jul 31, 2025 • 579k • 66

  • araag2/PubMedQA

    Viewer • Updated Jul 31, 2025 • 821k • 31

  • araag2/RCT_Summary

    Viewer • Updated Jul 31, 2025 • 154k • 16

  • araag2/Evidence_Inference_v2

    Viewer • Updated Nov 4, 2025 • 37.5k • 62 • 1

  • araag2/HINT

    Viewer • Updated Nov 4, 2025 • 37.4k • 25

  • araag2/Trial_Meta-Analysis

    Viewer • Updated Oct 17, 2025 • 3.5k • 107 • 1

  • araag2/TREC_Clinicial-Decision-Support

    Viewer • Updated Jul 31, 2025 • 309k • 56

  • araag2/TREC_Precision-Medicine

    Viewer • Updated Jul 31, 2025 • 121k • 19

  • araag2/TREC_Clinical-Trials

    Viewer • Updated Nov 4, 2025 • 307k • 36 • 1

  • araag2/SemEval_NLI4CT

    Viewer • Updated Jul 31, 2025 • 29.4k • 76

  • araag2/NLI4PR

    Viewer • Updated Sep 7, 2025 • 35k • 22

  • araag2/TrialPanorama

    Viewer • Updated Jan 13 • 1.23M • 84

  • araag2/TrialBench

    Viewer • Updated Jan 16 • 579k • 59
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs