Queryable LoRA: Instruction-Regularized Routing Over Shared Low-Rank Update Atoms Paper • 2605.08423 • Published 10 days ago • 2
NuminaMath Collection Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10, 2025 • 81
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs nielsr • Apr 7 • 61
Multiplication in Multimodal LLMs: Computation with Text, Image, and Audio Inputs Paper • 2604.18203 • Published 28 days ago • 6
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated 9 days ago • 148
view article Article I Built a RAG System That Listens to Live BBC News and Answers Questions About "What Happened 10 Minutes Ago" RakshitAralimatti • Dec 9, 2025 • 14
Benchmarking Debiasing Methods for LLM-based Parameter Estimates Paper • 2506.09627 • Published Jun 11, 2025 • 1
Platonic Representations for Poverty Mapping: Unified Vision-Language Codes or Agent-Induced Novelty? Paper • 2508.01109 • Published Aug 1, 2025 • 4
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published Apr 30, 2025 • 49
Selecting Optimal Candidate Profiles in Adversarial Environments Using Conjoint Analysis and Machine Learning Paper • 2504.19043 • Published Apr 26, 2025 • 4
view article Article Cohere on Hugging Face Inference Providers 🔥 +5 reach-vb, burtenshaw, merve, celinah, alexrs, julien-c, sbrandeis • Apr 16, 2025 • 129