Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published 5 days ago • 12
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published 4 days ago • 46
EEG Foundation Models: Progresses, Benchmarking, and Open Problems Paper • 2601.17883 • Published 9 days ago • 19
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published 5 days ago • 46
Self-Improving Pretraining: using post-trained models to pretrain better models Paper • 2601.21343 • Published 5 days ago • 14
CGPT: Cluster-Guided Partial Tables with LLM-Generated Supervision for Table Retrieval Paper • 2601.15849 • Published 12 days ago • 14
AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking Paper • 2601.17645 • Published 9 days ago • 22
Linear representations in language models can change dramatically over a conversation Paper • 2601.20834 • Published 6 days ago • 21
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 14 days ago • 34
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models Paper • 2601.10387 • Published 19 days ago • 12
LucaOne Collection Generalized biological foundation model with unified nucleic acid and protein language(Nature Machine Intelligence),https://github.com/LucaOne/LucaOne • 6 items • Updated Dec 31, 2025 • 2
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 29 days ago • 37
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published Dec 23, 2025 • 18
Are We on the Right Way to Assessing LLM-as-a-Judge? Paper • 2512.16041 • Published Dec 17, 2025 • 34
Hierarchical Dataset Selection for High-Quality Data Sharing Paper • 2512.10952 • Published Dec 11, 2025 • 2
Causal Judge Evaluation: Calibrated Surrogate Metrics for LLM Systems Paper • 2512.11150 • Published Dec 11, 2025 • 6