Benchmark and Evaluation - a YedsonUQ Collection

YedsonUQ 's Collections

Fine-Tuning, PEFT

Hallucination Frameworks Ideas

Understanding LLM Representation

Efficient Inference

Test-Time Scaling (TTS)

Foundational Deep Learning - Architecture

AI-Automated Scientific Research

Benchmark and Evaluation

Distributed Training and Federated Learning

Explainable AI - Interpretable AI

Theory, Conceptualization, Paradigms

Learning Paradigm/Scheme

Reasoning - Chain-of-Thought

Reinforcement Learning (RL)

Retrieval Augmented Generation (RAG)

Uncertainty Quantification

Survey

Benchmark and Evaluation

updated Sep 8, 2025