When Metrics Disagree: Automatic Similarity vs. LLM-as-a-Judge for Clinical Dialogue Evaluation Paper • 2603.00314 • Published 26 days ago • 1
Layer-wise dynamic rank for compressing large language models Paper • 2509.25622 • Published Oct 4, 2025 • 1
When Metrics Disagree: Automatic Similarity vs. LLM-as-a-Judge for Clinical Dialogue Evaluation Paper • 2603.00314 • Published 26 days ago • 1
Layer-wise dynamic rank for compressing large language models Paper • 2509.25622 • Published Oct 4, 2025 • 1