SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement Paper • 2603.06333 • Published 6 days ago • 1
The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness Paper • 2603.09200 • Published 3 days ago • 5
The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness Paper • 2603.09200 • Published 3 days ago • 5
SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement Paper • 2603.06333 • Published 6 days ago • 1
The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness Paper • 2603.09200 • Published 3 days ago • 5
SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement Paper • 2603.06333 • Published 6 days ago • 1
Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents Paper • 2602.23556 • Published 14 days ago
Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries Paper • 2603.04413 • Published Feb 3
When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning Paper • 2603.03475 • Published 9 days ago
I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift Paper • 2603.01297 • Published 11 days ago
Dial E for Ethical Enforcement: institutional VETO power as a governance primitive Paper • 2603.00617 • Published 12 days ago
Soft Clustering Anchors for Self-Supervised Speech Representation Learning in Joint Embedding Prediction Architectures Paper • 2602.09040 • Published Jan 30
Assessing LLM Reliability on Temporally Recent Open-Domain Questions Paper • 2602.11165 • Published Jan 17
Neural FOXP2 -- Language Specific Neuron Steering for Targeted Language Improvement in LLMs Paper • 2602.00945 • Published Feb 1
Stochastic CHAOS: Why Deterministic Inference Kills, and Distributional Variability Is the Heartbeat of Artifical Cognition Paper • 2601.07239 • Published Jan 12 • 3
ECLIPTICA -- A Framework for Switchable LLM Alignment via CITA - Contrastive Instruction-Tuned Alignment Paper • 2601.06157 • Published Jan 6
SPINAL -- Scaling-law and Preference Integration in Neural Alignment Layers Paper • 2601.06238 • Published Jan 8 • 1
Stochastic CHAOS: Why Deterministic Inference Kills, and Distributional Variability Is the Heartbeat of Artifical Cognition Paper • 2601.07239 • Published Jan 12 • 3
SPINAL -- Scaling-law and Preference Integration in Neural Alignment Layers Paper • 2601.06238 • Published Jan 8 • 1
Catch Me If You Can: How Smaller Reasoning Models Pretend to Reason with Mathematical Fidelity Paper • 2512.00552 • Published Nov 29, 2025