ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs Paper • 2402.11764 • Published Feb 19, 2024
Where LLM Agents Fail and How They can Learn From Failures Paper • 2509.25370 • Published Sep 29, 2025 • 12
The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs Paper • 2509.03730 • Published Sep 3, 2025 • 2
In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models Paper • 2409.15454 • Published Sep 23, 2024 • 2
Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs Paper • 2602.07276 • Published 7 days ago • 10
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers Paper • 2602.05115 • Published 9 days ago • 18 • 9
Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs Paper • 2602.07276 • Published 7 days ago • 10
In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models Paper • 2409.15454 • Published Sep 23, 2024 • 2
The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs Paper • 2509.03730 • Published Sep 3, 2025 • 2
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers Paper • 2602.05115 • Published 9 days ago • 18 • 9
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers Paper • 2602.05115 • Published 9 days ago • 18
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published 13 days ago • 39