Jonathan H. Parker
reply-guy
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 9 hours ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training upvoted a paper 9 days ago
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs Organizations
None yet