arxiv:2602.16165
Jon Peng
jonp07
ยท
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents updated
a model 2 days ago
jonp07/GRPO-ALFWorld published
a model 2 days ago
jonp07/GRPO-ALFWorld