Jon Peng

jonp07

·

AI & ML interests

None yet

Recent Activity

published a model 3 days ago

jonp07/slime_sif

updated a model 3 days ago

jonp07/slime_sif

authored a paper 5 months ago

HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents

View all activity

Organizations

Papers 1

arxiv:2602.16165

models 2

jonp07/slime_sif

Updated 3 days ago

jonp07/GRPO-ALFWorld

datasets 0

None public yet