Chen JingQi's picture

5 2

Chen JingQi

KyrinChen

·

KyrinChen

AI & ML interests

Agent & RL

Recent Activity

upvoted a paper 3 days ago

AI Can Learn Scientific Taste

upvoted a paper 14 days ago

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

upvoted a paper about 2 months ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

View all activity

Organizations

None yet

upvoted a paper 3 days ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published 4 days ago • 253

upvoted a paper 14 days ago

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Paper • 2603.04918 • Published 15 days ago • 55

upvoted a paper about 2 months ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Paper • 2601.16480 • Published Jan 23 • 51

upvoted a paper 2 months ago

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published Jan 4 • 58

upvoted a paper 4 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 242