5 29 8

Qiaosheng Chen

cqsss

https://cqsss.github.io/

cqsss

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

upvoted a collection 8 days ago

Bee

upvoted a paper 9 days ago

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

View all activity

Organizations

None yet

upvoted a paper 5 days ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published 8 days ago • 57

upvoted a collection 8 days ago

Bee

Collection

10 items • Updated Dec 17, 2025 • 13

upvoted a paper 9 days ago

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published 11 days ago • 32

upvoted 2 papers 11 days ago

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published 11 days ago • 125

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published 15 days ago • 151

upvoted a paper 23 days ago

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Paper • 2601.12346 • Published 26 days ago • 49

upvoted an article 29 days ago

Article

Visualize and understand GPU memory in PyTorch

Dec 24, 2024

•

264

upvoted a paper about 1 month ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

Paper • 2601.07779 • Published Jan 12 • 28

upvoted a paper about 2 months ago

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published Dec 27, 2025 • 48

upvoted a collection about 2 months ago

JanusCoder

Collection

8 items • Updated 11 days ago • 11

upvoted a paper 2 months ago

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

Paper • 2512.04784 • Published Dec 2, 2025 • 25

upvoted 2 papers 3 months ago

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Paper • 2511.15605 • Published Nov 19, 2025 • 24

InteractScience: Programmatic and Visually-Grounded Evaluation of Interactive Scientific Demonstration Code Generation

Paper • 2510.09724 • Published Oct 10, 2025 • 11

upvoted 3 papers 4 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 97

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published Oct 9, 2025 • 27

AutoPR: Let's Automate Your Academic Promotion!

Paper • 2510.09558 • Published Oct 10, 2025 • 53

upvoted 2 papers 5 months ago

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published Sep 29, 2025 • 19

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111

upvoted 2 papers 6 months ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27, 2025 • 37

CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback

Paper • 2507.22080 • Published Jul 25, 2025 • 9

Qiaosheng Chen

AI & ML interests

Recent Activity

Organizations

cqsss's activity

Visualize and understand GPU memory in PyTorch