3 48 15

Shengyuan Ding

ChrisDing1105

https://github.com/SYuan03

SYuan03

AI & ML interests

SII is an institution dedicated to innovation in education and research in the field of AI.

Recent Activity

liked a model 7 days ago

internlm/Intern-S1-Pro

upvoted a paper 8 days ago

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

upvoted a paper 28 days ago

SmartSearch: Process Reward-Guided Query Refinement for Search Agents

View all activity

Organizations

None yet

liked a model 7 days ago

internlm/Intern-S1-Pro

Image-Text-to-Text • Updated 2 days ago • 10k • 243

upvoted a paper 8 days ago

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Paper • 2602.02437 • Published 9 days ago • 75

upvoted 3 papers 28 days ago

liked a Space about 1 month ago

Qwen Image Edit 2511

🏆

305

Edit images based on natural language instructions

upvoted 2 papers about 2 months ago

DEER: Draft with Diffusion, Verify with Autoregressive Models

Paper • 2512.15176 • Published Dec 17, 2025 • 44

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

Paper • 2512.11799 • Published Dec 12, 2025 • 30

authored a paper 2 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 49

upvoted 2 papers 2 months ago

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published Dec 8, 2025 • 59

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 49

commented a paper 2 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 49 •

upvoted a paper 2 months ago

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Paper • 2512.03036 • Published Dec 2, 2025 • 22

upvoted 3 papers 3 months ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 44

UniREditBench: A Unified Reasoning-based Image Editing Benchmark

Paper • 2511.01295 • Published Nov 3, 2025 • 39

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31, 2025 • 30

upvoted a paper 4 months ago

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 19

liked a model 4 months ago

Qwen/Qwen3-VL-8B-Instruct

Image-Text-to-Text • 9B • Updated Oct 15, 2025 • 2.79M • • 746

upvoted 2 papers 4 months ago

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

Paper • 2510.11341 • Published Oct 13, 2025 • 35

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 109

Shengyuan Ding

AI & ML interests

Recent Activity

Organizations

ChrisDing1105's activity

Qwen Image Edit 2511