8 13 46

Shuhuai Ren

ShuhuaiRen

https://renshuhuai-andy.github.io/

AI & ML interests

NLP, Multi-modal

Recent Activity

upvoted a paper about 1 month ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

liked a model 6 months ago

XiaomiMiMo/MiMo-Audio-Tokenizer

upvoted a collection 6 months ago

MiMo-Audio

View all activity

Organizations

upvoted a paper about 1 month ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published Feb 3 • 46

liked a model 6 months ago

XiaomiMiMo/MiMo-Audio-Tokenizer

Updated Sep 19, 2025 • 186 • 31

upvoted a collection 6 months ago

MiMo-Audio

Collection

4 items • Updated 11 days ago • 25

upvoted a paper 6 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85

liked a dataset 7 months ago

apf1/datafilteringnetworks_2b

Updated Feb 28, 2025 • 113 • 20

New activity in XiaomiMiMo/MiMo-VL-7B-RL-2508 7 months ago

add hints for placing visual input and thinking control

#2 opened 7 months ago by

ShuhuaiRen

New activity in XiaomiMiMo/MiMo-VL-7B-SFT-2508 7 months ago

add hints for placing visual input and thinking control

#2 opened 7 months ago by

ShuhuaiRen

liked 2 models 7 months ago

XiaomiMiMo/MiMo-VL-7B-SFT-2508

Image-Text-to-Text • 8B • Updated Aug 21, 2025 • 8.27k • 35

XiaomiMiMo/MiMo-VL-7B-RL-2508

Image-Text-to-Text • 8B • Updated Aug 21, 2025 • 134k • 86

upvoted a collection 7 months ago

MiMo-VL

Collection

6 items • Updated Dec 17, 2025 • 39

liked a model 7 months ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 734k • • 12.5k

liked a Space 8 months ago

RISEBench Gallery

👀

A Gallery of Generation Results on RISEBench

liked 2 Spaces 9 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.31k

Generate a curated web‑text dataset for LLM training

The Ultra-Scale Playbook

🌌

3.74k

The ultimate guide to training LLM on large GPU Clusters

authored a paper 9 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80

upvoted a paper 9 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80

liked 2 models 10 months ago

XiaomiMiMo/MiMo-VL-7B-SFT

Image-Text-to-Text • 8B • Updated Jun 7, 2025 • 185 • 54

XiaomiMiMo/MiMo-VL-7B-RL

Image-Text-to-Text • 8B • Updated Jun 7, 2025 • 1.1k • 168

authored a paper 10 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 82

liked a dataset 10 months ago

BGLab/BioTrove

Viewer • Updated Dec 13, 2024 • 163M • 379 • 17

Shuhuai Ren

AI & ML interests

Recent Activity

Organizations

ShuhuaiRen's activity

add hints for placing visual input and thinking control

add hints for placing visual input and thinking control

RISEBench Gallery

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook