2 19 3

Oooowi

ZiruiZheng

zhengzirui

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 months ago

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

upvoted a paper 5 months ago

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

updated a model 5 months ago

ZiruiZheng/coco_data

View all activity

Organizations

upvoted a paper 3 months ago

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

Paper • 2512.03041 • Published Dec 2, 2025 • 64

upvoted a paper 5 months ago

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Paper • 2509.09676 • Published Sep 11, 2025 • 35

updated a model 5 months ago

ZiruiZheng/coco_data

Updated Sep 17, 2025

published a model 5 months ago

ZiruiZheng/coco_data

Updated Sep 17, 2025

upvoted 2 papers 5 months ago

RewardDance: Reward Scaling in Visual Generation

Paper • 2509.08826 • Published Sep 10, 2025 • 73

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

updated a dataset 6 months ago

ZiruiZheng/coco_data

Updated Aug 18, 2025 • 54

published a dataset 6 months ago

ZiruiZheng/coco_data

Updated Aug 18, 2025 • 54

updated 2 collections 7 months ago

text-to-image

Collection

2 items • Updated Aug 3, 2025

in-context learning

Collection

3 items • Updated Aug 3, 2025

upvoted a paper 9 months ago

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Paper • 2506.04308 • Published Jun 4, 2025 • 43

liked a model 10 months ago

Hzzone/GLIGEN_COCO

Updated May 10, 2024 • 1 • 1

upvoted a paper 11 months ago

ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement

Paper • 2504.01934 • Published Apr 2, 2025 • 22

commented a paper 11 months ago

Towards Physically Plausible Video Generation via VLM Planning

Paper • 2503.23368 • Published Mar 30, 2025 • 40 •

upvoted 2 papers 11 months ago

Towards Physically Plausible Video Generation via VLM Planning

Paper • 2503.23368 • Published Mar 30, 2025 • 40

AMD-Hummingbird: Towards an Efficient Text-to-Video Model

Paper • 2503.18559 • Published Mar 24, 2025 • 5

upvoted a paper 12 months ago

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published Feb 27, 2025 • 30

upvoted 2 papers about 1 year ago

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

Paper • 2502.08639 • Published Feb 12, 2025 • 43

Autoregressive Video Generation without Vector Quantization

Paper • 2412.14169 • Published Dec 18, 2024 • 14

New activity in stabilityai/stable-diffusion-3.5-medium about 1 year ago

Doesn't work boys - we'll get 'em next time. FIX INSIDE

#10 opened over 1 year ago by

mushroomfleet

Oooowi

AI & ML interests

Recent Activity

Organizations

ZiruiZheng's activity

Doesn't work boys - we'll get 'em next time. FIX INSIDE