Huanyu_Zhang's picture

3 10 3

Huanyu_Zhang

huanyu112

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

GENIUS: Generative Fluid Intelligence Evaluation Suite

upvoted a paper 2 days ago

GEBench: Benchmarking Image Generation Models as GUI Environments

upvoted a paper 6 days ago

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

View all activity

Organizations

upvoted a paper about 6 hours ago

GENIUS: Generative Fluid Intelligence Evaluation Suite

Paper • 2602.11144 • Published about 20 hours ago • 37

upvoted a paper 2 days ago

GEBench: Benchmarking Image Generation Models as GUI Environments

Paper • 2602.09007 • Published 3 days ago • 37

upvoted a paper 6 days ago

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Paper • 2601.21037 • Published 15 days ago • 15

upvoted a paper 9 days ago

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published 10 days ago • 16

submitted a paper to Daily Papers 9 days ago

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published 10 days ago • 16

liked a dataset 9 days ago

VIBE-Benchmark/VIBE-Benchmark

Viewer • Updated 10 days ago • 2.65k • 276 • 2

updated a dataset 10 days ago

VIBE-Benchmark/VIBE-Benchmark

Viewer • Updated 10 days ago • 2.65k • 276 • 2

updated 13 datasets 11 days ago

VIBE-Benchmark/VIBE-Seedream4.0

Viewer • Updated 11 days ago • 1.03k • 15

VIBE-Benchmark/VIBE-Seedream4.5

Viewer • Updated 11 days ago • 1.03k • 21

VIBE-Benchmark/OmniGen

Viewer • Updated 11 days ago • 1.03k • 29

VIBE-Benchmark/VIBE-Banana-Flash

Viewer • Updated 11 days ago • 1.01k • 38

VIBE-Benchmark/VIBE-GPT-Image

Viewer • Updated 11 days ago • 1.01k • 105

VIBE-Benchmark/Edit-R1-Qwen-Image-Edit-2509

Viewer • Updated 11 days ago • 1.03k • 20

VIBE-Benchmark/Qwen-Image-Edit-2509

Viewer • Updated 11 days ago • 1.03k • 19

VIBE-Benchmark/VIBE-Qwen-Image-Edit

Viewer • Updated 11 days ago • 934 • 38

VIBE-Benchmark/FLUX2-dev

Viewer • Updated 11 days ago • 1.03k • 1.16k

VIBE-Benchmark/OmniGen2

Viewer • Updated 11 days ago • 1.03k • 19 • 1

VIBE-Benchmark/UniWorld-V1

Viewer • Updated 11 days ago • 1.03k • 13

VIBE-Benchmark/BAGEL

Viewer • Updated 11 days ago • 1.03k • 1.14k

VIBE-Benchmark/Step1X-Edit-v1p2

Viewer • Updated 11 days ago • 934 • 602 • 1