2 24

zhijie deng PRO

zhijie3

https://thudzj.github.io/

thudzj

AI & ML interests

None yet

Recent Activity

published a Space 3 days ago

zhijie3/think-then-generate

updated a Space 3 days ago

zhijie3/think-then-generate

upvoted a paper 7 days ago

LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding

View all activity

Organizations

published a Space 3 days ago

Think Then Generate

🖼

updated a Space 3 days ago

Think Then Generate

🖼

upvoted a paper 7 days ago

LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding

Paper • 2512.16229 • Published 12 days ago • 15

upvoted 2 papers 12 days ago

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Paper • 2512.14681 • Published 13 days ago • 39

DEER: Draft with Diffusion, Verify with Autoregressive Models

Paper • 2512.15176 • Published 13 days ago • 41

upvoted a paper about 1 month ago

Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight

Paper • 2511.16175 • Published Nov 20 • 12

upvoted a paper 2 months ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published Oct 20 • 122

upvoted a paper 5 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 145

updated a Space 5 months ago

D2F LLaDA Instruct 8B

👁

Diffusion LLMs Can Do Faster-Than-AR Inference via Discret

upvoted a paper 5 months ago

Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing

Paper • 2508.09192 • Published Aug 8 • 30

published a Space 5 months ago

D2F LLaDA Instruct 8B

👁

Diffusion LLMs Can Do Faster-Than-AR Inference via Discret

upvoted a paper 6 months ago

Scaling Speculative Decoding with Lookahead Reasoning

Paper • 2506.19830 • Published Jun 24 • 12

upvoted 3 papers 7 months ago

LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks

Paper • 2506.00411 • Published May 31 • 31

Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions

Paper • 2505.19949 • Published May 26 • 16

Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition

Paper • 2505.19788 • Published May 26 • 13

upvoted a paper 8 months ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published Apr 21 • 47

authored 3 papers 9 months ago

commented a paper 9 months ago

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published Apr 1 • 67 •

zhijie deng PRO

AI & ML interests

Recent Activity

Organizations

zhijie3's activity

Think Then Generate

Think Then Generate

D2F LLaDA Instruct 8B

D2F LLaDA Instruct 8B