2 19 7

Wenming Tu

tutu0604

https://danjuan-77.github.io/

danjuan-77

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

Evaluating the Expressive Appropriateness of Speech in Rich Contexts

upvoted a paper 4 days ago

What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion

liked a model about 1 month ago

OpenMOSS-Team/MOSS-Audio-4B-Instruct

View all activity

Organizations

None yet

upvoted a paper about 22 hours ago

Evaluating the Expressive Appropriateness of Speech in Rich Contexts

Paper • 2605.09413 • Published 5 days ago • 5

upvoted a paper 4 days ago

What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion

Paper • 2605.07915 • Published 7 days ago • 8

liked a model about 1 month ago

OpenMOSS-Team/MOSS-Audio-4B-Instruct

Audio-Text-to-Text • 5B • Updated Apr 14 • 30.4k • 52

upvoted 2 papers about 2 months ago

Vec-LUT: Vector Table Lookup for Parallel Ultra-Low-Bit LLM Inference on Edge Devices

Paper • 2512.06443 • Published Dec 6, 2025 • 3

OxyGen: Unified KV Cache Management for Vision-Language-Action Models under Multi-Task Parallelism

Paper • 2603.14371 • Published Mar 15 • 4

liked a model about 2 months ago

Soul-AILab/SoulX-Duplug-0.6B

Updated Mar 17 • 92 • 16

upvoted a paper 2 months ago

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Paper • 2603.07980 • Published Mar 9 • 27

upvoted a paper 3 months ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 159

updated a model 4 months ago

tutu0604/UltraVoice-SFT

Text-to-Speech • Updated Jan 27 • 12

New activity in tutu0604/UltraVoice-SFT 4 months ago

Add metadata (pipeline tag, library name) and improve model card content

#1 opened 7 months ago by

nielsr

upvoted a paper 4 months ago

UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models

Paper • 2510.22588 • Published Oct 26, 2025 • 1

liked a Space 5 months ago

Qwen-Image-Edit-2511-LoRAs-Fast

🎃

1.42k

Demo of the Collection of Qwen Image Edit LoRAs

upvoted 2 papers 5 months ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 79

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Paper • 2512.03036 • Published Dec 2, 2025 • 22

liked a dataset 6 months ago

tutu0604/UltraVoice

Viewer • Updated Nov 13, 2025 • 101k • 537 • 14

New activity in tutu0604/UltraVoice 6 months ago

Add task categories and additional tags to dataset card metadata

#2 opened 7 months ago by

nielsr

updated a dataset 7 months ago

tutu0604/UltraVoice

Viewer • Updated Nov 13, 2025 • 101k • 537 • 14

published 2 datasets 7 months ago

tutu0604/UltraVoice-SLAM-Omni

Updated Oct 27, 2025 • 3

tutu0604/UltraVoice

Viewer • Updated Nov 13, 2025 • 101k • 537 • 14

published a model 7 months ago

tutu0604/UltraVoice-SFT

Text-to-Speech • Updated Jan 27 • 12

Wenming Tu

AI & ML interests

Recent Activity

Organizations

tutu0604's activity

Add metadata (pipeline tag, library name) and improve model card content

Qwen-Image-Edit-2511-LoRAs-Fast

Add task categories and additional tags to dataset card metadata