13 6 22

yang

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

Jackrong/Qwopus3.5-27B-v3-GGUF

liked a model 5 days ago

QuantTrio/gemma-4-31B-it-AWQ

liked a model 5 days ago

QuantTrio/gemma-4-31B-it-AWQ-6Bit

View all activity

Organizations

None yet

New activity in QuantTrio/Qwen3.5-27B-AWQ about 1 month ago

This is the best quant version in the world,better than FP8

🚀 5

#2 opened about 1 month ago by

New activity in Qwen/Qwen3.5-9B about 1 month ago

Can we get a 9B-FP8 version next

👍 14

#5 opened about 1 month ago by

New activity in Qwen/Qwen3-Coder-Next about 2 months ago

SVG improve needed

#35 opened about 2 months ago by

New activity in cyankiwi/Qwen3-Coder-Next-AWQ-4bit 2 months ago

how to fix: KeyError: 'model.layers.30.mlp.shared_expert.gate_gate_up_proj.weight'

🔥 1

#1 opened 2 months ago by

New activity in Qwen/Qwen3-VL-235B-A22B-Thinking 6 months ago

How much vram is needed to run this model? 8xRTX3090=192GB isn't enough to run the context.

#12 opened 6 months ago by

New activity in Qwen/Qwen3-VL-235B-A22B-Thinking 7 months ago

FP8/4bit version please

➕ 4

#7 opened 7 months ago by

zhanghx0905

New activity in Qwen/Qwen3-Next-80B-A3B-Thinking-FP8 7 months ago

ValueError: Detected some but not all shards of model.layers.0.linear_attn.in_proj are quantized. All shards of fused layers to have the same precision.

➕ 3

#1 opened 7 months ago by

New activity in Intel/Qwen3-Next-80B-A3B-Thinking-int4-mixed-AutoRound 7 months ago

AttributeError: 'FusedMoE' object has no attribute 'moe'

#1 opened 7 months ago by

New activity in cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit 7 months ago

Any idea on how to fix this: KeyError: 'layers.31.mlp.shared_expert.down_proj.weight'

#1 opened 7 months ago by

New activity in zai-org/GLM-4.5-Air 8 months ago

Multiple function_tool call needed

#12 opened 8 months ago by

New activity in OPEA/gemma-3-27b-it-int4-AutoRound 12 months ago

so consider build a model for GPU?

#1 opened about 1 year ago by

New activity in Qwen/QVQ-72B-Preview over 1 year ago

Supports function calls/structured outputs

#2 opened over 1 year ago by

luijait

New activity in kosbu/QVQ-72B-Preview-AWQ over 1 year ago

I am waiting for your release, just wait here

👀➕ 2

#1 opened over 1 year ago by

New activity in Qwen/QVQ-72B-Preview over 1 year ago

Supports function calls/structured outputs

#2 opened over 1 year ago by

luijait

yang

AI & ML interests

Recent Activity

Organizations

kq's activity

This is the best quant version in the world,better than FP8

Can we get a 9B-FP8 version next

SVG improve needed

how to fix: KeyError: 'model.layers.30.mlp.shared_expert.gate_gate_up_proj.weight'

How much vram is needed to run this model? 8xRTX3090=192GB isn't enough to run the context.

FP8/4bit version please

ValueError: Detected some but not all shards of model.layers.0.linear_attn.in_proj are quantized. All shards of fused layers to have the same precision.

AttributeError: 'FusedMoE' object has no attribute 'moe'

Any idea on how to fix this: KeyError: 'layers.31.mlp.shared_expert.down_proj.weight'

Multiple function_tool call needed

so consider build a model for GPU?

Supports function calls/structured outputs

I am waiting for your release, just wait here

Supports function calls/structured outputs