Man Cub
mancub
AI & ML interests
None yet
Recent Activity
new activity about 8 hours ago
adamjen/Devstral-Small-2-24B-Opus-Reasoning:How to use it with llama-server ? new activity 3 days ago
ubergarm/Qwen3.5-122B-A10B-GGUF:How to split this model between 2 (3) GPUs and CPU/RAM ? new activity 6 days ago
noctrex/Mistral-Small-4-119B-2603-MXFP4_MOE-GGUF:Poor performance and pretty lobotomizedOrganizations
None yet
How to use it with llama-server ?
👀 1
2
#1 opened about 8 hours ago
by
mancub
How to split this model between 2 (3) GPUs and CPU/RAM ?
17
#12 opened 7 days ago
by
mancub
Poor performance and pretty lobotomized
2
#1 opened 7 days ago
by
mancub
Love the license, confused by some of the decisions.
🤝👍 13
15
#15 opened 8 days ago
by
CyborgPaloma
My personal vLLM launch cmd on my old personal 2x3090 workstation
4
#1 opened 23 days ago
by
tclf90
It's really good.
👀 1
26
#3 opened 28 days ago
by
Shuasimodo
Increasing the precision of some of the weights when quantizing
👍 4
57
#2 opened about 1 month ago
by
Shuasimodo
New activity in TeichAI/GLM-4.7-Flash-Claude-Opus-4.5-High-Reasoning-Distill-GGUF about 2 months ago
A draft model with less parameters, for speculative thinking?
8
#5 opened about 2 months ago
by
mancub
Jan 21: All GLM-4.7-Flash quants reuploaded - much better outputs!
🔥❤️ 7
29
#10 opened 2 months ago
by
danielhanchen
Fast loras
2
#8 opened 3 months ago
by
melmass
Wan-Lighting : 4steps per model or 4steps total?
4
#59 opened 8 months ago
by
NielsGx
Can we have a Llama-3.1-8B-Lexi-Uncensored-V2_fp8_scaled.safetensors
🔥 1
12
#10 opened 11 months ago
by
drguolai
Within Seconds ?
7
#8 opened 12 months ago
by
Daemontatox
Is it censored output?
12
#2 opened 12 months ago
by
KurtcPhotoED
Please work with llama.cpp before releasing new models.
2
#10 opened 11 months ago
by
bradhutchings
Lack of 33B models?
👍 1
7
#1 opened over 2 years ago
by
mancub
No config.json ?
3
#1 opened almost 3 years ago
by
0x12d3
is this working properly?
23
#1 opened almost 3 years ago
by
Boffy
Uh oh, the "q's"...
27
#2 opened almost 3 years ago
by
mancub
Are you making k-quant series of this model?
3
#1 opened almost 3 years ago
by
mancub