Robert Shaw
robertgshaw2
AI & ML interests
None yet
Organizations
Update tokenizer_config.json
5
#3 opened 10 months ago
by
erichartford
What is an actorder group and what are the advantages of running this in vLLM?
4
#1 opened 12 months ago
by
nickandbro
Can I apply a LoRA?
2
#1 opened about 1 year ago
by
RonanMcGovern
Nice model, any info on scripts used to quantize?
1
#1 opened about 1 year ago
by
RonanMcGovern
How to download the model with transformer library
5
#6 opened about 1 year ago
by
Rick10
Update README.md
3
#25 opened about 1 year ago
by
robertgshaw2
Issue running on vLLM using FP8
2
#3 opened about 1 year ago
by
ffleandro
vllm says the requested model does not exist
2
#1 opened over 1 year ago
by
shivams101
Storage format differs from other w4a16 models
2
#2 opened over 1 year ago
by
timdettmers
Model weights are not loaded
4
#3 opened over 1 year ago
by
MarvelousMouse
Can not be inferenced with vllm openai server
1
#1 opened over 1 year ago
by
jjqsdq
Code example request with vllm
2
#1 opened over 1 year ago
by
ShiningJazz
4bit quantisation does not reduce vram usage.
1
#2 opened over 1 year ago
by
fu-man
How to run Meta-Llama-3-70B-Instruct-FP8 using several devices?
5
#3 opened over 1 year ago
by
Fertel
Reproduction
2
#792 opened over 1 year ago
by
robertgshaw2
Fails to run with nm-vllm
1
#1 opened over 1 year ago
by
clintonruairi
Update chart template
#2 opened over 1 year ago
by
robertgshaw2