David's picture

In a Training Loop 🔄

David

dnhkng

·

AI & ML interests

Exploratory Research -> DIYAGI

Recent Activity

new activity 21 days ago

google/gemma-4-31B-it:Can anyone improve the model using the Rys methodology—by duplicating a block of layers?

liked a model about 1 month ago

NeuroSenko/MiniMax-M2.7-exl3

new activity about 2 months ago

dnhkng/RYS-Qwen3.5-27B-FP8-XL:Any bf16/fp16 varient?

View all activity

Organizations

New activity in google/gemma-4-31B-it 21 days ago

Can anyone improve the model using the Rys methodology—by duplicating a block of layers?

#60 opened about 1 month ago by

New activity in dnhkng/RYS-Qwen3.5-27B-FP8-XL about 2 months ago

Any bf16/fp16 varient?

#1 opened about 2 months ago by

New activity in cyankiwi/MiniMax-M2.5-AWQ-4bit 2 months ago

Transformers support

#2 opened 2 months ago by

New activity in mratsim/MiniMax-M2.5-BF16-INT4-AWQ 3 months ago

FP8 + INT4 version

#2 opened 3 months ago by

New activity in mratsim/MiniMax-M2.1-FP8-INT4-AWQ 4 months ago

Looking forward to trying this!

#2 opened 4 months ago by

New activity in unsloth/GLM-4.7-GGUF 5 months ago

Original FP8 Weights

#9 opened 5 months ago by

New activity in mistralai/Mistral-Large-3-675B-Instruct-2512-NVFP4 6 months ago

The model says NVFP4 for H100s and A100s

#1 opened 6 months ago by

New activity in HuggingFaceTB/SmolVLM-256M-Instruct over 1 year ago

ONNX Demo code

#4 opened over 1 year ago by

New activity in HuggingFaceTB/SmolVLM-500M-Instruct over 1 year ago

ONNX decoder model uses non-standard operators

#6 opened over 1 year ago by

New activity in MaziyarPanahi/calme-2.4-rys-78b over 1 year ago

Collaboration?

#10 opened over 1 year ago by

🎉 Congrats 🎉

#5 opened over 1 year ago by

New activity in dnhkng/RYS-XLarge over 1 year ago

What's the status of the Rys models / training method?

#4 opened over 1 year ago by

New activity in open-llm-leaderboard/open_llm_leaderboard over 1 year ago

Model fail, re-eval request 😊

#885 opened almost 2 years ago by

New activity in MaziyarPanahi/calme-2.1-rys-78b over 1 year ago

Rys loss

#5 opened almost 2 years ago by

New activity in dnhkng/RYS-XLarge over 1 year ago

Request

#3 opened almost 2 years ago by

New activity in dnhkng/RYS-Llama-3.1-8B-Instruct over 1 year ago

Adding Evaluation Results

#2 opened over 1 year ago by

leaderboard-pr-bot

New activity in dnhkng/RYS-Llama3.1-Large over 1 year ago

Adding Evaluation Results

#1 opened over 1 year ago by

leaderboard-pr-bot

New activity in dnhkng/RYS-Medium over 1 year ago

Ready for quanting?

#3 opened over 1 year ago by

New activity in dnhkng/RYS-Phi-3-medium-4k-instruct over 1 year ago

Adding Evaluation Results

#2 opened over 1 year ago by

leaderboard-pr-bot

New activity in dnhkng/RYS-Llama-3.1-8B-Instruct over 1 year ago

Interesting model of LLaMa-3.1-8B-Instruct - explain what you did please - if you can. I'm curious. 😋

#1 opened almost 2 years ago by