Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
In a Training Loop π
617.8
TFLOPS
56
3
16
David
dnhkng
Follow
rjmalagon's profile picture
ikenokyo's profile picture
Stars321123's profile picture
138 followers
Β·
3 following
dnhkng
dnhkng
AI & ML interests
Exploratory Research -> DIYAGI
Recent Activity
new
activity
21 days ago
google/gemma-4-31B-it:
Can anyone improve the model using the Rys methodologyβby duplicating a block of layers?
liked
a model
about 1 month ago
NeuroSenko/MiniMax-M2.7-exl3
new
activity
about 2 months ago
dnhkng/RYS-Qwen3.5-27B-FP8-XL:
Any bf16/fp16 varient?
View all activity
Organizations
dnhkng
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
google/gemma-4-31B-it
21 days ago
Can anyone improve the model using the Rys methodologyβby duplicating a block of layers?
11
#60 opened about 1 month ago by
Regrin
New activity in
dnhkng/RYS-Qwen3.5-27B-FP8-XL
about 2 months ago
Any bf16/fp16 varient?
1
#1 opened about 2 months ago by
EclipseMist
New activity in
cyankiwi/MiniMax-M2.5-AWQ-4bit
2 months ago
Transformers support
#2 opened 2 months ago by
dnhkng
New activity in
mratsim/MiniMax-M2.5-BF16-INT4-AWQ
3 months ago
FP8 + INT4 version
14
#2 opened 3 months ago by
bigstorm
New activity in
mratsim/MiniMax-M2.1-FP8-INT4-AWQ
4 months ago
Looking forward to trying this!
π€―
2
17
#2 opened 4 months ago by
dnhkng
New activity in
unsloth/GLM-4.7-GGUF
5 months ago
Original FP8 Weights
6
#9 opened 5 months ago by
Ano-Nimus
New activity in
mistralai/Mistral-Large-3-675B-Instruct-2512-NVFP4
6 months ago
The model says NVFP4 for H100s and A100s
π
1
2
#1 opened 6 months ago by
dnhkng
New activity in
HuggingFaceTB/SmolVLM-256M-Instruct
over 1 year ago
ONNX Demo code
π₯
π
8
17
#4 opened over 1 year ago by
cnmoro
New activity in
HuggingFaceTB/SmolVLM-500M-Instruct
over 1 year ago
ONNX decoder model uses non-standard operators
4
#6 opened over 1 year ago by
robertknight
New activity in
MaziyarPanahi/calme-2.4-rys-78b
over 1 year ago
Collaboration?
π₯
1
10
#10 opened over 1 year ago by
dnhkng
π Congrats π
7
#5 opened over 1 year ago by
dnhkng
New activity in
dnhkng/RYS-XLarge
over 1 year ago
What's the status of the Rys models / training method?
2
#4 opened over 1 year ago by
smcleod
New activity in
open-llm-leaderboard/open_llm_leaderboard
over 1 year ago
Model fail, re-eval request π
8
#885 opened almost 2 years ago by
dnhkng
New activity in
MaziyarPanahi/calme-2.1-rys-78b
over 1 year ago
Rys loss
6
#5 opened almost 2 years ago by
dnhkng
New activity in
dnhkng/RYS-XLarge
over 1 year ago
Request
π€
1
4
#3 opened almost 2 years ago by
bartowski
New activity in
dnhkng/RYS-Llama-3.1-8B-Instruct
over 1 year ago
Adding Evaluation Results
#2 opened over 1 year ago by
leaderboard-pr-bot
New activity in
dnhkng/RYS-Llama3.1-Large
over 1 year ago
Adding Evaluation Results
#1 opened over 1 year ago by
leaderboard-pr-bot
New activity in
dnhkng/RYS-Medium
over 1 year ago
Ready for quanting?
1
#3 opened over 1 year ago by
bartowski
New activity in
dnhkng/RYS-Phi-3-medium-4k-instruct
over 1 year ago
Adding Evaluation Results
#2 opened over 1 year ago by
leaderboard-pr-bot
New activity in
dnhkng/RYS-Llama-3.1-8B-Instruct
over 1 year ago
Interesting model of LLaMa-3.1-8B-Instruct - explain what you did please - if you can. I'm curious. π
8
#1 opened almost 2 years ago by
Joseph717171
Load more