Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
14
5
10
Nathan Godey
nthngdy
Follow
davanstrien's profile picture
alessiodevoto's profile picture
rntc's profile picture
15 followers
·
2 following
https://nathangodey.github.io/
nthngdy
NathanGodey
AI & ML interests
None yet
Recent Activity
submitted
a paper
3 days ago
Lost in Backpropagation: The LM Head is a Gradient Bottleneck
updated
a model
6 days ago
nthngdy/matryoshka-baselines
published
a model
6 days ago
nthngdy/matryoshka-baselines
View all activity
Organizations
nthngdy
's models
60
Sort: Recently updated
nthngdy/matryoshka-baselines
Updated
6 days ago
•
20
nthngdy/matryoshka-1B
Text Generation
•
1B
•
Updated
6 days ago
•
377
nthngdy/matritest_1B
Text Generation
•
0.6B
•
Updated
20 days ago
•
135
nthngdy/matritest_600M
Text Generation
•
0.4B
•
Updated
20 days ago
•
136
nthngdy/matritest_300M
Text Generation
•
0.2B
•
Updated
20 days ago
•
155
nthngdy/matritest_van_1B
Text Generation
•
1B
•
Updated
20 days ago
•
68
nthngdy/matritest_van_600M
Text Generation
•
0.6B
•
Updated
20 days ago
•
68
nthngdy/matritest_van_300M
Text Generation
•
0.3B
•
Updated
20 days ago
•
66
nthngdy/matritest_van_100M
Text Generation
•
0.1B
•
Updated
20 days ago
•
70
nthngdy/matritest_100M
Text Generation
•
0.1B
•
Updated
20 days ago
•
161
nthngdy/bttl_2B
Updated
Jan 19
nthngdy/llama2-0b-unit-test_qfilt
Updated
Mar 10, 2025
•
1.01k
nthngdy/Llama-3.1-70B-Instruct_qfilt
Updated
Mar 7, 2025
•
569
nthngdy/olmo24b-random
Updated
Mar 3, 2025
•
1
nthngdy/DeepSeek-R1-Distill-Qwen-1.5B_qfilt
Updated
Mar 3, 2025
•
572
nthngdy/DeepSeek-R1-Distill-Llama-8B_qfilt
Updated
Mar 3, 2025
•
571
nthngdy/llama24b-random
Updated
Feb 26, 2025
•
1
nthngdy/olmo2-1B-random
Updated
Feb 6, 2025
nthngdy/Qwen2.5-7B-Instruct_qfilt
Updated
Feb 6, 2025
•
561
nthngdy/Qwen2.5-7B_qfilt
Updated
Feb 6, 2025
•
552
nthngdy/phi-4_qfilt
Updated
Feb 6, 2025
•
566
nthngdy/Mistral-Small-24B-Instruct-2501_qfilt
Updated
Feb 6, 2025
•
570
nthngdy/Meta-Llama-3.1-405B_qfilt
Updated
Feb 6, 2025
•
561
nthngdy/Llama-3.2-3B-Instruct_qfilt
Updated
Feb 6, 2025
•
569
nthngdy/Llama-3.1-70B_qfilt
Updated
Feb 6, 2025
•
565
nthngdy/Llama-3.2-1B-Instruct_qfilt
Updated
Nov 28, 2024
•
571
nthngdy/Llama-3.2-1B_qfilt
Updated
Nov 28, 2024
•
566
nthngdy/Llama-3.2-3B_qfilt
Updated
Nov 28, 2024
•
691
nthngdy/Llama-3.1-8B_qfilt
Updated
Nov 28, 2024
•
571
nthngdy/Llama-3.1-8B-Instruct_qfilt
Updated
Nov 28, 2024
•
1.18k
Previous
1
2
Next