Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
24
1
7
Rylan Schaeffer
PRO
RylanSchaeffer
Follow
masonwang025's profile picture
h-d-h's profile picture
brando's profile picture
7 followers
·
5 following
RylanSchaeffer
AI & ML interests
None yet
Recent Activity
updated
a dataset
5 days ago
RylanSchaeffer/math_perturbed
published
a dataset
6 days ago
RylanSchaeffer/math_perturbed
View all activity
Organizations
RylanSchaeffer
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
Qwen/Qwen2-1.5B
8 months ago
Number of Pretraining Tokens per Qwen 2.5 Model?
1
#9 opened 8 months ago by
RylanSchaeffer
New activity in
monology/pile-uncopyrighted
over 1 year ago
How was the valid and test split created?
3
#6 opened over 1 year ago by
Parallaxixs
New activity in
cleanrl/EleutherAI_pythia-1b-deduped__reward__tldr
over 1 year ago
Documentation for model training & data?
👍
2
2
#1 opened over 1 year ago by
RylanSchaeffer
New activity in
trl-internal-testing/tldr-preference-sft-trl-style
over 1 year ago
Corresponding Training Dataset for Reward Model?
2
#4 opened over 1 year ago by
RylanSchaeffer
New activity in
Ray2333/GRM-llama3-8B-sftreg
over 1 year ago
Abnormally Large Memory Footprint?
2
#2 opened over 1 year ago by
RylanSchaeffer
Some weights of the model checkpoint at Ray2333/GRM-llama3-8B-sftreg were not used when initializing
1
#1 opened over 1 year ago by
RylanSchaeffer
New activity in
openbmb/Eurus-RM-7b
over 1 year ago
How to use with batch size > 1?
1
#9 opened over 1 year ago by
RylanSchaeffer
New activity in
bigcode/starcoder
almost 2 years ago
valueerror: error initializing torch.distributed using env:// rendezvous: environment variable master_addr expected, but not set
1
#68 opened over 2 years ago by
mahi22muki
New activity in
google/gemma-7b
about 2 years ago
Dont download, google scuttled this model
🤯
👍
1
16
#77 opened about 2 years ago by
Tom-Neverwinter
New activity in
allenai/OLMo-1B
about 2 years ago
Why is there no intermediate checkpoint between 500B-1300B?
4
#11 opened about 2 years ago by
siqi-zz
New activity in
LLM360/Amber
about 2 years ago
Inquiry on Open-Sourcing Model Checkpoints
6
#3 opened over 2 years ago by
Mars2050
New activity in
mistralai/Mistral-7B-Instruct-v0.2
about 2 years ago
TypeError: bad operand type for unary -: 'NoneType'
👍
1
5
#19 opened over 2 years ago by
Jason571
New activity in
allenai/OLMo-1B
about 2 years ago
Unable to Load Model
3
#10 opened about 2 years ago by
RylanSchaeffer
Load more