22 40 255

AI Safety Research

AISafety

https://humanaligned.ai

AI & ML interests

LLMs, planning, EA

Recent Activity

upvoted an article about 18 hours ago

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

liked a Space 9 days ago

ProlificAI/humaine-leaderboard

liked a dataset 9 days ago

ProlificAI/social-reasoning-rlhf

View all activity

Organizations

New activity in Goodfire/DeepSeek-R1-SAE-l37 21 days ago

Expansion factor same on logic vs. math?

#2 opened 21 days ago by

AISafety

New activity in transcendingvictor/delphi-llama2-100k-validation-logprobs about 1 month ago

seems logits

#2 opened about 1 month ago by

AISafety

New activity in deepseek-ai/DeepSeek-V3.2-Exp about 1 month ago

The whale is back

❤️ 7

#8 opened 3 months ago by

Nechintosh

New activity in LiquidAI/LFM2-ColBERT about 2 months ago

Scores

#1 opened about 2 months ago by

AISafety

commented a paper 3 months ago

RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published Sep 26, 2025 • 40 •

New activity in deepseek-ai/DeepSeek-R1-0528-Qwen3-8B 7 months ago

Any plans for a Qwen3-32B model?

👍 13

#9 opened 7 months ago by

wanghf

commented a paper 9 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9, 2025 • 76 •

New activity in deepseek-ai/DeepSeek-V3-0324 9 months ago

That's the best day in our "open AI" world.

🤗 ❤️ 40

#28 opened 9 months ago by

DOFOFFICIAL

New activity in hexgrad/Kokoro-82M 12 months ago

Synthetic Data Collection PAUSED Jan 16

🚀 🔥 17

#21 opened 12 months ago by

hexgrad

New activity in Qwen/QwQ-32B-Preview about 1 year ago

QuietSTAR?

#9 opened about 1 year ago by

albatrossbirdie

New activity in NousResearch/Hermes-2-Theta-Llama-3-70B over 1 year ago

Exl2 quants

#1 opened over 1 year ago by

bullerwins

New activity in zai-org/glm-4-9b over 1 year ago

使用 lm_eval 测试时报错了

#1 opened over 1 year ago by

xianf

Full list of languages?

🔥 1

#2 opened over 1 year ago by

RASMUS

New activity in zai-org/glm-4-9b-chat-1m over 1 year ago

希望提供gguf版本

#6 opened over 1 year ago by

windkkk

Can you quantify this model in exl2?

👀 1

#7 opened over 1 year ago by

xldistance

New activity in Qwen/Qwen-72B about 2 years ago

Training loss?

👍 1

#2 opened about 2 years ago by

borgr

New activity in facebook/MusicGen over 2 years ago

MusicGen: a very calming song, rain in the background casual...

#15 opened over 2 years ago by

radames

New activity in bigcode/starcoder over 2 years ago

Unable to Deploy to Amazon SageMaker using Supplied Deploy Code

👍 2

#48 opened over 2 years ago by

garystafford

The command outputs = model.generate(inputs) is throwing error "RuntimeError: "LayerNormKernelImpl" not implemented for 'Half'"

#47 opened over 2 years ago by

ayaan-k1

AI Safety Research

AI & ML interests

Recent Activity

Organizations

AISafety's activity

Expansion factor same on logic vs. math?

seems logits

The whale is back

Scores

Any plans for a Qwen3-32B model?

That's the best day in our "open AI" world.

Synthetic Data Collection PAUSED Jan 16

QuietSTAR?

Exl2 quants

使用 lm_eval 测试时报错了

Full list of languages?

希望提供gguf版本

Can you quantify this model in exl2?

Training loss?

MusicGen: a very calming song, rain in the background casual...

Unable to Deploy to Amazon SageMaker using Supplied Deploy Code

The command outputs = model.generate(inputs) is throwing error "RuntimeError: "LayerNormKernelImpl" not implemented for 'Half'"