Prakamya Mishra's picture

14 7 4

Prakamya Mishra

Prakamya

amd

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

upvoted a paper 1 day ago

Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report

upvoted a paper about 2 months ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

View all activity

Organizations

upvoted a paper about 18 hours ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 4 days ago • 59

upvoted a paper 1 day ago

Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report

Paper • 2601.21051 • Published 5 days ago • 12

upvoted a paper about 2 months ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 38

updated a collection about 2 months ago

SAND

5 items • Updated Dec 6, 2025 • 1

updated 2 models about 2 months ago

amd/SAND-Math-Qwen2.5-32B

Text Generation • 33B • Updated Dec 6, 2025 • 29 • 3

amd/SAND-MathScience-DeepSeek-Qwen32B

Text Generation • 33B • Updated Dec 6, 2025 • 9 • 2

updated a dataset about 2 months ago

amd/SAND-Post-Training-Dataset

Viewer • Updated Dec 6, 2025 • 27.9k • 174 • 3

published a dataset about 2 months ago

amd/SAND-Post-Training-Dataset

Viewer • Updated Dec 6, 2025 • 27.9k • 174 • 3

published 2 models about 2 months ago

amd/SAND-MathScience-DeepSeek-Qwen32B

Text Generation • 33B • Updated Dec 6, 2025 • 9 • 2

amd/SAND-Math-Qwen2.5-32B

Text Generation • 33B • Updated Dec 6, 2025 • 29 • 3

updated a collection about 2 months ago

SAND

5 items • Updated Dec 6, 2025 • 1

updated 2 collections 2 months ago

SAND

5 items • Updated Dec 6, 2025 • 1

Quark Quantized PTPC FP8 Models

PTPC model quantized by quark • 9 items • Updated 18 days ago

updated a collection 3 months ago

Instella ✨

Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs. • 13 items • Updated Dec 5, 2025 • 10

commented a paper 3 months ago

Instella: Fully Open Language Models with Stellar Performance

Paper • 2511.10628 • Published Nov 13, 2025 • 5 •

updated 3 models 3 months ago

amd/AMD-OLMo-1B-SFT-DPO

Text Generation • 1B • Updated Nov 17, 2025 • 40 • 23

amd/AMD-OLMo-1B-SFT

Text Generation • 1B • Updated Nov 17, 2025 • 66 • 20

amd/AMD-OLMo-1B

Text Generation • 1B • Updated Nov 17, 2025 • 91 • 25