Open to Work

2 21 4

Kevork Sulahian

herooooooooo

bad_at_ai

AI & ML interests

LLM

Recent Activity

liked a Space 3 days ago

AdithyaSK/rl-environments-guide

new activity 24 days ago

dwko/Alpamayo-R1-10B-4bit:Share steps

updated a model about 1 month ago

herooooooooo/nemo_gym_sudoku_finetune_4bit

View all activity

Organizations

liked a Space 3 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

111

Building and scaling RL environments for LLM training

New activity in dwko/Alpamayo-R1-10B-4bit 24 days ago

Share steps

#2 opened 24 days ago by

herooooooooo

updated a model about 1 month ago

herooooooooo/nemo_gym_sudoku_finetune_4bit

Text Generation • 2B • Updated Mar 29 • 8

published a model about 1 month ago

herooooooooo/nemo_gym_sudoku_finetune_4bit

Text Generation • 2B • Updated Mar 29 • 8

updated a model about 2 months ago

herooooooooo/Qwen3.5-4B_lora_math_vision

Image-Text-to-Text • Updated Mar 25 • 1

published a model about 2 months ago

herooooooooo/Qwen3.5-4B_lora_math_vision

Image-Text-to-Text • Updated Mar 25 • 1

commented on Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem about 2 months ago

What kind of annotation tool did you use, if it's open could you please link it? I am hoping to fine-tune to use-cases without data, so I would need to first generate it and then add/fix some of the reasoning steps

upvoted 2 articles about 2 months ago

Article

Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, and Simulation

Mar 16

•

Article

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem

Jan 5

•

upvoted 2 articles 2 months ago

Article

Mixture of Experts (MoEs) in Transformers

Feb 26

•

159

Article

How I Landed Multiple Senior LLM Engineer Offers (And The Brutal Reality of AI Interviews Right Now)

Feb 26

•

published an article 2 months ago

Article

How I Landed Multiple Senior LLM Engineer Offers (And The Brutal Reality of AI Interviews Right Now)

Feb 26

•

authored a paper 3 months ago

Gaming the Answer Matcher: Examining the Impact of Text Manipulation on Automated Judgment

Paper • 2601.08849 • Published Dec 22, 2025 • 3

upvoted a paper 4 months ago

Gaming the Answer Matcher: Examining the Impact of Text Manipulation on Automated Judgment

Paper • 2601.08849 • Published Dec 22, 2025 • 3

updated a model 5 months ago

herooooooooo/functiongemma_test_version

Updated Dec 24, 2025

published a model 5 months ago

herooooooooo/functiongemma_test_version

Updated Dec 24, 2025

updated a model 11 months ago

herooooooooo/llama_finetuned_lora

Updated Jun 7, 2025

published a model 11 months ago

herooooooooo/llama_finetuned_lora

Updated Jun 7, 2025

upvoted an article 11 months ago

Article

Demystifying DeepSeekMath’s Data Pipeline: A FastText-Based Reproduction and Analysis

Jun 1, 2025

•

published an article 11 months ago

Article

Demystifying DeepSeekMath’s Data Pipeline: A FastText-Based Reproduction and Analysis

Jun 1, 2025

•

Kevork Sulahian

AI & ML interests

Recent Activity

Organizations

herooooooooo's activity

The ultimate guide to RL environments: building and scaling them in the LLM era

Share steps

Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, and Simulation

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem

Mixture of Experts (MoEs) in Transformers

How I Landed Multiple Senior LLM Engineer Offers (And The Brutal Reality of AI Interviews Right Now)

How I Landed Multiple Senior LLM Engineer Offers (And The Brutal Reality of AI Interviews Right Now)

Demystifying DeepSeekMath’s Data Pipeline: A FastText-Based Reproduction and Analysis

Demystifying DeepSeekMath’s Data Pipeline: A FastText-Based Reproduction and Analysis