Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Igor Kilbas's picture

Igor Kilbas

kaleinaNyan

daniilak's profile picture

21world's profile picture

id-2's profile picture

·

oKatanaaa

AI & ML interests

Computer Vision, NLP

Organizations

None yet

kaleinaNyan 's collections 4

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published Aug 11, 2025 • 50
Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 98
RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 32

A series of English/Russian instruction-following models.

kaleinaNyan/kolibri-mistral-0427-2.gguf

7B • Updated May 13, 2024 • 5 • 2
kaleinaNyan/kolibri-mistral-0427.gguf

7B • Updated May 13, 2024 • 16 • 1
kaleinaNyan/kolibri-mistral-0427-upd.gguf

7B • Updated May 23, 2024 • 6
kaleinaNyan/kolibri-mistral-0427-upd

Text Generation • 7B • Updated May 23, 2024 • 1 • 1

A series of encoder-transformer models for cheap evaluation of LLM on Russian Hard LLM Arena.

kaleinaNyan/jina-v3-rullmarena-judge-041024

0.6B • Updated Oct 9, 2024 • 1 • 1
kaleinaNyan/jina-v3-rullmarena-judge-300924

0.6B • Updated Oct 9, 2024 • 1 • 2
kaleinaNyan/jina-v3-rullmarena-judge

0.6B • Updated Sep 27, 2024 • 8 • 3

A series of English/Russian reasoning models.

kaleinaNyan/eule-qwen2.5instruct-14b-111224

15B • Updated Dec 14, 2024 • 1
kaleinaNyan/eule-qwen2.5instruct-7b-111224

8B • Updated Dec 14, 2024 • 1

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published Aug 11, 2025 • 50
Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 98
RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 32

A series of encoder-transformer models for cheap evaluation of LLM on Russian Hard LLM Arena.

kaleinaNyan/jina-v3-rullmarena-judge-041024

0.6B • Updated Oct 9, 2024 • 1 • 1
kaleinaNyan/jina-v3-rullmarena-judge-300924

0.6B • Updated Oct 9, 2024 • 1 • 2
kaleinaNyan/jina-v3-rullmarena-judge

0.6B • Updated Sep 27, 2024 • 8 • 3

A series of English/Russian instruction-following models.

kaleinaNyan/kolibri-mistral-0427-2.gguf

7B • Updated May 13, 2024 • 5 • 2
kaleinaNyan/kolibri-mistral-0427.gguf

7B • Updated May 13, 2024 • 16 • 1
kaleinaNyan/kolibri-mistral-0427-upd.gguf

7B • Updated May 23, 2024 • 6
kaleinaNyan/kolibri-mistral-0427-upd

Text Generation • 7B • Updated May 23, 2024 • 1 • 1

A series of English/Russian reasoning models.

kaleinaNyan/eule-qwen2.5instruct-14b-111224

15B • Updated Dec 14, 2024 • 1
kaleinaNyan/eule-qwen2.5instruct-7b-111224

8B • Updated Dec 14, 2024 • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs