tokyotech-llm

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

s-mizuki-nlp updated a model 4 days ago

tokyotech-llm/Qwen3-Swallow-32B-CPT-v0.2

s-mizuki-nlp updated a model 4 days ago

tokyotech-llm/Qwen3-Swallow-30B-A3B-CPT-v0.2

s-mizuki-nlp updated a model 4 days ago

tokyotech-llm/Qwen3-Swallow-8B-CPT-v0.2

View all activity

Organization Card

Community About org cards

Swallow LLM

Research and development of large language models conducted by the members mainly in Okazaki Laboratory and Yokota Laboratory at Institute of Science Tokyo (formerly known as Tokyo Institute of Technology)

From Okazaki Laboratory, Institute of Science Tokyo, the following members:
- Naoaki Okazaki
- Sakae Mizuki
- Youmi Ma
- Sangwhan Moon
- Koki Maeda
- Masanari Ohi
- Hinari Shimada
- Taihei Shiotani
- Koshiro Saito
- Tatsuya Ichinose
- Naoya Matsushita
- Sora Miyamoto
- Nguyen Tien Dung
- Yuta Katayama
From YOKOTA Laboratory, Institute of Science Tokyo, the following members:
- Rio Yokota
- Kazuki Fujii
- Taishi Nakamura
- Takumi Okamoto
- Ishida Shigeki
- Masaki Kawamura
- Yukito Tajima
From Artificial Intelligence Research Center, AIST, Japan, the following members:
- Hiroya Takamura

Collections 16

View 16 collections

models 132

tokyotech-llm/Qwen3-Swallow-8B-SFT-v0.2

Text Generation • 8B • Updated 4 days ago • 7.02k • 4

tokyotech-llm/Qwen3-Swallow-32B-CPT-v0.2

Text Generation • 33B • Updated 4 days ago • 216 • 1

tokyotech-llm/Qwen3-Swallow-30B-A3B-CPT-v0.2

Text Generation • 31B • Updated 4 days ago • 402

tokyotech-llm/Qwen3-Swallow-8B-CPT-v0.2

Text Generation • 8B • Updated 4 days ago • 534 • 1

tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2-AWQ-INT4

Text Generation • 33B • Updated 4 days ago • 424 • 1

tokyotech-llm/Qwen3-Swallow-30B-A3B-RL-v0.2-AWQ-INT4

Text Generation • 31B • Updated 4 days ago • 496

tokyotech-llm/Qwen3-Swallow-8B-RL-v0.2-AWQ-INT4

Text Generation • 8B • Updated 4 days ago • 720

tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2

Text Generation • 33B • Updated 4 days ago • 695 • 1

tokyotech-llm/Qwen3-Swallow-30B-A3B-RL-v0.2

Text Generation • 31B • Updated 4 days ago • 701 • 5

tokyotech-llm/Qwen3-Swallow-8B-RL-v0.2

Text Generation • 8B • Updated 4 days ago • 2.33k • 2

View 132 models

datasets 19

tokyotech-llm/Swallow-Nemotron-Post-Training-Dataset-v1

Viewer • Updated 6 days ago • 8.84M • 489 • 2

tokyotech-llm/lmsys-chat-1m-synth

Updated 8 days ago • 833 • 20

tokyotech-llm/s1-test-time-scaling-synth-public

Viewer • Updated 8 days ago • 59k • 16

tokyotech-llm/swallow-code-v2

Viewer • Updated Nov 8, 2025 • 147M • 174k • 32

tokyotech-llm/swallow-math-v2

Viewer • Updated Nov 6, 2025 • 17.4M • 5.23k • 27

tokyotech-llm/swallow_english_mt_bench

Viewer • Updated Aug 18, 2025 • 80 • 216

tokyotech-llm/MMLU-ProX-English

Updated Aug 18, 2025 • 323

tokyotech-llm/MMLU-Pro-English

Updated Aug 18, 2025 • 523

tokyotech-llm/MMLU-ProX-Japanese

Updated Aug 18, 2025 • 610

tokyotech-llm/JEMHopQA

Viewer • Updated Aug 8, 2025 • 3.78k • 247

View 19 datasets