📚Traditional Chinese Translation Dataset 收集繁體中文在語言模型上存在多國語言翻譯的資料集,例如:中轉英、中轉越南等。繁體中文與東亞、東南亞關係密切,需考量未來延展性 Heng666/MultiCCAligned-TW-Corpus Viewer • Updated Feb 19, 2024 • 3.13M • 8 • 6 Heng666/OpenSubtitles-TW-Corpus Viewer • Updated Feb 20, 2024 • 7.22M • 9 • 3 Heng666/TED2020-TW-Corpus Viewer • Updated Feb 2, 2024 • 1.74M • 15 • 5
Taiwan-Patent-Series 專注解決台灣專利用於語言模型上應用 Heng666/Taiwan-patent-qa-eval Viewer • Updated Feb 28, 2024 • 192 • 44 • 2 Heng666/Taiwan-patent-qa Viewer • Updated Feb 28, 2024 • 1.22k • 28 • 5 Heng666/Taiwan-patent-corpus Viewer • Updated Feb 28, 2024 • 28 • 3 • 1
Asis_LLM_LeaderBoard Runtime error Featured 562 Open Ko-LLM Leaderboard 📉 562 Explore and filter language model benchmark results Runtime error 56 Open Multilingual Llm Leaderboard 🐨 56 Search for model performance across languages and benchmarks Running 9 Leaderboard / SeaEval 🥇 9 Explore NLP leaderboard metrics
Runtime error Featured 562 Open Ko-LLM Leaderboard 📉 562 Explore and filter language model benchmark results
Runtime error 56 Open Multilingual Llm Leaderboard 🐨 56 Search for model performance across languages and benchmarks
Traditional_Chinese_Aya_series 繁體中文化 Heng666/Traditional_Chinese-aya_collection Viewer • Updated Feb 19, 2024 • 2.02M • 291 • 8 Heng666/Traditional_Chinese-aya_dataset Viewer • Updated Feb 19, 2024 • 4.91k • 103 • 4 Heng666/Traditional_Chinese-aya_evaluation_suite Viewer • Updated Feb 19, 2024 • 650 • 13 • 3
Taiwan-pretrain-llm-zh_tw-corpus 本清冊收集用於訓練 繁體中文資料的資料集。特別適合需要自行訓練語言模型者使用 bigscience-data/roots_zh-tw_wikipedia Viewer • Updated Dec 12, 2022 • 197k • 16 • 12 erhwenkuo/wikinews-zhtw Viewer • Updated Oct 10, 2023 • 9.83k • 42 • 5 graelo/wikipedia Viewer • Updated Sep 10, 2023 • 105M • 2.47k • 71 botp/yentinglin-zh_TW_c4 Viewer • Updated Aug 16, 2023 • 5.18M • 68 • 7
translation traintogpb/aihub-flores-koen-integrated-prime-small-30k Viewer • Updated May 23, 2024 • 33.4k • 53 • 8 UdS-LSV/menyo20k_mt Updated Jan 18, 2024 • 59 • 3 ryo0634/bsd_ja_en Viewer • Updated Jan 11, 2024 • 24.2k • 62 • 12 albertvillanova/sat Updated Oct 24, 2022 • 72
traintogpb/aihub-flores-koen-integrated-prime-small-30k Viewer • Updated May 23, 2024 • 33.4k • 53 • 8
📚Traditional Chinese Translation Dataset 收集繁體中文在語言模型上存在多國語言翻譯的資料集,例如:中轉英、中轉越南等。繁體中文與東亞、東南亞關係密切,需考量未來延展性 Heng666/MultiCCAligned-TW-Corpus Viewer • Updated Feb 19, 2024 • 3.13M • 8 • 6 Heng666/OpenSubtitles-TW-Corpus Viewer • Updated Feb 20, 2024 • 7.22M • 9 • 3 Heng666/TED2020-TW-Corpus Viewer • Updated Feb 2, 2024 • 1.74M • 15 • 5
Traditional_Chinese_Aya_series 繁體中文化 Heng666/Traditional_Chinese-aya_collection Viewer • Updated Feb 19, 2024 • 2.02M • 291 • 8 Heng666/Traditional_Chinese-aya_dataset Viewer • Updated Feb 19, 2024 • 4.91k • 103 • 4 Heng666/Traditional_Chinese-aya_evaluation_suite Viewer • Updated Feb 19, 2024 • 650 • 13 • 3
Taiwan-Patent-Series 專注解決台灣專利用於語言模型上應用 Heng666/Taiwan-patent-qa-eval Viewer • Updated Feb 28, 2024 • 192 • 44 • 2 Heng666/Taiwan-patent-qa Viewer • Updated Feb 28, 2024 • 1.22k • 28 • 5 Heng666/Taiwan-patent-corpus Viewer • Updated Feb 28, 2024 • 28 • 3 • 1
Taiwan-pretrain-llm-zh_tw-corpus 本清冊收集用於訓練 繁體中文資料的資料集。特別適合需要自行訓練語言模型者使用 bigscience-data/roots_zh-tw_wikipedia Viewer • Updated Dec 12, 2022 • 197k • 16 • 12 erhwenkuo/wikinews-zhtw Viewer • Updated Oct 10, 2023 • 9.83k • 42 • 5 graelo/wikipedia Viewer • Updated Sep 10, 2023 • 105M • 2.47k • 71 botp/yentinglin-zh_TW_c4 Viewer • Updated Aug 16, 2023 • 5.18M • 68 • 7
Asis_LLM_LeaderBoard Runtime error Featured 562 Open Ko-LLM Leaderboard 📉 562 Explore and filter language model benchmark results Runtime error 56 Open Multilingual Llm Leaderboard 🐨 56 Search for model performance across languages and benchmarks Running 9 Leaderboard / SeaEval 🥇 9 Explore NLP leaderboard metrics
Runtime error Featured 562 Open Ko-LLM Leaderboard 📉 562 Explore and filter language model benchmark results
Runtime error 56 Open Multilingual Llm Leaderboard 🐨 56 Search for model performance across languages and benchmarks
translation traintogpb/aihub-flores-koen-integrated-prime-small-30k Viewer • Updated May 23, 2024 • 33.4k • 53 • 8 UdS-LSV/menyo20k_mt Updated Jan 18, 2024 • 59 • 3 ryo0634/bsd_ja_en Viewer • Updated Jan 11, 2024 • 24.2k • 62 • 12 albertvillanova/sat Updated Oct 24, 2022 • 72
traintogpb/aihub-flores-koen-integrated-prime-small-30k Viewer • Updated May 23, 2024 • 33.4k • 53 • 8