instruction-pretrain/instruction-synthesizer Text Generation • 7B • Updated Mar 1, 2025 • 53 • 79
Running 131 TxT360: Trillion Extracted Text 📖 131 Explore and analyze the TxT360 dataset for LLM pre-training