eshmoideas 's Collections Training
updated
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn
Tool-Integrated Reasoning
Paper
• 2509.02479
• Published • 84
scikit-learn/sklearn-transformers
Text Classification
• Updated • 25
keras-io/swin-transformers
Image Classification
• Updated • 22
• 4
keras-io/structured-data-classification-grn-vsn
Tabular Classification
• Updated • 25
• 9
keras-io/timeseries_transformer_classification
Time Series Forecasting
• Updated • 20
• 13
nvidia/Llama-4-Maverick-17B-128E-Eagle3
Updated • 242
• 9
nvidia/DeepSeek-R1-0528-NVFP4
Text Generation
• 397B • Updated • 6.53k
• 42
EnvX: Agentize Everything with Agentic AI
Paper
• 2509.08088
• Published • 8
MachineLearningLM/MachineLearningLM-7B-v1
Text Generation
• 8B • Updated • 40
• 14
mradermacher/MachineLearningLM-7B-v1-GGUF
8B • Updated • 196
• 5
nvidia/DirectDiscriminativeOptimization
Text Classification
• 73B • Updated • 31
• 10
Qwen/WorldPM-72B-UltraFeedback
Text Classification
• 73B • Updated • 156
• 7
Qwen/WorldPM-72B-HelpSteer2
Text Classification
• 73B • Updated • 273
• 10
Text Classification
• 73B • Updated • 45
• 81