zd21
/

GLM-Z1-9B-0414-TDRM

Model card Files Files and versions

README.md exists but content is empty.

Downloads last month: 1

Safetensors

Model size

9B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including zd21/GLM-Z1-9B-0414-TDRM

TDRM

Learning Smooth Reward Models with Temporal Difference for LLM RL and Inference • 14 items • Updated 4 days ago • 2