arxiv:2602.18633
Taiwei Shi
MaksimSTW
AI & ML interests
reinforcement learning, alignment, human-AI collaboration, and computational social science
Recent Activity
authored
a paper
about 14 hours ago
DP-RFT: Learning to Generate Synthetic Text via Differentially Private Reinforcement Fine-Tuning authored
a paper
15 days ago
Experiential Reinforcement Learning