Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Real Physical Benchmark
non-profit
Activity Feed
Follow
8
AI & ML interests
None defined yet.
Recent Activity
yingmanji
authored
a paper
about 7 hours ago
Frontier-Eng: Benchmarking Self-Evolving Agents on Real-World Engineering Tasks with Generative Optimization
leelin
authored
a paper
about 1 month ago
Vision-Language Models Can Self-Improve Reasoning via Reflection
leelin
authored
a paper
about 1 month ago
PaLMR: Towards Faithful Visual Reasoning via Multimodal Process Alignment
View all activity
Team members
8
PhysicalBenchmark
's datasets
None public yet