[EMNLP'25] A Benchmark for Assessing VLM Safety with Real-World Memes
DongGeon Lee
oneonlee
AI & ML interests
Data-centric natural language processing, AI Safety
Recent Activity
upvoted
a
collection
about 5 hours ago
COMPASS
authored
a paper
about 11 hours ago
Everyday Physics in Korean Contexts: A Culturally Grounded Physical
Reasoning Benchmark
authored
a paper
about 11 hours ago
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+
Languages and Cultures