Open to Collab

Abidoye Aanuoluwapo

Aanuoluwapo65

https://anuoluwapo65.github.io/

AI & ML interests

Computer vision and multimodal learning

Recent Activity

upvoted a paper about 5 hours ago

NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control

upvoted a paper about 5 hours ago

Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines?

upvoted a paper about 5 hours ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

View all activity

Organizations

upvoted 5 papers about 5 hours ago

NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control

Paper • 2602.09070 • Published 17 days ago • 44

Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines?

Paper • 2602.14111 • Published 11 days ago • 56

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published 14 days ago • 57

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published 17 days ago • 206

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published 2 days ago • 433

upvoted an article about 5 hours ago

Article

Deploying Open Source Vision Language Models (VLM) on Jetson

2 days ago

•

upvoted 5 papers about 24 hours ago

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published 24 days ago • 136

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published 13 days ago • 145

upvoted an article 1 day ago

Article

Scaling Self Supervised Learning for Histology: introducing Phikon

Oct 31, 2023

•

liked a model 1 day ago

sentence-transformers/static-retrieval-mrl-en-v1

liked a dataset 1 day ago

nisten/battlefield-medic-sharegpt

Viewer • Updated Apr 8, 2025 • 3.33k • 58 • 20

liked a Space 2 days ago

Open VLM Leaderboard

🌎

998

VLMEvalKit Evaluation Results Collection

upvoted an article 3 days ago

Article

What is going on with AlphaFold3?

May 21, 2024

•

liked a Space 5 days ago

4M Demo

⚡

203

4M: Massively Multimodal Masked Modeling

upvoted 3 papers 5 days ago

Agent READMEs: An Empirical Study of Context Files for Agentic Coding

Paper • 2511.12884 • Published Nov 17, 2025 • 26

HeartMuLa: A Family of Open Sourced Music Foundation Models

Paper • 2601.10547 • Published Jan 15 • 44

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 147

Abidoye Aanuoluwapo

AI & ML interests

Recent Activity

Organizations

Aanuoluwapo65's activity

Deploying Open Source Vision Language Models (VLM) on Jetson

Scaling Self Supervised Learning for Histology: introducing Phikon

Open VLM Leaderboard

What is going on with AlphaFold3?

4M Demo