Deepmind

company

https://deepmind.com/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

nielsr submitted a paper about 20 hours ago

MonkeyOCRv2: A Visual-Text Foundation Model for Document AI

nielsr submitted a paper 2 days ago

Xiaomi-Robotics-U0: Unified Embodied Synthesis with World Foundation Model

nielsr submitted a paper 7 days ago

Single-Rollout Asynchronous Optimization for Agentic Reinforcement Learning

View all activity

Papers

3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code

Unified Neural Scaling Laws

View all Papers

deepmind 's papers 28

Submitted by

taesiri

3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code

deepmind

Submitted by

Ethan Caballero

Unified Neural Scaling Laws

deepmind

Submitted by

Jihwan Kim

LiteFrame: Efficient Vision Encoders Unlock Frame Scaling in Video LLMs

deepmind

Submitted by

taesiri

Context Training with Active Information Seeking

deepmind

Submitted by

taesiri

ProEval: Proactive Failure Discovery and Efficient Performance Estimation for Generative AI Evaluation

deepmind

Submitted by

taesiri

ELT: Elastic Looped Transformers for Visual Generation

deepmind

Submitted by

Allen Nie

Understanding the Challenges in Iterative Generative Optimization with LLMs

deepmind

Submitted by

Yu

Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

deepmind

Submitted by

taesiri

Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD

deepmind

Submitted by

taesiri

A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

deepmind

Submitted by

Vladimir Kulikov

Versatile Editing of Video Content, Actions, and Dynamics without Training

deepmind

Submitted by

Allen Nie

POLCA: Stochastic Generative Optimization with LLM

deepmind

Submitted by

taesiri

Code-Space Response Oracles: Generating Interpretable Multi-Agent Policies with Large Language Models

deepmind

Submitted by

Junyi Zhang

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

deepmind

Submitted by

Rishabh Kabra

A Mixed Diet Makes DINO An Omnivorous Vision Encoder

deepmind

2

Submitted by

Ziyi Wu

360Anything: Geometry-Free Lifting of Images and Videos to 360°

deepmind

Submitted by

taesiri

Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

deepmind

Submitted by

Fu-Yun Wang

Image Diffusion Preview with Consistency Solver

deepmind

Submitted by

taesiri

The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality

deepmind

Submitted by

taesiri

Evaluating Gemini Robotics Policies in a Veo World Simulator

deepmind

Submitted by

taesiri

SIMA 2: A Generalist Embodied Agent for Virtual Worlds

deepmind

Submitted by

taesiri

Robot Learning from a Physical World Model

deepmind

Submitted by

Tyler Zhu

Dynamic Reflections: Probing Video Representations with Text Alignment

deepmind

2

Submitted by

Kaizhao Liang

Cautious Weight Decay

deepmind

Submitted by

Ming Zhong

Vibe Checker: Aligning Code Evaluation with Human Preference

deepmind

2

Submitted by

Nilesh Gupta

Scalable In-context Ranking with Generative Models

deepmind

Submitted by

taesiri

Video models are zero-shot learners and reasoners

deepmind

Submitted by

Sam Motamed

Do generative video models learn physical principles from watching videos?

deepmind