Model Card for vamos_navigation_only

This model is a merged LoRA fine-tuned version of google/paligemma2-3b-pt-224 on the mateoguaman/vamos_navigation_only_dataset dataset. It has been trained using TRL.

Note that this model was NOT trained with language annotations, so it is not a steerable model. It can only do point-based navigation without preferences.

Quick start

Coming Soon

Framework versions

TRL: 0.15.2
Transformers: 4.49.0
Pytorch: 2.6.0
Datasets: 3.4.1
Tokenizers: 0.21.1

License and Usage

This model is a fine-tuned derivative of google/paligemma2-3b-pt-224,
subject to the Gemma Terms of Use.

The training data includes content under CC BY-NC 4.0, so this model and its outputs are provided for non-commercial use only.

Please see the accompanying LICENSE and NOTICE files for full details.

Downloads last month: 1

Safetensors

Model size

3B params

Tensor type

F16

Video Preview

Robotics

Model tree for mateoguaman/vamos_navigation_only

Base model

google/paligemma2-3b-pt-224

Finetuned

(112)

this model

Dataset used to train mateoguaman/vamos_navigation_only

Collection including mateoguaman/vamos_navigation_only

VAMOS: A Hierarchical Vision-Language-Action Model for Capab

Collection

This collection contains VLM planner checkpoints, affordance module checkpoints for Spot and HOUND, training datasets, and a demo • 7 items • Updated Oct 27, 2025 • 3