Moozonian

💻 Developer Nexus: vila

GitHub

NVlabs/VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

⭐ 3770 | 🍴 315
GitHub

AnjieCheng/NaVILA

[RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"

⭐ 531 | 🍴 45
GitHub

mit-han-lab/vila-u

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

⭐ 418 | 🍴 19
GitHub

yang-zj1026/NaVILA-Bench

Vision-Language Navigation Benchmark in Isaac Lab

⭐ 297 | 🍴 25