vila - Moozonian Search

GitHub

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

⭐ 3770 | 🍴 315

GitHub

[RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"

⭐ 531 | 🍴 45

GitHub

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

⭐ 418 | 🍴 19

GitHub

Vision-Language Navigation Benchmark in Isaac Lab

⭐ 297 | 🍴 25

💻 Developer Nexus: vila