Skip to content
AImpact
IT EN
Medium Multimodal AI · 1 min read

Moondream 1: the 1.6B VLM that runs on Raspberry Pi

In one sentence Moondream is a 1.6B parameter VLM capable of captioning, VQA, and object detection on edge hardware like Raspberry Pi and Android smartphones.

Verified Official source
ShareLinkedInX
Reading level

Until 2024, models capable of seeing and describing images required powerful GPUs and cloud infrastructure. Moondream proved this isn't necessarily the case: with just 1.6 billion parameters, this model runs on a Raspberry Pi, an Android phone, and laptops without a dedicated GPU. It can describe images, answer questions about them, and locate objects, all locally without sending data to any server. It opens the door to visual intelligence in everyday devices.

Companies

vikhyatk (independent)

Tools

Moondream 1, Moondream 2

Tags

Edge AIVLMTiny ModelOn-DeviceVQA

Sources