Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Apple VLM: On-Device Performance
Demonstrate Apple’s on‑device VLM, a quantized Qwen model, evaluating Q&A accuracy, prompt response, multilingual visual/text support, and hardware‑specific resource use.
Apple has released a VLM that’s a quantized fine tuned version of Qwen they’ve optimized for iOS and macOS Apple Silicon devices. I want to show some experiments on when it works and when it fails. For example how good is it at Q&A? How responsive to prompting is it? What languages can it work with both visually and textually? Are resource usages different on different hardware? What tunability does Apple offer by default?