Large Multimodal Models(LMM)

Despite its unlimited capacity, generative artificial intelligence (AI) can only do so much due to its environmental perception

Generative AI research after the commonly used Large Language Models (LLMs), including the ChatGPT original model, which could only parse text

Digital assistants and productivity tools will also become much more helpful

Qualcomm Technologies strives to enable multimodal AI on devices

LLaVA, a community-driven LMM with over seven billion parameters, on a Snapdragon 8 Gen 3 Mobile Platform-powered Android phone

Given the multimodal hype, this work is vital. Last week, Microsoft introduced the Phi-3.5 family of visual and linguistic devices

At its Made by Google presentation, Google promoted LMMs and introduced the multimodal input model Gemini Nano

Qualcomm Technologies is partnering with Google and many LMM and LLM manufacturers, including Meta's Llama series