Skip to Content
author name

Le Xue

author title Senior Applied Scientist
My Background

Le Xue is an AI researcher working on multimodal foundation models such as Multimodal LLMs and Multimodal 3D foundation models. He leads AI research for series of projects of xGen-MM(BLIP-3) -- A Family of Open Large Multimodal Models.

My Expertise

His research interests includes Multimodal AI, LLMs, 3D Vision, and Autonomous Agents.

When I'm Not Writing ...

I enjoy traveling, playing the guitar, and working out.

Architecture, Training and Dataset Github Code: https://github.com/JiuhaiChen/BLIP3o Models: https://huggingface.co/BLIP3o/BLIP3o-Model Demo: https://huggingface.co/spaces/BLIP3o/blip-3o Motivation OpenAI’s GPT-4o has demonstrated state-of-the-art performance in image understanding, generation and editing tasks. Emerging hypotheses of its architecture suggest a hybrid…

Get the latest articles in your inbox.