Multimodal AI
AI systems that can process and generate multiple types of data — text, images, audio, and video — within a single model. Multimodal models like GPT-4o can analyze images, generate text, understand speech, and produce audio responses in a unified conversation.