Multimodal: Audio, Image, Video Processing
Detailed explanation of multimodal capability in ADK Go—how to let Agent process non-text content like audio, images, video.
Table of Contents
Multimodal: Audio, Image, Video Processing
Modern AI is not just text—images, audio, video can all be understood. ADK Go supports multimodal input, letting Agent process richer content.
