Google Gemini API Documentation

Google AI Studio and the Gemini API documentation walk developers through multimodal capabilities that process text, images, audio, and video in a single request. The guides cover grounding with Google Search, structured output generation, context caching for long documents, and system instructions. Useful for understanding how multimodal AI differs from text-only approaches.