implementation

Implement Gemini Multimodal Interaction

Inspect the original prompt language first, then copy or adapt it once you know how it fits your workflow.

Linked challenge: Gemini-powered Voice Navigator Agent

Format

Text-first

Lines

Sections

Linked challenge

Gemini-powered Voice Navigator Agent

Prompt source

Original prompt text with formatting preserved for inspection.

1 lines

1 sections

No variables

0 checklist items

Extend your Google ADK agent to use Gemini 1.5 Pro for processing user utterances and generating responses. Your agent should be able to interpret natural language commands related to navigation and safety. Show how you would configure the ADK to use Gemini as the primary LLM for reasoning and how to pass a user's voice input to Gemini and then use Gemini's response to drive the `speak` tool. Also, outline how you would pass environmental context (like current location and activity) to Gemini.

Adaptation plan

Keep the source stable, then change the prompt in a predictable order so the next run is easier to evaluate.

Keep stable

Hold the task contract and output shape stable so generated implementations remain comparable.

Tune next

Update libraries, interfaces, and environment assumptions to match the stack you actually run.

Verify after

Test failure handling, edge cases, and any code paths that depend on hidden context or secrets.

Prompt diagnostics

Variables

Lists

Code blocks

Purpose

implementation

This prompt is mostly narrative and instruction-driven, so adapt examples and output constraints before you rewrite the structure.

Linked challenge

Gemini-powered Voice Navigator Agent

Develop a hands-free, multimodal conversational agent using Google's Agent Development Kit (ADK) that integrates with Google Maps for real-time navigational assistance. The agent should leverage Gemini's multimodal capabilities to understand voice commands, provide spoken directions, and offer context-aware information based on the user's location and activity (e.g., walking, cycling). This challenge focuses on building robust, real-time voice interfaces that seamlessly integrate generative AI with location-based services, prioritizing safety and natural interaction.

Open challenge

Related prompts

Browse library

Agent Initialization and Tool Definition

implementation

Real-time Contextual Tool Integration

implementation

Safety Protocol and Hazard Response

implementation