feat(google-genai): add Gemini Text-to-Speech (TTS) implementation #5132

apappascs · 2025-12-21T10:39:47Z

This commit adds Text-to-Speech support for Google GenAI (Gemini) with the following components:

API Client Layer

GeminiTtsApi: Low-level REST client for Gemini TTS API
Support for single-speaker and multi-speaker (conversational) TTS
PCM audio format (s16le, 24kHz, mono)
Request/response POJOs with Jackson annotations
Convenience factory methods for simpler API usage

Model Layer

GeminiTtsModel: Spring AI TextToSpeechModel implementation
GeminiTtsOptions: Builder-based configuration with runtime overrides
Support for 30+ voices across 24+ languages
Prompt-based style control (accent, pace, delivery)

Spring Boot Integration

Auto-configuration with properties binding
Dedicated starter: spring-ai-starter-model-google-genai-tts
Configuration prefix: spring.ai.google.genai.tts
Conditional bean creation based on spring.ai.model.audio.speech property

Thank you for taking time to contribute this pull request!
You might have already read the contributor guide, but as a reminder, please make sure to:

Add a Signed-off-by line to each commit (git commit -s) per the DCO
Rebase your changes on the latest main branch and squash your commits
Add/Update unit tests as needed
Run a build and make sure all tests pass prior to submission

For more details, please check the contributor guide.
Thank you upfront!

This commit adds comprehensive Text-to-Speech support for Google GenAI (Gemini) with the following components: ## API Client Layer - GeminiTtsApi: Low-level REST client for Gemini TTS API - Support for single-speaker and multi-speaker (conversational) TTS - PCM audio format (s16le, 24kHz, mono) - Request/response POJOs with Jackson annotations - Convenience factory methods for simpler API usage ## Model Layer - GeminiTtsModel: Spring AI TextToSpeechModel implementation - GeminiTtsOptions: Builder-based configuration with runtime overrides - Support for 30+ voices across 24+ languages - Prompt-based style control (accent, pace, delivery) ## Spring Boot Integration - Auto-configuration with properties binding - Dedicated starter: spring-ai-starter-model-google-genai-tts - Configuration prefix: spring.ai.google.genai.tts - Conditional bean creation based on spring.ai.model.audio.speech property Signed-off-by: Alexandros Pappas <apappascs@gmail.com>

apappascs · 2025-12-21T10:43:41Z

cc: @ddobrin

apappascs force-pushed the feature/google-genai-tts branch from f5abaa9 to b716b59 Compare December 21, 2025 10:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(google-genai): add Gemini Text-to-Speech (TTS) implementation #5132

feat(google-genai): add Gemini Text-to-Speech (TTS) implementation #5132

apappascs commented Dec 21, 2025

apappascs commented Dec 21, 2025

Labels

1 participant

feat(google-genai): add Gemini Text-to-Speech (TTS) implementation #5132

Are you sure you want to change the base?

feat(google-genai): add Gemini Text-to-Speech (TTS) implementation #5132

Conversation

apappascs commented Dec 21, 2025

API Client Layer

Model Layer

Spring Boot Integration

apappascs commented Dec 21, 2025

Labels

1 participant