Add Speech-to-Text (stt) to elevenlabs (#40063)

2025-12-10 00:30:02 -06:00 · 2025-10-10 09:01:33 -06:00 · 2025-10-10 09:01:33 -06:00 · 98b68d8fdb
commit 98b68d8fdb
parent 8e3d0bb7af
1 changed files with 27 additions and 2 deletions
--- a/source/_integrations/elevenlabs.markdown
+++ b/source/_integrations/elevenlabs.markdown
@ -7,6 +7,7 @@ ha_release: 2024.8
 ha_iot_class: Cloud Polling
 ha_domain: elevenlabs
 ha_platforms:
  - stt
  - tts
 ha_config_flow: true
 ha_integration_type: service
@ -14,22 +15,35 @@ ha_codeowners:
  - '@sorgfresser'
 ---
-The **ElevenLabs** text-to-speech {% term integrations %} adds support for [ElevenLabs](https://elevenlabs.io/) as text-to-speech to read a text with natural sounding voices.
+The **ElevenLabs** {% term integrations %} adds support for [ElevenLabs](https://elevenlabs.io/) as text-to-speech to read a text with natural sounding voices and speech-to-text to convert speech into text.
 ## Prerequisites
 - You need an ElevenLabs account to use this integration. A free account suffices for basic usage.
 - For custom voices or more quota you need a subscription.
 - You need your API key from the ElevenLabs website.
 - Your API key requires the following permissions:
  - Text to Speech
  - Speech to Text
  - Voices (Read only)
  - Models (Read only)
 {% include integrations/config_flow.md %}
 ### Text-to-speech
 For an overview of which languages can be used, check the [complete list of supported languages](https://elevenlabs.io/languages).
 For a description of which voices are available for use, check your VoiceLab voices.
 To see the available models and their benefits, check the [models documentation](https://elevenlabs.io/docs/speech-synthesis/models).
 ### Speech-to-text
 An overview of supported languages is available at [complete list of supported languages](https://elevenlabs.io/docs/capabilities/speech-to-text#supported-languages).
 List of models and their capabilities can be determined at [models documentation](https://elevenlabs.io/docs/capabilities/speech-to-text#models).
 ## Action speak
 The `tts.speak` action is the modern way to use TTS. Add the `speak` action, select the entity for your ElevenLabs TTS (it's named ElevenLabs by default), select the media player entity or group to send the TTS audio to, and enter the message to speak.
@ -61,14 +75,25 @@ voice:
  required: false
  type: string
 model:
-  description: "Model ID for the model to use. Will override the default model of the entity!"
+  description: "Model ID for the text-to-speech model to use. Will override the default model of the entity!"
  required: false
  type: string
 Speech-to-text model:
  description: "Model ID for the speech-to-text model to use. Will override the default model of the entity!"
  required: false
  type: string
 Auto-detect language:
  description: "Should speech-to-text auto-detect the language spoken, overrides the language selected in voice assistant!"
  required: false
  type: boolean
  default: false
 {% endconfiguration %}
 For more information about using text-to-speech with Home Assistant and more details on all the options it provides, see the [TTS documentation](/integrations/tts/).
 For more information about using speech-to-text with Home Assistant and more details on all the options it provides, see the [STT documentation](/integrations/stt/).
 ## Removing the integration
 This integration follows standard integration removal. No extra steps are required.