mirror of
https://github.com/home-assistant/home-assistant.io.git
synced 2025-12-10 00:30:02 -06:00
Add Speech-to-Text (stt) to elevenlabs (#40063)
This commit is contained in:
parent
8e3d0bb7af
commit
98b68d8fdb
@ -7,6 +7,7 @@ ha_release: 2024.8
|
|||||||
ha_iot_class: Cloud Polling
|
ha_iot_class: Cloud Polling
|
||||||
ha_domain: elevenlabs
|
ha_domain: elevenlabs
|
||||||
ha_platforms:
|
ha_platforms:
|
||||||
|
- stt
|
||||||
- tts
|
- tts
|
||||||
ha_config_flow: true
|
ha_config_flow: true
|
||||||
ha_integration_type: service
|
ha_integration_type: service
|
||||||
@ -14,22 +15,35 @@ ha_codeowners:
|
|||||||
- '@sorgfresser'
|
- '@sorgfresser'
|
||||||
---
|
---
|
||||||
|
|
||||||
The **ElevenLabs** text-to-speech {% term integrations %} adds support for [ElevenLabs](https://elevenlabs.io/) as text-to-speech to read a text with natural sounding voices.
|
The **ElevenLabs** {% term integrations %} adds support for [ElevenLabs](https://elevenlabs.io/) as text-to-speech to read a text with natural sounding voices and speech-to-text to convert speech into text.
|
||||||
|
|
||||||
## Prerequisites
|
## Prerequisites
|
||||||
|
|
||||||
- You need an ElevenLabs account to use this integration. A free account suffices for basic usage.
|
- You need an ElevenLabs account to use this integration. A free account suffices for basic usage.
|
||||||
- For custom voices or more quota you need a subscription.
|
- For custom voices or more quota you need a subscription.
|
||||||
- You need your API key from the ElevenLabs website.
|
- You need your API key from the ElevenLabs website.
|
||||||
|
- Your API key requires the following permissions:
|
||||||
|
- Text to Speech
|
||||||
|
- Speech to Text
|
||||||
|
- Voices (Read only)
|
||||||
|
- Models (Read only)
|
||||||
|
|
||||||
{% include integrations/config_flow.md %}
|
{% include integrations/config_flow.md %}
|
||||||
|
|
||||||
|
### Text-to-speech
|
||||||
|
|
||||||
For an overview of which languages can be used, check the [complete list of supported languages](https://elevenlabs.io/languages).
|
For an overview of which languages can be used, check the [complete list of supported languages](https://elevenlabs.io/languages).
|
||||||
|
|
||||||
For a description of which voices are available for use, check your VoiceLab voices.
|
For a description of which voices are available for use, check your VoiceLab voices.
|
||||||
|
|
||||||
To see the available models and their benefits, check the [models documentation](https://elevenlabs.io/docs/speech-synthesis/models).
|
To see the available models and their benefits, check the [models documentation](https://elevenlabs.io/docs/speech-synthesis/models).
|
||||||
|
|
||||||
|
### Speech-to-text
|
||||||
|
|
||||||
|
An overview of supported languages is available at [complete list of supported languages](https://elevenlabs.io/docs/capabilities/speech-to-text#supported-languages).
|
||||||
|
|
||||||
|
List of models and their capabilities can be determined at [models documentation](https://elevenlabs.io/docs/capabilities/speech-to-text#models).
|
||||||
|
|
||||||
## Action speak
|
## Action speak
|
||||||
|
|
||||||
The `tts.speak` action is the modern way to use TTS. Add the `speak` action, select the entity for your ElevenLabs TTS (it's named ElevenLabs by default), select the media player entity or group to send the TTS audio to, and enter the message to speak.
|
The `tts.speak` action is the modern way to use TTS. Add the `speak` action, select the entity for your ElevenLabs TTS (it's named ElevenLabs by default), select the media player entity or group to send the TTS audio to, and enter the message to speak.
|
||||||
@ -61,14 +75,25 @@ voice:
|
|||||||
required: false
|
required: false
|
||||||
type: string
|
type: string
|
||||||
model:
|
model:
|
||||||
description: "Model ID for the model to use. Will override the default model of the entity!"
|
description: "Model ID for the text-to-speech model to use. Will override the default model of the entity!"
|
||||||
required: false
|
required: false
|
||||||
type: string
|
type: string
|
||||||
|
Speech-to-text model:
|
||||||
|
description: "Model ID for the speech-to-text model to use. Will override the default model of the entity!"
|
||||||
|
required: false
|
||||||
|
type: string
|
||||||
|
Auto-detect language:
|
||||||
|
description: "Should speech-to-text auto-detect the language spoken, overrides the language selected in voice assistant!"
|
||||||
|
required: false
|
||||||
|
type: boolean
|
||||||
|
default: false
|
||||||
|
|
||||||
{% endconfiguration %}
|
{% endconfiguration %}
|
||||||
|
|
||||||
For more information about using text-to-speech with Home Assistant and more details on all the options it provides, see the [TTS documentation](/integrations/tts/).
|
For more information about using text-to-speech with Home Assistant and more details on all the options it provides, see the [TTS documentation](/integrations/tts/).
|
||||||
|
|
||||||
|
For more information about using speech-to-text with Home Assistant and more details on all the options it provides, see the [STT documentation](/integrations/stt/).
|
||||||
|
|
||||||
## Removing the integration
|
## Removing the integration
|
||||||
|
|
||||||
This integration follows standard integration removal. No extra steps are required.
|
This integration follows standard integration removal. No extra steps are required.
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user