Audio generation step by step screenshot

This project is not covered by Drupal’s security advisory policy.

Leverages either Google TTS or AI Text to Speech (e.g. Gemini, ElevenLabs, OpenAI, etc) to generate long audio files using a curated view mode controlled version of your content item using natural voice.

Features

  1. Provides a 'Save and generate audio' button next to save to allow content editors to trigger generation on demand when they have made a notable change.
  2. Runs via batch with chunking breaking at natural pauses (between paragraphs where possible, otherwise sentences) to ensure smooth audio when combining the per chunk audio clips.
  3. Handles any length of content.
  4. Provides voice control (dictated by e.g. OpenAI, Gemini, or other AI Provider options) or via Google TTS options, including language selection like en-GB instead of en-US.
  5. Allows SSML pronunciation control dictionary management (basic) when using Google TTS (or basic phonetical find and replace for non SSML AI generated voices).
  6. Text being read has fine-grained controlled via View Mode and template
  7. Automated test coverage for reliability and stability. Security coverage opt-in once stable release made.

Post-Installation

See the README.md file (to be transferred here eventually).

Optionally composer require google/cloud-text-to-speech if you prefer to use Google TTS; it is cheaper for volume but sounds less natural.

Similar projects

Supporting organizations: 
Initial development and maintenance

Project information

Releases