Files
Alejandro Gutiérrez d3163a5bff feat(db): mesh data model — meshes, members, invites, audit log
- pgSchema "mesh" with 4 tables isolating the peer mesh domain
- Enums: visibility, transport, tier, role
- audit_log is metadata-only (E2E encryption enforced at broker/client)
- Cascade on mesh delete, soft-delete via archivedAt/revokedAt

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-04 21:19:32 +01:00

3.8 KiB

title, description, url
title description url
Text to Speech Convert text into natural-sounding speech using advanced AI voice synthesis models. /ai/docs/tts

Text to Speech

The Text to Speech (TTS) demo application transforms written text into high-quality spoken audio. It leverages state-of-the-art AI models to generate lifelike voices in various languages and styles.

Features

Discover the powerful capabilities of this AI-powered voice synthesis solution:

Access a wide range of voices from providers like [Eleven Labs](https://elevenlabs.io/), including different accents, ages, and emotional tones, to find the perfect match for your content. Experience near-instantaneous audio generation with streaming delivery, providing immediate feedback as your content comes to life. Enjoy a full-featured playback interface with precise controls for playback speed and convenient options to download generated audio files. Fine-tune your audio output with adjustable parameters for pitch, speed, and pauses, creating the most natural and engaging delivery possible (available options vary by provider). Benefit from a thoughtfully designed interface that makes transforming text to speech effortless and efficient, even for first-time users.

AI models

This application primarily utilizes specialized text-to-speech models from Eleven Labs.

} />

For comprehensive information about available voices and advanced customization techniques, consult the ElevenLabs SDK documentation.

Structure

The Text-to-Speech feature is organized across the monorepo for maximum flexibility and maintainability:

Core

The @turbostarter/ai package (packages/ai) contains the essential logic under modules/tts:

  • Comprehensive types, validation schemas, and constants specific to TTS functionality
  • Core API logic for processing text-to-speech requests and interfacing with AI models
  • Robust handling of generated audio file uploads to cloud storage

API

The packages/api package defines the backend API endpoints using Hono:

  • src/modules/ai/tts/tts.router.ts: implements Hono RPC routes for TTS generation, handles input validation, applies critical middleware (authentication, credit management), and invokes the core functionality from @turbostarter/ai.

Web

The Next.js application (apps/web) provides the user interface:

  • src/app/[locale]/(apps)/tts/**: contains the Next.js App Router pages and layouts for the TTS experience
  • src/components/tts/**: houses reusable React components specific to the TTS interface (text input area, voice selector, audio player, etc.)

Mobile

The Expo/React Native application (apps/mobile) provides the native mobile experience:

  • src/app/tts/**: defines the screens for the mobile TTS interface
  • src/components/tts/**: contains React Native components optimized for the mobile experience
  • API interaction: utilizes the same Hono RPC client (packages/api) as the web app for consistent communication with the backend

This architecture ensures perfect consistency between platforms while allowing for optimized UI implementations tailored to each environment.