Skip to main content

Getting Started

Overview

Our WebSocket API offers real-time audio streaming capabilities for text-to-speech (TTS) synthesis. This document provides detailed guidelines on how to establish a WebSocket connection and send requests to synthesize speech effectively.

Features

  • Real-time audio streaming: Utilizes WebSockets for low-latency, real-time streaming.
  • Flexible output formats: Supports both binary and JSON frame outputs for audio data.
  • Robust error handling: Categorizes errors as recoverable and unrecoverable, ensuring the stability of the WebSocket connection.