Version: 2.0.0

Create a voice

This endpoint creates a voice and optionally starts training

note

Create voice API is only available for Business plan users. If you're running into trouble, upgrade to a Business plan or higher on the billing page.

Voice types

There are two types of voices you can create on the Resemble platform: Rapid Voice Clone and Professional Voice Clone.

Rapid Voice Clone

A Rapid Voice Clone is a quick and easy way to create a voice for your content. Using as little as 10 seconds of recordings, you can create a voice clone in under a minute.

Professional Voice Clone

A Professional Voice Clone provides a more accurate way of creating a voice. It requires at least 10 minutes of recordings and takes around 40 minutes to create. This allows for a more detailed and personalized voice, as the AI has more data to work with.

Voice Data

There are 2 ways to provide data for a voice:

Providing a URL to a dataset when creating the voice
Uploading individual recordings using the recording API

Option 1: Providing a URL to a dataset when creating the voice

Rapid Voice

Create a voice using the "Create a voice" endpoint and provide a URL to the dataset in the dataset_url attribute. The dataset must be a wav file of at least 10 seconds.
After creating the voice, follow the Build a voice documentation to start training.

Professional Voice

Create a voice using the "Create a voice" endpoint and provide a URL to the dataset in the dataset_url attribute. Please see here for acceptable dataset formats.
The dataset will first be analyzed and then training will begin automatically.

Option 2: Uploading individual recordings using the recording API

Rapid Voice

Create a voice using the "Create a voice" endpoint and omit the dataset_url attribute.
Use the instructions on the "Create a recording" page to upload recordings to your voice.
Upon uploading at least 3 recordings, follow the Build a voice documentation to start training.

Professional Voice

Create a voice using the "Create a voice" endpoint and omit the dataset_url attribute.
Use the instructions on the "Create a recording" page to upload recordings to your voice.
Upon uploading at least 20 recordings, follow the Build a voice documentation to start training.

HTTP Request

POST https://app.resemble.ai/api/v2/voices

JSON Body Parameters	Type	Description
name	string	Name of the voice
voice_type	(optional) string	The type of voice to create. Either `rapid` or `professional`. If not provided defaults to `professional`
dataset_url	(optional) string	A URL to a dataset on which to train the voice on. Please see here for acceptable dataset formats
callback_uri	(optional) string	A URL (webhook) that will be notified upon voice training completion Please see here for callback details

HTTP Response

{
  "success": true,
  "item": {
    "uuid": <string>,
    "name": <string>,
    "status": <string>,
    "dataset_url": <string>,
    "created_at": <UTC Date>,
    "updated_at": <UTC Date>,
  }
}

Callback

If you've provided a callback_uri when you created a voice, you will receive the following POST request when the voice has completed training.

Training Completion Callback

This callback happens when your training completes without any issues.

{
    "ok": true,
    "id": "<string>",
    "status": "finished",
    "recordings": [],
    "issue": null
}

Dataset Issue Callback

If the status is set to dataset_issue, this callback will contain detailed information about the issue and problematic recordings:

{
  "ok": "<boolean>",
  "id": "<string>",
  "status": "dataset_issue",
  "issue": "Detailed description of the dataset issue.",
  "recordings": [
    {
      "uuid": "<string>",
      "name": "<string>",
      "transcript": "<string>",
      "stoi_score": "<number>",
      "pesq_score": "<number>"
      "si_dr_score": "<number>",
      "resemble_sample_score": "<number>",
      "is_active": "<boolean>",
      "is_outlier": "<boolean>",
      "is_silent": "<boolean>"
    },
    ...
  ]
}

JSON Body Parameters	Type	Description
id	string	The UUID of the voice this callback is for.
status	string	The status of the voice, such as `finished` or `dataset_issue`.
issue	string	A detailed description of the issue, if any.
recordings	array	The `recordings` array provides detailed feedback for each problematic recording, including scores for STOI, PESQ, and SI-SDR, as well as flags indicating whether the recording is active, an outlier, or silent.

Examples

NodeJS

import { Resemble } from '@resemble/node'

Resemble.setApiKey('YOUR_API_TOKEN')

await Resemble.v2.voices.create({ name: "Chef", dataset_url: "https://../dataset.zip", callback_uri: "http://example.com/cb" })

Try it out

API Key:

JSON Body:

Create a voice

Voice types​

Rapid Voice Clone​

Professional Voice Clone​

Voice Data​

Option 1: Providing a URL to a dataset when creating the voice​

Rapid Voice​

Professional Voice​

Option 2: Uploading individual recordings using the recording API​

Rapid Voice​

Professional Voice​

HTTP Request​

HTTP Response​

Callback​

Training Completion Callback​

Dataset Issue Callback​

Examples​

Try it out​

Voice types

Rapid Voice Clone

Professional Voice Clone

Voice Data

Option 1: Providing a URL to a dataset when creating the voice

Rapid Voice

Professional Voice

Option 2: Uploading individual recordings using the recording API

Rapid Voice

Professional Voice

HTTP Request

HTTP Response

Callback

Training Completion Callback

Dataset Issue Callback

Examples

Try it out