Dataset formats for voice upload
Resemble accepts data in two formats:
- A single audio/wav file.
- A gzipped file (.tar.gz) that includes a
metadata.csv
file and awavs
directory. Thewavs
directory should have audio files between 1.5 seconds to 15 seconds in length. Each wav file should have an entry in themetadata.csv
file that includes the filename without the extension, and the transcript for that file. The|
character serves as the delimiter inmetadata.csv
.
For example:
data/
metadata.csv
wavs/
file1.wav
file2.wav
file3.wav
Where the metadata file is formatted as follows:
file1|This is the text that is included in file one.