| # filename | phonemes | speaker id (speaker id is not used, so anything random is fine) | |
| /path/to/file.wav|hello world but in phonemes|0 |
| # filename | phonemes | speaker id (speaker id is not used, so anything random is fine) | |
| /path/to/file.wav|hello world but in phonemes|0 |