Autotts: End-to-End Text-to-Speech Synthesis Through Differentiable Duration Modeling

Autotts: End-to-End Text-to-Speech Synthesis Through Differentiable Duration Modeling | IEEE Conference Publication | IEEE Xplore