Can Knowledge of End-to-End Text-to-Speech Models Improve Neural Midi-to-Audio Synthesis Systems? | IEEE Conference Publication | IEEE Xplore