Diverse and Vivid Sound Generation from Text Descriptions | IEEE Conference Publication | IEEE Xplore