Conferences >2024 28th International Confe...

Transformer Based Multimodal Summarization and Highlight Abstraction Approach for Texts and Speech Audios

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Multimodal summarization is a kind of summarization application in which its inputs and/or outputs can be in different data types like text, video, and audio. In this stu...Show More

Metadata

Abstract:

Multimodal summarization is a kind of summarization application in which its inputs and/or outputs can be in different data types like text, video, and audio. In this study, a new approach based on fine tuning of different pre-trained transformers was developed for abstractive and extractive summarization of audio and text data. In the proposed method, abstractive and extractive summaries of text data are provided only as text, while extractive summaries of audio data are presented as both text and audio data. Abstractive summaries of the audio data are presented as text only. Transformers with text2text input-output relationship were used in both extractive and abstractive summarization processes of the proposed method. For the training and inference processes of audio this type of data to be handled in transformers, an ASR step was followed before the summarization step. The experimental results obtained were given in detail and compared with similar approaches in the literature. As a result of the comparison, it was seen that the proposed method achieved better performance than similar prior approaches.

Published in: 2024 28th International Conference on Information Technology (IT)

Date of Conference: 21-24 February 2024

Date Added to IEEE Xplore: 25 March 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/IT61232.2024.10475775

Conference Location: Zabljak, Montenegro

Contents

References is not available for this document.

Transformer Based Multimodal Summarization and Highlight Abstraction Approach for Texts and Speech Audios

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Transformer Based Multimodal Summarization and Highlight Abstraction Approach for Texts and Speech Audios

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?