Sentence Segmentation from Unformatted Text using Language Modeling and Sequence Labeling Approaches | IEEE Conference Publication | IEEE Xplore

Sentence Segmentation from Unformatted Text using Language Modeling and Sequence Labeling Approaches


Abstract:

Current research devoted to the Natural Language Processing problem of sentence segmentation from raw text. The focus was directed to the task of segmentation of auto-gen...Show More

Abstract:

Current research devoted to the Natural Language Processing problem of sentence segmentation from raw text. The focus was directed to the task of segmentation of auto-generated transcripts for videos that do not have any punctuation and segmentation. Two general approaches to solve the problem of sentence segmentation were proposed and experiments concluded on a comparison of results of pre-trained transformer-based models. Research on how different approach of solving problem affects results were carried out. As a result, the sequence labeling approach turned out to be the most suitable.
Date of Conference: 06-09 October 2020
Date Added to IEEE Xplore: 02 July 2021
ISBN Information:
Conference Location: Kharkiv, Ukraine

Contact IEEE to Subscribe

References

References is not available for this document.