Abstract:
Currently, there is limited research on entity relation extraction in the domain of ancient Chinese, primarily due to the scarcity of publicly available datasets for mode...Show MoreMetadata
Abstract:
Currently, there is limited research on entity relation extraction in the domain of ancient Chinese, primarily due to the scarcity of publicly available datasets for model training. To advance the study of entity relation extraction tasks in ancient Chinese, this paper reconstructs a CMAG-ERE2.0 dataset based on "Comprehensive Mirror to Aid in Government" Addressing issues such as missing relation predictions and inaccurate long-span entity boundary recognition prevalent in existing models applied to ancient Chinese texts, we propose a joint entity relation extraction model tailored for this context. In our model, relative position information of tokens enhances constraints on long-span entity boundaries, while multi-step fusion of relationship type information establishes connections between relational hints and entities, facilitating precise extraction of both long-span entities and latent relations. Additionally, we implement multi-task decoding for modeling purposes, which significantly improves the model’s ability to extract entities and relations within complex textual environments. Experimental results demonstrate that our proposed model effectively mitigates these challenges.
Published in: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Date of Conference: 06-11 April 2025
Date Added to IEEE Xplore: 07 March 2025
ISBN Information:
ISSN Information:
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Types Of Information ,
- Information Fusion ,
- Related Entities ,
- Relation Extraction ,
- Entity Relation Extraction ,
- Ancient Chinese ,
- Related Information ,
- Position Information ,
- Dataset For Model Training ,
- Loss Function ,
- Generation Sequencing ,
- Feature Representation ,
- Attention Mechanism ,
- Multilayer Perceptron ,
- Trainable Parameters ,
- Subject And Object ,
- Linker Domain ,
- Sequence Labeling ,
- Decoder Module ,
- Sentence Information ,
- Entity Pairs ,
- Modern Chinese
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Types Of Information ,
- Information Fusion ,
- Related Entities ,
- Relation Extraction ,
- Entity Relation Extraction ,
- Ancient Chinese ,
- Related Information ,
- Position Information ,
- Dataset For Model Training ,
- Loss Function ,
- Generation Sequencing ,
- Feature Representation ,
- Attention Mechanism ,
- Multilayer Perceptron ,
- Trainable Parameters ,
- Subject And Object ,
- Linker Domain ,
- Sequence Labeling ,
- Decoder Module ,
- Sentence Information ,
- Entity Pairs ,
- Modern Chinese
- Author Keywords