Loading [MathJax]/extensions/MathZoom.js
ANeTCM: A Novel MRC Framework for Traditional Chinese Medicine Named Entity Recognition | IEEE Journals & Magazine | IEEE Xplore

ANeTCM: A Novel MRC Framework for Traditional Chinese Medicine Named Entity Recognition


This diagram illustrates the TCM-BERT model, which utilizes continuous pre-training with traditional Chinese medicine (TCM) cases. The model processes input sequences and...

Abstract:

Traditional Chinese medicine (TCM) named entity recognition for supporting downstream tasks is receiving increasing attention. However, mainstream named entity recognitio...Show More

Abstract:

Traditional Chinese medicine (TCM) named entity recognition for supporting downstream tasks is receiving increasing attention. However, mainstream named entity recognition models applied to the TCM domain are still affected by the following two challenges: lack of domain knowledge and imbalance between entity classes. Therefore, we propose ANeTCM, a model that enhances both domain knowledge and inter-entity balance. Specifically, we first use a large number of TCM medical case data to continuously pretrain Roberta and enhance its domain knowledge. Secondly, the sequence annotation is converted into a machine reading comprehension task, and gated linear units are incorporated to further enhance the model’s feature learning capability. Finally, the weights of the samples are adjusted using a normal distribution to address the imbalance of entity classes. We conducted extensive experiments on two TCM named entity recognition datasets and selected several competitive models. The experimental results show the effectiveness of our model.
This diagram illustrates the TCM-BERT model, which utilizes continuous pre-training with traditional Chinese medicine (TCM) cases. The model processes input sequences and...
Published in: IEEE Access ( Volume: 12)
Page(s): 113235 - 113243
Date of Publication: 16 August 2024
Electronic ISSN: 2169-3536

References

References is not available for this document.