Abstract:
Named Entity Recognition (NER) represents a fundamental operation within Natural Language Processing (NLP), focused on the extraction and classification of specific entit...Show MoreMetadata
Abstract:
Named Entity Recognition (NER) represents a fundamental operation within Natural Language Processing (NLP), focused on the extraction and classification of specific entities embedded in textual data. Given the rising linguistic diversity present in digital communications, there is an escalating need for NER systems to be proficient in identifying and categorizing entities across a spectrum of languages. However, developing NER models for resource-poor languages presents significant challenges due to limited labeled data and linguistic resources. This paper examines methodologies for enhancing the ability of NLP models to perform NER across diverse languages by transferring knowledge from high-resource languages to low-resource languages. We delve into advanced approaches such as cross-lingual transfer learning, multilingual embeddings, and cross-lingual model adaptation. Cross-lingual transfer learning utilizes pre-trained models from high-resource languages to initialize NER systems for low-resource languages, thereby facilitating the effective transfer of linguistic knowledge across language boundaries. Multilingual embeddings provide a shared representation space for words across languages, facilitating the transfer of linguistic knowledge. Additionally, cross-lingual model adaptation techniques aim to adapt existing NER models to new languages through fine-tuning or domain adaptation. By enhancing the generalizability of NER models through cross-lingual knowledge transfer, we enable these models to perform effectively across diverse linguistic contexts, including both resource-rich and resource-poor languages. These advancements contribute to broader accessibility and applicability of NER technology across languages and cultures, facilitating more inclusive and comprehensive language processing applications.
Date of Conference: 21-23 November 2024
Date Added to IEEE Xplore: 11 February 2025
ISBN Information: