Improved GPT2 Event Extraction Method Based on Mixed Attention Collaborative Layer Vector | IEEE Journals & Magazine | IEEE Xplore

Improved GPT2 Event Extraction Method Based on Mixed Attention Collaborative Layer Vector


Improved GPT2 Event Extraction Method Based on Mixed Attention Collaborative Layer Vector

Abstract:

As internet information expands rapidly, extracting valuable event information from unstructured text has become an important research topic. This paper proposes an impro...Show More

Abstract:

As internet information expands rapidly, extracting valuable event information from unstructured text has become an important research topic. This paper proposes an improved GPT2 model, termed HACLV-GPT2, which is the initial utilization of a GPT-like architecture for the purpose of event extraction. The model utilizes a generative input template and incorporates a hybrid attention mechanism to enhance the understanding of complex contexts. Additionally, the HACLV-GPT2 model employs a layer-vector fusion strategy to optimize the output of Transformer Blocks, effectively boosting prediction performance. The experimental results show that the HACLV-GPT2 model performs excellently in both event argument extraction and event type detection tasks, with F1 values of 0.8020 and 0.9614, respectively, surpassing several baseline models. This outcome fully validates the effectiveness and superiority of the proposed method. Furthermore, ablation experiments confirm the critical role of the hybrid attention mechanism and layer-vector fusion strategy in performance improvement.
Improved GPT2 Event Extraction Method Based on Mixed Attention Collaborative Layer Vector
Published in: IEEE Access ( Volume: 12)
Page(s): 160074 - 160082
Date of Publication: 29 October 2024
Electronic ISSN: 2169-3536

Funding Agency:


References

References is not available for this document.