dCollection 디지털 학술정보 유통시스템

Artificial Intelligence Open-Domain News Event Extraction Method Based on BERT Huang Hairui

원문보기

주제(키워드) Event extraction , Open-domain event extraction , Neural topic model
주제(DDC) 006.31
발행기관 아주대학교 일반대학원
지도교수 Tae-Sun Chung
발행년도 2024
학위수여년월 2024. 2
학위명 석사
학과 및 전공 일반대학원 인공지능학과
실제URI http://www.dcollection.net/handler/ajou/000000033549
본문언어 영어
저작권 아주대학교 논문은 저작권에 의해 보호받습니다.

초록/요약

News is an important way for people to obtain information, and in the open news environment, the type of news is increasingly diversified and the scale of news is huge, which causes problems such as information overload and redundancy. The open-domain event extraction task aims to identify and extract various types of event information from predefined text, The task is usually based on methods such as pre-training or neural topic modeling. However, there are a number of problems with existing methods. First, Existing pre-trained models suffer from insufficient feature vector extraction and excessively high embedding dimensions. Second, Existing methods are not rich enough in semantics and lack syntactic structural information, resulting in poor readability of results and insufficient extraction accuracy. Therefore, to address these issues, this paper first improves the open-domain event extraction method based on the neural topic model of BERT, and then dynamically in- tegrates semantic and syntactic dependency information to obtain rich semantic and syn- tactic features, in order to further improve the model performance. The main research is as follows: Proposed an improvement method of neural topic modeling based on BERT. First, BERT is used in the coding layer for pre-training to obtain the contextual representation of the feature sequences. Second, the Umap dimensionality reduction method is used to obtain more extensive local and global information, and the joint distribution of variables is combined with the deep hidden variable probabilistic graph model to further optimize the parameter inference learning process. Finally, the self-attention mechanism is introduced to assign weights to different nodes to reduce the influence of noisy data, so that the model can pay attention to the more critical features, and further improve the performance of the open-domain event extraction model. Keywords: Event extraction, Open-domain event extraction, Neural topic model.

1 I. Introduction 1
2 II. Background 4
3 III. Related work 7
3.1 Dimensionality reduction methods 7
3.2 Probabilistic graphical models 9
3.3 Inference Methods 11
3.4 Attention Mechanism 11
3.5 Graph Neural Networks 12
4 IV. Proposed Method 14
4.1 Problem Analysis 14
4.2 Model Architecture 16
4.2.1 Embedding Layer 18
4.2.2 Dimensionality Reduction Layer 19
4.2.3 Model Generation 20
4.2.4 Inference and Parameter Learning Layer 22
5 V. Experiment 24
5.1 Dataset 24
5.2 Evaluation Metrics 24
5.3 Experimental results and analysis 25
5.3.1 Main experiment results and analysis 25
5.3.2 Ablation experiment results and analysis 27
6 VI. Conclusion 28

반출 Meta View 목록

아주대학교

검색 상세

Artificial Intelligence Open-Domain News Event Extraction Method Based on BERT Huang Hairui

초록/요약

목차