Unsupervised Anomaly Detection in Multi-Aspect Data via Tensor Decomposition and Hidden Markov Models
- 주제(키워드) Anomaly Detection , Time-series analysis , Tensors , Hidden Markov models , Gaussian mixture model
- 주제(DDC) 006.31
- 발행기관 아주대학교 일반대학원
- 지도교수 이슬
- 발행년도 2024
- 학위수여년월 2024. 8
- 학위명 석사
- 학과 및 전공 일반대학원 인공지능학과
- 실제URI http://www.dcollection.net/handler/ajou/000000033909
- 본문언어 영어
- 저작권 아주대학교 논문은 저작권에 의해 보호받습니다.
초록/요약
Anomaly detection in unlabeled multi-aspect bio-signals with semi-periodic patterns is a challenging task. We propose a novel unsupervised anomaly scoring method called importance to effectively address this problem. Our approach combines Tucker decomposition and Gaussian Mixture Hidden Markov Models (GM-HMM) to simultaneously capture the latent patterns in the multi-aspect structure and the inherent temporal patterns of the data. The importance score’s novelty stems from 1) a new definition of error contribution from the input values, and 2) a weight definition for the temporal factor based on GM-HMM. This weighted error contribution enables more accurate anomaly detection compared to existing methods. Extensive experiments were conducted on synthetic multi-aspect time-series data to demonstrate the effectiveness of our importance score for anomaly detection compared to other approaches. Further evaluations on three real-world bio-signal datasets provide empirical evidence of the effectiveness in detecting unusual signals.
more목차
1 Introduction 1
1.1 Contributions 3
2 Fundamental 4
2.1 Tensor Operation 5
2.2 Tucker Decomposition 7
2.3 Hidden Markov Model 8
2.3.1 Gaussian Mixture - Hidden Markov Model (GM-HMM) 11
3 Method 14
3.1 Tensorizing and Decomposition Signal 15
3.2 Temporal Weight Calculation with GM-HMM 16
3.3 Importance 19
4 Experiment 21
4.1 Model Selection 23
4.2 Validation of Weight 24
4.3 Detecting of Anomalities in Bio-Signals via Importance Score 25
4.3.1 Anomaly Detection on Fetal ECG Dataset 25
4.3.2 Anomaly Detection on Sleep Stage Dataset 26
4.3.3 Anomaly Detection on VitalDB Dataset 27
5 Conclusion 29

