Date of Award:
5-2024
Document Type:
Dissertation
Degree Name:
Doctor of Philosophy (PhD)
Department:
Computer Science
Committee Chair(s)
Shuhan Yuan
Committee
Shuhan Yuan
Committee
Soukaina Filali Boubrahimi
Committee
Curtis Dyreson
Committee
John Edwards
Committee
Luis Gordillo
Abstract
In the digital transformation era, safeguarding online systems against anomalies – unusual patterns indicating potential threats or malfunctions – has become crucial. This dissertation embarks on enhancing the accuracy, explainability, and ethical integrity of anomaly detection systems. By integrating advanced machine learning techniques, it improves anomaly detection performance and incorporates fairness and explainability at its core.
The research tackles performance enhancement in anomaly detection by leveraging few-shot learning, demonstrating how systems can effectively identify anomalies with minimal training data. This approach overcomes data scarcity challenges. Reinforcement learning is employed to iteratively refine models, enhancing decision-making processes. Transfer learning enables the application of insights across domains, improving system versatility. The integration of Generative Pre-trained Transformer (GPT) models marks a significant advancement, offering enhanced precision in anomaly identification through sophisticated language modeling techniques.
Exploring explainability and root cause analysis, the dissertation introduces advanced frameworks that shed light on the mechanisms behind anomaly detections across different data types. InterpretableSAD enhances sequential log data analysis, pinpointing specific anomalous events to clarify detection processes. RootCLAM addresses tabular data anomalies through causal inference, identifying root causes and suggesting actionable mitigation strategies. The narrative extends to time series data with RecAD and AERCA; RecAD offers algorithmic recourse by proposing minimal-cost actions for correcting anomalies, while AERCA utilizes an autoencoder-based framework to unravel Granger causality, illuminating causative factors behind anomalies. These frameworks empower users with the knowledge and tools to understand and act upon findings, facilitating the identification of irregularities across diverse data landscapes.
Ethical integrity remains paramount, addressed through the CFAD framework, which ensures counterfactual fairness by embedding ethical principles directly into anomaly detection processes. This guarantees equitable treatment across scenarios, advocating for technologies that serve all users equitably and challenge inherent biases.
Extensive evaluations on various datasets demonstrate the proposed models' effectiveness in addressing anomaly detection challenges. This dissertation contributes to advancing techniques that are not only accurate but also interpretable and fair, promoting the responsible use of anomaly detection in real-world applications. This dissertation lays a solid foundation for further exploration into advanced anomaly detection techniques, promising to guide the development of even more robust, transparent, and equitable systems in the digital age.
Checksum
6aea3f097554a9a87d1d4263da38b2f9
Recommended Citation
Han, Xiao, "Achieving Responsible Anomaly Detection" (2024). All Graduate Theses and Dissertations, Fall 2023 to Present. 126.
https://digitalcommons.usu.edu/etd2023/126
Included in
Copyright for this work is retained by the student. If you have any questions regarding the inclusion of this work in the Digital Commons, please email us at .