Identification of stock market manipulation using a hybrid ensemble approach

Anomaly detection in time series data is a complex data mining issue with many useful, real-world applications. Anomalies in datasets represent deviations in the expected behaviour of a system and can indicate rare but significant events that require intervention. Market manipulation is a serious is...

Fuld beskrivelse

Saved in:
Bibliografiske detaljer
Main Authors: Quinn, Pearse, Toman, Marinus, Curran, Kevin
Format: UMS Journal (OJS)
Sprog:eng
Udgivet: Universitas Muhammadiyah Surakarta 2023
Fag:
Online adgang:https://journals2.ums.ac.id/index.php/arstech/article/view/2576
Tags: Tilføj Tag
Ingen Tags, Vær først til at tagge denne postø!
_version_ 1805340805404033024
author Quinn, Pearse
Toman, Marinus
Curran, Kevin
author_facet Quinn, Pearse
Toman, Marinus
Curran, Kevin
author_sort Quinn, Pearse
collection OJS
description Anomaly detection in time series data is a complex data mining issue with many useful, real-world applications. Anomalies in datasets represent deviations in the expected behaviour of a system and can indicate rare but significant events that require intervention. Market manipulation is a serious issue in financial jurisdictions worldwide, with financial regulators such as the SEC constantly trying to prevent it and prosecute those guilty of it. This paper makes use of state-of-the-art deep learning techniques as well as more classical statistical techniques in order to detect anomalies in five real-world datasets. The predictions of these models are then aggregated in two different ensemble models. The results of the individual models as well as the ensemble models, are evaluated, and F1-Score measures performance. Nine individual models, consisting of three models based on LSTM with Dynamic Thresholding, three ARIMA models and three Exponential Smoothing models, were used to generate predictions of anomalies based on daily trading volumes. The individual predictions of these models were then aggregated, with two different ensemble methods being used, namely the majority voting ensemble method and the ensemble averaging aggregation method. While both performed well, the majority voting ensemble method was seen to be the superior method in this study, with an average F1Score of 0.494, compared to an F1Score of 0.414 for the ensemble averaging aggregation method.
format UMS Journal (OJS)
id oai:ojs2.journals2.ums.ac.id:article-2576
institution Universitas Muhammadiyah Surakarta
language eng
publishDate 2023
publisher Universitas Muhammadiyah Surakarta
record_format ojs
spelling oai:ojs2.journals2.ums.ac.id:article-2576 Identification of stock market manipulation using a hybrid ensemble approach Quinn, Pearse Toman, Marinus Curran, Kevin Anomaly Detection Deep Learning Exponential Smoothing Long Short-Term Memory (LSTM) Market Manipulation Anomaly detection in time series data is a complex data mining issue with many useful, real-world applications. Anomalies in datasets represent deviations in the expected behaviour of a system and can indicate rare but significant events that require intervention. Market manipulation is a serious issue in financial jurisdictions worldwide, with financial regulators such as the SEC constantly trying to prevent it and prosecute those guilty of it. This paper makes use of state-of-the-art deep learning techniques as well as more classical statistical techniques in order to detect anomalies in five real-world datasets. The predictions of these models are then aggregated in two different ensemble models. The results of the individual models as well as the ensemble models, are evaluated, and F1-Score measures performance. Nine individual models, consisting of three models based on LSTM with Dynamic Thresholding, three ARIMA models and three Exponential Smoothing models, were used to generate predictions of anomalies based on daily trading volumes. The individual predictions of these models were then aggregated, with two different ensemble methods being used, namely the majority voting ensemble method and the ensemble averaging aggregation method. While both performed well, the majority voting ensemble method was seen to be the superior method in this study, with an average F1Score of 0.494, compared to an F1Score of 0.414 for the ensemble averaging aggregation method. Universitas Muhammadiyah Surakarta 2023-11-29 info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion application/pdf https://journals2.ums.ac.id/index.php/arstech/article/view/2576 10.23917/arstech.v4i2.2576 Applied Research and Smart Technology (ARSTech); Vol. 4 No. 2 (2023): Applied Research and Smart Technology; 53-63 2722-9645 2722-9637 eng https://journals2.ums.ac.id/index.php/arstech/article/view/2576/882 Copyright (c) 2023 Pearse Quinn, Marinus Toman, Kevin Curran https://creativecommons.org/licenses/by/4.0
spellingShingle Anomaly Detection
Deep Learning
Exponential Smoothing
Long Short-Term Memory (LSTM)
Market Manipulation
Quinn, Pearse
Toman, Marinus
Curran, Kevin
Identification of stock market manipulation using a hybrid ensemble approach
title Identification of stock market manipulation using a hybrid ensemble approach
title_full Identification of stock market manipulation using a hybrid ensemble approach
title_fullStr Identification of stock market manipulation using a hybrid ensemble approach
title_full_unstemmed Identification of stock market manipulation using a hybrid ensemble approach
title_short Identification of stock market manipulation using a hybrid ensemble approach
title_sort identification of stock market manipulation using a hybrid ensemble approach
topic Anomaly Detection
Deep Learning
Exponential Smoothing
Long Short-Term Memory (LSTM)
Market Manipulation
topic_facet Anomaly Detection
Deep Learning
Exponential Smoothing
Long Short-Term Memory (LSTM)
Market Manipulation
url https://journals2.ums.ac.id/index.php/arstech/article/view/2576
work_keys_str_mv AT quinnpearse identificationofstockmarketmanipulationusingahybridensembleapproach
AT tomanmarinus identificationofstockmarketmanipulationusingahybridensembleapproach
AT currankevin identificationofstockmarketmanipulationusingahybridensembleapproach