https://doi.org/10.1140/epjs/s11734-022-00714-3
Regular Article
Detecting epileptic seizures using machine learning and interpretable features of human EEG
1
National Medical and Surgical Center named after N. I. Pirogov, Ministry of Healthcare of the Russian Federation, Nizhnyaya Pervomaiskaya Str. 70, 105203, Moscow, Russia
2
Faculty of Information Technology and Big Data Analysis, Financial University under the Government of the Russian Federation, Leningradskii Pr. 49, 125167, Moscow, Russia
3
Baltic Center for Artificial Intelligence and Neurotechnology, Immanuel Kant Baltic Federal University, Nevskogo Str. 14, 236041, Kaliningrad, Russia
4
Neuroscience and Cognitive Technology Laboratory, Center for Technologies in Robotics and Mechatronics Components, Innopolis University, Universitetskaya Str. 1, 420500, Innopolis, Russia
Received:
31
August
2022
Accepted:
26
October
2022
Published online:
17
November
2022
Epilepsy is a neurological disorder distinguished by sudden and unexpected seizures. To diagnose epilepsy, clinicians register the signals of brain electric activity (electroencephalograms, EEG) and extract segments with seizures. It enables characterizing their type and finding an onset zone, a brain area where they originate. This procedure requires manual EEG deciphering, which is slow and necessitates the assistance of machine learning (ML) algorithms. Traditionally, ML handles this issue in a supervised fashion, i.e., after the training on the representative data, it constructs a boundary in the feature space that separates classes. As the number of features grows, this boundary becomes complex and less generalized. The feature space of brain data is high dimensional. The standard recording includes 30 signals and 50 frequencies resulting in 1500 features. Using additional time-domain features may further enlarge the feature space. Thus, selecting appropriate features is a big part of the successful classification. The selection procedure relies on either a data-based mathematical approach (e.g., principal components, PCs) or the expert domain knowledge of data (explainable features, EFs). Here, we demonstrate the benefits of using EFs. For the EEG data of 30 epileptic patients, we trained a RandomForest algorithm using PCs and EFs. The feature importance analysis revealed that explainable features outperform principal components.
© The Author(s), under exclusive licence to EDP Sciences, Springer-Verlag GmbH Germany, part of Springer Nature 2022. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.