Abstract
1- Introduction
2- Feature extraction
3- Epileptic seizure detection
4- Methods for feature evaluation
5- Experimental results
6- Conclusions
References
Abstract
Since the manual detection of electrographic seizures in continuous electroencephalogram (EEG) monitoring is very time-consuming and requires a trained expert, attempts to develop automatic seizure detection are diverse and ongoing. Machine learning approaches are intensely being applied to this problem due to their ability to classify seizure conditions from a large amount of data, and provide pre-screened results for neurologists. Several features, data transformations, and classifiers have been explored to analyze and classify seizures via EEG signals. In the literature, some jointly-applied features used in the classification may have shared similar contributions, making them redundant in the learning process. Therefore, this paper aims to comprehensively summarize feature descriptions and their interpretations in characterizing epileptic seizures using EEG signals, as well as to review classification performance metrics. To provide meaningful information of feature selection, we conducted an experiment to examine the quality of each feature independently. The Bayesian error and non-parametric probability distribution estimation were employed to determine the significance of the individual features. Moreover, a redundancy analysis using a correlation-based feature selection was applied. The results showed that the following features – variance, energy, nonlinear energy, and Shannon entropy computed on a raw EEG signal, as well as variance, energy, kurtosis, and line length calculated on wavelet coefficients – were able to significantly capture the seizures. When compared with a baseline method of classifying all epochs as normal, an improvement of 4.77–13.51% in the Bayesian error was obtained.
Introduction
An epileptic seizure, as defined by the International League Against Epilepsy [1], is a temporary event of symptoms due to synchronization of abnormally excessive activities of neurons in the brain. It has been estimated that approximately 65 million people around the world are affected by epilepsy [2]. Nevertheless, it is still a time-consuming process for neurologists to review continuous electroencephalograms (EEGs) to monitor epileptic patients. Therefore, several researchers have developed differenttechniques that help neurologists to identify an epilepsy occurrence [3–5]. The whole process of automated epileptic seizure analysis primarily consists of data acquisition, signal pre-processing, feature extraction, feature or channel selection, and classification. This paper focuses on a selection of features commonly used in the literature, including statistical parameters (mean, variance, skewness, and kurtosis), amplitude-related parameters (energy, nonlinear energy, line length, maximum and minimum values) and entropyrelated measures. These features can be categorized based on their interpretation or the domain from which the features are calculated. While some studies have considered a particular group of features applicable to their proposed classification method [6–8], others have applied various groups of features extracted from the time, frequency, and time-frequency domains. For example, 55 features were used with a support vector machine (SVM) and postprocessing for neonatal seizure detection, which provided a good detectionrate of 89.2% withone false detectionperhour [9]. Feature redundancy and relevance analysis were applied to 132 features to reduce the vector dimension [10]. Fast correlation based-filter proposed in [11], correlation-based selection (CFS) [12], and ReliefF established in [13], were utilized to select non-redundant features and the filtered features were classified by an artificial neural network (ANN). It turned out that 30-optimal selected features by ReliefF achieved the best result with 91% sensitivity and 95% specificity. These studies showed thatthere is the need to review feature selection as relevant features are directly related to the seizure classification performance.