Detecting Parkinson’s disease from shoe-mounted accelerometer sensors using convolutional neural networks optimized with modified metaheuristics

Luka Jovanovic; Robertas Damaševičius; Rade Matic; Milos Kabiljo; Vladimir Simic; Goran Kunjadic; Milos Antonijevic; Miodrag Zivkovic; Nebojsa Bacanin

doi:10.7717/peerj-cs.2031

Detecting Parkinson’s disease from shoe-mounted accelerometer sensors using convolutional neural networks optimized with modified metaheuristics

Luka Jovanovic¹, Robertas Damaševičius ², Rade Matic³, Milos Kabiljo³, Vladimir Simic^4,5, Goran Kunjadic⁶, Milos Antonijevic⁷, Miodrag Zivkovic⁷, Nebojsa Bacanin^7,8

1Faculty of Technical Sciences, Singidunum University, Belgrade, Serbia

2Department of Applied Informatics, Vytautas Magnus University, Akademija, Lithuania

3Department for Information Systems and Technologies, Belgrade Academy for Business and Arts Applied Studies, Belgrade, Serbia

4Faculty of Transport and Traffic Engineering, University of Belgrade, Belgrade, Serbia

5College of Engineering, Department of Industrial Engineering and Management, Yuan Ze University, Taoyuan City, Taiwan

6Higher Colleges of Technology, Abu Dhabi, United Arab Emirates

7Faculty of Informatics and Computing, Singidunum University, Belgrade, Serbia

8MEU Research Unit, Middle East University, Amman, Jordan

DOI: 10.7717/peerj-cs.2031

Published: 2024-05-13
Accepted: 2024-04-09
Received: 2024-01-19

Academic Editor: Bilal Alatas

Subject Areas: Algorithms and Analysis of Algorithms, Artificial Intelligence, Data Mining and Machine Learning, Neural Networks
Keywords: Parkinson’s disease, Convolutional neural network, Optimization, Extreme gradient boosting, Metaheuristics, Wearable sensors, Smart healthcare

Copyright: © 2024 Jovanovic et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

Cite this article: Jovanovic L, Damaševičius R, Matic R, Kabiljo M, Simic V, Kunjadic G, Antonijevic M, Zivkovic M, Bacanin N. 2024. Detecting Parkinson’s disease from shoe-mounted accelerometer sensors using convolutional neural networks optimized with modified metaheuristics. PeerJ Computer Science 10:e2031 https://doi.org/10.7717/peerj-cs.2031

The authors have chosen to make the review history of this article public.

Abstract

Neurodegenerative conditions significantly impact patient quality of life. Many conditions do not have a cure, but with appropriate and timely treatment the advance of the disease could be diminished. However, many patients only seek a diagnosis once the condition progresses to a point at which the quality of life is significantly impacted. Effective non-invasive and readily accessible methods for early diagnosis can considerably enhance the quality of life of patients affected by neurodegenerative conditions. This work explores the potential of convolutional neural networks (CNNs) for patient gain freezing associated with Parkinson’s disease. Sensor data collected from wearable gyroscopes located at the sole of the patient’s shoe record walking patterns. These patterns are further analyzed using convolutional networks to accurately detect abnormal walking patterns. The suggested method is assessed on a public real-world dataset collected from parents affected by Parkinson’s as well as individuals from a control group. To improve the accuracy of the classification, an altered variant of the recent crayfish optimization algorithm is introduced and compared to contemporary optimization metaheuristics. Our findings reveal that the modified algorithm (MSCHO) significantly outperforms other methods in accuracy, demonstrated by low error rates and high Cohen’s Kappa, precision, sensitivity, and F1-measures across three datasets. These results suggest the potential of CNNs, combined with advanced optimization techniques, for early, non-invasive diagnosis of neurodegenerative conditions, offering a path to improve patient quality of life.

Introduction

Neurodegenerative diseases encompass a set of progressive conditions marked by the gradual deterioration and demise of nerve cells (neurons) in either the brain or the peripheral nervous system (Dugger & Dickson, 2017; Kovacs, 2018). Such diseases commonly lead to a decline in cognitive function, movement, and various other neurological functions (Katsuno et al., 2018; Christidi et al., 2018). The most common neurodegenerative diseases include Alzheimer’s disease, Parkinson’s disease, Amyotrophic Lateral Sclerosis, Multiple Sclerosis, Creutzfeldt-Jakob disease, and many others. The precise origins of neurodegenerative diseases are frequently intricate and multifaceted, incorporating genetic, environmental, and lifestyle factors (Hou et al., 2019; Popa-Wagner et al., 2020; Bianchi, Herrera & Laura, 2021). Diagnosis typically entails a blend of clinical assessment, medical history analysis, and, in special cases, imaging or genetic testing (Gómez-Río et al., 2016; Zhou et al., 2020; García & Bustos, 2018; Hansson, 2021). Despite continuous research and progress in comprehending these disorders, numerous neurodegenerative diseases persist. Treatment primarily centers on symptom management and improving the quality of life for persons grappling with these disorders (Maneu, Lax & Cuenca, 2022; Aza et al., 2022; Mortberg, Vallabh & Minikel, 2022).

The diagnostics of neurodegenerative diseases pose several challenges, reflecting the complexity of these conditions and the boundaries of the current medical approaches (Domínguez-Fernández et al., 2023; Kumar et al., 2023). Although early detection is vital for timely intervention and better management of the condition, many disorders, such as Alzheimer’s and Parkinson’s, often manifest symptoms only in the later stages, making early detection challenging (Shusharina et al., 2023; Bhat et al., 2018). Moreover, the overlap of symptoms between different neurodegenerative disorders complicates accurate diagnosis. Distinguishing between conditions with similar clinical presentations is crucial for appropriate treatment and management. Lastly, early symptoms may be very subtle, and many patients seek treatment only when symptoms seriously threaten their quality of life (Morel et al., 2022; Rosqvist, Schrag & Odin, 2022; Meng et al., 2022).

Early diagnosis of Parkinson’s disease (and other neurodegenerative disorders in general) provides several advantages, contributing to better patient outcomes and overall disease management (Stern, 1993; Pagan, 2012; Kobylecki, 2020). It allows the prompt beginning of the treatment, and at the same time medications and therapy can be more efficient in controlling the disease’s symptoms if started early enough, consequently significantly improving the patient’s quality of life (Murman, 2012; Hauser et al., 2009; Armstrong & Okun, 2020). From the healthcare point of view, medical staff can monitor the progression of the disease over time, allowing tailored treatment plans to the individual patients, minimizing the symptoms’ impact on patient’s daily activities (van Halteren et al., 2020; Fröhlich et al., 2022). Reduced economic burden should not be neglected as well, as early diagnostics can minimize the necessary hospitalizations and visits to the emergency rooms. Therefore, it allows more efficient medical staff and resource allocation that will in the long run reduce the overall costs (Yang et al., 2020; Boina, 2022; Soh et al., 2022; Radder et al., 2020).

Data-driven models for Parkinson’s disease detection leverage different types of sensors and contemporary technologies to collect and analyze data related to movement, walking patterns (Priya et al., 2021), handwriting tremors (Bernardo et al., 2022), phonation (Hashim et al., 2023), computer keypress time data (Bernardo et al., 2021) and other symptoms commonly associated with Parkinson’s disorder (Khoury et al., 2019; Dai, Tang & Wang, 2019; Chandrabhatla, Pomeraniec & Ksendzovsky, 2022). These relatively cheap wearable sensors play a crucial role in remote monitoring, early detection, and ongoing management of Parkinson’s disease. Wearable gadgets, like smartwatches and fitness trackers, frequently incorporate accelerometers and gyroscopes, enabling them to track patterns of movement and identify subtle changes in gait and tremor (Mughal et al., 2022; Reichmann, Klingelhoefer & Bendig, 2023; Schalkamp et al., 2023). Gait-freezing, for instance, is a typical symptom observed in individuals with Parkinson’s disease, defined as a sudden and temporary inability to initiate or continue walking despite the willingness to move (Pozzi et al., 2019; Gao et al., 2020; Lewis et al., 2022; Moreira-Neto et al., 2022). This phenomenon is easily detectable by a simple gyroscopic system built in the patient’s shoes (Pardoel et al., 2019; Marcante et al., 2021).

Methods belonging to artificial intelligence (AI) and subcategories like machine learning (ML) are pivotal in the early identification and surveillance of Parkinson’s disease, employing diverse techniques such as machine learning and data analysis (Senturk, 2020; Belić et al., 2019; Gupta et al., 2023). These methodologies analyze intricate datasets to discern patterns that signify the presence of the disease. For example, AI methods are capable of analyzing medical imaging data, such as magnetic resonance imaging (MRI) or computed tomography (CT) scans, to discover subtle changes in brain structures associated with Parkinson’s diseases (Xu & Zhang, 2019; Zhang, 2022; Francis, Rajan & Pandian, 2022). Additionally, AI approaches are capable of analyzing data from wearable devices to track changes in motor function, such as gait abnormalities or tremors linked to conditions like Parkinson’s disease (Balaji, Brindha & Balakrishnan, 2020; Wu et al., 2023). These algorithms may also aid in examining genetic and biomarker data to pinpoint specific markers related to particular diseases. Consequently, AI methods can help differentiate among various neurodegenerative diseases that might exhibit similar clinical symptoms but possess distinct underlying pathology (Khoury et al., 2019; Thapa et al., 2020; Noor et al., 2019). In this way, doctors are enabled to make better-informed decisions.

Recognizing the potential benefits of data-driven diagnostic approaches is vital, nevertheless, it is important to acknowledge the specific challenges they pose. Sufficient volumes of data are required, and the quality and consistency of the data are paramount. Ethical considerations concerning data collection and processing, patient consent, and data privacy require thorough attention. Moreover, it is vital to verify and regularly enhance AI models to ensure their accuracy, making them suitable for clinical use. Nonetheless, AI techniques show great promise, as evident in recent publications (Singh et al., 2019; Tăuţan, Ionescu & Santarnecchi, 2021; Lin et al., 2020; Khaliq et al., 2023). Another advantage of employing AI approaches is the possibility to apply feature analysis tools like Shapley Additive Explanations (SHAP) (Lundberg et al., 2020), which can contribute to a better comprehending of both the particular disorder and diagnostic process itself (Liu et al., 2022; McFall et al., 2023; Junaid et al., 2023). SHAP analysis improves the transparency of the model, interpretability, and trust in the obtained results, therefore allowing an informed decision-making process in general. Interpretability of the outcomes is vital, as it provides clear insight into each feature’s contribution to the overall model forecasts. Feature importance, on the other hand, allows quantification of each feature’s significance, therefore enabling feature prioritization with respect to their impact on the model’s forecasts.

The central difficulty in the domain of AI and ML is centered on identifying the suitable values for the hyperparameters of the model being used. This problem is accentuated by the principle encapsulated in the “no free lunch” theorem (NFL) (Wolpert & Macready, 1997), underlying that there is no universally superior method for consistently outperforming all others across a diverse range of problems. Essentially, this theorem underscores the need to tailor hyperparameter configurations for each unique problem to attain satisfactory performance. Failing to choose the optimal hyperparameters inevitably leads to a suboptimal level of performance of the utilized model. Manual fine-tuning of a model for each specific problem is an exceptionally intricate and time-intensive procedure, which is inherently an NP-hard optimization challenge. Therefore, conventional deterministic algorithms are not appropriate for resolving it. In the domain of stochastic methods, metaheuristics approaches are regarded as very powerful optimization tools, exhibiting considerable potential in this field, which is evidenced by a significant number of recent relevant publications (Todorovic et al., 2023; Petrovic et al., 2023; Zivkovic et al., 2023; Bacanin et al., 2023; Nematzadeh et al., 2022; Esmaeili, Bidgoli & Hakami, 2022; Chou et al., 2022; Abbas et al., 2023; Chou, Nguyen & Chang, 2022).

This manuscript addresses the analysis of gait, a critical aspect in the process of diagnosing Parkinson’s disease (Jankovic, 2015; Von Coelln et al., 2021; Mirelman et al., 2019). Recent relevant studies have emphasized the significance of gait analysis (Wang et al., 2023), revealing that perturbations in gait can manifest during the early stages of the disease (Pistacchi et al., 2017; Di Biase et al., 2020; Ghislieri et al., 2021). Common characteristics of gait anomalies in Parkinson’s disorder include shuffling steps, diminished arm swinging, freezing during walking, and postural instability (Perumal & Sankar, 2016; Morris et al., 2001). These anomalies are associated with the underlying drop of dopamine and other alterations in brain activity affecting motor control and coordination. Considering that alterations in gait represent among the earliest symptoms of the disorder, a robust gait classification can prove highly valuable for medical staff, aiding them in the diagnostic process.

A research gap is present in the literature in terms of observing an immense amount of collected sensor data in the form of images to better address positional relations in the data while reducing computational demands through limited local connectivity in CNN. The use of CNNs is well established throughout the fields of computer vision, where CNNs excel in classification tasks. However, their application is challenging when observing continuous data measurements, such as the data acquired from the shoe-mounted sensors. By converting the sensor data to the image format, it can be conveniently used as an input for a CNN classifier. Therefore this research seeks to address this literature gap utilizing a convolutional neural network (CNN) to reduce the number of attributes, as the employed real-world medical dataset is intricate. This approach has yet to be explored in the literature and has yet to be applied to the detection of Parkinson’s disease. XGBoost model is then used to produce the final classification. Additionally, an improved version of the novel sinh cosh optimizer (SCHO) (Bai et al., 2023) has been employed to optimize the hyperparameters of the model for this specific challenge. As one of the most recent additions to the metaheuristics family, the potential of SCHO has not yet been thoroughly explored. Moreover, the empirical trials that were executed before the main experiments have shown that the elementary version of SCHO attains very promising results, and it was consequently chosen for further improvements. Hence, the principal contributions of this research can be succinctly outlined as follows:

An improved version of the SCHO algorithm was devised, to enhance the elementary variant of the metaheuristics.
This devised algorithm was incorporated as the component of the ML framework to discover the optimal collection of hyperparameters for the particular gait analysis problem.
The assessment of the proposed model was conducted with the standard gait dataset which is associated with Parkinson’s disorder. The simulation results were subsequently juxtaposed with the models optimized by alternative state-of-the-art metaheuristics algorithms, accompanied by a statistical assessment of the simulation results.
Exploring a unique perspective on Parkinson’s sensor data in the form of images in combination with CNN to better determine relations between data points.
SHAP has been employed to interpret the obtained outcomes, providing a deeper comprehension of both the model and the significance of the attributes.

The rest of the manuscript is prepared as follows. In “Background”, a literature survey on AI and data-driven diagnostic procedures is conducted, as well as medical classification problems. Additionally, a brief overview of the CNNs, XGBoost model, and metaheuristics optimization is given. “Methods” initially presents the plain version of SCHO metaheuristics, outlines its limitations, and suggests alterations to improve the algorithm. The experimental setup is detailed in “Setup”, while “Results” encompasses the simulation outcomes, and statistical assessment, followed by the SHAP analysis of top-performing models. Concluding remarks and directions for future research activities in this challenging field are presented in “Conclusion”.

Background

The incorporation of AI and data-driven methodologies presents numerous advantages in medicine, particularly in diagnostic procedures (Dai, Tang & Wang, 2019; Anikwe et al., 2022; Basile et al., 2023; Lee & Yoon, 2021). Modern diagnostic methods within Healthcare 4.0, encompassing the integration of Internet of Things (IoT) gadgets, generate a substantial data influx, which is observed as a trend that continues to escalate (Krishnamoorthy, Dua & Gupta, 2023; Kishor & Chakraborty, 2022; Greco et al., 2020; Javaid & Khan, 2021). AI methods demonstrate the capability to rapidly and accurately assess complex datasets, and in many cases surpass human medical experts in the diagnostics of different diseases and disorders. These models excel in identifying delicate patterns and disparities that humans may evade. Moreover, AI approaches exhibit high efficiency, supporting early diagnostics by discovering subtle illness markers in their earliest stages, and facilitating prompt interventions and correct choice of treatment (Hunter, Hindocha & Lee, 2022; Paul et al., 2022; Rashid et al., 2022; Van der Schaar et al., 2021). Early detection is associated with enhanced chances of patient recovery and a significant decrease in overall healthcare costs (Johnson, Albizri & Simsek, 2022; Rajpurkar et al., 2022; Muhammad et al., 2020).

The application of AI streamlines the decision-making process for healthcare experts, supporting prompt diagnostics and decreasing patient waiting delays, consequently improving the general efficacy of the healthcare systems (Basile et al., 2023; Tang et al., 2021; Stewart et al., 2023; Lång et al., 2023; Alowais et al., 2023). Well-developed trailblazing AI frameworks consistently produce results, regardless of the time of day or medical provider’s practical knowledge, contributing to a reduction in errors attributed to human factor (Yeasmin, 2019; Haleem et al., 2020; Gaba, 2018). Furthermore, AI-powered diagnostics may generate significant cost reduction through optimization of resource allocation, minimization of unnecessary tests, and prevention of wrong or late diagnoses (Blasiak, Khong & Kee, 2020; Munavalli et al., 2021; Lång et al., 2021; Dembrower et al., 2020).

Despite numerous publications dealing with the growing interest and potential of AI in medicine, and without a doubt numerous positive facets, some common drawbacks must be considered critically. First of all, claiming that AI can outperform human doctors and revolutionize medicine overnight leads to overambitious expectations, and finishes with undermined trust if the concepts are not implemented fast enough. Another point to be highlighted here is the shortage of interpretability and transparency. It is vital to comprehend why some model has made a particular prediction, notably in the medical domain, where each decision made may have life-changing consequences for the patient. Lastly, the implementation of these models in practice demands considerable resources, like funds, infrastructural changes, and training for medical workers.

Convolutional neural networks

Convolutional neural networks (CNNs or ConvNets) represent a class of deep neural networks specifically crafted for tasks related to visual data, such as images and videos. Renowned for their exceptional efficacy in computer vision applications, CNNs have consistently demonstrated cutting-edge performance across diverse challenges, like image classification, object detection, segmentation, and beyond (Li et al., 2021; Gu et al., 2018; Yamashita et al., 2018).

The core of the CNNs consists of the convolutional layers, which perform convolution operations over input data by applying filters (kernels) to identify patterns and features. Afterward, pooling layers reduce the spatial dimensions while retaining important features. Activation functions, like ReLU, are used for introducing the non-linear element to the network, which allows it to acquire intricate relations within the data. Fully connected layers, connecting each neuron in one layer to every neuron in the following layer, are employed in the final stages to perform classification tasks. Flattening is utilized to convert the output of the convolutional and pooling layers, which is fed to the fully connected layers. Batch normalization is commonly applied to standardize the input of every layer, aiding in the stabilization and acceleration of the training process. Finally, dropout serves as a regularization method wherein randomly selected cells are removed during the training phase to mitigate the risk of overfitting.

The accuracy of the model is heavily influenced by hyperparameters, making them a crucial aspect of optimization (Wang, Zhang & Zhang, 2019). Examples of hyperparameters include the number of kernels and their size in each convolutional layer, the learning rate, batch size, the architecture involving the count of convolutional and fully-connected (dense) layers, weight regularization in dense layers, the choice of activation function, the dropout rate, and more. Hyperparameter optimization is not a universally solvable process for all problems, necessitating a “trial and error” approach. However, these approaches are time-consuming and offer no guarantee of outcomes, contributing to their classification as NP-hard. Metaheuristics algorithms have shown promising outcomes in handling such challenges (Yamasaki, Honma & Aizawa, 2017; Qolomany et al., 2017; Bochinski, Senst & Sikora, 2017). For an in-depth mathematical formulation of CNN, refer to Albawi, Mohammed & Al-Zawi (2017), and a more recent exploration on the same topic is provided in Gu et al. (2018).

CNNs are frequently used for image and video recognition, medical image analysis, face recognition, and more (Krizhevsky, Sutskever & Hinton, 2012; Ranjan et al., 2017; Balaban, 2015; Spetlík, Franc & Matas, 2018; Cai, Gao & Zhao, 2020; Ting, Tan & Sim, 2019). A particularly important application of CNNs is the field of medical images, where they are successfully applied for the classification of brain tumors (Bíngol & Alatas, 2021; Bezdan et al., 2021b), breast cancer (Zuluaga-Gomez et al., 2021) and thoracic diseases (Abiyev & Ma’aitaH, 2018). Renowned pre-trained CNN architectures such as AlexNet (Krizhevsky, Sutskever & Hinton, 2017), VGGNet (Simonyan & Zisserman, 2014), ResNet (Szegedy et al., 2017), Inception (Soria Poma, Riba & Sappa, 2020), and MobileNet (Wang et al., 2020) have seen broad adoption across diverse tasks. It is noteworthy that, although CNNs are predominantly linked with computer vision tasks, their application extends beyond. They have been successfully employed in processing other types of data, such as one-dimensional signals in speech and audio processing.

XGBoost

The XGBoost (Chen & Guestrin, 2016) technique leverages a decision tree-based ensemble learning strategy to combine forecasts from numerous weak learners. Each tree, using a gradient-boosting framework, corrects faults caused by its ancestors. The efficacy of XGBoost is based on its regularization methods and parallel processing efficiency. Aside from optimization, regularization and gradient boosting can improve performance. The XGBoost model predicts using intricate relationships between input and target patterns. To improve the objective function, the XGBoost method employs an incremental training strategy. Due to an immense count of parameters requiring adjustment when tuning XGBoost, the trial and error technique is impractical. Given the intricacy of some situations, a strong model is required. The primary characteristics of a good model are speed, generalization, and accuracy.

To produce the best outcomes, the model should be trained iteratively. The objective function of the XGBoost is described by the Eq. (1)

(1) $o b j (Θ) = L (θ) + Ω (Θ),$ where $T h e t a$ is the collection of XGBoost hyperparameters, $L (T h e t a)$ is the loss function, and $O m e g a (T h e t a)$ is the regularization term. The last parameter controls the model’s complexity. The loss function depends on the specific problem being addressed.

(2) $L (Θ) = \sum_{i} (y_{i} - {\hat{y}}_{i})^{2},$ in which the $y_{i}$ is the predicted value, while the predicted target for each iteration $i$ is ${\hat{y}}_{i}$ .

(3) $L (Θ) = \sum_{i} [y_{i} \ln (1 + e^{- {\hat{y}}_{i}}) + (1 - y_{i}) \ln (1 + e^{{\hat{y}}_{i}})] .$

The purpose of this procedure is to distinguish between real and expected values. The total loss function is minimized to enhance classification.

Metaheuristics optimization

The realm of metaheuristics optimizers gained popularity due to their efficiency in resolving NP-hard tasks. The biggest hurdle is discovering solutions within an acceptable time frame and upholding manageable hardware demands. These methods may be separated into distinctive groups, but there is no strict definition. One classification commonly acknowledged by the majority of scientists is distinguishing them concerning the phenomena that inspire these algorithms. Thus, these distinctive families comprise light, swarm, genetic, physics, human, and the most recent addition in the form of mathematically inspired approaches. For example light-based methods are inspired by light propagation properties (Alatas & Bingol, 2020), and the notable techniques include ray optimization algorithm (RO) (Kaveh & Khayatazad, 2012) and optics-inspired optimization (OIO) (Kashan, 2015; Bingol & Alatas, 2020).

Drawing motivation from breeds that thrive in huge swarms and benefit from collective behavior, swarm-inspired algorithms are particularly effective when a sole individual is insufficient to accomplish a task. The swarm family of algorithms was established as highly effective in solving NP-hard problems, but to optimize their performance, it is recommended to hybridize them with similar algorithms. The challenge with these stochastic population-based methods lies in their tendency to favor either exploration or exploitation. This can be addressed by incorporating mechanisms from different solutions. Notable approaches include particle swarm optimization (PSO) (Eberhart & Kennedy, 1995), genetic algorithm (GA) (Mirjalili & Mirjalili, 2019), sine cosine algorithm (SCA) (Mirjalili, 2016), firefly algorithm (FA) (Yang & Slowik, 2020), grey wolf optimizer (GWO) (Faris et al., 2018), reptile search algorithm (RSA) (Abualigah et al., 2022), red fox algorithm (Połap & Woźniak, 2021), polar bear algorithm (Polap & Woźniak, 2017), and the COLSHADE algorithm (Gurrola-Ramos, Hernàndez-Aguirre & Dalmau-Cedeño, 2020).

Swarm algorithms find practical applications across a diverse array of real-world challenges. These applications span various domains, including glioma MRI classification (Bezdan et al., 2020), credit card fraud detection (Jovanovic et al., 2022a; Petrovic et al., 2022), global optimization problems (Strumberger et al., 2019; Zamani, Nadimi-Shahraki & Gandomi, 2022; Nadimi-Shahraki & Zamani, 2022). Additionally, swarm metaheuristics are successfully employed in cloud computing (Predić et al., 2023; Bacanin et al., 2019), enhancing the audit opinion forecasting (Todorovic et al., 2023) predicting the number of COVID-19 cases (Zivkovic et al., 2021), software engineering (Zivkovic et al., 2023), feature selection (Bezdan et al., 2021a; Jovanovic et al., 2022b; Stankovic et al., 2022), security and intrusion detection (Savanović et al., 2023; Jovanovic et al., 2022c; Salb et al., 2023), and enhancing wireless sensor networks (Zivkovic et al., 2020a, 2020b).

The authors in Ahmadpour, Ghadiri & Hajian (2021) devised a genetic algorithm-grounded method for monitoring patients’ blood pressure, leading to a significant enhancement in their overall quality of life and enabling the early diagnosis of some preventable illnesses. Khan & Algarni (2020) investigated an IoT environment that facilitates continuous monitoring of patients’ conditions, resulting in substantial improvements in their cardiovascular health. Illustrative instances of AI-assisted medical diagnosis encompass the detection of diabetic retinopathy (Gupta & Chhikara, 2018), classification of skin lesions (Mahbod et al., 2019), categorization of lung cancer (Ren, Zhang & Wang, 2022), and applications such as magnetic resonance imaging (MRI) and X-ray imaging within the medical field (Zivkovic et al., 2022; Budimirovic et al., 2022).

Materials and Methods

This section begins by presenting the basic sinh cosh optimizer, followed by the suggested modifications that improve the performance of the original implementation.

The original sinh cosh optimizer

SCHO is a recent metaheuristics algorithm developed by Bai et al. (2023). It is a mathematically inspired method, as it relies on the properties of sinh and cosh. Hyperbolic functions encompass common trigonometric functions, with sinh and cosh being fundamental examples. Metaheuristic algorithms may benefit from two key characteristics of cosh and sinh. Firstly, cosh values consistently exceed one, serving as a crucial threshold between exploration and exploitation. Secondly, sinh values fall within the interval [−1, 1], approaching zero, thereby enhancing both exploration and exploitation aspects.

As a metaheuristic relying on population-based methods, the algorithm sets an initial population characterized by a considerable degree of randomness, as illustrated in Eq. (4).

(4) $A = [\begin{matrix} a_{1, 1} . . . a_{1, j} . . . a_{1, D} \\ a_{1, 2} . . . a_{2, j} . . . a_{2, D} \\ a_{N, 1} . . . a_{N, j} . . . a_{N, D} \end{matrix}]$

In this context, P represents a group of solutions, where the position $A_{i, j}$ of each agent is calculated according to Eq. (5). Here, the variables D and N signify the dimensional space of the solution and the number of solutions, respectively.

(5) $a = r n d (N, D) \times (u b, l b) + l b$ where $r n d$ represents an arbitrary value, while $u b$ and $l b$ mark the upper and lower boundaries of the search domain.

After the initialization phase, the algorithm must strike a balance between exploration and exploitation, directing solutions toward promising regions within the search space. Exploration is divided into two strategies, and the equilibrium is controlled by Eq. (6):

(6) $S = f l o o r (\frac{T}{c t})$ here T represents the maximum count of rounds, and $c t$ denotes a control parameter with value empirically determined to be $3.6$ .

In the exploration phase, solutions are updated as defined by Eq. (7):

(7) $A_{(i, j)}^{t + 1} = {\begin{matrix} A_{b e s t}^{(j)} + r_{1} \times W_{1} \times A_{(i, j)}^{t} & r_{2} > 0.5 \\ A_{b e s t}^{(j)} - r_{1} \times W_{1} \times A_{(i, j)}^{t} & r_{2} ∖ l t 0.5 \end{matrix}$

In this context, $t$ represents the iteration number, $A^{t + 1} (i, j)$ describes the $j$ -th dimension of the $i$ -th agent, and $A^{(j)} b e s t$ denotes the best agent in the $j$ dimension. Random values within the range of $[0, 1]$ are chosen for $r_{1}$ and $r_{2}$ . The coefficient $W_{1}$ signifies a weighted coefficient for the specific agent and can be calculated as follows:

(8) $W_{1} = r_{3} \times b_{1} \times (c o s h r_{4} + μ \times s i n h r_{4} - 1)$

The value of $b_{1}$ is progressively reduced throughout the iterations, while $r_{3}$ and $r_{4}$ are randomly chosen within limits $[0, 1]$ . Additionally, a sensitivity parameter is defined and denoted as $μ$ .

During exploration, the second strategy involves the application of Eq. (9)

(9) $A_{(i, j)}^{t + 1} = {\begin{cases} A_{b e s t}^{(j)} + | ϵ \times W_{2} \times A_{b e s t}^{(j)} - A_{i, j}^{(t)} | & r_{5} > 0.5 \\ A_{b e s t}^{(j)} - | ϵ \times W_{2} \times A_{b e s t}^{(j)} - A_{i, j}^{(t)} | & r_{5} < 0.5 \end{cases}$

Within this equation, $ϵ$ is configured to the recommended value of $0.003$ from the original publication. The weight coefficient $W_{2}$ is calculated as follows:

(10) $W_{2} = r_{6} \times b_{2}$ where $r_{6}$ represents a random number drawn from $[0, 1]$ , while $b_{2}$ represents a slowly decreasing value.

Another significant phase of the optimization process is exploitation, when solutions concentrate on promising regions of the search realm, taking more refined steps in the direction of the optima. One more time, the metaheuristic employs two techniques. The initial stage utilizes Eq. (11).

(11) $A_{(i, j)}^{t + 1} = {\begin{matrix} A_{b e s t}^{(j)} + r_{7} \times W_{3} \times A_{(i, j)}^{t} & r_{8} > 0.5 \\ A_{b e s t}^{(j)} - r_{7} \times W_{3} \times A_{(i, j)}^{t} & r_{8} ∖ l t 0.5 \end{matrix}$ the parameters $r_{7}$ and $r_{8}$ are selected within limits $[0, 1]$ , while $W_{3}$ is established as follows:

(12) $W_{3} = r_{9} \times b_{1} \times (c o s h r_{10} + μ \times s i n h r_{10})$ where $r_{9}$ and $r_{10}$ denote arbitrarily chosen values within $[0, 1]$ .

The second technique relies on the Eq. (13):

(13) $A_{(i, j)}^{t + 1} = A_{(i, j)}^{t} + r_{11} \times \frac{s i n g r_{12}}{c o s h r_{1} 2} | W_{2} \times A_{b e s t}^{t} - A_{i, j}^{t} |$ where $r_{11}$ and $r_{12}$ are arbitrarily chosen within $[0, 1]$ .

The modified SCHO algorithm

While the basic SCHO algorithm performs well, being a relatively new method, it has plenty of space for improvement. Exploration inside this algorithm is lacking, according to evaluation utilizing CEC (Jiang et al., 2018) standard assessment methodologies. The improved version seeks to address this restriction by incorporating two more strategies.

The first technique is based on the artificial bee colony (ABC) (Karaboga & Basturk, 2008) algorithm. When tired solutions fail to improve, they are discarded and replaced with newly created solutions. Because there are only two iterations in this experiment, solutions that fail to demonstrate improvement are discarded after two iterations if no progress is seen. This method has been shown to improve exploration.

The second approach mentioned is quasi-reflective learning (QRL) (Fan, Chen & Xia, 2020), which is used to generate unique solutions and advance research. This approach is also used to produce potential solutions during the algorithm’s early stages. A given solution X’s quasi-reflected counterpart $z$ is calculated as follows:

(14) $X_{z}^{q r} = r a n d (\frac{l b_{z} + u b_{z}}{2}, x_{z})$ here lb and ub represent the lower and upper limitations of the search realm and rand marks an arbitrary value inside the observed interval. The suggested method is labeled simply modified SCHO (MSCHO). The pseudocode of the proposed optimizer is exhibited in Algorithm 1.

Algorithm 1 :

Pseudocode of the suggested MSCHO algorithm.

1: Set initial parameter values

2: Initialize agent populace P by applying QRL

3: while T > t do

4: Utilized appropriate SCHA search technique to reposition agents

5: Compute agent objective function outcome

6: for each agent A in P do

7: if A has not shown improvement in 2 iterations then

8: Initialize a new agent a by applying QRL

9: end if

10: end for

11: end while

12: return Best attained agent in P

DOI: 10.7717/peerj-cs.2031/table-18

Introduced framework

In this work, a two-layer framework is introduced to reduce computational demands while still maintaining a holistic observation of patient gait. Each patient’s walking gait recordings consist of several sensor readings over time. It can be difficult to observe this data using a simple approach without the use of feature engineering. This can in turn reduce the amount of information available to the algorithm when making a decision. An additional drawback of this approach is that different studies may require the application of different techniques depending on the sensors used and procedures followed during the study.

A flowchart of the framework can be observed in Fig. 1. The framework is capable of automatically adapting to the given task. The two layers in the framework form a collaborative unit that seeks to improve overall accuracy through feature selection and parameter optimization. The first layer of the framework leverages a CNN for feature selection. The second layer of the framework applies the XGBoost algorithm optimized by several contemporary metastatic tasked with demonstrating the best objective function outcomes.

Figure 1: Flowchart of the introduced framework.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-1

The CNN parameters of the first layer of the framework are optimized by the introduced MSCHO algorithm. Respective optimization ranges empirically determined and for this experiment are as follows; learning rate $[0.01, 0.0001]$ , count of training epochs $[5, 25]$ , count of convolutions layers $[1, 3]$ , units in each the convolutional layer $[8, 64]$ number of output features $[8, 16]$ . Models are tuned and trained until a predetermined accuracy is surpassed. In this work, an accuracy threshold of 90% is used. The final constructed model architecture is comprised of one convolutional layer with $16$ neurons. A learning rate of $0.001225$ , $20$ training epochs are selected by the MSCHO algorithm to provide the best outcomes. Pooling layers are not optimized in this study. The optimization aims to attain a lighter CNN architecture suitable for feature selection while maintaining low resource demands. A depiction of the best architecture is provided in Fig. 2.

Figure 2: Framework layer 1 CNN model architecture.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-2

The reduced feature set is used by the second layer of the framework to further enhance performance. An interpretation of the best constructed CNN utilizing the SHAP (Lundberg & Lee, 2017) model on a selection of 10 random samples is provided in Fig. 3. Input images were zoomed in to improve readability. Once a reduced feature space is determined several metaheuristics are tasked with constructive well-tuned XGBoost models. The specifics of the setup for XGBoost are provided in detail in the following section.

Figure 3: Best constructed CNN model feature importance according to SHAP explainer.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-3

Experimental setup

To assess the introduced approach for Parkinson’s detection from patient gait data from publicly available sources is used (https://physionet.org/content/gaitpdb/1.0.0/). The utilized dataset encompasses three clinical studies (Hausdorff et al., 2007; Frenkel-Toledo et al., 2005; Yogev et al., 2005). Gait pattern measurements for 93 patients with a clinical diagnosis of Parkinson’s are present in the dataset. Additionally, gait measurements for 73 healthy individuals are included in the control group. Patient gait data is collected using a network of 16 sensors positioned on the patient’s shoe soles as depicted in Fig. 4.

Figure 4: Shoe sole sensor positions.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-4

Each sensor output is digitized at a sampling rate of 100 Hz. An additional two columns are conducted in the dataset that represents the sum of all the sensors for each foot. This is done to account for variance in patient weight and variance in balance. More precisely, these columns account for patient weight balance on each foot, as different patients have different weights, therefore the intensity they exert on accelerometers (primarily on impact with the floor) will be slightly different. Additionally, the patients do not have the same center of mass, as they gravitate toward a slightly different balance point in their gait (due to injury, poor posture, and other factors). Therefore, the sum of all values on a given foot at a given moment is added together and introduced as a final column. Using this data, it is possible to examine the force record in relation to time and location, generate metrics that represent the center of pressure over time, and establish timing metrics for each foot.

To enable efficient processing of the available data via CNN-s, patient gait recordings are converted into a 2-dimensional black and white image. It has a similar format to the image shown in Fig. 5. However, the figure shown in the manuscript is resized and stretched to improve readability. The images used for the dataset have significantly lower sizes approximately 20 by 100 pixels with a small variance depending on patient pace.

Figure 5: Sample generated gait image.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-5

Each patient’s gait recordings are separated into batches of 20 steps per sample and appropriately labeled as per patient diagnosis. Each dataset is then separated into training and testing sections. Of the available data 70% is used to train and optimize models, and the remaining 30% is withdrawn for testing. Three experiments are conducted with each of the available datasets.

Simulations in the second layer of the framework involve applying several metaheuristics to the optimization of XGBoost hyperparameters. The optimized hyperparameters are selected due to their high influence on model performance and their respective ranges include the learning rate $[0.1, 0.9]$ , minimum child weight $[1, 10]$ , subsample $[0.1, 1.0]$ , colsample by tree $[0.01, 1.00]$ , max depth $[3, 10]$ and $γ$ $[0, 0.8]$ . Respective ranges have been empirically determined.

Several algorithms are included in a comparative analysis to determine the effectiveness of the introduced approach. These include the original SCHO (Bai et al., 2023) algorithm, SCA (Mirjalili, 2016), GA (Mirjalili & Mirjalili, 2019), PSO (Eberhart & Kennedy, 1995), FA (Yang & Slowik, 2020), WOA (Mirjalili & Lewis, 2016), BSO (Shi, 2011), RSA (Abualigah et al., 2022), COA (Jia et al., 2023) and COLSHADE (Gurrola-Ramos, Hernàndez-Aguirre & Dalmau-Cedeño, 2020) algorithms. All metaheuristics are implemented under identical conditions using parameters suggested in the works that originally introduced the algorithm. The population size of 10 agents is used by each algorithm with 15 iterations allocated to improve outcomes. To account for the randomness inherent in metaheuristics algorithms simulations are carried out through 30 experiments. The restricted number of iterations was selected due to the high computational requirements of the simulations. Nevertheless, during empirical trials, it was observed that all algorithms converged successfully within 15 iterations, and additional rounds would not yield significant improvements. In addition to optimized models, a baseline XGBoost implementation without optimization is included in the comparison.

As the evaluated algorithms optimized classification models, several metrics are included in the assessment to ensure a thorough evaluation. Apart from the standard accuracy, precision, sensitivity, and F1-measure metrics shown in Eqs. (15)–(18), the Cohen’s kappa (Warrens, 2015) metric is also tracked during experimentation calculated according to Eq. (19). These metrics are crucial in the evaluation of the model’s performance level. For example, precision is very useful in the cases of imbalanced datasets and where the cost of false positives is high, as it focuses solely on the positively classified entries. On the other hand, sensitivity is vital in scenarios where it is important to correctly identify positive entries, where high values of sensitivity indicate that the model is efficient in capturing the majority of relevant instances. Based on precision and sensitivity together, the F1-measure is utilized to establish the classification threshold. The larger values of the F1-measure suggest that the regarded model has an appropriate balance betwixt precision and sensitivity, thus being capable of efficiently classifying entries belonging to both classes, without favoring one over another. Finally, Cohen’s kappa value measures the inter-rater agreement between classifiers, and it is particularly useful when dealing with imbalanced datasets, as it is considered to be a very robust and reliable indicator.

(15) $A c c u r a c y = \frac{T P + T N}{T P + F P + T N + F N}$

(16) $P r e c i s i o n = \frac{T P}{T P + F P}$

(17) $S e n s i t i v i t y = \frac{T P}{T P + F N}$

(18) $F - 1 s c o r e = \frac{2 \cdot P r e c i s i o n \cdot S e n s i t i v i t y}{P r e c i s i o n + S e n s i t i v i t y}$ here the TP, TN, FP, and FN, denote true positive, true negative, false positive, and false negative values respectively.

(19) $κ = \frac{p_{o} - p_{e}}{1 - p_{e}} = 1 - \frac{1 - p_{o}}{1 - p_{e}}$ where $p_{o}$ represents an observed value while $p_{e}$ is the expected. By utilizing Cohen’s kappa score the optimization challenge is stated as a maximization task.

Simulation outcomes

Simulations are carried out through three experiments, each with one of the available datasets. In each simulation, metaheuristics are tasked with accurately classifying patient gait. Following the simulations, the attained scores are methodically statistically verified.

Yogev et al. (2005) dataset simulations

Comparisons in terms of best, worst mean and median executions for the objective function are shown in Table 1 and in terms of indicator (Cohen’s kappa) are similarly shown in Table 2. Stability comparisons are shown in terms of Std and Var.

Table 1:

Yogev et al. (2005) dataset objective function outcomes.

Method	Best	Worst	Mean	Median	Std	Var
CNN-XG-MSCHO	0.029499	0.032448	0.030973	0.030973	0.001475	2.18E−06
CNN-XG-SCHO	0.032448	0.038348	0.037365	0.038348	0.002199	4.83E−06
CNN-XG-SCA	0.029499	0.032448	0.030482	0.029499	0.001391	1.93E−06
CNN-XG-GA	0.029499	0.038348	0.032448	0.030973	0.003406	1.16E−05
CNN-XG-PSO	0.029499	0.038348	0.033432	0.032448	0.003679	1.35E−05
CNN-XG-FA	0.029499	0.035398	0.031465	0.030973	0.002199	4.83E−06
CNN-XG-WOA	0.029499	0.032448	0.030973	0.030973	0.001475	2.18E−06
CNN-XG-BSO	0.032448	0.035398	0.032940	0.032448	0.001099	1.21E−06
CNN-XG-RSA	0.029499	0.035398	0.032940	0.033923	0.002648	7.01E−06
CNN-XG-COA	0.029499	0.038348	0.034415	0.035398	0.002781	7.73E−06
CNN-XG-COLSHADE	0.032448	0.035398	0.034415	0.035398	0.001391	1.93E−06

DOI: 10.7717/peerj-cs.2031/table-1

Table 2:

Yogev et al. (2005) dataset indicator (Cohen’s kappa) outcomes.

Method	Best	Worst	Mean	Median	Std	Var
CNN-XG-MSCHO	0.934485	0.927779	0.931132	0.931132	0.003353	1.12E−05
CNN-XG-SCHO	0.927779	0.914279	0.916467	0.914279	0.005061	2.56E−05
CNN-XG-SCA	0.934485	0.928088	0.932301	0.934485	0.003090	9.55E−06
CNN-XG-GA	0.934485	0.914279	0.927651	0.930976	0.007824	6.12E−05
CNN-XG-PSO	0.934485	0.914279	0.925353	0.927623	0.008423	7.10E−05
CNN-XG-FA	0.934485	0.921043	0.930061	0.931287	0.004986	2.49E−05
CNN-XG-WOA	0.934485	0.928088	0.931287	0.931287	0.003199	1.02E−05
CNN-XG-BSO	0.927779	0.920702	0.926547	0.927779	0.002617	6.85E−06
CNN-XG-RSA	0.934485	0.921043	0.926703	0.924580	0.005981	3.58E−05
CNN-XG-COA	0.934485	0.914279	0.923273	0.921043	0.006408	4.11E−05
CNN-XG-COLSHADE	0.927779	0.921043	0.923405	0.921550	0.002992	8.95E−06

DOI: 10.7717/peerj-cs.2031/table-2

In Table 1, CNN-XG-MSCHO exhibits the lowest Best value, indicating strong potential in optimal scenarios. CNN-XG-BSO stands out for its consistency, having the lowest standard deviation and variance, suggesting predictable performance.

In Table 2, the CNN-XG-MSCHO, CNN-XG-SCA, and CNN-XG-WOA methods consistently show high Best, Mean, and Median Kappa values, suggesting they are highly effective. The CNN-XG-GA and CNN-XG-PSO methods have wider ranges in performance (noted in their higher standard deviation and variance), indicating less consistent but potentially high-quality outcomes. This suggests a trade-off between achieving peak performance and maintaining consistency across different evaluations.

A graphical comparison for the objective and indicator functions outcome distributions is provided in Fig. 6.

Figure 6: Yogev et al. (2005) objective and Cohen kappa distributions plots.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-6

Distribution plots in Fig. 6 indicate that a high stability is demonstrated by the introduced algorithm in terms of objective and indicator function outcomes suggesting a strong reliability of the tested algorithms. An interesting observation can be made on the violin plots of the objective function. Multiple peaks in the objective function suggest that two local minimums might be present in the optimization space for this problem. However the SCA algorithm, as well as the introduced modified version managed to overcome this local optima attaining better overall outcomes, something the original SCHO algorithm did not manage to overcome. The improvements introduced by the modified mechanisms are therefore evident. A strong convergence hindered several algorithms in the simulation. Algorithms such as the original SCHO, BSOA, and COLSHADE showcased a limited exploration ability. Despite the BSO algorithm attaining the highest stability it also fails to locate the best solution within the search space relative to those located by other algorithms.

The final execution outcomes of each algorithm are shown in swarm plots for the objective and indicator evaluations in Fig. 7. As shown in Fig. 7 solution clustering for each algorithm does happen around the previously mentioned local minimum however, the original algorithm fails to field a solution that meets the performance of the solutions located by other algorithms. Nevertheless, the introduced modified algorithm manages to outperform the original and meet the performance of the best-performing competing metaheuristics.

Figure 7: Yogev et al. (2005) objective and Cohen kappa swarm plots.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-7

Convergence rate comparisons between optimizers are provided in Fig. 8. Convergence demonstrated during optimizations through the interactions depicted in Fig. 8 brings to light several factors associated with each optimizer. Notably the exploration and exploitation rations between algorithms. Algorithms that converge too quickly often fail to find an optimal solution and get stuck in a local optimal such as the the case with the BSO. Similarly, algorithms with very slow convergence, fail to locate a solution within the allocated optimization period, yielding poor results such as the case with the COLSHADE algorithm. The modified algorithm showcases a good balance between these mechanisms locating an optimal solution after around 50% of the total allocated iterations.

Figure 8: Yogev et al. (2005) objective and Cohen kappa convergence graphs.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-8

Detailed metrics comparisons in terms of precision, sensitivity, F1-measure, and accuracy between the best-performing algorithms are provided in Table 3. The table presents detailed metrics for the best-performing models in detecting Parkinson’s disease using CNN-XGBoost methods. Notably, several methods (CNN-XG-MSCHO, SCA, GA, PSO, FA, WOA, RSA, COA) exhibit remarkable consistency across precision, sensitivity, and F1-measure for both control and Parkinson’s disease (PD) patients, with high accuracy. These models demonstrate excellent precision (above 0.94 for control and above 0.98 for PD patients) and sensitivity (above 0.96 for both groups), leading to high F1-measures (above 0.95), indicating their robustness and reliability in classification tasks. The results reflect the effectiveness of these CNN-XGBoost methods in accurately diagnosing Parkinson’s disease.

Table 3:

Yogev et al. (2005) dataset detailed metrics for best-constructed models.

Method	Metric	Control	PD patients	Accuracy	Macro avg.	Weighted avg.
CNN-XG-MSCHO	Precision	0.940678	0.986425	0.970501	0.963552	0.971041
	Sensitivity	0.973684	0.968889	0.970501	0.971287	0.970501
	F1-measure	0.956897	0.977578	0.970501	0.967238	0.970623
CNN-XG-SCHO	Precision	0.940171	0.981982	0.967552	0.961076	0.967922
	Sensitivity	0.964912	0.968889	0.967552	0.966901	0.967552
	F1-measure	0.952381	0.975391	0.967552	0.963886	0.967653
CNN-XG-SCA	Precision	0.940678	0.986425	0.970501	0.963552	0.971041
	Sensitivity	0.973684	0.968889	0.970501	0.971287	0.970501
	F1-measure	0.956897	0.977578	0.970501	0.967238	0.970623
CNN-XG-GA	Precision	0.940678	0.986425	0.970501	0.963552	0.971041
	Sensitivity	0.973684	0.968889	0.970501	0.971287	0.970501
	F1-measure	0.956897	0.977578	0.970501	0.967238	0.970623
CNN-XG-PSO	Precision	0.940678	0.986425	0.970501	0.963552	0.971041
	Sensitivity	0.973684	0.968889	0.970501	0.971287	0.970501
	F1-measure	0.956897	0.977578	0.970501	0.967238	0.970623
CNN-XG-FA	Precision	0.940678	0.986425	0.970501	0.963552	0.971041
	Sensitivity	0.973684	0.968889	0.970501	0.971287	0.970501
	F1-measure	0.956897	0.977578	0.970501	0.967238	0.970623
CNN-XG-WOA	Precision	0.940678	0.986425	0.970501	0.963552	0.971041
	Sensitivity	0.973684	0.968889	0.970501	0.971287	0.970501
	F1-measure	0.956897	0.977578	0.970501	0.967238	0.970623
CNN-XG-BSO	Precision	0.940171	0.981982	0.967552	0.961076	0.967922
	Sensitivity	0.964912	0.968889	0.967552	0.966901	0.967552
	F1-measure	0.952381	0.975391	0.967552	0.963886	0.967653
CNN-XG-RSA	Precision	0.940678	0.986425	0.970501	0.963552	0.971041
	Sensitivity	0.973684	0.968889	0.970501	0.971287	0.970501
	F1-measure	0.956897	0.977578	0.970501	0.967238	0.970623
CNN-XG-COA	Precision	0.940678	0.986425	0.970501	0.963552	0.971041
	Sensitivity	0.973684	0.968889	0.970501	0.971287	0.970501
	F1-measure	0.956897	0.977578	0.970501	0.967238	0.970623
CNN-XG-COLSHADE	Precision	0.940171	0.981982	0.967552	0.961076	0.967922
	Sensitivity	0.964912	0.968889	0.967552	0.966901	0.967552
	F1-measure	0.952381	0.975391	0.967552	0.963886	0.967653
	Support	114	225

DOI: 10.7717/peerj-cs.2031/table-3

A comparison between the best-constructed models as well as the base XGBoost method in terms of error rate is provided in Table 4. The table compares the error rates of various CNN-XGBoost models in detecting Parkinson’s disease. Most models, including CNN-XG-MSCHO, SCA, GA, PSO, FA, WOA, RSA, and COA, exhibit remarkably low error rates of 0.029499. In contrast, CNN-XG-SCHO, BSO, and COLSHADE show slightly higher rates at 0.032448. Notably, the base XGBoost method (CNN-XG) has a significantly higher error rate of 0.056047. This comparison highlights the enhanced accuracy of the CNN-XGBoost methods over the base model, demonstrating the effectiveness of integrating CNN with XGBoost for this application.

Table 4:

Yogev et al. (2005) error rate compassion between the best-constructed models.

Method	Best model error rate
CNN-XG-MSCHO	0.029499
CNN-XG-SCHO	0.032448
CNN-XG-SCA	0.029499
CNN-XG-GA	0.029499
CNN-XG-PSO	0.029499
CNN-XG-FA	0.029499
CNN-XG-WOA	0.029499
CNN-XG-BSO	0.032448
CNN-XG-RSA	0.029499
CNN-XG-COA	0.029499
CNN-XG-COLSHADE	0.032448
CNN-XG	0.056047

DOI: 10.7717/peerj-cs.2031/table-4

Finally, parameter selections made by each algorithm for the best-performing models are shown in Table 5. The table showcases the parameter selections for the best-performing CNN-XGBoost models in Parkinson’s disease detection. A key observation is the preference for a high learning rate (0.9) in several methods like CNN-XG-MSCHO, SCA, GA, PSO, FA, WOA, indicating a faster learning process. Another notable aspect is the variation in ‘Min Child Weight’, ‘Max Depth’, and ‘Gamma’ across methods, reflecting the different strategies for handling overfitting and model complexity. For example, CNN-XG-GA and PSO opt for a ‘Gamma’ of 0, suggesting less regularization, while others like SCA and FA choose a higher ‘Gamma’ for more. This diversity in parameters underlines the customization and optimization involved in each method to achieve the best results.

Table 5:

Yogev et al. (2005) dataset best model parameter selections.

Method	Learning rate	Min child W.	Subsample	Colsample by tree	Max depth	Gamma
CNN-XG-MSCHO	0.900000	7.689301	1.000000	0.793686	6.000000	0.628655
CNN-XG-SCHO	0.726800	4.427460	0.404638	0.667566	6.225021	0.542562
CNN-XG-SCA	0.900000	7.409894	1.000000	1.000000	8.000000	0.800000
CNN-XG-GA	0.900000	10.000000	1.000000	1.000000	5.000000	0.000000
CNN-XG-PSO	0.900000	10.000000	1.000000	1.000000	6.000000	0.000000
CNN-XG-FA	0.900000	7.435905	1.000000	1.000000	9.000000	0.800000
CNN-XG-WOA	0.900000	7.488228	1.000000	1.000000	9.000000	0.800000
CNN-XG-BSO	0.563712	9.720787	0.809317	1.000000	4.000000	0.235176
CNN-XG-RSA	0.598110	10.000000	1.000000	1.000000	10.000000	0.205231
CNN-XG-COA	0.641151	10.000000	1.000000	1.000000	6.000000	0.118814
CNN-XG-COLSHADE	0.416614	1.000000	0.795437	1.000000	9.000000	0.522865

DOI: 10.7717/peerj-cs.2031/table-5

Hausdorff et al. (2007) dataset simulations

Comparisons in terms of best, worst mean and median executions for the objective function are shown in Table 6 and in terms of indicator (Cohen’s kappa) are similarly shown in Table 7. Stability comparisons are shown in terms of Std and Var. Notably, CNN-XG-GA and CNN-XG-BSO exhibit remarkable consistency (zero standard deviation and variance) with identical best, worst, mean, and median values, suggesting highly stable performance. In contrast, CNN-XG-MSCHO, PSO, and FA show a wider range of outcomes but still maintain low error rates, indicating effectiveness albeit with less consistency. This comparison highlights differences in stability and performance range among the methods, underscoring the balance between achieving low error rates and maintaining consistent outcomes.

Table 6:

Hausdorff et al. (2007) dataset objective function outcomes.

Method	Best	Worst	Mean	Median	Std	Var
CNN-XG-MSCHO	0.018088	0.023256	0.020672	0.020672	0.002110	4.45E−06
CNN-XG-SCHO	0.020672	0.023256	0.021533	0.020672	0.001218	1.48E−06
CNN-XG-SCA	0.020672	0.023256	0.021102	0.020672	0.000963	9.27E−07
CNN-XG-GA	0.020672	0.020672	0.020672	0.020672	0.000000	0.000000
CNN-XG-PSO	0.018088	0.023256	0.020241	0.020672	0.001776	3.15E−06
CNN-XG-FA	0.018088	0.023256	0.020672	0.020672	0.001492	2.23E−06
CNN-XG-WOA	0.020672	0.023256	0.021102	0.020672	0.000963	9.27E−07
CNN-XG-BSO	0.020672	0.020672	0.020672	0.020672	0.000000	0.000000
CNN-XG-RSA	0.018088	0.020672	0.019811	0.020672	0.001218	1.48E−06
CNN-XG-COA	0.018088	0.020672	0.019811	0.020672	0.001218	1.48E−06
CNN-XG-COLSHADE	0.018088	0.020672	0.020241	0.020672	0.000963	9.27E−07

DOI: 10.7717/peerj-cs.2031/table-6

Table 7:

Hausdorff et al. (2007) dataset indicator (Cohen’s kappa) outcomes.

Method	Best	Worst	Mean	Median	Std	Var
CNN-XG-MSCHO	0.942407	0.925198	0.933817	0.933846	0.007026	4.94E−05
CNN-XG-SCHO	0.933846	0.925198	0.931074	0.933846	0.004162	1.73E−05
CNN-XG-SCA	0.934509	0.925198	0.932515	0.933846	0.003281	1.08E−05
CNN-XG-GA	0.933846	0.933846	0.933846	0.933846	0.000000	0.000000
CNN-XG-PSO	0.942407	0.925198	0.935369	0.934178	0.005886	3.46E−05
CNN-XG-FA	0.942407	0.925198	0.933832	0.933846	0.004968	2.47E−05
CNN-XG-WOA	0.933846	0.925198	0.932405	0.933846	0.003223	1.04E−05
CNN-XG-BSO	0.933846	0.933846	0.933846	0.933846	0.000000	0.000000
CNN-XG-RSA	0.942407	0.933846	0.936700	0.933846	0.004036	1.63E−05
CNN-XG-COA	0.942407	0.933846	0.936700	0.933846	0.004036	1.63E−05
CNN-XG-COLSHADE	0.942407	0.933846	0.935273	0.933846	0.003190	1.02E−05

DOI: 10.7717/peerj-cs.2031/table-7

Table 7 displays the Cohen’s Kappa outcomes for various CNN-XGBoost methods, with CNN-XG-GA and CNN-XG-BSO showing remarkable stability (zero standard deviation and variance). CNN-XG-MSCHO, PSO, and FA exhibit higher Kappa values, suggesting superior agreement in certain scenarios, but with greater variability in results. The range of best-to-worst values across methods is narrow, indicating overall good and consistent performance in classification accuracy. This balance between high performance and stability is crucial in applications like medical diagnostics.

Additionally, a graphical comparison for the objective and indicator functions outcome distributions is provided in Fig. 9. Distribution plots in Fig. 9 indicate a strong presence of a local optimum within the search space. As evident, many algorithms fail to locate a relative best solution due to this local optima. The original SCHO, SCA, GA, WOA, and BSO all overly focus on local best solutions suggesting a lack of diversification within these methods. The introduced algorithm effectively covers to wards an optimal matching of the performance of the best algorithms. While high stability is demonstrated by several algorithms, these fail to locate a relatively best solution in this simulation. The improvements introduced by the modified mechanisms are therefore an effective way of overcoming the exploration limitations observed in the original.

Figure 9: Hausdorff et al. (2007) objective and Cohen kappa distributions plots.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-9

The final execution outcomes of each algorithm are shown in swarm plots for the objective and indicator evaluations in Fig. 10. As shown in Fig. 10 solution clustering for each algorithm does happen around the previously mentioned local minimum however, the original algorithm fails to field a solution that meets the performance of the solutions located by other algorithms. Nevertheless, the introduced modified algorithm manages to outperform the original and meet the performance of the best-performing competing metaheuristics. Additionally, many of the solutions are located closed to the true optima indicating a strong reliability.

Figure 10: Hausdorff et al. (2007) objective and Cohen kappa swarm plots.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-10

Convergence rate comparisons between optimizers are provided in Fig. 11. Convergence demonstrated during optimizations through the interactions depicted in Fig. 11 brings to light several factors associated with each optimizer. Notably the exploration and exploitation rations between algorithms. Algorithms that converge too quickly often fail to find an optimal solution and get stuck in a local optimal such as the case with the GA. Similarly, algorithms with very slow convergence, fail to locate a solution within the allocated optimization period, yielding poor results such as the case with the BSO algorithm. The modified algorithm showcases a good balance between these mechanisms in this simulataion as well. While a slower converges is evident it remains steady though the iterations with a local optima located in the final few iterations.

Figure 11: Hausdorff et al. (2007) objective and Cohen kappa convergence graphs.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-11

Detailed metrics comparisons in terms of precision, sensitivity, F1-measure, and accuracy between the best-performing algorithms are provided in Table 8. The table reveals that most CNN-XGBoost methods demonstrate exceptionally high performance in precision, sensitivity, and F1-measure for both control and PD patients, with accuracy consistently above 97%. Notably, methods like CNN-XG-MSCHO, PSO, FA, RSA, COA, and COLSHADE show remarkable precision and sensitivity values above 94% and 96% respectively, leading to F1-measures above 95%. This uniformity in high performance across different metrics indicates the robustness and reliability of these methods in accurately classifying PD patients.

Table 8:

Hausdorff et al. (2007) dataset detailed metrics for best-constructed models.

Method	Metric	Control	PD patients	Accuracy	Macro avg.	Weighted avg.
CNN-XG-MSCHO	Precision	0.947368	0.990354	0.981912	0.968861	0.982023
	Sensitivity	0.960000	0.987179	0.981912	0.973590	0.981912
	F1-measure	0.953642	0.988764	0.981912	0.971203	0.981958
CNN-XG-SCHO	Precision	0.946667	0.987179	0.979328	0.966923	0.979328
	Sensitivity	0.946667	0.987179	0.979328	0.966923	0.979328
	F1-measure	0.946667	0.987179	0.979328	0.966923	0.979328
CNN-XG-SCA	Precision	0.935065	0.990323	0.979328	0.962694	0.979614
	Sensitivity	0.960000	0.983974	0.979328	0.971987	0.979328
	F1-measure	0.947368	0.987138	0.979328	0.967253	0.979431
CNN-XG-GA	Precision	0.946667	0.987179	0.979328	0.966923	0.979328
	Sensitivity	0.946667	0.987179	0.979328	0.966923	0.979328
	F1-measure	0.946667	0.987179	0.979328	0.966923	0.979328
CNN-XG-PSO	Precision	0.947368	0.990354	0.981912	0.968861	0.982023
	Sensitivity	0.960000	0.987179	0.981912	0.973590	0.981912
	F1-measure	0.953642	0.988764	0.981912	0.971203	0.981957
CNN-XG-FA	Precision	0.947368	0.990354	0.981912	0.968861	0.982023
	Sensitivity	0.960000	0.987179	0.981912	0.973590	0.981912
	F1-measure	0.953642	0.988764	0.981912	0.971203	0.981958
CNN-XG-WOA	Precision	0.946667	0.987179	0.979328	0.966923	0.979328
	Sensitivity	0.946667	0.987179	0.979328	0.966923	0.979328
	F1-measure	0.946667	0.987179	0.979328	0.966923	0.979328
CNN-XG-BSO	Precision	0.946667	0.987179	0.979328	0.966923	0.979328
	Sensitivity	0.946667	0.987179	0.979328	0.966923	0.979328
	F1-measure	0.946667	0.987179	0.979328	0.966923	0.979328
CNN-XG-RSA	Precision	0.947368	0.990354	0.981912	0.968861	0.982023
	Sensitivity	0.960000	0.987179	0.981912	0.973590	0.981912
	F1-measure	0.953642	0.988764	0.981912	0.971203	0.981958
CNN-XG-COA	Precision	0.947368	0.990354	0.981912	0.968861	0.982023
	Sensitivity	0.960000	0.987179	0.981912	0.973590	0.981912
	F1-measure	0.953642	0.988764	0.981912	0.971203	0.981958
CNN-XG-COLSHADE	Precision	0.947368	0.990354	0.981912	0.968861	0.982023
	Sensitivity	0.960000	0.987179	0.981912	0.973590	0.981912
	F1-measure	0.953642	0.988764	0.981912	0.971203	0.981958
	Support	75	312

DOI: 10.7717/peerj-cs.2031/table-8

A comparison between the best-constructed models as well as the base XGBoost method in terms of error rate is provided in Table 9. The table compares the error rates of various CNN-XGBoost methods with the base XGBoost (CNN-XG) method in detecting Parkinson’s disease. The CNN-XG-MSCHO, PSO, FA, RSA, COA, and COLSHADE methods exhibit notably low error rates of 0.018088. In contrast, CNN-XG-SCHO, SCA, GA, WOA, and BSO show slightly higher rates at 0.020672. The base XGBoost model has a higher error rate of 0.025840. This comparison highlights the superior accuracy of the CNN-XGBoost methods over the base model, indicating the effectiveness of integrating CNN with XGBoost in this application.

Table 9:

Hausdorff et al. (2007) error rate comparison between the best-constructed models.

Method	Best model error rate
CNN-XG-MSCHO	0.018088
CNN-XG-SCHO	0.020672
CNN-XG-SCA	0.020672
CNN-XG-GA	0.020672
CNN-XG-PSO	0.018088
CNN-XG-FA	0.018088
CNN-XG-WOA	0.020672
CNN-XG-BSO	0.020672
CNN-XG-RSA	0.018088
CNN-XG-COA	0.018088
CNN-XG-COLSHADE	0.018088
CNN-XG	0.025840

DOI: 10.7717/peerj-cs.2031/table-9

Finally, parameter selections made by each algorithm for the best-performing models using the (Hausdorff et al., 2007) dataset are shown in Table 10. There is a noticeable diversity in parameter choices. For instance, several models (CNN-XG-MSCHO, WOA, RSA) opt for a high learning rate of 0.9, indicating a preference for faster learning, while others like CNN-XG-SCHO, SCA, and GA choose lower rates for gradual learning. The ‘Min Child Weight’ and ‘Max Depth’ parameters vary significantly, reflecting different strategies to control model complexity and prevent overfitting. The ‘Gamma’ values also differ, suggesting varying degrees of regularization among the models. This diversity underscores the tailored approach each method takes to optimize performance.

Table 10:

Hausdorff et al. (2007) dataset best model parameter selections.

Method	Learning rate	Min child W.	Subsample	Colsample by tree	Max depth	Gamma
CNN-XG-MSCHO	0.900000	3.259372	0.891252	1.000000	7.555943	0.800000
CNN-XG-SCHO	0.191214	1.666386	0.348368	0.914698	4.000000	0.353234
CNN-XG-SCA	0.100000	1.000000	0.954390	1.000000	10.000000	0.000000
CNN-XG-GA	0.318375	1.000000	1.000000	1.000000	7.000000	0.800000
CNN-XG-PSO	0.100000	1.000000	0.989769	1.000000	5.000000	0.138171
CNN-XG-FA	0.100000	1.000000	1.000000	1.000000	5.000000	0.000000
CNN-XG-WOA	0.900000	1.000000	1.000000	1.000000	10.000000	0.243771
CNN-XG-BSO	0.323362	1.000000	0.965132	1.000000	7.000000	0.708029
CNN-XG-RSA	0.900000	1.073230	0.193228	0.901294	6.000000	0.613811
CNN-XG-COA	0.100000	3.423273	0.993274	1.000000	10.000000	0.405371
CNN-XG-COLSHADE	0.100000	2.174713	1.000000	0.901547	7.000000	0.198174

DOI: 10.7717/peerj-cs.2031/table-10

Frenkel-Toledo et al. (2005) dataset simulations

Comparisons in terms of best, worst mean and median executions for the objective function are shown in Table 11 and in terms of indicator (Cohen’s kappa) are similarly shown in Table 12. Stability comparisons are shown in terms of Std and Var. CNN-XG-MSCHO and CNN-XG-SCA stand out with the lowest ‘Best’ values, indicating potential for optimal performance. However, CNN-XG-FA shows the highest variability and worst outcomes, suggesting less stability. This contrast in performance and consistency across methods highlights the importance of selecting the right algorithm and parameters for specific applications to achieve the most effective results.

Table 11:

Frenkel-Toledo et al. (2005) dataset objective function outcomes.

Method	Best	Worst	Mean	Median	Std	Var
CNN-XG-MSCHO	0.046875	0.052083	0.051215	0.052083	0.001941	3.77E−06
CNN-XG-SCHO	0.052083	0.062500	0.056424	0.054688	0.004675	2.19E−05
CNN-XG-SCA	0.046875	0.062500	0.054688	0.054688	0.004987	2.49E−05
CNN-XG-GA	0.057292	0.062500	0.058160	0.057292	0.001941	3.77E−06
CNN-XG-PSO	0.052083	0.062500	0.057292	0.057292	0.003007	9.04E−06
CNN-XG-FA	0.052083	0.072917	0.059028	0.057292	0.007158	5.12E−05
CNN-XG-WOA	0.046875	0.062500	0.052951	0.052083	0.004675	2.19E−05
CNN-XG-BSO	0.052083	0.062500	0.057292	0.057292	0.003007	9.04E−06
CNN-XG-RSA	0.046875	0.062500	0.056424	0.057292	0.004675	2.19E−05
CNN-XG-COA	0.046875	0.062500	0.055556	0.057292	0.004910	2.41E−05
CNN-XG-COLSHADE	0.046875	0.062500	0.055556	0.057292	0.004910	2.41E−05

DOI: 10.7717/peerj-cs.2031/table-11

Table 12:

Frenkel-Toledo et al. (2005) dataset indicator (Cohen’s kappa) outcomes.

Method	Best	Worst	Mean	Median	Std	Var
CNN-XG-MSCHO	0.9055118	0.895322	0.897157	0.895527	0.003739	1.40E−05
CNN-XG-SCHO	0.8953222	0.874387	0.886696	0.890257	0.009332	8.71E−05
CNN-XG-SCA	0.9058824	0.874387	0.890141	0.890145	0.010027	1.01E−04
CNN-XG-GA	0.8842867	0.874387	0.883052	0.884627	0.003894	1.52E−05
CNN-XG-PSO	0.8949097	0.874387	0.884747	0.884967	0.005930	3.52E−05
CNN-XG-FA	0.8953222	0.853738	0.881454	0.884854	0.014274	2.04E−04
CNN-XG-WOA	0.9056974	0.874387	0.893596	0.895322	0.009382	8.80E−05
CNN-XG-BSO	0.8949097	0.874387	0.884785	0.884854	0.005929	3.52E−05
CNN-XG-RSA	0.9056974	0.874387	0.886432	0.884514	0.009397	8.83E−05
CNN-XG-COA	0.9055118	0.874633	0.888244	0.884854	0.009766	9.54E−05
CNN-XG-COLSHADE	0.9055118	0.874387	0.888312	0.884967	0.009820	9.64E−05

DOI: 10.7717/peerj-cs.2031/table-12

Table 12 shows the Cohen’s Kappa outcomes for various CNN-XGBoost methods from the Frenkel-Toledo et al. (2005) dataset. CNN-XG-MSCHO, SCA, and WOA exhibit higher ‘Best’ Kappa values, indicating strong agreement in certain scenarios. However, CNN-XG-FA shows the largest range in outcomes, suggesting variability in its performance. The relatively narrow spread between the best and worst values across all methods indicates consistent performance, with a balance between achieving high agreement and maintaining consistency in different evaluations.

Additionally, a graphical comparison for the objective and indicator functions outcome distributions is provided in Fig. 12. Distribution plots in Fig. 12 indicate a strong stability of several algorithms such as the GA, PS, WOA, BSO, and RSA, however, these algorithms fail to locate a true optima. The interesting observation is that the introduced algorithm also demonstrates high stability. However, the introduced modified algorithm heavily converges on the best-located solution relative to the other algorithms.

Figure 12: Frenkel-Toledo et al. (2005) objective and Cohen’s kappa distributions plots.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-12

The final execution outcomes of each algorithm are shown in swarm plots for the objective and indicator evaluations in Fig. 13. Swarm diagrams in Fig. 13 further enforce the previous observations. Many of the solutions provided by the modified algorithm fall near the located optimal. An improvement over the original version of the algorithm is evident, as many of the solutions of the original algorithm focus on a local optimal as opposed to the best solution located by competing algorithms.

Figure 13: Frenkel-Toledo et al. (2005) objective and Cohen’s kappa swarm plots.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-13

Convergence rate comparisons between optimizers are provided in Fig. 14. The introduced algorithm showcases a high rate of convergence in comparison to competing algorithms as shown in Fig. 14. Nevertheless, the convergence is justified as an optimum is located that matches solutions located by other algorithms. It is important to emphasize that the NFL states that no single solution works equally well for all presented problems. While the quick convergence may be beneficial for this specific problem, other applications may prefer a stronger focus on exploration.

Figure 14: Frenkel-Toledo et al. (2005) objective and Cohen’s kappa convergence graphs.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-14

Detailed metrics comparisons in terms of precision, sensitivity, F1-measure, and accuracy between the best-performing algorithms are provided in Table 13. The table reveals that most CNN-XGBoost methods achieve high precision, sensitivity, and F1-measures for both control and PD patients, with overall accuracy consistently above 94%. Notably, methods like CNN-XG-MSCHO, SCA, and COA demonstrate exceptional precision and sensitivity, leading to high F1-measures, indicating their robustness and reliability in classification. This uniformity in high performance across different metrics underscores the effectiveness of these methods in accurately diagnosing PD.

Table 13:

Frenkel-Toledo et al. (2005) dataset detailed metrics for best-constructed models.

Method	Metric	Control	PD patients	Accuracy	Macro avg.	Weighted avg.
CNN-XG-MSCHO	Precision	0.943182	0.961538	0.953125	0.952360	0.953221
	Sensitivity	0.954023	0.952381	0.953125	0.953202	0.953125
	F1-measure	0.948571	0.956938	0.953125	0.952755	0.953147
CNN-XG-SCHO	Precision	0.923077	0.970297	0.947917	0.946687	0.948900
	Sensitivity	0.965517	0.933333	0.947917	0.949425	0.947917
	F1-measure	0.943820	0.951456	0.947917	0.947638	0.947996
CNN-XG-SCA	Precision	0.923913	0.980000	0.953125	0.951957	0.954586
	Sensitivity	0.977011	0.933333	0.953125	0.955172	0.953125
	F1-measure	0.949721	0.956098	0.953125	0.952909	0.953208
CNN-XG-GA	Precision	0.941860	0.943396	0.942708	0.942628	0.942700
	Sensitivity	0.931034	0.952381	0.942708	0.941708	0.942708
	F1-measure	0.936416	0.947867	0.942708	0.942142	0.942679
CNN-XG-PSO	Precision	0.942529	0.952381	0.947917	0.947455	0.947917
	Sensitivity	0.942529	0.952381	0.947917	0.947455	0.947917
	F1-measure	0.942529	0.952381	0.947917	0.947455	0.947917
CNN-XG-FA	Precision	0.923077	0.970297	0.947917	0.946687	0.948900
	Sensitivity	0.965517	0.933333	0.947917	0.949425	0.947917
	F1-measure	0.943820	0.951456	0.947917	0.947638	0.947996
CNN-XG-WOA	Precision	0.933333	0.970588	0.953125	0.951961	0.953707
	Sensitivity	0.965517	0.942857	0.953125	0.954187	0.953125
	F1-measure	0.949153	0.956522	0.953125	0.952837	0.953183
CNN-XG-BSO	Precision	0.942529	0.952381	0.947917	0.947455	0.947917
	Sensitivity	0.942529	0.952381	0.947917	0.947455	0.947917
	F1-measure	0.942529	0.952381	0.947917	0.947455	0.947917
CNN-XG-RSA	Precision	0.933333	0.970588	0.953125	0.951961	0.953707
	Sensitivity	0.965517	0.942857	0.953125	0.954187	0.953125
	F1-measure	0.949153	0.956522	0.953125	0.952837	0.953183
CNN-XG-COA	Precision	0.943182	0.961538	0.953125	0.952360	0.953221
	Sensitivity	0.954023	0.952381	0.953125	0.953202	0.953125
	F1-measure	0.948571	0.956938	0.953125	0.952755	0.953147
CNN-XG-COLSHADE	Precision	0.943182	0.961538	0.953125	0.952360	0.953221
	Sensitivity	0.954023	0.952381	0.953125	0.953202	0.953125
	F1-measure	0.948571	0.956938	0.953125	0.952755	0.953147
	Support	87	105

DOI: 10.7717/peerj-cs.2031/table-13

A comparison between the best-constructed models as well as the base XGBoost method in terms of error rate is provided in Table 14. The table shows the error rates for various CNN-XGBoost models compared to the base XGBoost model in the Frenkel-Toledo et al. (2005) dataset. CNN-XG-MSCHO, SCA, WOA, RSA, COA, and COLSHADE show notably low error rates of 0.046875. CNN-XG-SCHO, PSO, FA, and BSO have slightly higher rates at 0.052083, while CNN-XG-GA is at 0.057292. The base XGBoost model (CNN-XG) has the highest error rate of 0.067708. This highlights the enhanced accuracy of the CNN-XGBoost methods over the base model, demonstrating the effectiveness of integrating CNN with XGBoost.

Table 14:

Frenkel-Toledo et al. (2005) error rate comparison between the best-constructed models.

Method	Best model error rate
CNN-XG-MSCHO	0.046875
CNN-XG-SCHO	0.052083
CNN-XG-SCA	0.046875
CNN-XG-GA	0.057292
CNN-XG-PSO	0.052083
CNN-XG-FA	0.052083
CNN-XG-WOA	0.046875
CNN-XG-BSO	0.052083
CNN-XG-RSA	0.046875
CNN-XG-COA	0.046875
CNN-XG-COLSHADE	0.046875
CNN-XG	0.067708

DOI: 10.7717/peerj-cs.2031/table-14

Finally, parameter selections made by each algorithm for the best-performing models are shown in Table 15. The table shows varied parameter selections for the best-performing models in the Frenkel-Toledo et al. (2005) dataset. Most methods opt for a high ‘Min Child Weight’, indicating a preference for more complex models. Learning rates vary significantly, with some models choosing moderate rates (around 0.5–0.7), while others like CNN-XG-SCHO and FA go for a high rate of 0.9. The ‘Max Depth’ is consistently low (mostly around 3), except for CNN-XG-WOA and RSA, suggesting an overall preference for shallower trees. ‘Gamma’ values also vary, indicating different degrees of regularization across methods.

Table 15:

Frenkel-Toledo et al. (2005) dataset best model parameter selections.

Method	Learning rate	Min child W.	Subsample	Colsample by tree	Max depth	Gamma
CNN-XG-MSCHO	0.553152	9.981037	1.000000	0.010000	3.000000	0.397994
CNN-XG-SCHO	0.900000	4.303456	0.694402	1.000000	3.000000	0.454477
CNN-XG-SCA	0.648538	1.000000	1.000000	1.000000	3.000000	0.800000
CNN-XG-GA	0.541834	10.000000	1.000000	0.010000	3.000000	0.037293
CNN-XG-PSO	0.566946	9.337266	1.000000	0.013087	3.000000	0.535184
CNN-XG-FA	0.900000	3.747522	1.000000	1.000000	9.000000	0.800000
CNN-XG-WOA	0.712243	10.000000	1.000000	0.010000	10.000000	0.800000
CNN-XG-BSO	0.530657	6.792584	1.000000	0.045359	3.000000	0.492458
CNN-XG-RSA	0.737744	10.000000	1.000000	0.015460	10.000000	0.691594
CNN-XG-COA	0.574955	10.000000	1.000000	0.040267	3.000000	0.758948
CNN-XG-COLSHADE	0.571372	10.000000	1.000000	0.010000	3.000000	0.709069

DOI: 10.7717/peerj-cs.2031/table-15

Statistical outcome validation

As experimental data is often insufficient to establish the superiority of one algorithm over its competitors, scientists in current computer research must assess the statistical significance of proposed advancements. According to Eftimov, Korošec & Seljak (2016), literature recommendations suggest that statistical tests in such scenarios should involve creating a representative collection of outcomes for each method. However, when dealing with outliers from a non-normal distribution, this strategy may be ineffective, potentially leading to misleading results. The unresolved dispute highlighted by Eftimov, Korošec & Seljak (2016) revolves around whether employing the mean objective function value in statistical tests is suitable for comparing stochastic techniques. Notwithstanding these potential drawbacks, the objective function of the classification error rate was averaged over 30 distinct runs to evaluate the performance of the optimizer.

Following the execution of the Shapiro-Wilk test (Shapiro & Francia, 1972) for single-problem analysis employing the specified procedure, a determination was reached. A data sample was compiled for each algorithm and each problem by aggregating the results of each run, and the corresponding $p$ -values were computed for all method-problem combinations. The resulting $p$ -values are presented in Table 16.

Table 16:

Shapiro-Wilk test scores for the single-problem analysis.

Algorithm	Yogev et al. (2005)	Hausdorff et al. (2007)	Frenkel-Toledo et al. (2005)
MSCHO	0.024	0.019	0.016
SCHO	0.026	0.025	0.028
SCA	0.017	0.019	0.018
GA	0.021	0.026	0.025
PSO	0.029	0.030	0.030
FA	0.035	0.029	0.025
WOA	0.031	0.027	0.023
BSO	0.017	0.020	0.024
RSA	0.024	0.028	0.024
COA	0.021	0.025	0.022
COLSHADE	0.039	0.035	0.038

DOI: 10.7717/peerj-cs.2031/table-16

These findings are further supported by Fig. 15, illustrating the distributions of objective function outcomes for each optimizer across 30 independent runs.

Figure 15: Objective function KDE for each simulation.

Download full-size image

DOI: 10.7717/peerj-cs.2031/fig-15

The rejection of the null hypothesis is warranted as the $p$ -values in Table 16 are all below the predetermined significance threshold, denoted as $α$ , set at $0.05$ . Consequently, the data samples for solutions do not all conform to a Gaussian distribution, indicating that usage of the average objective value in future statistical tests is inappropriate. Hence, the best results were selected for further statistical analysis in this study. Due to the non-met normalcy assumption, parametric tests were deemed to be unsuitable.

Next, the non-parametric Wilcoxon signed-rank test (Wilcoxon, 1992) has been employed. It can be applied to the identical data series containing the best scores achieved in each metaheuristic run. During the test, the proposed approach was employed as the control method, and the Wilcoxon signed-rank test was conducted on the provided data series. For each of the three instances observed, the calculated p-values were lower than $0.05$ . Utilizing the significance threshold of $0.1$ ( $α = 0.1$ ), the results indicate that the new algorithm statistically surpassed all contending techniques considerably. The comprehensive outcomes of the Wilcoxon signed-rank test are presented in Table 17.

Table 17:

Wilcoxon signed-rank test values exhibiting p-values for experiments (MSCHO vs. others).

Algorithm	Yogev et al. (2005)	Hausdorff et al. (2007)	Frenkel-Toledo et al. (2005)
SCHO	0.037	0.041	0.033
SCA	0.032	0.032	0.027
GA	0.018	0.017	0.015
PSO	0.003	0.001	0.008
FA	0.040	0.036	0.033
WOA	0.027	0.029	0.024
BSO	0.022	0.025	0.023
RSA	0.031	0.033	0.035
COA	0.033	0.030	0.031
COLSHADE	0.028	0.028	0.022

DOI: 10.7717/peerj-cs.2031/table-17

The Wilcoxon signed-rank test results indicate that the developed algorithm (MSCHO) statistically significantly outperformed the other algorithms in all three datasets (Yogev et al., 2005; Hausdorff et al., 2007; Frenkel-Toledo et al., 2005). The p-values for all comparisons are below 0.05, well within the significance threshold of 0.1. This consistency across different datasets suggests that the MSCHO algorithm consistently offers supreme performance in comparison to the contending methods evaluated in this study.

Summarizing, and determining the best algorithm based on the provided results involves considering multiple factors like error rates, Cohen’s kappa, precision, sensitivity, F1-measure, and Shapiro-Wilk test results. Algorithms like CNN-XG-MSCHO, SCA, and WOA consistently showed low error rates across different datasets, indicating high accuracy. Similarly, in terms of Cohen’s kappa, precision, sensitivity, and F1-measures, these algorithms demonstrated robust performance, suggesting effective classification capability. However, the Shapiro-Wilk testing outcomes suggest a non-normal distribution in performance metrics, suggesting variability in algorithm performance across different problems. Therefore, while CNN-XG-MSCHO, SCA, and WOA generally exhibit strong performance, the best choice may depend on specific dataset characteristics and application requirements.

Conclusions

In conclusion, neurodegenerative conditions considerably affect the patient’s quality of life, often lacking a cure but allowing for slowed progression with timely intervention. Unfortunately, many patients only seek a diagnosis when the condition has advanced to a stage significantly affecting their quality of life. The development of effective, non-invasive, and readily accessible methods for early diagnosis holds great potential to enhance the quality of life for persons impacted by neurodegenerative conditions.

This study specifically explores the use of convolutional neural networks to identify gait freezing associated with Parkinson’s disease in patients. Leveraging sensor data from wearable gyroscopes placed in the soles of patients’ shoes to capture walking patterns, the research utilizes convolutional networks for the accurate detection of abnormal gait. The proposed approach undergoes evaluation using a publicly available real-world dataset from individuals affected by Parkinson’s and a control group. To enhance classification accuracy, a modified variant of the recent crayfish optimization metaheuristics has been introduced and compared with contemporary optimization metaheuristics.

A unique approach is taken in this work of reformatting patient sensor recordings into image data to account for relations between the sensor data. A two-layer framework is introduced to tackle this challenging task. The first layer of the system comprises a CNN optimized by the introduced MSCHO algorithm. Once the CNN attains an acceptable accuracy (exceeding 90%) it is used to perform feature selection. Additionally, the CNN is interpreted using the SHAP approach. The reduced feature set is used to train XGBoost models that are optimized by several metaheuristics.

The simulation outcomes indicate that all tested metaheuristics eventually attained the same best outcomes. However, the introduced algorithm, while not always the best, manages to attain the best mean and median outcomes across the conducted experiments. The attained outcomes are meticulously statistically validated to ensure a statistically significant improvement. The introduced approach showcases an improvement over preceding works that tackle this challenge consistently exceeding 95% accuracy across all three conducted simulations on publicly available datasets. However, there are certain limitations with this work that need to be noted. Due to the high computational demand of optimization, only a subset of optimization algorithms is considered in the comparative simulation and analysis. Additionally, limited population sizes are used. The study only tackles optimization within the second layer of the framework, optimizing XGBoost hyperparameters, while CNN architectures are optimized by the introduced MSCHO algorithm.

Future endeavors will target additional improvements of the introduced algorithm, and testing a wider range of metaheuristics algorithms for Parkinson’s disease predictions. Moreover, the possible applications of the suggested method for other neurodegenerative conditions will be explored, as well as the applications outside of the medical domain, like intrusion detection, waste classification, renewable power production forecasting, and cloud computing.

Supplemental Information

Code.

DOI: 10.7717/peerj-cs.2031/supp-1

Download

Image dataset.

DOI: 10.7717/peerj-cs.2031/supp-2

Download

[1] Abbas F, Zhang F, Ismail M, Khan G, Iqbal J, Alrefaei AF, Albeshr MF. 2023. Optimizing machine learning algorithms for landslide susceptibility mapping along the karakoram highway, Gilgit Baltistan, Pakistan: a comparative study of baseline, bayesian, and metaheuristic hyperparameter optimization techniques. Sensors 23(15):6843

[2] Abiyev RH, Ma’aitaH MKS. 2018. Deep convolutional neural networks for chest diseases detection. Journal of Healthcare Engineering 2018:1-11

[3] Abualigah L, Abd Elaziz M, Sumari P, Geem ZW, Gandomi AH. 2022. Reptile search algorithm (RSA): a nature-inspired meta-heuristic optimizer. Expert Systems with Applications 191(11):116158

[4] Ahmadpour MR, Ghadiri H, Hajian SR. 2021. Model predictive control optimisation using the metaheuristic optimisation for blood pressure control. IET Systems Biology 15(2):41-52

[5] Alatas B, Bingol H. 2020. Comparative assessment of light-based intelligent search and optimization algorithms. Light & Engineering 28(6)

[6] Albawi S, Mohammed TA, Al-Zawi S. 2017. Understanding of a convolutional neural network.

[7] Alowais SA, Alghamdi SS, Alsuhebany N, Alqahtani T, Alshaya AI, Almohareb SN, Aldairem A, Alrashed M, Bin Saleh K, Badreldin HA, Al Yami MS, Al Harbi S, Albekairy AM. 2023. Revolutionizing healthcare: the role of artificial intelligence in clinical practice. BMC Medical Education 23(1):689

[8] Anikwe CV, Nweke HF, Ikegwu AC, Egwuonwu CA, Onu FU, Alo UR, Teh YW. 2022. Mobile and wearable sensors for data-driven health monitoring system: state-of-the-art and future prospect. Expert Systems with Applications 202(5):117362

[9] Armstrong MJ, Okun MS. 2020. Diagnosis and treatment of Parkinson disease: a review. JAMA 323(6):548-560

[10] Aza A, Gómez-Vela M, Badia M, Begoña Orgaz M, González-Ortega E, Vicario-Molina I, Montes-López E. 2022. Listening to families with a person with neurodegenerative disease talk about their quality of life: integrating quantitative and qualitative approaches. Health and Quality of Life Outcomes 20(1):1-12

[11] Bacanin N, Bezdan T, Tuba E, Strumberger I, Tuba M, Zivkovic M. 2019. Task scheduling in cloud computing environment by grey wolf optimizer.

[12] Bacanin N, Zivkovic M, Antonijevic M, Venkatachalam K, Lee J, Nam Y, Marjanovic M, Strumberger I, Abouhawwash M. 2023. Addressing feature selection and extreme learning machine tuning by diversity-oriented social network search: an application for phishing websites detection. Complex & Intelligent Systems 9:7269-7304

[13] Bai J, Li Y, Zheng M, Khatir S, Benaissa B, Abualigah L, Wahab MA. 2023. A sinh cosh optimizer. Knowledge-Based Systems 282(1):111081

[14] Balaban S. 2015. Deep learning and face recognition: the state of the art. Biometric and Surveillance Technology for Human and Activity Identification XII 9457:68-75

[15] Balaji E, Brindha D, Balakrishnan R. 2020. Supervised machine learning based gait classification system for early detection and stage classification of Parkinson’s disease. Applied Soft Computing 94(5):106494

[16] Basile LJ, Carbonara N, Pellegrino R, Panniello U. 2023. Business intelligence in the healthcare industry: the utilization of a data-driven approach to support clinical decision making. Technovation 120(8):102482

[17] Belić M, Bobić V, Badža M, Šolaja N, Ðurić-Jovičić M, Kostić VS. 2019. Artificial intelligence for assisting diagnostics and assessment of Parkinson’s disease—a review. Clinical Neurology and Neurosurgery 184:105442

[18] Bernardo LS, Damaševičius R, De Albuquerque VHC, Maskeliūnas R. 2021. A hybrid two-stage squeezenet and support vector machine system for Parkinson’s disease detection based on handwritten spiral patterns. International Journal of Applied Mathematics and Computer Science 31(4):549-561

[19] Bernardo LS, Damaševičius R, Ling SH, de Albuquerque VHC, Tavares JMRS. 2022. Modified squeezenet architecture for Parkinson’s disease detection based on keypress data. Biomedicines 10(11):2746

[20] Bezdan T, Cvetnic D, Gajic L, Zivkovic M, Strumberger I, Bacanin N. 2021a. Feature selection by firefly algorithm with improved initialization strategy.

[21] Bezdan T, Milosevic S, Venkatachalam K, Zivkovic M, Bacanin N, Strumberger I. 2021b. Optimizing convolutional neural network by hybridized elephant herding optimization algorithm for magnetic resonance image classification of glioma brain tumor grade.

[22] Bezdan T, Zivkovic M, Tuba E, Strumberger I, Bacanin N, Tuba M. 2020. Glioma brain tumor grade classification from mri using convolutional neural networks designed by modified fa.

[23] Bhat S, Acharya UR, Hagiwara Y, Dadmehr N, Adeli H. 2018. Parkinson’s disease: cause factors, measurable indicators, and early diagnosis. Computers in Biology and Medicine 102(8):234-241

[24] Bianchi VE, Herrera PF, Laura R. 2021. Effect of nutrition on neurodegenerative diseases. A systematic review. Nutritional Neuroscience 24(10):810-834

[25] Bingol H, Alatas B. 2020. Chaos based optics inspired optimization algorithms as global solution search approach. Chaos, Solitons & Fractals 141(3):110434

[26] Bíngol H, Alatas B. 2021. Classification of brain tumor images using deep learning methods. Turkish Journal of Science and Technology 16(1):137-143

[27] Blasiak A, Khong J, Kee T. 2020. Curate. AI: optimizing personalized medicine with artificial intelligence. SLAS TECHNOLOGY: Translating Life Sciences Innovation 25(2):95-105

[28] Bochinski E, Senst T, Sikora T. 2017. Hyper-parameter optimization for convolutional neural network committees based on evolutionary algorithms.

[29] Boina R. 2022. Assessing the increasing rate of Parkinson’s disease in the us and its prevention techniques. International Journal of Biotechnology Research and Development 3(1):IJBTRD_03_01_001

[30] Budimirovic N, Prabhu E, Antonijevic M, Zivkovic M, Bacanin N, Strumberger I, Venkatachalam K. 2022. Covid-19 severity prediction using enhanced whale with salp swarm feature classification. Computers, Materials & Continua 72(1):1685-1698

[31] Cai L, Gao J, Zhao D. 2020. A review of the application of deep learning in medical image classification and segmentation. Annals of Translational Medicine 8(11):713

[32] Chandrabhatla AS, Pomeraniec IJ, Ksendzovsky A. 2022. Co-evolution of machine learning and digital technologies to improve monitoring of Parkinson’s disease motor symptoms. NPJ Digital Medicine 5(1):32

[33] Chen T, Guestrin C. 2016. Xgboost: a scalable tree boosting system.

[34] Chou J-S, Liu C-Y, Prayogo H, Khasani RR, Gho D, Lalitan GG. 2022. Predicting nominal shear capacity of reinforced concrete wall in building by metaheuristics-optimized machine learning. Journal of Building Engineering 61(4):105046

[35] Chou J-S, Nguyen N-M, Chang C-P. 2022. Intelligent candlestick forecast system for financial time-series analysis using metaheuristics-optimized multi-output machine learning. Applied Soft Computing 130(1):109642

[36] Christidi F, Migliaccio R, Santamaría-García H, Santangelo G, Trojsi F. 2018. Social cognition dysfunctions in neurodegenerative diseases: neuroanatomical correlates and clinical implications. Behavioural Neurology 2018(3):1849794

[37] Dai Y, Tang Z, Wang Y. 2019. Data driven intelligent diagnostics for Parkinson’s disease. IEEE Access 7:106941-106950

[38] Dembrower K, Wåhlin E, Liu Y, Salim M, Smith K, Lindholm P, Eklund M, Strand F. 2020. Effect of artificial intelligence-based triaging of breast cancer screening mammograms on cancer detection and radiologist workload: a retrospective simulation study. The Lancet Digital Health 2(9):e468-e474

[39] Di Biase L, Di Santo A, Caminiti ML, De Liso A, Shah SA, Ricci L, Di Lazzaro V. 2020. Gait analysis in Parkinson’s disease: an overview of the most accurate markers for diagnosis and symptoms monitoring. Sensors 20(12):3529

[40] Domínguez-Fernández C, Egiguren-Ortiz J, Razquin J, Gómez-Galán M, De las Heras-García L, Paredes-Rodríguez E, Astigarraga E, Miguélez C, Barreda-Gómez G. 2023. Review of technological challenges in personalised medicine and early diagnosis of neurodegenerative disorders. International Journal of Molecular Sciences 24(4):3321

[41] Dugger BN, Dickson DW. 2017. Pathology of neurodegenerative diseases. Cold Spring Harbor Perspectives in Biology 9(7):a028035

[42] Eberhart R, Kennedy J. 1995. Particle swarm optimization.

[43] Eftimov T, Korošec P, Seljak BK. 2016. Disadvantages of statistical comparison of stochastic optimization algorithms.

[44] Esmaeili H, Bidgoli BM, Hakami V. 2022. Cmml: combined metaheuristic-machine learning for adaptable routing in clustered wireless sensor networks. Applied Soft Computing 118(4):108477

[45] Fan Q, Chen Z, Xia Z. 2020. A novel quasi-reflected Harris hawks optimization algorithm for global optimization problems. Soft Computing 24(19):14825-14843

[46] Faris H, Aljarah I, Al-Betar MA, Mirjalili S. 2018. Grey wolf optimizer: a review of recent variants and applications. Neural Computing and Applications 30(2):413-435

[47] Francis A, Rajan V, Pandian IA. 2022. Implementation of deep learning approaches for early detection of Parkinson’s disease from mri images. In: Advancement, Opportunities, and Practices in Telehealth Technology. Hershey, Pennsylvania: IGI Global. 187-197

[48] Frenkel-Toledo S, Giladi N, Peretz C, Herman T, Gruendlinger L, Hausdorff JM. 2005. Treadmill walking as an external pacemaker to improve gait rhythm and stability in Parkinson’s disease. Movement Disorders: Official Journal of the Movement Disorder Society 20(9):1109-1114

[49] Fröhlich H, Bontridder N, Petrovska-Delacréta D, Glaab E, Kluge F, Yacoubi ME, Marín Valero M, Corvol J-C, Eskofier B, Van Gyseghem J-M, Lehericy S, Wnikler J, Klucken J. 2022. Leveraging the potential of digital technology for better individualized treatment of Parkinson’s disease. Frontiers in Neurology 13:788427

[50] Gaba DM. 2018. Human error in dynamic medical domains 1. In: Human Error in Medicine. Boca Raton, Florida: CRC Press. 197-224

[51] Gao C, Liu J, Tan Y, Chen S. 2020. Freezing of gait in Parkinson’s disease: pathophysiology, risk factors and treatments. Translational Neurodegeneration 9(1):1-22

[52] García J-C, Bustos R-H. 2018. The genetic diagnosis of neurodegenerative diseases and therapeutic perspectives. Brain Sciences 8(12):222

[53] Ghislieri M, Agostini V, Rizzi L, Knaflitz M, Lanotte M. 2021. Atypical gait cycles in Parkinson’s disease. Sensors 21(15):5079

[54] Gómez-Río M, Caballero MM, Gorriz Saez JM, Mínguez-Castellanos A. 2016. Diagnosis of neurodegenerative diseases: the clinical approach. Current Alzheimer Research 13(5):469-474

[55] Greco L, Percannella G, Ritrovato P, Tortorella F, Vento M. 2020. Trends in IoT based solutions for health care: moving AI to the edge. Pattern Recognition Letters 135(3):346-353

[56] Gu J, Wang Z, Kuen J, Ma L, Shahroudy A, Shuai B, Liu T, Wang X, Wang G, Cai J, Chen T. 2018. Recent advances in convolutional neural networks. Pattern Recognition 77(11):354-377

[57] Gupta A, Chhikara R. 2018. Diabetic retinopathy: present and past. Procedia Computer Science 132(11):1432-1440

[58] Gupta R, Kumari S, Senapati A, Ambasta RK, Kumar P. 2023. New era of artificial intelligence and machine learning-based detection, diagnosis, and therapeutics in Parkinson’s disease. Ageing Research Reviews 90(22):102013

[59] Gurrola-Ramos J, Hernàndez-Aguirre A, Dalmau-Cedeño O. 2020. Colshade for real-world single-objective constrained optimization problems.

[60] Haleem A, Vaishya R, Javaid M, Khan IH. 2020. Artificial intelligence (AI) applications in orthopaedics: an innovative technology to embrace. Journal of Clinical Orthopaedics and Trauma 11(Suppl 1):S80

[61] Hansson O. 2021. Biomarkers for neurodegenerative diseases. Nature Medicine 27(6):954-963

[62] Hashim FA, Neggaz N, Mostafa RR, Abualigah L, Damasevicius R, Hussien AG. 2023. Dimensionality reduction approach based on modified hunger games search: case study on Parkinson’s disease phonation. Neural Computing and Applications 35(29):21979-22005

[63] Hausdorff JM, Lowenthal J, Herman T, Gruendlinger L, Peretz C, Giladi N. 2007. Rhythmic auditory stimulation modulates gait variability in Parkinson’s disease. European Journal of Neuroscience 26(8):2369-2375

[64] Hauser RA, Lew MF, Hurtig HI, Ondo WG, Wojcieszek J, Fitzer‐Attas CJ, on behalf of the TEMPO Open‐label Study Group. 2009. Long-term outcome of early versus delayed rasagiline treatment in early Parkinson’s disease. Movement Disorders 24(4):564-573

[65] Hou Y, Dan X, Babbar M, Wei Y, Hasselbalch SG, Croteau DL, Bohr VA. 2019. Ageing as a risk factor for neurodegenerative disease. Nature Reviews Neurology 15(10):565-581

[66] Hunter B, Hindocha S, Lee RW. 2022. The role of artificial intelligence in early cancer diagnosis. Cancers 14(6):1524

[67] Jankovic J. 2015. Gait disorders. Neurologic Clinics 33(1):249-268

[68] Javaid M, Khan IH. 2021. Internet of Things (IoT) enabled healthcare helps to take the challenges of covid-19 pandemic. Journal of Oral Biology and Craniofacial Research 11(2):209-214

[69] Jia H, Rao H, Wen C, Mirjalili S. 2023. Crayfish optimization algorithm. Artificial Intelligence Review 56(Suppl 2):1919-1979

[70] Jiang S, Yang S, Yao X, Tan KC, Kaiser M, Krasnogor N. 2018. Benchmark functions for the cec’2018 competition on dynamic multiobjective optimization. Technical report, Newcastle University.

[71] Johnson M, Albizri A, Simsek S. 2022. Artificial intelligence in healthcare operations to enhance treatment outcomes: a framework to predict lung cancer prognosis. Annals of Operations Research 308:275-305

[72] Jovanovic D, Antonijevic M, Stankovic M, Zivkovic M, Tanaskovic M, Bacanin N. 2022a. Tuning machine learning models using a group search firefly algorithm for credit card fraud detection. Mathematics 10(13):2272

[73] Jovanovic L, Jovanovic D, Antonijevic M, Zivkovic M, Budimirovic N, Strumberger I, Bacanin N. 2022c. The xgboost tuning by improved firefly algorithm for network intrusion detection.

[74] Jovanovic D, Marjanovic M, Antonijevic M, Zivkovic M, Budimirovic N, Bacanin N. 2022b. Feature selection by improved sand cat swarm optimizer for intrusion detection.

[75] Junaid M, Ali S, Eid F, El-Sappagh S, Abuhmed T. 2023. Explainable machine learning models based on multimodal time-series data for the early detection of Parkinson’s disease. Computer Methods and Programs in Biomedicine 234(3):107495

[76] Karaboga D, Basturk B. 2008. On the performance of artificial bee colony (ABC) algorithm. Applied Soft Computing 8(1):687-697

[77] Kashan AH. 2015. A new metaheuristic for optimization: optics inspired optimization (OIO) Computers & Operations Research 55:99-125

[78] Katsuno M, Sahashi K, Iguchi Y, Hashizume A. 2018. Preclinical progression of neurodegenerative diseases. Nagoya Journal of Medical Science 80(3):289-298

[79] Kaveh A, Khayatazad M. 2012. A new meta-heuristic method: ray optimization. Computers & Structures 112:283-294

[80] Khaliq F, Oberhauser J, Wakhloo D, Mahajani S. 2023. Decoding degeneration: the implementation of machine learning for clinical detection of neurodegenerative disorders. Neural Regeneration Research 18(6):1235

[81] Khan MA, Algarni F. 2020. A healthcare monitoring system for the diagnosis of heart disease in the IoMT cloud environment using msso-anfis. IEEE Access 8:122259-122269

[82] Khoury N, Attal F, Amirat Y, Oukhellou L, Mohammed S. 2019. Data-driven based approach to aid parkinson’s disease diagnosis. Sensors 19(2):242

[83] Kishor A, Chakraborty C. 2022. Artificial intelligence and internet of things based healthcare 4.0 monitoring system. Wireless Personal Communications 127(2):1615-1631

[84] Kobylecki C. 2020. Update on the diagnosis and management of Parkinson’s disease. Clinical Medicine 20(4):393-398

[85] Kovacs GG. 2018. Concepts and classification of neurodegenerative diseases. In: Handbook of Clinical Neurology. Amsterdam: Elsevier. 145:301-307

[86] Krishnamoorthy S, Dua A, Gupta S. 2023. Role of emerging technologies in future IoT-driven healthcare 4.0 technologies: a survey, current challenges and future directions. Journal of Ambient Intelligence and Humanized Computing 14(1):361-407

[87] Krizhevsky A, Sutskever I, Hinton GE. 2012. ImageNet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems 25:1097-1105

[88] Krizhevsky A, Sutskever I, Hinton GE. 2017. ImageNet classification with deep convolutional neural networks. Communications of the ACM 60(6):84-90

[89] Kumar AV, Kumar S, Garg VK, Goel N, Hoang VT, Kashyap D. 2023. Future perspectives for automated neurodegenerative disorders diagnosis: challenges and possible research directions. In: Data Analysis for Neurodegenerative Disorders. Berlin: Springer. 255-267

[90] Lee D, Yoon SN. 2021. Application of artificial intelligence-based technologies in the healthcare industry: opportunities and challenges. International Journal of Environmental Research and Public Health 18(1):271

[91] Lewis SJ, Factor SA, Giladi N, Hallett M, Nieuwboer A, Nutt JG, Przedborski S, Papa SM, Committee MSI. 2022. Addressing the challenges of clinical research for freezing of gait in Parkinson’s disease. Movement Disorders 37(2):264-267

[92] Li Z, Liu F, Yang W, Peng S, Zhou J. 2021. A survey of convolutional neural networks: analysis, applications, and prospects. IEEE Transactions on Neural Networks and Learning Systems 33(12):6999-7019

[93] Lin C-H, Chiu S-I, Chen T-F, Jang J-SR, Chiu M-J. 2020. Classifications of neurodegenerative disorders using a multiplex blood biomarkers-based machine learning model. International Journal of Molecular Sciences 21(18):6914

[94] Liu Y, Liu Z, Luo X, Zhao H. 2022. Diagnosis of Parkinson’s disease based on shap value feature selection. Biocybernetics and Biomedical Engineering 42(3):856-869

[95] Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, Katz R, Himmelfarb J, Bansal N, Lee S-I. 2020. From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence 2(1):56-67

[96] Lundberg SM, Lee S-I. 2017. A unified approach to interpreting model predictions. In: Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R, eds. Advances in Neural Information Processing Systems 30. United Kingdom: Curran Associates, Inc.. 4765-4774

[97] Lång K, Hofvind S, Rodríguez-Ruiz A, Andersson I. 2021. Can artificial intelligence reduce the interval cancer rate in mammography screening? European Radiology 31:5940-5947

[98] Lång K, Josefsson V, Larsson A-M, Larsson S, Högberg C, Sartor H, Hofvind S, Andersson I, Rosso A. 2023. Artificial intelligence-supported screen reading versus standard double reading in the mammography screening with artificial intelligence trial (MASAI): a clinical safety analysis of a randomised, controlled, non-inferiority, single-blinded, screening accuracy study. The Lancet Oncology 24(8):936-944

[99] Mahbod A, Schaefer G, Wang C, Ecker R, Ellinge I. 2019. Skin lesion classification using hybrid deep neural networks.

[100] Maneu V, Lax P, Cuenca N. 2022. Current and future therapeutic strategies for the treatment of retinal neurodegenerative diseases. Neural Regeneration Research 17(1):103

[101] Marcante A, Di Marco R, Gentile G, Pellicano C, Assogna F, Pontieri FE, Spalletta G, Macchiusi L, Gatsios D, Giannakis A, Chondrogiorgi M, Konitsiotis S, Fotiadis DI, Antonini A. 2021. Foot pressure wearable sensors for freezing of gait detection in Parkinson’s disease. Sensors 21(1):128

[102] McFall GP, Bohn L, Gee M, Drouin SM, Fah H, Han W, Li L, Camicioli R, Dixon RA. 2023. Identifying key multi-modal predictors of incipient dementia in Parkinson’s disease: a machine learning analysis and tree shap interpretation. Frontiers in Aging Neuroscience 15:18

[103] Meng D, Jin Z, Gao L, Wang Y, Wang R, Fang J, Qi L, Su Y, Liu A, Fang B. 2022. The quality of life in patients with Parkinson’s disease: focus on gender difference. Brain and Behavior 12(3):e2517

[104] Mirelman A, Bonato P, Camicioli R, Ellis TD, Giladi N, Hamilton JL, Hass CJ, Hausdorff JM, Pelosin E, Almeida QJ. 2019. Gait impairments in Parkinson’s disease. The Lancet Neurology 18(7):697-708

[105] Mirjalili S. 2016. Sca: a sine cosine algorithm for solving optimization problems. Knowledge-Based Systems 96(63):120-133

[106] Mirjalili S, Lewis A. 2016. The whale optimization algorithm. Advances in Engineering Software 95(12):51-67

[107] Mirjalili S, Mirjalili S. 2019. Genetic algorithm. In: Evolutionary Algorithms and Neural Networks: Theory and Applications. 43-55

[108] Moreira-Neto A, Ugrinowitsch C, Coelho DB, de Lima-Pardini AC, Barbosa ER, Teixeira LA, Amaro E, Horak FB, Mancini M, Nucci MP, Silva-Batista C. 2022. Freezing of gait, gait initiation, and gait automaticity share a similar neural substrate in parkinson’s disease. Human Movement Science 86(Pt 9):103018

[109] Morel T, Cleanthous S, Andrejack J, Barker RA, Blavat G, Brooks W, Burns P, Cano S, Gallagher C, Gosden L, Siu C, Slagle AF, Trenam K, Boroojerdi B, Ratcliffe N, Schroeder K. 2022. Patient experience in early-stage Parkinson’s disease: using a mixed methods analysis to identify which concepts are cardinal for clinical trial outcome assessment. Neurology and Therapy 11(3):1319-1340

[110] Morris ME, Huxham F, McGinley J, Dodd K, Iansek R. 2001. The biomechanics and motor control of gait in Parkinson disease. Clinical Biomechanics 16(6):459-470

[111] Mortberg MA, Vallabh SM, Minikel EV. 2022. Disease stages and therapeutic hypotheses in two decades of neurodegenerative disease clinical trials. Scientific Reports 12(1):17708

[112] Mughal H, Javed AR, Rizwan M, Almadhor AS, Kryvinska N. 2022. Parkinson’s disease management via wearable sensors: a systematic review. IEEE Access 10:35219-35237

[113] Muhammad L, Islam MM, Usman SS, Ayon SI. 2020. Predictive data mining models for novel coronavirus (COVID-19) infected patients’ recovery. SN Computer Science 1(4):206

[114] Munavalli JR, Boersma HJ, Rao SV, Van Merode G. 2021. Real-time capacity management and patient flow optimization in hospitals using AI methods. In: Artificial Intelligence and Data Mining in Healthcare. 55-69

[115] Murman DL. 2012. Early treatment of Parkinson’s disease: opportunities for managed care. American Journal of Managed Care 18(7):S183

[116] Nadimi-Shahraki MH, Zamani H. 2022. Dmde: Diversity-maintained multi-trial vector differential evolution algorithm for non-decomposition large-scale global optimization. Expert Systems with Applications 198:116895

[117] Nematzadeh S, Kiani F, Torkamanian-Afshar M, Aydin N. 2022. Tuning hyperparameters of machine learning algorithms and deep neural networks using metaheuristics: a bioinformatics study on biomedical and biological cases. Computational Biology and Chemistry 97(3):107619

[118] Noor MBT, Zenia NZ, Kaiser MS, Mahmud M, Al Mamun S. 2019. Detecting neurodegenerative disease from MRI: a brief review on a deep learning perspective.

[119] Pagan FL. 2012. Improving outcomes through early diagnosis of Parkinson’s disease. American Journal of Managed Care 18(7):S176

[120] Pardoel S, Kofman J, Nantel J, Lemaire ED. 2019. Wearable-sensor-based detection and prediction of freezing of gait in Parkinson’s disease: a review. Sensors 19(23):5141

[121] Paul S, Maindarkar M, Saxena S, Saba L, Turk M, Kalra M, Krishnan PR, Suri JS. 2022. Bias investigation in artificial intelligence systems for early detection of Parkinson’s disease: a narrative review. Diagnostics 12(1):166

[122] Perumal SV, Sankar R. 2016. Gait and tremor assessment for patients with parkinson’s disease using wearable sensors. ICT Express 2(4):168-174

[123] Petrovic A, Bacanin N, Zivkovic M, Marjanovic M, Antonijevic M, Strumberger I. 2022. The adaboost approach tuned by firefly metaheuristics for fraud detection.

[124] Petrovic A, Damaševičius R, Jovanovic L, Toskovic A, Simic V, Bacanin N, Zivkovic M, Spalević P. 2023. Marine vessel classification and multivariate trajectories forecasting using metaheuristics-optimized extreme gradient boosting and recurrent neural networks. Applied Sciences 13(16):9181

[125] Pistacchi M, Gioulis M, Sanson F, De Giovannini E, Filippi G, Rossetto F, Marsala SZ. 2017. Gait analysis and clinical correlations in early Parkinson’s disease. Functional Neurology 32(1):28

[126] Polap D, Woźniak M. 2017. Polar bear optimization algorithm: meta-heuristic with fast population movement and dynamic birth and death mechanism. Symmetry 9(10):203

[127] Połap D, Woźniak M. 2021. Red fox optimization algorithm. Expert Systems with Applications 166:114107

[128] Popa-Wagner A, Dumitrascu DI, Capitanescu B, Petcu EB, Surugiu R, Fang W-H, Dumbrava D-A. 2020. Dietary habits, lifestyle factors and neurodegenerative diseases. Neural Regeneration Research 15(3):394

[129] Pozzi NG, Canessa A, Palmisano C, Brumberg J, Steigerwald F, Reich MM, Minafra B, Pacchetti C, Pezzoli G, Volkmann J, Isaias IU. 2019. Freezing of gait in Parkinson’s disease reflects a sudden derangement of locomotor network dynamics. Brain 142(7):2037-2050

[130] Predić B, Jovanovic L, Simic V, Bacanin N, Zivkovic M, Spalevic P, Budimirovic N, Dobrojevic M. 2023. Cloud-load forecasting via decomposition-aided attention recurrent neural network tuned by modified particle swarm optimization. Complex & Intelligent Systems 10:2246-2269

[131] Priya SJ, Rani AJ, Subathra M, Mohammed MA, Damaševičius R, Ubendran N. 2021. Local pattern transformation based feature extraction for recognition of Parkinson’s disease based on gait signals. Diagnostics 11(8):1395

[132] Qolomany B, Maabreh M, Al-Fuqaha A, Gupta A, Benhaddou D. 2017. Parameters optimization of deep learning models using particle swarm optimization.

[133] Radder DL, Lennaerts HH, Vermeulen H, van Asseldonk T, Delnooz CC, Hagen RH, Munneke M, Bloem BR, de Vries NM. 2020. The cost-effectiveness of specialized nursing interventions for people with parkinson’s disease: the nice-pd study protocol for a randomized controlled clinical trial. Trials 21(1):1-11

[134] Rajpurkar P, Chen E, Banerjee O, Topol EJ. 2022. AI in health and medicine. Nature Medicine 28(1):31-38

[135] Ranjan R, Sankaranarayanan S, Castillo CD, Chellappa R. 2017. An all-in-one convolutional neural network for face analysis.

[136] Rashid J, Batool S, Kim J, Wasif Nisar M, Hussain A, Juneja S, Kushwaha R. 2022. An augmented artificial intelligence approach for chronic diseases prediction. Frontiers in Public Health 10:860396

[137] Reichmann H, Klingelhoefer L, Bendig J. 2023. The use of wearables for the diagnosis and treatment of Parkinson’s disease. Journal of Neural Transmission 130(6):783-791

[138] Ren Z, Zhang Y, Wang S. 2022. A hybrid framework for lung cancer classification. Electronics 11(10):1614

[139] Rosqvist K, Schrag A, Odin P, CLaSP Consortium. 2022. Caregiver burden and quality of life in late stage Parkinson’s disease. Brain Sciences 12(1):111

[140] Salb M, Jovanovic L, Bacanin N, Antonijevic M, Zivkovic M, Budimirovic N, Abualigah L. 2023. Enhancing internet of things network security using hybrid CNN and xgboost model tuned via modified reptile search algorithm. Applied Sciences 13(23):12687

[141] Savanović N, Toskovic A, Petrovic A, Zivkovic M, Damaševičius R, Jovanovic L, Bacanin N, Nikolic B. 2023. Intrusion detection in healthcare 4.0 internet of things systems via metaheuristics optimized machine learning. Sustainability 15(16):12563

[142] Schalkamp A-K, Peall KJ, Harrison NA, Sandor C. 2023. Wearable movement-tracking data identify Parkinson’s disease years before clinical diagnosis. Nature Medicine 29(8):2048-2056

[143] Senturk ZK. 2020. Early diagnosis of Parkinson’s disease using machine learning algorithms. Medical Hypotheses 138(4):109603

[144] Shapiro SS, Francia R. 1972. An approximate analysis of variance test for normality. Journal of the American statistical Association 67(337):215-216

[145] Shi Y. 2011. Brain storm optimization algorithm.

[146] Shusharina N, Yukhnenko D, Botman S, Sapunov V, Savinov V, Kamyshov G, Sayapin D, Voznyuk I. 2023. Modern methods of diagnostics and treatment of neurodegenerative diseases and depression. Diagnostics 13(3):573

[147] Simonyan K, Zisserman A. 2014. Very deep convolutional networks for large-scale image recognition. ArXiv preprint

[148] Singh G, Vadera M, Samavedham L, Lim EC-H. 2019. Multiclass diagnosis of neurodegenerative diseases: a neuroimaging machine-learning-based approach. Industrial & Engineering Chemistry Research 58(26):11498-11505

[149] Soh EML, Neo S, Saffari SE, Wong ASY, Ganesan G, Li W, Ng HL, Xu Z, Tay KY, Au WL, Tan KB, Tan LCS. 2022. Longitudinal healthcare utilization and costs in Parkinson’s disease: pre-diagnosis to 9 years after. Journal of Parkinson’s Disease 12(3):957-966

[150] Soria Poma X, Riba E, Sappa A. 2020. Dense extreme inception network: towards a robust CNN model for edge detection.

[151] Spetlík R, Franc V, Matas J. 2018. Visual heart rate estimation with convolutional neural network.

[152] Stankovic M, Antonijevic M, Bacanin N, Zivkovic M, Tanaskovic M, Jovanovic D. 2022. Feature selection by hybrid artificial bee colony algorithm for intrusion detection.

[153] Stern MB. 1993. Parkinson’s disease: early diagnosis and management. Journal of Family Practice 36(4):439-447

[154] Stewart J, Goudie A, Lu J, Dwivedi G. 2023. AI in emergency medicine. In: AI in Clinical Medicine: A Practical Guide for Healthcare Professionals. 117-128

[155] Strumberger I, Tuba E, Zivkovic M, Bacanin N, Beko M, Tuba M. 2019. Dynamic search tree growth algorithm for global optimization.

[156] Szegedy C, Ioffe S, Vanhoucke V, Alemi AA. 2017. Inception-v4, inception-ResNet and the impact of residual connections on learning.

[157] Tăuţan A-M, Ionescu B, Santarnecchi E. 2021. Artificial intelligence in neurodegenerative diseases: a review of available tools with a focus on machine learning techniques. Artificial Intelligence in Medicine 117:102081

[158] Tang KJW, Ang CKE, Constantinides T, Rajinikanth V, Acharya UR, Cheong KH. 2021. Artificial intelligence and machine learning in emergency medicine. Biocybernetics and Biomedical Engineering 41(1):156-172

[159] Thapa S, Singh P, Jain DK, Bharill N, Gupta A, Prasad M. 2020. Data-driven approach based on feature selection technique for early diagnosis of Alzheimer’s disease.

[160] Ting FF, Tan YJ, Sim KS. 2019. Convolutional neural network improvement for breast cancer classification. Expert Systems with Applications 120(6):103-115

[161] Todorovic M, Stanisic N, Zivkovic M, Bacanin N, Simic V, Tirkolaee EB. 2023. Improving audit opinion prediction accuracy using metaheuristics-tuned XGBoost algorithm with interpretable results through SHAP value analysis. Applied Soft Computing 149(1):110955

[162] Van der Schaar M, Alaa AM, Floto A, Gimson A, Scholtes S, Wood A, McKinney E, Jarrett D, Lio P, Ercole A. 2021. How artificial intelligence and machine learning can help healthcare systems respond to COVID-19. Machine Learning 110(1):1-14

[163] van Halteren AD, Munneke M, Smit E, Thomas S, Bloem BR, Darweesh SK. 2020. Personalized care management for persons with Parkinson’s disease. Journal of Parkinson’s Disease 10(s1):S11-S20

[164] Von Coelln R, Gruber-Baldini A, Reich S, Armstrong M, Savitt J, Shulman L. 2021. The inconsistency and instability of Parkinson’s disease motor subtypes. Parkinsonism & Related Disorders 88(9680):13-18

[165] Wang K, Boonpratatong A, Chen W, Ren L, Wei G, Qian Z, Lu X, Zhao D. 2023. The fundamental property of human leg during walking: linearity and nonlinearity. IEEE Transactions on Neural Systems and Rehabilitation Engineering 31:4871-4881

[166] Wang W, Li Y, Zou T, Wang X, You J, Luo Y. 2020. A novel image classification approach via dense-mobilenet models. Mobile Information Systems 2020:7602384

[167] Wang Y, Zhang H, Zhang G. 2019. cPSO-CNN: an efficient pso-based algorithm for fine-tuning hyper-parameters of convolutional neural networks. Swarm and Evolutionary Computation 49(4):114-123

[168] Warrens MJ. 2015. Five ways to look at Cohen’s kappa. Journal of Psychology & Psychotherapy 5(4):1000197

[169] Wilcoxon F. 1992. Individual comparisons by ranking methods. In: Breakthroughs in Statistics: Methodology and Distribution. Berlin: Springer. 196-202

[170] Wolpert DH, Macready WG. 1997. No free lunch theorems for optimization. IEEE Transactions on Evolutionary Computation 1(1):67-82

[171] Wu P, Cao B, Liang Z, Wu M. 2023. The advantages of artificial intelligence-based gait assessment in detecting, predicting, and managing Parkinson’s disease. Frontiers in Aging Neuroscience 15:496

[172] Xu J, Zhang M. 2019. Use of magnetic resonance imaging and artificial intelligence in studies of diagnosis of parkinson’s disease. ACS Chemical Neuroscience 10(6):2658-2667

[173] Yamasaki T, Honma T, Aizawa K. 2017. Efficient optimization of convolutional neural networks using particle swarm optimization.

[174] Yamashita R, Nishio M, Do RKG, Togashi K. 2018. Convolutional neural networks: an overview and application in radiology. Insights into Imaging 9(4):611-629

[175] Yang W, Hamilton JL, Kopil C, Beck JC, Tanner CM, Albin RL, Ray Dorsey E, Dahodwala N, Cintina I, Hogan P, Thompson T. 2020. Current and projected future economic burden of Parkinson’s disease in the us. NPJ Parkinson’s Disease 6(1):15

[176] Yang X-S, Slowik A. 2020. Firefly algorithm. In: Swarm Intelligence Algorithms. Boca Raton, Florida: CRC Press. 163-174

[177] Yeasmin S. 2019. Benefits of artificial intelligence in medicine.

[178] Yogev G, Giladi N, Peretz C, Springer S, Simon ES, Hausdorff JM. 2005. Dual tasking, gait rhythmicity, and Parkinson’s disease: which aspects of gait are attention demanding? European Journal of Neuroscience 22(5):1248-1256

[179] Zamani H, Nadimi-Shahraki MH, Gandomi AH. 2022. Starling murmuration optimizer: a novel bio-inspired algorithm for global and engineering optimization. Computer Methods in Applied Mechanics and Engineering 392:114616

[180] Zhang J. 2022. Mining imaging and clinical data with machine learning approaches for the diagnosis and early detection of Parkinson’s disease. NPJ Parkinson’s Disease 8(1):13

[181] Zhou J, Jangili P, Son S, Ji MS, Won M, Kim JS. 2020. Fluorescent diagnostic probes in neurodegenerative diseases. Advanced Materials 32(51):2001945

[182] Zivkovic M, Bacanin N, Antonijevic M, Nikolic B, Kvascev G, Marjanovic M, Savanovic N. 2022. Hybrid CNN and xgboost model tuned by modified arithmetic optimization algorithm for covid-19 early diagnostics from x-ray images. Electronics 11(22):3798

[183] Zivkovic M, Bacanin N, Tuba E, Strumberger I, Bezdan T, Tuba M. 2020a. Wireless sensor networks life time optimization based on the improved firefly algorithm.