Skip to main content

Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults

Abstract

Background

Obstructive sleep apnea (OSA) is a globally prevalent disease with a complex diagnostic method. Severe OSA is associated with multi-system dysfunction. We aimed to develop an interpretable machine learning (ML) model for predicting the risk of severe OSA and analyzing the risk factors based on clinical characteristics and questionnaires.

Methods

This was a retrospective study comprising 1656 subjects who presented and underwent polysomnography (PSG) between 2018 and 2021. A total of 23 variables were included, and after univariate analysis, 15 variables were selected for further preprocessing. Six types of classification models were used to evaluate the ability to predict severe OSA, namely logistic regression (LR), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), adaptive boosting (AdaBoost), bootstrapped aggregating (Bagging), and multilayer perceptron (MLP). All models used the area under the receiver operating characteristic curve (AUC) was calculated as the performance metric. We also drew SHapley Additive exPlanations (SHAP) plots to interpret predictive results and to analyze the relative importance of risk factors. An online calculator was developed to estimate the risk of severe OSA in individuals.

Results

Among the enrolled subjects, 61.47% (1018/1656) were diagnosed with severe OSA. Multivariate LR analysis showed that 10 of 23 variables were independent risk factors for severe OSA. The GBM model showed the best performance (AUC = 0.857, accuracy = 0.766, sensitivity = 0.798, specificity = 0.734). An online calculator was developed to estimate the risk of severe OSA based on the GBM model. Finally, waist circumference, neck circumference, the Epworth Sleepiness Scale, age, and the Berlin questionnaire were revealed by the SHAP plot as the top five critical variables contributing to the diagnosis of severe OSA. Additionally, two typical cases were analyzed to interpret the contribution of each variable to the outcome prediction in a single patient.

Conclusions

We established six risk prediction models for severe OSA using ML algorithms. Among them, the GBM model performed best. The model facilitates individualized assessment and further clinical strategies for patients with suspected severe OSA. This will help to identify patients with severe OSA as early as possible and ensure their timely treatment.

Trial registration

Retrospectively registered.

Peer Review reports

Background

Obstructive sleep apnea (OSA) refers to apnea and hypoventilation caused by repeated collapse and obstruction of the upper airway during sleep. OSA leads to frequent hypoxemia, hypercapnia, and sleep architecture disorders [1]. Epidemiological studies have shown that OSA has a high prevalence in adults, with approximately one billion people worldwide suffering from OSA, and the number of cases of moderate to severe OSA, for which treatment is generally recommended, is estimated to be almost 425 million [2]. As OSA progresses, the patient develops symptoms such as daytime sleepiness, hypomnesia, and inattention, which reduce their quality of life, impair work performance, and result in a higher risk of car accidents [3]. Moreover, severe OSA could cause dysfunction of multiple organ systems, such as the cardiovascular, endocrine, and nervous systems [4]. Due to the increasing prevalence and mortality of OSA and related complications [5], timely detection and treatment are crucial, especially in patients with severe OSA.

Because of the lack of symptom specificity and the complexity of diagnostic methods, OSA diagnosis remains difficult despite its high prevalence. One study showed that approximately 80% of patients with clinical symptoms of OSA were not diagnosed promptly [2]. Polysomnography (PSG) is the gold standard for OSA diagnosis [6]. However, PSG has many limitations: (i) The test is not widely available, waiting times are long, and the test takes a substantial amount of time. (ii) PSG technology is complex and specialized, and the obtained data are numerous and highly variable, so data interpretation is affected by personal factors and errors. (iii) The monitoring cost is high, and the economic burden is heavy. Therefore, it is crucial to develop a more convenient and accurate method to detect OSA and determine its severity.

Previous studies used various physiological data to develop predictive models of OSA to compensate for the defects of PSG, including clinical information [7,8,9,10,11,12,13], electrocardiograms [14,15,16], electroencephalograms [17,18,19], and oxyhemoglobin saturation [20,21,22]. However, the acquisition of partial physiological information requires evaluation by professional physicians and specialized equipment. In this respect, it does not minimize the preliminary screening process. Therefore, information that can be self-reported by patients is the most readily available and more suitable for inclusion in predictive models. Many parameters that are associated with the occurrence and severity of OSA have been identified in previous studies. The most well-recognized parameters are gender, age, body mass index (BMI), waist circumference, neck circumference, unhealthy lifestyle habits, and symptoms suggestive of OSA [23,24,25,26]. However, there is currently a lack of methods that use these parameters to rapidly and robustly diagnose OSA and determine its severity.

Machine learning (ML), a type of artificial intelligence, has been widely used in medical data analysis in recent years [27]. ML can be employed to detect complex relationships between predictors and outcomes and to improve the accuracy of analysis through continuous online learning. Because of its high efficiency, low cost, and convenience, ML has become a popular method for disease risk prediction and preliminary screening [28], including sleep medicine [29, 30]. ML shows the potential to overcome the challenges in OSA diagnosis.

Therefore, in the present study, we propose an interpretable ML model for predicting the risk of severe OSA in adults and analyzing the risk factors. The determination of OSA severity is important in early screening in a clinical context. This model provides individual strategies for severe OSA patients and aids in timely diagnosis and treatment.

Methods

Study population

This was a retrospective study. All patients (n = 1656) presented at the Second Affiliated Hospital of Xi’an Jiaotong University, China, between 2018 and 2021. Inclusion criteria were as follows: (i) age > 18 years old; (ii) with symptoms suggestive of OSA and related complications such as snoring, breathing cessations during sleep, and hypertension; and (iii) willingness to undergo PSG. Subjects were excluded (i) if they had craniofacial abnormalities/disorders; (ii) if they had neuromuscular disorders; (iii) if they were diagnosed with central sleep apnea syndrome based on PSG; (iv) if they had previously received sleep treatment; or (v) in case of long-term usage of medications known to affect sleep. This study was approved by the Ethics Committee of the Second Affiliated Hospital of Xi’an Jiaotong University. All participants provided informed consent for data collection and analysis. All data were anonymized.

Data collection and feature selection

All subjects underwent PSG in a temperature-controlled and sound-attenuated room, supervised by the night-shift staff. Two experienced sleep physicians scored the PSG findings according to the American Academy of Sleep Medicine (AASM) criteria [31]. The apnea-hypopnea index (AHI) was defined as the total number of apneas and hypopneas per hour of sleep. OSA was defined as AHI ≥ 5 and severe OSA was defined as AHI ≥ 30.

A literature search was conducted to identify collected variables, including demographic data, lifestyle habits, medical history, symptoms suggestive of OSA, and sleep questionnaires. Demographic data included gender, age, BMI, neck circumference, and waist circumference. Lifestyle habits included smoking, drinking, lack of exercise, poor sleep, emotional instability, and stress. Hypertension, family history of hypertension, heart disease (self-reported arrhythmia, angina, coronary artery disease, or heart failure), diabetes, and hypothyroidism were self-reported medical history questions. Symptoms suggestive of OSA included snoring, breathing cessations during sleep, hypomnesia, and inattention. Sleep questionnaires included the Epworth Sleepiness Scale (ESS) [32], the Berlin questionnaire (BQ) [33], and the STOP-BANG questionnaire (SBQ) [34].

By comparing the clinical characteristics of the severe and non-severe OSA groups, the risk factors for predicting severe OSA were analyzed using the univariate analysis, and they were incorporated into ML models as characteristic variables. Additionally, the multivariate logistic regression analysis were also used to obtain independent predictors associated with severe OSA.

Machine learning models

Six advanced ML algorithms were used to detect severe OSA, namely adaptive boosting (AdaBoost) [35], logistic regression (LR) [36], multilayer perceptron (MLP) [37], bootstrapped aggregating (Bagging) [38], gradient boosting machine (GBM) [39], and extreme gradient boost (XGBoost) [40].

Subjects were randomly divided into a training set and a test set with a ratio of 7:3. The training set was used to establish the prediction models. Each model was trained using 10-fold cross-validation, where each repetition was used as the test to overcome sampling bias. And the models were applied and validated in the test set.

The grid search and internal cross-validation were used to find the best hyperparameters of models in the training set (Table 1). Further analysis was performed using Python v3.10.9 with the pandas v1.5.3, streamlit v1.22.0, numpy v1.23.5, imblearn v0.10.1, matplotlib v3.7.0, sklearn v1.2.1, xgboost v1.7.5, shap v0.41.0, seaborn v0.12.2 packages (the source code is available on https://github.com/Wu-Shi-Nan/SOSA).

Table 1 The hyperparameters for proposed six machine learning models

Statistical analysis

All analyses were performed using R software (version 3.6.0). ML models and web calculators were built using Python (version 3.8). Continuous variables are presented as the median with interquartile range (IQR). Categorical variables are presented as numbers with proportions. Differences between groups were compared by the Wilcoxon rank-sum test, the Chi-squared test, or Fisher’s exact test. Univariate and multivariate analyses were conducted to analyze the risk factors for predicting severe OSA. P < 0.05 was set as the threshold for statistical significance. The prediction performance of the models was evaluated based on the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, and specificity. SHapley Additive exPlanations (SHAP) plots were drawn to interpret predictive results and to analyze the relative importance of risk factors. An online calculator based on the model with the best AUC value was capable of estimating the risk of severe OSA in individuals. The proposed approach is shown in Fig. 1.

Fig. 1
figure 1

Schematic overview of the proposed method

OSA, obstructive sleep apnea

Results

Comparisons between patients with and without severe OSA

According to the PSG results, 1656 enrolled OSA patients were divided into the non-severe OSA group (AHI < 30 per hour) and the severe OSA group (AHI ≥ 30 per hour).

The differences between the two groups in terms of gender, age, BMI, waist circumference, neck circumference, smoking, drinking, lack of exercise, hypertension, snoring, breathing cessations during sleep, hypomnesia, ESS, BQ, and SBQ (hypomnesia P < 0.05, other P < 0.001) were statistically significant. Compared with the non-severe OSA patients, the severe OSA patients were more likely to be male, to be older, to have a higher BMI, to have a larger waist circumference, to have a larger neck circumference, to smoke, to drink, and to not do exercise. Severe OSA patients had more pronounced symptoms of hypertension, snoring, breathing cessations during sleep, and hypomnesia, and their sleep questionnaire results were more serious (Table 2).

Table 2 Comparisons of variables between patients with and without severe OSA

Univariate and multivariate logistic regression analyses

A total of 1159 patients were assigned to the training set (70% of the total population). In the training set, univariate analysis revealed that male sex, age, BMI, waist circumference, neck circumference, smoking, drinking, lack of exercise, hypertension, snoring, breathing cessations during sleep, hypomnesia, ESS, BQ, and SBQ (hypomnesia P < 0.05, other P < 0.001) were significantly associated with severe OSA. All these parameters were included in multivariate LR analysis. The results showed that male sex (OR: 1.509, 95%CI: 1.014–2.245, P < 0.05), age (OR: 1.017, 95%CI: 1.005–1.029, P < 0.05), BMI (OR: 1.073, 95%CI: 1.005–1.146, P < 0.005), waist circumference (OR: 1.045, 95%CI: 1.020–1.072, P < 0.001), smoking (OR: 1.539, 95%CI: 1.172–2.020, P < 0.05), snoring (OR: 5.859, 95%CI: 1.597–21.418, P < 0.05), breathing cessations during sleep (OR: 1.656, 95%CI: 1.206–2.273, P < 0.05), moderate ESS (OR: 1.956, 95%CI: 1.389–2.754, P < 0.001), high ESS (OR: 2.692, 95%CI: 1.738–4.170, P < 0.001), BQ (OR: 3.782, 95%CI: 2.217–6.452, P < 0.001), and SBQ (OR: 2.108, 95%CI: 1.379–3.223, P = 0.001) were independent predictors of severe OSA (Table 3).

Table 3 Univariate and multivariate logistic regression analyses

Performance of machine learning models

We designed six classification models and evaluated their ability to predict severe OSA. Each model was trained by 10-fold cross-validation on the training group. The GBM model had the highest average accuracy with an AUC of 0.861 (Fig. 2). The results of the validation group also showed that the AUC values of the six ML models ranged from 0.765 to 0.857, while the GBM model showed the best performance (Fig. 3). The proposed GBM model had an accuracy, sensitivity, and specificity of 0.766, 0.798, and 0.734, respectively (Table 4). Therefore, we chose the GBM model as the best prediction model for conducting further analyses.

Fig. 2
figure 2

Ten-fold cross-validation of six ML models on the training group

AUC, area under the receiver operating characteristic curve; AdaBoost, adaptive boosting; LR, logistic regression; Bagging, bootstrapped aggregating; MLP, multilayer perceptron; GBM, gradient boosting machine; and XGBoost, extreme gradient boost

Fig. 3
figure 3

ROC curves of six ML models on the validation group

ROC, receiver operating characteristic; AUC, area under the receiver operating characteristic curve; AdaBoost, adaptive boosting; LR, logistic regression; Bagging, bootstrapped aggregating; MLP, multilayer perceptron; GBM, gradient boosting machine; and XGBoost, extreme gradient boost

Table 4 Performance of six ML models

Relative importance of variables in the GBM model

To further analyze the relative importance of risk factors, we drew a SHAP summary plot. The importance of the 10 features that have the largest effect on the model output was explained. Waist circumference, neck circumference, ESS, age, and BQ were found to be the top five critical variables contributing to the diagnosis of severe OSA (Fig. 4).

Fig. 4
figure 4

SHAP summary plot to illustrate the model predicting severe OSA at the feature level

The features were ranked according to the sum of the SHAP values for all patients, and the SHAP values are used to show the distribution of the effect of each feature on the GBM model outputs. Each dot represents a case in the dataset. The color of a dot indicates the value of the feature, with blue indicating low values and red indicating high values. Only the top 10 important predictors are shown in the plot. SHAP, SHapley Additive exPlanations; ESS, Epworth Sleepiness Scale; BQ, Berlin questionnaire; BMI, body mass index; and SBQ, STOP-BANG questionnaire

Interpretation of the GBM Model at the individual level

The SHAP force plot was used to illustrate individual risk profiles. Two typical cases were analyzed to interpret the contribution of each variable to the predicted outcome in a single patient (Fig. 5). In Fig. 5A, the predicted probability for severe OSA was relatively high (0.87) due to many conditions, including high BMI (32.05 kg/m2) and waist circumference (114 cm), age (31 years), smoking, breathing cessations during sleep, and high SBQ and BQ. In contrast, in Fig. 5B, the probability of severe OSA was low (0.48) due to many conditions, including low waist circumference (95 cm) and neck circumference (38 cm), the absence of hypertension, and low ESS.

Fig. 5
figure 5

SHAP force plot to illustrate the model predicting severe OSA at the individual level

Red and blue bars, respectively, represent the positive and negative effects of each predictor contributing to the occurrence of the outcome. The extent of the impact is represented by the size of the bar. ESS, Epworth Sleepiness Scale; BQ, Berlin questionnaire; BMI, body mass index; and SBQ, STOP-BANG questionnaire

Online calculator

An online calculator (https://hypertension-in--patients-with-osa-pmgz9gtjaaqzp4bi7iewp3.streamlit.app/) was developed based on the GBM model (Fig. 6). The risk of severe OSA is calculated by entering simple information. This can help clinicians treat patients with suspected severe OSA.

Fig. 6
figure 6

Online calculator predicting severe OSA in adults

Discussion

In the present study, we propose an interpretable ML model for predicting the risk of severe OSA. We used clinical features and sleep questionnaires from 1656 individuals for modeling, and then compared the prediction performance of six advanced ML models (LR, GBM, MLP, Bagging, AdaBoost, and XGBoost). Moreover, we analyzed the relative importance of risk factors by SHAP plots and interpreted the best model at the individual level through two typical cases. This demonstrated the application of a realistic database and interpretable ML method to develop a physician-understandable risk prediction model for severe OSA. This ML-based model facilitates early risk stratification and intervention for suspected OSA patients, making individualized medicine possible.

Previous studies on OSA prediction had two main characteristics: predicting the risk of OSA in the general population and selecting multiple types of input variables. Holfinger et al. utilized ML algorithms to make OSA prediction tools in clinical and community-based samples, and the ML models performed better than traditional logistic regression. It proved the feasibility of ML methods for OSA prediction [12]. Huo et al. made a ML derived questionnaire for screening OSA based on clinical information in two public datasets, and the model achieved AUC 0.78and 0.76, respectively [10]. Li et al. extracted features from multiple types of input signals, including oxygen saturation, electrocardiograms, airflow, thoracic, and abdominal signals. They used a multi-error-reduction classification system and gained the accuracy of 94.66% [41]. In the present study, we also considered these two aspects. When selecting the population and the target of prediction, we focused on people with suspected OSA symptoms who have a higher risk and severity of OSA. Due to the diverse symptoms of OSA, most patients are not diagnosed in time, leading to the detection of the disease in a severe stage [2]. Moreover, OSA can cause damage to multiple organ systems in several ways [4]. Common complications include hypertension, arrhythmia, cognitive disorders, and diabetes. Considering the importance and urgency, we selected the population with suspected OSA symptoms to predict severe OSA, a group with higher prevalence, complications and mortality. When selecting the input variables, the primary considerations were scientific rationality and easy accessibility. Therefore, we selected many questions that could be answered by the participants themselves. The questions were determined by experts based on the literature and their knowledge.

A total of 23 variables were collected in this study. After univariate analysis, 15 variables were selected for further model construction. The multivariate LR analysis showed that 10 variables were independent predictors of severe OSA. These variables could be classified into general characteristics (gender and age), body size parameters (BMI and waist circumference), lifestyle habits (smoking), sleep symptoms (snoring and breathing cessations during sleep), and sleep questionnaires suggesting a high risk of OSA (ESS and SBQ). This was generally consistent with previous studies. OSA was more common in men with high BMI and with increasing age [24]. The association could be explained through multiple mechanisms, including increased upper airway collapsibility and impaired neuromuscular control of upper airway patency due to local fat deposition. Waist circumference was also an important risk factor and predictor of OSA, along with BMI, and was significantly correlated with OSA severity [25]. Adverse lifestyles could also have a role in the rising prevalence of OSA. Smoking-related airway inflammation and disease may increase susceptibility to OSA [42]. Snoring, one of the most typical clinical symptoms of OSA, is very important in OSA diagnosis [43]. Breathing cessations during sleep indicate the severity of OSA. ESS and SBQ are used to evaluate the degree of daytime sleepiness and the risk of OSA, respectively; they are widely used for OSA screening [44].

We established six advanced ML models to predict the risk of severe OSA. The performance of models was ranked based on the AUC as follows: GBM (0.857) > XGBoost (0.840) > AdaBoost (0.825) > Bagging (0.814) > MLP (0.773) > LR (0.765). The GBM classification model showed the best performance with an accuracy of 0.766, a sensitivity of 0.798, and a specificity of 0.734. These ML models showed satisfactory performance, confirming the efficacy and applicability of ML in assessing the risk of severe OSA. In addition, most of the models tended to have higher sensitivity than specificity. In clinical practice, it is more desirable for predictive models to have better sensitivity to detect the disease in time. Moreover, the GBM model we proposed has advantages over similar models in previous studies in terms of number of participants and prediction performance. Huang et al. proposed a predictive model for severe OSA based on a support vector machine, which showed AUC, sensitivity, and specificity values of 0.780, 0.702, and 0.703, respectively [11]. He et al. established an LR model to determine the presence and severity of OSA. This model had a satisfactory performance in predicting the presence of OSA with an accuracy of 0.812. However, the accuracy was only 0.416 in predicting the severity of OSA [45]. Liu et al. established a support vector machine model to predict severe OSA; in males and females, the AUC was 0.765 and 0.830, respectively [46]. Laharnar et al. developed a scoring system for predicting severe OSA with an AUC of 0.900 and an accuracy, sensitivity, and specificity of 0.820. This system showed excellent performance but suffered from a small sample size and a low number of variables [47]. Zhang et al. developed a ML model to screen moderate to severe OSA based on the Chinese population faciocervical and anthropometric measurements. This model gained an AUC of 0.824, and was especially suitable for people without significant daytime sleepiness [48]. Kuan et al. proposed OSA predictions using age, sex and BMI incorporating ML algorithms. The validation AUCs of the two models in large populations were 0.806 and 0.807, respectively [49]. To better use the model, we further built an online calculator estimating the probability of severe OSA. Using online and self-reported data to screen for disease risk can improve clinical efficiency by providing an automated screening strategy.

Interpretability is an integral part of medical ML models. The goal of interpretable models is to make decisions similar to human behavior by providing explanations [50]. The currently proposed measures for explanation mainly analyze the model after training, i.e., post hoc interpretability [51]. We further used SHAP values to interpret the model. As shown in the SHAP summary plot, waist circumference, neck circumference, ESS, age, and BQ were among the most important prediction factors. At the individual level, high waist circumference, neck circumference, and BMI, smoking, breathing cessations during sleep, and a high-risk level of SBQ and BQ were risk factors of severe OSA; in contrast, low waist circumference, low neck circumference, and low ESS were protective factors. These results were in line with the multivariate LR analysis and previous findings. These results suggest that in addition to symptoms, we need to pay attention to the body size of patients in clinical work. In the case of insufficient diagnostic conditions, the scales have an important auxiliary judgment value. Weight loss may also be important for OSA patients.

Compared with previous studies, the present study is significant in terms of the research process and results. Unlike previous cases based only on public databases, we recruited subjects from northwestern China and collected their clinical information. Therefore, this study reflects the real situation of severe OSA patients, improving its applicability. Moreover, the included variables were all self-reported by subjects, facilitating generalization. In addition, previous studies only predicted OSA using one model, whereas we applied and compared various advanced ML methods and conducted an interpretability analysis of the most appropriate ML methods. Our proposed simplified model provides a screening platform for severe OSA, facilitates individualized risk stratification and formulates diagnostic decisions. It will help to identify patients with severe OSA as early as possible and provide timely treatment, especially in economically underdeveloped areas. Meanwhile, this model explains the importance of variables, which helps improve insights into severe OSA risk factors for clinicians.

There were several limitations in this study. First, the model we established was developed to detect severe OSA, and thus subjects with suspected OSA in whom the prevalence of OSA was high were enrolled. Therefore, the results may not apply to the general population, where the prevalence of OSA is much lower. Second, the prevalence of OSA varies greatly among regions and ethnicities [52]. This study was a single-center study, and all subjects were Chinese, so this model needs to be validated in populations of multiple ethnicities. Third, this was a retrospective study without follow-up data. Further prospective validation studies should include larger samples to obtain more reliable results. Fourth, the accuracy and stability of the model need to be evaluated and improved. In summary, the model must be improved before being generalized by increasing the sample size, adding more relevant variables, and optimizing the algorithm.

Conclusions

In the present study, we developed six types of ML models to predict the risk of severe OSA, namely LR, MLP, Bagging, GBM, AdaBoost, and XGBoost. All six models showed high prediction performance. The GBM model performed best. Furthermore, we interpreted the model at the domain, feature, and individual levels. The findings of this study demonstrate the potential of a personalized screening model for individuals with a high risk of severe OSA using self-reported information. This will help to identify patients with severe OSA as early as possible and provide timely treatment, especially in economically underdeveloped areas. In the future, ML models are expected to make greater contributions to disease screening.

Data Availability

Research data are not publicly available but can be obtained from the corresponding author on request after approval from the institutional review boards of all participating institutions.

Abbreviations

AASM:

American Academy of Sleep Medicine

AdaBoost:

Adaptive boosting

AHI:

Apnea-hypopnea index

AUC:

Area under the receiver operating characteristic curve

Bagging:

Bootstrapped aggregating

BMI:

Body mass index

BQ:

Berlin questionnaire

CI:

Confidence intervals

ESS:

Epworth sleepiness scale

GBM:

Gradient boosting machine

IQR:

Interquartile range

LR:

Logistic regression

ML:

Machine learning

MLP:

Multilayer perceptron

OR:

Odds ratio

OSA:

Obstructive sleep apnea

PSG:

Polysomnography

ROC:

Receiver operating characteristic

SBQ:

STOP-BANG questionnaire

SHAP:

SHapley Additive exPlanations

XGBoost:

Extreme gradient boosting

References

  1. Gottlieb D, Punjabi NJJ. Diagnosis and management of obstructive sleep apnea: a review. JAMA. 2020;323(14):1389–400.

    Article  PubMed  Google Scholar 

  2. Benjafield AV, Ayas NT, Eastwood PR, Heinzer R, Ip MSM, Morrell MJ, et al. Estimation of the global prevalence and burden of obstructive sleep apnoea: a literature-based analysis. Lancet Respir Med. 2019;7(8):687–98.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Young T, Peppard PE, Gottlieb DJ. Epidemiology of obstructive sleep apnea: a population health perspective. Am J Respir Crit Care Med. 2002;165(9):1217–39.

    Article  PubMed  Google Scholar 

  4. Trzepizur W, Blanchard M, Ganem T, Balusson F, Feuilloy M, Girault J-M et al. Sleep apnea–specific hypoxic burden, symptom subtypes, and risk of cardiovascular events and all-cause mortality. 2022;205(1):108–17.

  5. Rundo JV. Obstructive sleep apnea basics. Cleve Clin J Med. 2019;86(9 Suppl 1):2–9.

    Article  PubMed  Google Scholar 

  6. Kapur VK, Auckley DH, Chowdhuri S, Kuhlmann DC, Mehra R, Ramar K, et al. Clinical practice Guideline for Diagnostic Testing for Adult Obstructive Sleep Apnea: an American Academy of Sleep Medicine Clinical Practice Guideline. J Clin Sleep Med. 2017;13(3):479–504.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Surani S, Sheta A, Turabieh H, Park J, Mathur S, Katangur A. Diagnosis of Sleep Apnea using Artificial neural network and binary particle Swarm optimization for feature selection. Chest. 2019;156(4).

  8. Kim YJ, Jeon JS, Cho SE, Kim KG, Kang SG. Prediction models for obstructive sleep apnea in Korean adults using machine learning techniques. Diagnostics (Basel). 2021;11(4).

  9. Jonassen TM, Bjorvatn B, Saxvig IW, Eagan TM, Lehmann S. Clinical information predicting severe obstructive sleep apnea: a cross-sectional study of patients waiting for sleep diagnostics. Respir Med. 2022;197:106860.

    Article  PubMed  Google Scholar 

  10. Huo J, Quan SF, Roveda J, Li A. BASH-GN: a new machine learning-derived questionnaire for screening obstructive sleep apnea. Sleep Breath. 2022.

  11. Huang WC, Lee PL, Liu YT, Chiang AA, Lai F. Support vector machine prediction of obstructive sleep apnea in a large-scale Chinese clinical sample. Sleep. 2020;43(7).

  12. Holfinger SJ, Lyons MM, Keenan BT, Mazzotti DR, Mindel J, Maislin G, et al. Diagnostic performance of machine learning-derived OSA Prediction Tools in large clinical and community-based samples. Chest. 2022;161(3):807–17.

    Article  PubMed  Google Scholar 

  13. Bozkurt S, Bostanci A, Turhan M. Can statistical machine learning algorithms help for classification of obstructive sleep apnea severity to optimal utilization of Polysomnography resources? Methods Inf Med. 2017;56(4):308–18.

    Article  PubMed  Google Scholar 

  14. Srinivasulu A, Mohan S, H T, S P, R Y, editors. Apnea Event Detection Using Machine Learning Technique for the Clinical Diagnosis of Sleep Apnea Syndrome. 2021 3rd International Conference on Signal Processing and Communication (ICPSC); 2021 13–14 May 2021.

  15. Rajesh K, Dhuli R, Kumar TS. Obstructive sleep apnea detection using discrete wavelet transform-based statistical features. Comput Biol Med. 2021;130:104199.

    Article  PubMed  Google Scholar 

  16. Lin CY, Wang YW, Setiawan F, Trang NTH, Lin CW. Sleep apnea classification Algorithm Development using a machine-learning Framework and Bag-of-features derived from Electrocardiogram Spectrograms. J Clin Med. 2021;11(1).

  17. Sabil A, Vanbuis J, Baffet G, Feuilloy M, Le Vaillant M, Meslier N et al. Automatic identification of sleep and wakefulness using single-channel EEG and respiratory polygraphy signals for the diagnosis of obstructive sleep apnea. 2019;28(2):e12795.

  18. Onargan A, Gavcar B, Çalışkan G, Akan A, editors. Prediction of Sleep Apnea using EEG signals and machine learning algorithms. IEEE; 2021. 2021 Medical Technologies Congress (TIPTEKNO).

  19. Almuhammadi WS, Aboalayon KA, Faezipour M, editors. Efficient obstructive sleep apnea classification based on EEG signals. 2015 Long Island Systems, Applications and Technology. IEEE; 2015.

  20. Magalang UJ, Dmochowski J, Veeramachaneni S, Draw A, Mador MJ, El-Solh A et al. Prediction of the apnea-hypopnea index from overnight pulse oximetry. 2003;124(5):1694–701.

  21. Ganglberger W, Bucklin AA, Tesh RA, Da Silva Cardoso M, Sun H, Leone MJ et al. Sleep apnea and respiratory anomaly detection from a wearable band and oxygen saturation. 2022;26(3):1033–44.

  22. Yu H, Deng C, Sun J, Chen Y, Cao YJS, Breathing. Cascading detection model for prediction of apnea-hypopnea events based on nasal flow and arterial blood oxygen saturation. 2020;24(2):483–90.

  23. Young T, Skatrud J, Peppard PEJJ. Risk factors for obstructive sleep apnea in adults. 2004;291(16):2013–6.

  24. Senaratna CV, Perret JL, Lodge CJ, Lowe AJ, Campbell BE, Matheson MC, et al. Prevalence of obstructive sleep apnea in the general population: a systematic review. Sleep Med Rev. 2017;34:70–81.

    Article  PubMed  Google Scholar 

  25. Tom C, Roy B, Vig R, Kang DW, Aysola RS, Woo MA, et al. Correlations between Waist and Neck circumferences and obstructive sleep apnea characteristics. Sleep Vigil. 2018;2(2):111–8.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Senaratna CV, English DR, Currier D, Perret JL, Lowe A, Lodge C, et al. Sleep apnoea in Australian men: Disease burden, co-morbidities, and correlates from the Australian longitudinal study on male health. BMC Public Health. 2016;16(Suppl 3):1029.

    Article  PubMed  Google Scholar 

  27. Deo RC. Machine learning in Medicine. Circulation. 2015;132(20):1920–30.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Ngiam KY, Khor WJTLO. Big data and machine learning algorithms for health-care delivery. 2019;20(5):e262–e73.

  29. Bandyopadhyay A, Goldstein C. Clinical applications of artificial intelligence in sleep medicine: a sleep clinician’s perspective. Sleep Breath. 2022.

  30. Watson NF, Fernandez CRJSMR. Artificial intelligence and sleep: advancing sleep medicine. 2021;59:101512.

  31. Berry R, Brooks R, Gamaldo C, Harding S, Lloyd R, Quan S et al. AASM Scoring Manual Updates for 2017 (Version 2.4). 2017;13(5):665–6.

  32. Johns MW. A New Method for Measuring Daytime Sleepiness: the Epworth Sleepiness Scale. Sleep. 1991;14(6):540–5.

    Article  CAS  PubMed  Google Scholar 

  33. Netzer N, Stoohs R, Netzer C, Clark K, Strohl KJA. Using the Berlin Questionnaire to identify patients at risk for the sleep apnea syndrome. 1999;131(7):485–91.

  34. Chung F, Abdullah H, Liao PJC, STOP-Bang Questionnaire. A practical Approach to screen for obstructive. Sleep Apnea. 2016;149(3):631–8.

    Google Scholar 

  35. Bjurgert J, Valenzuela P. Rojas CJItonn, systems l. Adapt Boosting Syst Identif. 2018;29(9):4510–4.

    Google Scholar 

  36. Meurer W, Tolles JJJ. Logistic Regression Diagnostics: Understanding How Well a Model Predicts Outcomes. 2017;317(10):1068-9.

  37. Silva F, Sanz M, Seixas J, Solano E, Omar YJNINNS. Perceptrons from Memristors. 2020;122:273–8.

    Google Scholar 

  38. Hothorn T, Lausen B, Benner A. Radespiel-Tröger MJSim. Bagging Survival Trees. 2004;23(1):77–91.

    Google Scholar 

  39. Zhang Z, Zhao Y, Canes A, Steinberg D, Lyashevska O. JAotm. Predictive analytics with gradient boosting in clinical medicine. 2019;7(7):152.

  40. Torlay L, Perrone-Bertolotti M, Thomas E, Baciu MJB. Machine learning-XGBoost analysis of language networks to classify patients with Epilepsy. 2017;4(3):159–69.

  41. Li X, Leung FHF, Su S, Ling SH. Sleep apnea detection using multi-error-reduction classification system with multiple bio-signals. Sens (Basel). 2022;22(15).

  42. Zeng X, Ren Y, Wu K, Yang Q, Zhang S, Wang D et al. Association between smoking behavior and obstructive sleep apnea: a systematic review and meta-analysis. Nicotine Tob Res. 2022.

  43. Maimon N, Hanly PJJJ. Does snoring intensity correlate with the severity of obstructive sleep apnea? 2010;6(5):475–8.

  44. Chiu HY, Chen PY, Chuang LP, Chen NH, Tu YK, Hsieh YJ, et al. Diagnostic accuracy of the Berlin questionnaire, STOP-BANG, STOP, and Epworth sleepiness scale in detecting obstructive sleep apnea: a bivariate meta-analysis. Sleep Med Rev. 2017;36:57–70.

    Article  PubMed  Google Scholar 

  45. He S, Li Y, Xu W, Han D. Using clinical data to predict obstructive sleep apnea. J Thorac Dis. 2022;14(2):227–37.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Liu WT, Wu HT, Juang JN, Wisniewski A, Lee HC, Wu D, et al. Prediction of the severity of obstructive sleep apnea by anthropometric features via support vector machine. PLoS ONE. 2017;12(5):e0176991.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Laharnar N, Herberger S, Prochnow LK, Chen NH, Cistulli PA, Pack AI, et al. Simple and unbiased OSA prescreening: introduction of a new morphologic OSA prediction score. Nat Sci Sleep. 2021;13:2039–49.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Zhang L, Yan Y, Li S, Li H, Lin Y, Li N et al. Moderate to severe OSA screening based on support vector machine of the Chinese population faciocervical measurements dataset: a cross-sectional study. 2021;11(9):e048482.

  49. Kuan Y, Hong C, Chen P, Liu W, Chung CJMb MBE. e. Logistic regression and artificial neural network-based simple predicting models for obstructive sleep apnea by age, sex, and body mass index. 2022;19(11):11409-21.

  50. Rudin C. Stop Explaining Black Box Machine Learning Models for high stakes decisions and use interpretable models instead. Nat Mach Intell. 2019;1(5):206–15.

    Article  PubMed  PubMed Central  Google Scholar 

  51. Gunning D, Stefik M, Choi J, Miller T, Stumpf S, Yang GZ. XAI-Explainable artificial intelligence. Sci Robot. 2019;4(37).

  52. Hnin K, Mukherjee S, Antic NA, Catcheside P, Chai-Coetzer CL, McEvoy D, et al. The impact of ethnicity on the prevalence and severity of obstructive sleep apnea. Sleep Med Rev. 2018;41:78–86.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

The authors thank the Department of Otorhinolaryngology Head and Neck Surgery, The Second Affiliated Hospital of Xi’an Jiaotong University for the help of this study.

Funding

This work was supported by the National Natural Science Foundation of China (grant no. 62076198) and the Key Research and Development Program in the Social Development Field of Shaanxi, China (2020GXLH-Y005, 2021SF-286). The funding bodies played no role in the design of the study and collection, analysis, interpretation of data, and in writing the manuscript.

Author information

Authors and Affiliations

Authors

Contributions

WL, SW, and XR designed the research. YS, YZ and CZ wrote the manuscript. YY, XN, YS, YX and HL collected the data. XC, LX and XH performed data curation. SW and WL analyzed and processed the data. XR directed the research. All authors reviewed the results and approved the final version of the manuscript. XR is the guarantor of the paper who takes full responsibility for the integrity of the work, from its inception to the published article.

Corresponding authors

Correspondence to Shinan Wu, Wenle Li or Xiaoyong Ren.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Ethics Committee of the Second Affiliated Hospital of Xi’an Jiaotong University (No.2020 − 1122). All methods involved in the collection of these data were performed in accordance with the relevant guidelines and regulations. All participants provided informed consent for data collection and analysis. All data were anonymized.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shi, Y., Zhang, Y., Cao, Z. et al. Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults. BMC Med Inform Decis Mak 23, 230 (2023). https://0-doi-org.brum.beds.ac.uk/10.1186/s12911-023-02331-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12911-023-02331-z

Keywords