Deep learning for detecting and early predicting chronic obstructive pulmonary disease from spirogram time series

Deep learning for detecting and early predicting chronic obstructive pulmonary disease from spirogram time series

Introduction

Chronic Obstructive Pulmonary Disease (COPD) is a progressively worsening lung disease that leads to difficulty breathing, limited activity, and a decline in quality of life1,2. As the disease progresses, COPD may also increase the risk of cardiovascular diseases3 and even lead to premature death. Therefore, timely and accurate COPD detection is crucial to reduce patient health risks4. Previous studies have shown a strong correlation between detecting the disease at an early stage and the success of its treatment. Failing to identify the disease during this crucial period will worsen its severity5,6.

Clinical diagnosis often involves identifying COPD patients by determining if the FEV1/FVC ratio is below 70%7,8,9. Researchers have discovered that the FEV1/FVC ratio method is not always accurate when used with people of different ages10,11. This means that some patients might not get the right diagnosis at the right time, missing out on early and personalized treatment options.

In recent years, researchers have used deep learning technologies12 to identify COPD characteristics by analyzing the Volume-Flow curve. This method helps to overcome the limitations of traditional approaches but cannot effectively predict an individual’s latent risk of developing COPD. Moreover, introducing deep learning technology has resulted in models lacking transparency, making it difficult to gain the trust of medical professionals and patients13. Therefore, developing an artificial intelligence interpretable algorithm that can accurately detect patients with COPD and early predict an individual’s latent risk of COPD is crucial for slowing disease progression and preventing patient mortality.

However, most existing methods for COPD risk analysis mainly face the following four challenges:

  • In order to determine the degree of airflow obstruction, it is necessary to generate Volume-Flow curves based on the original Time-Volume curves. Nevertheless, current techniques for generating Volume-Flow curves can yield unstable curves, potentially resulting in incorrect predictions by the model.

  • The length of the Volume-Flow curve varies among individuals due to differences in exhalation durations. Nevertheless, existing techniques commonly handle Volume-Flow curves with varying lengths by either filling in missing values or truncating through downsampling. The former can readily add extraneous noise, whereas the latter could sacrifice important data dependencies.

  • Existing deep learning models are still black-box models that can only produce detection outcomes without offering an explainer for those results. Models lacking transparency may find it challenging to gain the trust of medical professionals and patients.

  • Currently, existing methods can only detect patients who have already had COPD based on obvious characteristics displayed on the spirogram (In this article, the spirogram specifically involves measuring Volume-Flow curve time series). However, these methods fail to early predict an individual’s probability of COPD in the future based on changes in the spirogram.

To address the aforementioned challenges, we propose DeepSpiro, a method based on deep learning for early prediction of future COPD risk (Fig. 1). Specifically, this paper makes four major contributions:

  • We apply a method for constructing Volume-Flow curves guided by Time-Volume instability smoothing (SpiroSmoother), which uses a curve smoothing algorithm to precisely enhance the stability of the Volume-Flow curve while retaining the essential physiological information from the original Volume-Flow data.

  • We develop a COPD identification method based on learning from varied-length key patch variability-pattern (SpiroEncoder). This algorithm can dynamically calculate the “key patch” number that is best suited for each time series data patch, thereby unifying the time series representation and extracting key physiological information from the original high-dimensional dynamic sequence into a unified low-dimensional time series representation.

  • We propose a method for explaining the model based on volume attention and heterogeneous feature fusion (SpiroExplainer). This method combines the probability of having COPD with demographic data like age, sex, and probability. This information is then used as input for the model, which outputs a COPD risk assessment and offers an explainer for the model’s decisions regarding the individual.

  • We developed a method for predicting the risk of COPD based on the variability pattern of key patch concavity (SpiroPredictor) for the first time. Our method can precisely forecast the probability of disease onset in undiagnosed high-risk patients over the next 1–5 years and beyond. Additionally, it can accurately categorize these patients, thereby addressing the current deficiency in early predicting future COPD risk.

Fig. 1: Our input module uses the raw Time–Volume curve time series collected from hospitals and patient demographic data, which are then passed into our AI-based model module.
Deep learning for detecting and early predicting chronic obstructive pulmonary disease from spirogram time series

The AI-based model module is divided into four tasks (see the section “Overview” for details). After processing through the AI-based model module, the output data is handled by the output module. If the AI-based model diagnoses the individual as a COPD patient, we will output their diagnosis results and the interpretability figure of the model. If the AI-based model diagnoses the individual as a non-COPD patient, we will output their risk of developing COPD over the next 1-5 years.

Full size image

Our work has made COPD detection and early risk prediction more accurate, ultimately contributing to improved clinical decision-making and patient prognosis. By providing interpretable results and predicting future risks, DeepSpiro has the potential to become a valuable early screening tool. This could help delay disease progression and may potentially reduce patient mortality.

Results

Model detection and prediction performance

In this study, we aim to detect COPD and validate DeepSpiro superiority by comparing it to a ResNet18 baseline model. Previously, Justin Cosentino et al. proposed a baseline model in a paper published in Nature Genetics12. It was trained and evaluated on the UK Biobank dataset, with evaluation metrics including AUROC, AUPRC, and F1-score. To ensure a fair comparison, we used the same dataset and evaluation criteria. In the COPD detection task, DeepSpiro achieved an AUROC of 83.28%, an AUPRC of 35.70%, and an F1-score of 39.50% as shown in Fig. 2. DeepSpiro outperformed the baseline model in these metrics by 1.16%, 1.24%, and 1.36%, respectively.

Fig. 2: Evaluation comparison.
figure 2

a–c Comparison of receiver operating characteristic (ROC) curves for three prediction methods—DeepSpiro COPD risk predictions, Volume-Flow ResNet18 COPD risk predictions, and FEV1/FVC ratio-based risk—across three evaluation tasks: All COPD patients (left), future COPD-related hospitalization (center), and COPD-related death (right). d-f Comparison of precision-recall (PR) curves across three evaluation tasks—All COPD patients (left), future COPD-related hospitalization (center), and COPD-related death (right)—for three prediction methods: DeepSpiro COPD risk predictions, Volume-Flow ResNet18 COPD risk predictions, and FEV1/FVC ratio-based risk. The error bars represent bootstrapped 95% confidence intervals (n = 100 bootstrapping samples).

Full size image

Table 1 presents a comparison of different metrics prior to and following signal enhancement. Based on the experimental findings, our signal enhancement technique effectively improves the stability of the Volume-Flow curve without compromising the essential physiological information of the original Volume-Flow data. This enhancement greatly aids in evaluating patients with COPD.

Table 1 Evaluation comparison table
Full size table

DeepSpiro emphasizes data integrity and the preservation of key information, in contrast to the Volume-Flow ResNet18 method (Baseline model). Based on the experimental findings, DeepSpiro betters the Volume-Flow ResNet18 method in all categories, as evidenced by higher values of key metrics such as AUROC, AUPRC, and F1-score.

In terms of parameters, DeepSpiro also performs exceptionally well. In order to demonstrate the specific advantages of our proposed DeepSpiro with respect to certain parameters, a comparative study is carried out between DeepSpiro and the Volume-Flow ResNet18 model. Specifically, Pytorch is used to build both models and utilize the top library to calculate both models’ parameter count and floating-point operations. The calculation results are shown in Supplementary Table 4. The results indicate that DeepSpiro’s parameter count and floating-point operations are significantly lower than those of the Volume-Flow ResNet18 model.

By leveraging demographic information, the model can gain a more comprehensive understanding of the patient’s background14 and on this basis, improve its prediction accuracy. Additionally, incorporating FEV1/FVC, the gold standard for COPD diagnosis, into the model can enhance the model’s capability in diagnosing COPD. To more comprehensively reflect the improvement in the model’s diagnostic capability by demographic information and the FEV1/FVC diagnostic gold standard15,16,17, we concatenate the probabilities outputted by module II (Fig. 8II), demographic information, and the FEV1/FVC diagnostic gold standard into a new vector and input it into a logistic regression model. Figure 3 is a nomogram18, from which we can intuitively see that the added demographic information and the FEV1/FVC diagnostic gold standard enhance the model’s diagnostic accuracy. Therefore, the model explainer method based on volume attention and heterogeneous feature fusion has a promotive effect on the diagnosis of COPD.

Fig. 3: The nomogram of COPD detection.
figure 3

The nomogram illustrates the contribution of demographic information and the FEV1/FVC diagnostic gold standard to the model’s diagnostic accuracy for COPD. The nomogram allows for visual estimation of the probability of COPD diagnosis by assigning weighted scores to each variable. The “threshold” in the figure represents the cutoff value at which the predicted probability indicates a positive diagnosis for COPD. For instance, a threshold of 0.5 means that a predicted probability greater than 0.5 would be considered indicative of COPD.

Full size image

In summary, traditional pulmonary function assessment indicators cannot fully reflect the disease’s complexity and patients’ overall health status. As shown in Table 1, DeepSpiro is more advanced than traditional pulmonary function assessment indicators and the Volume-Flow ResNet18 model.

Future risk prediction analysis

To better explain the results, we categorize the curves into two groups: early-phase curves (PEF~FEF25 and FEF25~FEF50) and late-phase curves (FEF50–FEF75 and FEF75+). As shown in Fig. 4a, by observing the trend of changes in the two types of curves, we can see a significant difference in the degree of concavity. The concavity of the early-phase curves decreases as disease risk decreases, while the concavity of the late-phase curves increases as disease risk decreases. By observing Fig. 4a, we find that individuals with higher disease risk (e.g., 1 year) are more likely to experience curve collapse in the early phases, whereas individuals with lower risk (e.g., Non-COPD) tend to exhibit curve collapse in the late phases. As the risk of disease increases, the phase in which the curve collapse occurs tends to shift earlier.

Fig. 4: Future COPD prediction overview.
figure 4

a The left vertical axis represents the predicted probability of COPD, while the right vertical axis indicates the concavity degree based on the directed area metric for each phase. Each plot displays the mean concavity degree for the respective phase, illustrating how the concavity changes over time in relation to COPD risk. b This figure illustrates the probabilities of not being diagnosed with COPD over time for the high-risk and low-risk groups, as predicted by the model. The X-axis represents the time since the pulmonary function test, while the Y-axis shows the probability of not being diagnosed with COPD at each time point. Due to right censoring, not all high-risk patients are diagnosed within the observation period, resulting in probabilities that remain above zero. c As the onset time progresses, the concavity of the patient’s Volume-Flow curve decreases year by year.

Full size image

To more intuitively demonstrate the relationship between disease risk and the timing of curve collapse, we designed a concavity trend measurement model. This model uses the formula: ({mathscr{C}}(PEF sim FEF25)) + ({mathscr{C}}(FEF25 sim FEF50))({mathscr{C}}(FEF50 sim FEF75))({mathscr{C}}(FEF75+)) to measure the concavity trend of the curve. From the formula, it can be seen that a larger concavity trend value indicates that the curve collapse (the phase where expiratory flow rate drops most significantly) occurs in the earlier phases, whereas a smaller concavity trend value indicates that the curve collapse occurs in the later phases.

Figure 4b includes the future disease risk of high-risk and low-risk populations. The model predicts individuals as a high-risk group for those identified as at risk of disease and as a low-risk group for those predicted not to have the disease. We plotted the changes in disease risk for both groups as curves, showing the proportion diagnosed with COPD over time. The X-axis represents the time elapsed since the pulmonary function test, and the Y-axis represents the proportion of patients diagnosed with COPD at a specific timestamp within the population, i.e., the risk of disease. From the figure, it can be seen that from the time of the pulmonary function test, the disease risk for the model-predicted high-risk group shows an exponential increase over time, while the disease risk for the low-risk group remains near zero, essentially unchanged with a slight and weak increase. High-risk and low-risk groups also exhibit significant differences, with a p-value of <0.001.

In order to obtain the expected Volume-Flow curve for different periods, we used the k-median algorithm to extract features from the model for samples corresponding to different risk periods (1 year, 2 years, 3 years, 4 years, 5+ years, Non-COPD). We then determined the median sample for each risk period and used it as the expected Volume-Flow curve for that stage. Similarly, to obtain the expected Volume-Flow curve for different subgroups, we subdivided the groups according to their classifications and used k-median analysis to find the median sample for each subgroup, using it to represent the expected Volume-Flow curve for that subgroup.

As shown in Fig. 4c, as the onset time progresses, the concavity of the patient’s Volume-Flow curve decreases year by year. This trend also indicates that the impact of the disease on lung function is more significant in the later stages, and the closer it is to the onset time, the more apparent the disease becomes.

From the predictive distribution probabilities over future times and the future disease risk scenarios for high-risk and low-risk populations, it can be seen that DeepSpiro can effectively predict the future development trends of the disease, thereby providing better treatment and management plans for patients.

Result of subgroup analysis

To further explore the diagnostic accuracy of the model across different subsets of the population, subgroup analyses for various age groups, sex, and smoking statuses are conducted.

The subgroup analysis results for smokers illustrate the model’s application scenarios. Among patients with COPD, the prevalence rate in smokers is higher than in non-smokers19. For our model, as shown in Fig. 5a, the prediction probability for smokers is higher than for non-smokers, and the prediction range for smokers is also wider than that for the non-smoking population. This is consistent with clinical significance. Regarding the p-value, the model demonstrates a strong discriminative ability between the two groups.

Fig. 5: Subgroup analysis overview.
figure 5

a The subgroup analysis for smoking. b The subgroup analysis by sex. c The subgroup analysis by age. d As the onset time progresses, an individual’s concavity measure gradually decreases. Compared to non-smokers, smokers show significantly higher lung function concavity measures. e As the onset time progresses, an individual’s concavity measure gradually decreases. Compared to females, males show significantly higher lung function concavity measures. f As the onset time progresses, an individual’s concavity measure gradually decreases. Compared to younger patients, older patients show significantly higher lung function concavity measures.

Full size image

The subgroup analysis results for sex showcase the model’s application scenarios. The 2018 China Adult Lung Health Study, which surveyed 50,991 individuals across 10 provinces and cities, has shown that the incidence rate among men is higher than that among women, with men at 11.9% and women at 5.4%. For our model, as shown in Fig. 5b, the prediction probability for men is higher than for women, and the prediction range for men is also greater than that for women. This aligns with clinical significance. Regarding the p-value, the model demonstrates a strong discriminative ability between the two groups.

Age is a significant factor influencing the development of COPD. Typically, the high-incidence age range for COPD is between 45 and 80 years. Within this age range, patients’ lung functions tend to decline gradually. Particularly after age 45, the trend of declining lung function becomes more apparent. After reaching 80 years, lung function further declines, and patients’ immune systems weaken, making them relatively more susceptible to diseases. Therefore, prevention and early identification of COPD are especially important for this age group. We divide patients into Youth (18–44), Middle (45–55), and Elderly (55 and above) for subgroup analysis, as shown in Fig. 5c. The results indicate that DeepSpiro’s prediction probability for smokers is higher than for non-smokers, and the prediction range for smokers is also larger than for the non-smoking population, consistent with clinical significance. Regarding the p-value, the model shows strong discriminative ability between the two groups.

Additionally, we compare the AUROC of DeepSpiro, the Volume-Flow ResNet18 model, and the FEV1/FVC metric across different subgroups, as shown in Table 2. Across all subgroups, DeepSpiro’s AUROC is the highest, demonstrating DeepSpiro’s applicability in different subgroups.

Table 2 Comparison of subgroup evaluations
Full size table

We also conducted a subgroup analysis for individuals across different future periods. As shown in Fig. 5d, as the onset time progresses, an individual’s concavity measure gradually decreases. Compared to non-smokers, smokers show significantly higher lung function concavity measures, further confirming the impact of smoking on COPD. This trend is consistent with clinical observations, highlighting the association between long-term smoking and lung function decline.

As shown in Fig. 5e, as the onset time progresses, an individual’s concavity measure gradually decreases. Compared to females, males show significantly higher lung function concavity measures, which are typically related to higher smoking rates, different lifestyle habits, and physiological differences in men. This aligns with the expected results from clinical research.

As shown in Fig. 5f, as the onset time progresses, an individual’s concavity measure gradually decreases. Compared to younger patients, older patients show significantly higher lung function concavity measures, which may be related to natural lung function decline with age, weakened immunity, and other factors. This is consistent with clinical observations and expected results. (Note: Due to the small sample size in the Youth (18–44) group, it was not possible to subdivide and compare across different stages, so only the Middle (45–55) and Elderly (55+) groups were compared.)

Result of SHAP analysis

To further analyze whether the model’s predictions hold clinical significance, we conduct a SHAP analysis20.

As shown in the beeswarm plot in Fig. 6a, the relative importance of features is revealed. The figure shows that the higher the COPD risk value, the greater its impact on the model. Smoking, being male, and older age all influence the model’s judgment. DeepSpiro’s findings are consistent with related research conclusions, indicating that smokers, males, and older patients have a higher risk of being predicted as having COPD.

Fig. 6: SHAP analysis overview.
figure 6

a Brighter colors mean that the feature has a more positive impact on the model’s predictions. The blow ratio represents the FEV1/FVC value, an important indicator of lung function. b Relationship between smoking status and predicted risk of COPD. The figure illustrates how the risk value predicted by DeepSpiro correlates with smoking status, indicating a higher risk value is associated with a larger proportion of smokers. This result is consistent with clinical findings, as smokers are more likely to develop COPD. c Relationship between age and predicted risk of COPD. The figure demonstrates that the predicted risk value of DeepSpiro increases with age, indicating that older individuals have a higher risk of developing COPD. This observation is consistent with clinical findings, which show that older patients are more susceptible to COPD.

Full size image

We examine the relationship between smoking and risk values as well as age and risk values and present our results with dependency graphs. Figure 6b shows the relationship between smoking and the risk of COPD. From the figure, it is evident that the predicted risk value of DeepSpiro is closely related to smoking status. The higher the risk value, the larger the proportion of smokers. This aligns with clinical significance. For COPD patients, smokers are more susceptible to the disease.

Figure 6c illustrates the relationship between age and the risk of COPD. From the figure, it can be seen that the predicted risk value of DeepSpiro is closely related to age. The higher the risk value, the older the age. This aligns with clinical significance. For COPD patients, older patients are more susceptible to the disease.

Model explainer analysis

Healthcare professionals often struggle to accurately identify and locate disease information directly from long sequence data in COPD classification tasks. To address this issue, we utilize SpiroExplainer, automatically focusing on the diseased regions of the Volume-Flow data.

The proposed method can automatically capture anomalies from long sequence Volume-Flow data, providing specific patches with abnormalities. Figure 7 shows that the model can automatically focus on key patches that distinguish COPD patients. Brighter colors indicate higher attention in the corresponding areas, while darker colors indicate lower attention.

Fig. 7: The explainer of the model.
figure 7

The brighter the color, the more attention the model pays. a A model interpretability figure for cases where non-COPD patients detected as non-COPD by the model. b A model interpretability figure for cases where non-COPD patients detected as COPD by the model. c A model interpretability figure for cases where COPD patients detected as non-COPD by the model. d A model interpretability figure for cases where COPD patients detected as COPD by the model.

Full size image

An interesting phenomenon is that for non-COPD patients, as shown in Fig. 7a, the Volume-Flow curve in the FEF25–FEF50 area appears “full,” without the small airway collapse phenomenon near FEF75 at the tail of the curve. DeepSpiro primarily focuses on these two areas near FEF25–FEF50 and FEF75, which match the characteristics of the non-COPD patient group. On the other hand, for COPD patients, as shown in Fig. 7d, the curve’s tail near the FEF75 area collapses due to a small airway collapse resulting from COPD. DeepSpiro focuses on the area near FEF75, aligning with the characteristics of the COPD patient group. Thus, DeepSpiro can effectively differentiate and assess the features of different populations.

In analyzing cases where the model made incorrect predictions, we found that for some individuals who might have asthma, as shown in Fig. 7b, due to insufficient inhalation, the curve after FEF75 shows a weak exhalation characteristic similar to the small airway collapse seen in COPD patients. Since the model mainly focuses on the area near FEF75, it leads to lower prediction accuracy for these asthma patients. For some individuals who might have COPD, as shown in Fig. 7c, due to uneven exhalation effort or mid-breath inhalation by COPD patients, the Volume-Flow curve might present double or multiple peaks, appearing “full” in the FEF25–FEF50 range similar to non-COPD patients. DeepSpiro primarily focuses on areas near FEF25–FEF50 and FEF75, which also leads to lower prediction accuracy for these COPD patients. We believe the model’s predictions are medically interpretable, and due to the complexity and overlap of disease characteristics, it is challenging to avoid prediction errors completely.

Discussion

Recent research on COPD has primarily focused on three areas: detection, prediction, and genetic studies. The application of machine learning and deep learning has achieved significant progress in disease detection. Researchers have used advanced deep learning models, such as convolutional neural networks (CNNs), to analyze various types of data like e-nose signals21, spirometry data12, lung sounds22, and CT images23, playing a vital role in COPD detection and classification. To improve diagnostic accuracy, efforts have also been made to integrate multi-source data, such as combining multi-omics data like proteomics and transcriptomics with disease-specific protein/gene interaction information24 or integrating imaging data with questionnaire data25, providing a more comprehensive diagnostic foundation. In data processing, specialized preprocessing workflows have been developed to eliminate noise and enhance dataset quality, such as denoising and feature extraction for lung sounds26. In the field of prediction, machine learning models have been employed to forecast COPD disease progression and survival rates27,28, including the use of deep neural networks to predict COPD stages29, and applying anomaly detection methods to identify COPD manifestations in chest CT scans30, which have shown remarkable effectiveness in detecting lung function impairment and disease severity. In genetic research, bioinformatics approaches have been used to identify key genes and genetic markers associated with COPD31,32,33,34, deepening our understanding of COPD pathogenesis and providing valuable insights for early detection and potential therapeutic target development.

In this paper, we designed a deep-learning method for detecting and early-predicting COPD from the Volume-Flow curve time series. Specifically, we use SpiroSmoother, thereby enhancing the stability of the Volume-Flow curve. We propose SpiroEncoder, which unifies the temporal representation of “key patches” and achieves the conversion of key physiological information from high to low dimensions. Utilizing SpiroExplainer, we fuse diagnostic probabilities with demographic information to improve the accuracy of COPD risk assessment and the model’s explainer. The results and explainer of the model can provide patients with timely COPD risk reports. Moreover, we propose SpiroPredictor, by incorporating key information such as the degree of concavity, the model can accurately predict the future disease probability of high-risk patients who have not yet been diagnosed, thereby enabling early intervention and treatment to slow the progression of the disease. Our proposed DeepSpiro model has two primary functions. First, it can detect COPD with high performance, achieving AUC and other performance metrics that surpass those of current SOTA models. Second, our model can predict the risk of developing COPD over the next 1–5 years using key patch concavity information, an innovative method we have proposed for the first time. Experimental results show that our novel method has high accuracy in predicting future COPD risk.

Our work does have some limitations. First, although DeepSpiro has shown good performance in detecting and predicting COPD using spirogram time series in the UK Biobank, its generalizability to other population groups remains uncertain. The characteristics of the spirogram may vary among different populations, which could potentially affect the model’s performance.

Second, while DeepSpiro demonstrates high accuracy in predicting COPD using spirogram time series in a research setting, further validation in real clinical environments is crucial. The transition from research to practical application may present unforeseen challenges, such as variations in data quality, increased noise, and differences in clinical workflows. These factors could impact the model’s performance, necessitating additional adjustments and optimizations in future work.

Third, although our model demonstrated high AUROC values across most prediction intervals, its performance in terms of AUPRC was less satisfactory, particularly for predicting COPD onset within 1–2 years. We attribute this to the inherent class imbalance in the dataset (see Supplementary Table 1), where the proportion of new COPD cases is extremely low. Since AUPRC is highly sensitive to class imbalance, it may underestimate the true performance of the model in such cases. To investigate whether this issue stemmed from our methodology or the inherent challenges of the task, we compared our model with the approach proposed by Cosentino et al.12. The comparison results (see Supplementary Table 2 and Supplementary Fig. 1) revealed that under similar experimental conditions, their model also exhibited low AUPRC values, highlighting the general difficulty of identifying rare COPD onset events in highly imbalanced datasets. Nevertheless, we observed that for certain prediction windows, such as those exceeding 5 years, our method achieved comparable or superior AUPRC values. This indicates that incorporating concavity features from the volume-flow curve alongside demographic information helps capture critical features over longer time horizons. From a clinical perspective, the importance of early COPD detection often outweighs the risk of false positives. Individuals flagged as high-risk can undergo further confirmatory assessments, such as chest imaging. Additionally, the robust AUROC performance of our model across various prediction windows demonstrates its strong discriminative capability, making DeepSpiro a valuable early screening tool. This, in turn, has the potential to improve patient care quality and optimize resource allocation in real-world medical practice.

In the future, we will focus on training and validating the model using datasets from diverse geographic locations and population backgrounds. This will help assess the impact of demographic and environmental factors on model performance and improve its generalizability. Additionally, we plan to develop user-friendly software tools that allow doctors to conveniently apply the model in clinical settings and obtain reliable and understandable results to assist in diagnosing and treating patients. Finally, we aim to improve the model’s generalizability by incorporating more high-quality datasets, enabling it to provide effective predictions in a wider range of scenarios.

Methods

Problem definition

For a COPD dataset D = {X, Y} composed of N instances, where X = {x1, x2, …, xj, …, xN} represents the monitoring data for N instances, and Y = {y1, y2, …, yj, …, yN} represents the COPD disease labels for the N instances. The disease label for the jth instance, yj {0, 1}, with 0 indicating no disease and 1 indicating the presence of disease. The monitoring data xj = {sj, dj, aj} includes the spirogram data sj, demographic information dj = {dgj, daj, dsj, …}, and key concavity information aj = {aj,pef−fef25, aj,fef25−fef50, aj,fef50−fef75, aj,fef75+}. Here, the spirogram data is a varied-length sequence sj = {sj,1, sj,2, …sj,i, …} representing airflow variation over time during exhalation. In the demographic information, dgj denotes sex, daj represents age, and dsj indicates smoking status. The key patch concavity information includes aj,pef−fef25 for the concavity information between PEF and FEF25, aj,fef25−fef50 for the concavity information between FEF25 and FEF50, aj,fef50−fef75 for the concavity information between FEF50 and FEF75, and aj,fef75+ for the concavity information beyond FEF75.

DeepSpiro takes the monitoring data X of N instances as input and produces predicted disease labels (hat{Y}={{hat{y}}_{1},{hat{y}}_{2},ldots ,{hat{y}}_{j},ldots ,{hat{y}}_{N}}), explainer (hat{E}={{hat{e}}_{1},{hat{e}}_{2},ldots ,{hat{e}}_{j},ldots ,{hat{e}}_{N}}), COPD risk values R and the risk coefficients contributed by each monitoring input dimension Rs, Rdg, Rda, Rds, etc., as well as the probabilities of future disease occurrence for high-risk undiagnosed COPD patients Fcopd, F1year, F2year, F3year, F4year, F5year+, Fnon−copd. Here, yj {0, 1}, where 0 indicates no disease, and 1 indicates the presence of disease.

Overview

Figure 8 illustrates the architecture of DeepSpiro, where the blocks represent different components of the framework. DeepSpiro is primarily divided into two tasks: COPD detection and early COPD prediction. The inputs to DeepSpiro always include spirometry data and demographic information (such as age, gender, smoking status, etc.). For the COPD detection task, the model outputs interpretability data and a COPD risk score. For the early prediction task, it outputs the probability of the individual developing COPD within the next 1–5 years. It is important to note that DeepSpiro only proceeds with the early prediction task if the detection task determines that the individual does not currently have COPD. Therefore, if the detection results indicate that the patient has COPD, the model’s final output is whether the patient has COPD. If not, the output includes both the current COPD status and the probability of developing COPD in the next 1–5 years.

Fig. 8: Framework overview.
figure 8

It is divided into four modules. In the first module, we process the varied-length original volume curve and convert it into a smoothed Volume-Flow curve. In the second module, we extract features from the varied-length Volume-Flow curve. In the third module, combined with demographic information, we output the COPD detection results and model explainer. In the fourth module, based on concavity information, we output the risk values for COPD at different future periods.

Full size image

DeepSpiro can be divided into four main modules: (1) SpiroSmoother (see Fig. 8I): constructing Volume-Flow curves guided by Time-Volume instability smoothing: This enhances the stability of key physiological information in the original Volume-Flow data. (2) SpiroEncoder (see Fig. 8II): extracting critical features from the variability-pattern of varied-length key patches: This dynamically identifies “key patches” and unifies the time-series representation, achieving the conversion of key physiological information from high-dimensional to low-dimensional. (3) SpiroExplainer (see Fig. 8III): explaining model based on volume attention and heterogeneous feature fusion: This combines diagnostic probabilities with demographic information to assess COPD risk and provides an explainer of the model’s decisions. (4) SpiroPredictor (see Fig. 8IV): predicting the risk of COPD based on the variability pattern of key patch concavity: This integrates key concavity information to assess COPD risk for various future periods.

Specifically, the original spirogram data is processed with signal enhancement technology to obtain smoothed Volume-Flow data. The smoothed Volume-Flow data is used as input in DeepSpiro for varied-length time series data in order to extract essential physiological information. Subsequently, the model combines the important physiological details and relevant demographic information using volume attention and heterogeneous feature fusion techniques. This process enables the model to output a COPD risk assessment and provide an explainer for its decisions. By using the COPD risk assessment values and concavity information obtained from the Volume-Flow signals in the feature fusion model, we can make precise predictions about the probability of future disease in high-risk patients who have not yet received a diagnosis.

SpiroSmoother: constructing Volume-Flow curves guided by Time-Volume instability smoothing

The current state-of-the-art (SOTA) method for detecting COPD from the Volume-Flow curve using deep learning has been proposed by J. Cosentino et al.12. This method involves converting original Time-Volume curves into Volume-Flow curves, which can result in instability in the curves (as shown in Fig. 9C). Extensive experimental validation has been demonstrated that these instabilities can impact the final risk assessment for COPD patients. To address this, we employ SpiroSmoother. This approach precisely enhances the stability of the original Volume-Flow curves while preserving key physiological signals within the original Volume-Flow data.

Fig. 9: Spirometry curve and smoothing effects overview.
figure 9

a Examples of Volume-Flow diagrams for different populations. b The conversion of the original Time-Volume curve. To obtain the degree of airflow limitation, we need to use finite difference methods on the original Time-Volume curve to calculate the corresponding Volume-Flow curve. c Smoothing comparison, the original curve shows fluctuations in some areas when converting to Time-Flow and Volume-Flow curves. After smoothing, the curves in the same positions become more stable.

Full size image

As shown in Fig. 9c, we observe instability when converting the Time-Volume curve into a Time-Flow curve. Therefore, we believe that smoothing the original Time-Volume curve could help enhance the stability of both the Time-Flow curve and the Volume-Flow curve.

Gaussian filtering is employed to implement instability smoothing guidance for the original Time-Volume curves. This method achieves smoothing by taking a weighted average of the areas around the data points, where the weight of each data point is determined by a Gaussian function. For a given data point xi its smoothed value yi can be defined as

$${y}_{i}=frac{mathop{sum }nolimits_{j = -k}^{k}{x}_{i+j}cdot g(j)}{mathop{sum }nolimits_{j = -k}^{k}g(j)}$$
(1)

In this formula, g(j) represents the Gaussian function, k is the filter’s window size, and xi+j represents the original data point and its adjacent points. The formula calculates the weighted average of data points around xi, using weights derived from the Gaussian function g(j). This function is centered at xi, and its standard deviation determines both the range and the degree of smoothing. Through this method, Gaussian filtering effectively smooths the data while preserving important feature information.

To convert the smoothed Time-Volume curve into a Volume-Flow curve, we utilize the finite difference method to approximate the first derivative of the volume data with respect to time. Simply put, it involves calculating the volume change rate at consecutive time points to obtain corresponding flow data. The flow Q(t) can be obtained by calculating the derivative of volume V(t) with respect to time t, where flow Q(t) is defined as

$$Q(t)approx frac{V(t+Delta t)-V(t)}{Delta t}$$
(2)

where V(t) represents a set of Time-Volume curve data, Δt is the time interval between two adjacent time points, and t denotes the time points.

We linearly interpolate the calculated flow data to ensure that the Time-Flow curve is consistent with the original Time-Volume curve in terms of time data. In this way, we can construct the Volume-Flow curves ({mathscr{F}}(v)) using the Time-Flow curves Q(t) and the Time-Volume curves V(t). The Volume-Flow curves ({mathscr{F}}(v)) is defined as

$${mathscr{F}}(v)=Q({V}^{-1}(v))$$
(3)

As shown in Fig. 9c, the Time-Flow curve and the Volume-Flow curve have been constructed using the Time-Volume instability smoothing guidance technique and have effectively eliminated the original fluctuations, significantly enhancing the data’s stability. In subsequent experiments, we have used these stable Volume-Flow curves to replace the original data for model training. Experimental results have confirmed that the curves processed through instability smoothing guidance demonstrate superior performance in risk assessment.

SpiroEncoder: extracting critical features from the variability-pattern of varied-length key patches

Existing methods for modeling varied-length time series data generally use missing value imputation or downsampling truncation. The former introduces external noise, while the latter will likely lose meaningful data dependencies. We aim to preserve the key signals of the original data and adapt them to model input. To this end, we propose SpiroEncoder.

First, we need to reconstruct the original data. We have adopted an adaptive temporal decomposition method, which dynamically calculates the most suitable “key patches” count for each time series patch, accurately capturing the key Volume-Flow information within each time patch. By introducing the hyperparameter k, we can calculate the number of key patches based on the ratio of the sequence length to k, ensuring that each patch contains enough information for subsequent analysis. For a given length of time series L, the number of key patches S can be defined as

$$S=leftlceil frac{L}{k}rightrceil$$
(4)

where represents the ceiling function, ensuring that meaningful data dependencies are not lost even when L is not divisible by k.

We can obtain the number of key patches for each sequence using the above-mentioned method. Thus, we can divide each sequence into several key patches. We feed all the key patches into the Net1D network35 to extract the spatial features between the patches and the temporal features within each patch. The Net1D network extracts spatial features for us. We must compress the high-dimensional spirogram space into the same low-dimensional key patch to adapt these features for our subsequent modules.

First, we determine the length of the most extended sequence in the dataset and use this length along with the preset hyperparameter k. Calculate the number of key patches all sequences should have, thereby normalizing the temporal representation of the data. For the longest sequence length ({L}_{max }) in the dataset, the maximum number of patches is defined as

$${N}_{max }=leftlceil frac{{L}_{max }}{k}rightrceil$$
(5)

We perform encoding for the key features of each sequence. This encoding marks the key patches and records them as a mask. For each sequence, we create a mask vector. Parts of the vector where key patches exist are masked as 1, and parts extending from the end of the existing key patches to the maximum number of patches are masked as 0.

We construct a zero tensor (Oin {{mathbb{R}}}^{Ntimes Stimes C}), where N represents the number of samples, S represents the number of key patches, and C represents the number of output channels. Subsequently, we selectively apply these encodings to the zero tensor O using the mask vector. For each sample, the mask vector indicates where to insert the features of key patches. Specifically, if the mask vector is one at a certain position, then we insert the encoding at the corresponding position in the zero tensor O; if the mask vector is 0 at a position, the zero tensor remains unprocessed at that position. (This part also constitutes the padding of 0s for the sequence.) Thus, the zero tensor O transforms into a fixed-length sequence containing encoding information, and we refer to the transformed zero tensor O as the feature tensor of key patches O. For each sample i and its key patches j, we use the mask vector Mi,j to decide whether to apply encoding. Oi,j,: can be defined as

$${O}_{i,j,:}={M}_{i,j}cdot {E}_{i,j,:}$$
(6)

where the features extracted by Net1D for each key patch are ({E}_{i,j,:}in {{mathbb{R}}}^{C}.)

Because the process of constructing the key patch feature tensor O involves padding with zeros, this padded information may introduce unnecessary noise when captured by the Bidirectional Long Short-Term Memory Network (Bi-LSTM). To address this, we aim to effectively ignore this portion of padded information before entering the Long Short-Term Memory Network.

First, we calculate the effective length Lengthi for each sequence, which is the number of valid key patches in the sequence i. The effective length Lengthi can be defined as

$${rm {Lengt{h}}}_{i}=mathop{sum }limits_{j=1}^{S}{M}_{i,j}$$
(7)

Then, we create a new tensor (Pin {{mathbb{R}}}^{Ttimes C}), where (T=mathop{sum }nolimits_{i = 1}^{N}{rm {Lengt{h}}}_{i}), that is, the sum of the effective lengths of all sequences, and C is the number of output channels. Following the sequence order, we copy the features of valid key patches from the key patch feature tensor O into the tensor P. After the above steps, the tensor P contains only valid key patch features.

After the tensor P enters the Bidirectional Long Short-Term Memory network (Bi-LSTM)36, the variability-pattern relationships among key patches are obtained.

SpiroExplainer: explaining model based on volume attention and heterogeneous feature fusion

Existing deep learning models still function as black boxes, capable only of producing diagnostic outcomes without offering an explainer for those results. In order to establish credibility with medical professionals and patients, the model must offer a decision-making process that is more open and easily understood by both parties. Therefore, through the dynamic volume attention integration method, we accurately highlight the time patches crucial for model predictions using a method for explaining models based on volume attention and heterogeneous feature fusion.

Data enters the first linear transformation layer as it passes through the volume attention layer. This layer provides an initial feature representation for each time step, which is specifically manifested as

$${X}^{{prime} }={W}_{1}X+{b}_{1}$$
(8)

Here, W1 and b1 are the weight and bias of the linear layer, respectively.

To perform a nonlinear transformation, We follow up by applying the Swish activation function to the linearly transformed data ({X}^{{prime} }). The Swish activation function helps to improve the model’s ability to capture complex features. The transformed data after applying Swish activation function is given by

$${X}^{{primeprime} }={X}^{{prime} }cdot sigma ({X}^{{prime} })$$
(9)

where (sigma ({X}^{{prime} })=frac{1}{1+{{rm {e}}}^{-{X}^{{prime} }}}) is the sigmoid function applied element-wise to ({X}^{{prime} }). Next, we apply a bilinear transformation to the activated data X by using a bilinear weight matrix. This is specifically represented as

$${X}^{{primeprime} {prime} }={X}^{{primeprime} }{W}_{{rm{bil}}}$$
(10)

where Wbil is the bilinear weight matrix.

After passing through the bilinear transformation, the data ({X}^{{primeprime} {prime} }) goes through another linear layer to calculate the attention scores S for each time step. Finally, the S scores are normalized using the softmax function to obtain the volume attention weights.

We combine the volume attention information with the smoothed Time-Volume curve data. The decision-making process within the deep neural network model can be transformed into visualized graphs. The graphs provide a clear representation of the specific data regions that the model prioritizes when making decisions, thereby improving the transparency of the model and bolstering the credibility of its decision-making process.

Research indicates that there is a significant correlation between smoking, age, and other demographic information with COPD. Therefore, we aim to incorporate some demographic information to enhance the effectiveness of our model. For this purpose, after passing through the volume attention layer, we first conduct an initial COPD assessment through a fully connected layer, resulting in (hat{P}={{hat{p}}^{1},{hat{p}}^{2},ldots ,{hat{p}}^{j},ldots ,{hat{p}}^{N}}), where (hat{P}) represents a series of COPD assessment values for the samples, with ({hat{p}}^{j}in [0,1]) denoting the assessment value for the jth sample. In the subsequent steps, we use these evaluation values to generate the feature Fprobability, representing the probability of disease. Specifically, Fprobability is obtained by applying the Softmax function to (hat{P}), as follows:

$${F}_{{rm{probability}}}={text{Softmax}},({hat{P}})$$
(11)

Additionally, we extract features Fstruct from demographic and other structured data. Then, we combine these features with Fprobability using the following fusion operation:

$${F}_{{rm{fusion}}}={F}_{{rm{probability}}}oplus {F}_{{rm{struct}}}$$
(12)

where denotes the concatenation of features.

By utilizing the Gradient Boosting framework (Catboost) to process the fused features Ffusion, we predict the risk of COPD. The COPD risk assessment obtained in this manner outperforms the assessment results derived solely from unstructured varied-length Volume-Flow curve data.

SpiroPredictor: predicting the risk of COPD based on the variability-pattern of key patch concavity

In future risk prediction tasks, we used the degree of concavity in individual Volume-Flow graphs as one of the features for prediction. As shown in Fig. 9a, high-risk individuals and COPD patients exhibit more significant concavity, while low-risk individuals exhibit fewer concavity than high-risk individuals and COPD patients. The following provides a detailed explanation of how this feature of concavity is extracted.

Since the spirogram itself is not strictly monotonic, each phase consists of a set of concave and convex curve segments (such as the FEF50–FEF75 phase in Fig. 10a), making it difficult to simply judge the degree of collapse in the curve. Therefore, we have defined a concavity-based directed area metric to approximate the degree of collapse in such non-monotonic curves, in order to characterize the overall concave-convex nature exhibited by the curve.

Fig. 10: Volume-Flow curves illustrating phase-specific concavity and airway collapse.
figure 10

a A representative Volume-Flow curve divided into four phases: early (PEF–FEF25), mid-early (FEF25–FEF50), mid-late (FEF50–FEF75), and late (FEF75+), highlighting concave and convex segments relative to the baseline. b Volume-Flow curve of a healthy individual, showing airway collapse occurring in the late phase (FEF75+). c Volume-Flow curve of a COPD patient, showing early-stage airway collapse in the early phase (PEF–FEF25).

Full size image

First, given the Volume-Flow curve ({mathscr{F}}(v)) for a particular phase s, we define its baseline BL(s, v) as the straight line connecting the starting and ending points of the curve in that phase, where the starting volume of the phase curve is B(s) and the ending volume is G(s). The slope of the baseline BL(s, v), denoted as m(s), can be defined as:

$$m(s)=frac{{mathscr{F}}left.right((G(s))-{mathscr{F}}left.right((B(s))}{G(s)-B(s)}$$
(13)

The intercept b(s) of the baseline BL(s) can be defined as

$$b(s)={mathscr{F}}(B(s))-m(s)times B(s)$$
(14)

Thus, the baseline BL for a given phase s can be expressed as: where v is a specific volume value on the Volume-Flow curve. Next, we define a concavity measure ({mathscr{C}}(s)) based on directed area, which is the directed area of phase s:

$${mathscr{C}}(s)=mathop{sum }limits_{v=B(s)}^{G(s)}({rm {BL}}(s,v)-{mathscr{F}}(v))$$
(15)

Therefore, if the curve of phase s is convex, meaning that the Volume-Flow curve of this phase lies above its baseline (as in the PEF–FEF25 phase in Fig. 10a), the directed area is negative, i.e., the concavity measure is negative. Conversely, if the curve of phase s is concave, meaning that the Volume-Flow curve of this phase lies below its baseline (as in the FEF25–FEF50 phase in Fig. 10a), the directed area is positive, i.e., the concavity measure is positive. According to the above definition, the more concave a curve is, the larger the concavity measure, and conversely, the more convex a curve is, the smaller the concavity measure. The concavity measure of a given phase represents the concavity information for that phase.

Furthermore, for Volume-Flow curves in some phases that are neither strictly monotonic increasing nor monotonic decreasing, the total concavity measure of phase s depends on the sum of the concave area (positive value) and the convex area (negative value). If the concave area in phase s is larger, the concavity measure is greater (positive value), whereas if the convex area is larger, the concavity measure is smaller (negative value). For example, in the FEF50–FEF75 phase in Fig. 10a, the concave area is much larger than the convex area. Thus, by calculation, the concavity measures ({mathscr{C}}({rm {FEF50}}-{rm {FEF75}})) based on the directed area are positive, indicating that this phase exhibits a significant concave trend rather than a convex one.

If we divide the spirogram into four phases: early (PEF–FEF25), mid-early (FEF25–FEF50), mid-late (FEF50–FEF75), and late (FEF75+), we find that in healthy individuals, the spirogram typically remains full during the early phases, and collapse does not occur until the late phase (such as in the FEF75+ phase, as shown in Fig. 10b). In contrast, COPD patients often experience early-stage collapse (such as in the PEF–FEF25 phase, as shown in Fig. 10c). Therefore, we aim to use the proposed concavity measure based on directed area to study the relationship between the phase in which collapse occurs and future disease risk.

We fused the COPD Risk output from the SpiroExplainer module with the key concavity information of the Volume-Flow curve and input this combined data into a future disease risk predictor for assessing future risk. Our future disease risk predictor can be classifiers like Catboost37, Xgboost38, Random Forest39, etc. The model, having been obtained in this way, accurately predicts the probability of illness in the next 1–5 years, and beyond for high-risk patients who have not yet been diagnosed.

Dataset and preprocessing

The data used in our study was obtained from the UK Biobank. We specifically focused on information from 453,558 patients who underwent their initial pulmonary function test (The test outputs the Time-Volume time-series curve). It is worth noting that due to factors such as genetics, environment, or lifestyle, the spirogram characteristics may vary among different population groups. Therefore, in our data processing, we chose to conduct experiments only on the largest population group in the UK Biobank, which is of European descent. The DeepSpiro model relies on detailed and precise patterns in spirogram time series to detect and predict COPD. Therefore, incomplete or distorted data may lead to errors in the model’s predictions. To mitigate the issue of inaccurate predictions caused by poor data quality, we have implemented several data preprocessing measures. These measures ensure that only high-quality and reliable data is used for model training and validation, thereby enhancing the robustness and accuracy of the DeepSpiro model.

The original Time-Volume (expiratory volumes over a period of time) measurements are extracted from UK Biobank field 3066, which contains expiratory volumes recorded every 10 ms in ml. The Time-Volume measurements are used from each participant’s first visit. To ensure the Time-Volume data are valid, we consult UK Biobank field 3061; if the value in field 3061 is either 0 or 32, the expiration is considered valid. If multiple valid expirations are available, we choose the first one in sequence. To control the quality of the expirations, we review the UK Biobank fields 3062 (FVC), 3063 (FEV1), and 3064 (PEF); if any of these are in the top or bottom 0.5% of the observed values, any expiratory data will discard from that patient. The original expiratory volume measurements are converted from milliliters to liters, and the corresponding flow curves are calculated by approximating the first derivative with respect to time using finite differences. As shown in Fig. 9b, the Time-Flow and Time-Volume curves combine to generate a one-dimensional Volume-Flow curve time series.

We refer to the labeling methods in Justin Cosentino et al.’s research12. Specifically, the labels are generated by combining information from various sources in the UK Biobank. We use a binary COPD label that is determined through self-reporting, hospital admissions, and primary care reports to train DeepSpiro. Self-reported COPD status is derived from codes 1112, 1113, and 1472 in the UK Biobank field 20002. Hospital-reported COPD status comes from codes like J430 and others in the UK Biobank field 41270, and codes like 4920 and others in field 41271. The COPD status from primary care reports is required using TRUD to map the UK Biobank’s gp-clinical to read codes from UK Biobank field 42040, with mapped fields including codes like J430, among others. For specific extraction of field codes, please refer to Table 3. In addition, death information is used in subsequent analyses. Death information is recorded in UK Biobank field 40000. If any value is present in this field, we consider the patient to have died.

Table 3 The following table lists the required fields and corresponding codes during label extraction
Full size table

After completing the necessary data processing, the data were categorized into three groups: All (the complete dataset), Hospitalization (patients reporting hospitalization), and Death (patients who passed away). The number of COPD cases in each category is summarized in the Table 4.

Table 4 The number of COPD cases under different categories
Full size table

Currently, the dataset includes a total of 348,039 participants, divided into training and testing sets at an 8:2 ratio. Specifically, the training set contains 278,431 participants, while the testing set comprises 69,608 participants.

Evaluations

To evaluate the performance of this method, we used a validation set to calculate the model’s area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), and F1 scores.

AUROC: The area under the receiver operating characteristic curve (AUROC) is a widely used metric for evaluating the performance of classification models, especially in binary classification problems. It is calculated by varying the classification threshold to determine the true positive rate (({rm {TPR}}=frac{{rm {TP}}}{{rm {TP+FN}}})) and the false positive rate (({rm {FPR}}=frac{{rm {FP}}}{{rm {FP+TN}}})). The ROC curve shows the relationship between the true positive rate and the false positive rate. The value of AUROC is the area under the ROC curve, which can be approximated as ({rm {AUROC}}=mathop{int}nolimits_{!0}^{1}R(F),{rm {d}}F), where F represents the false positive rate and R(F) is the corresponding true positive rate. Numerical methods generally estimate the FPR values f1, f2, …, fn and the corresponding TPR values t1, t2, …, tn. The trapezoidal area approximation of the AUROC is: ({rm {AUROC}}approx mathop{sum }nolimits_{i = 1}^{n-1}frac{({f}_{i+1}-{f}_{i})times ({t}_{i}+{t}_{i+1})}{2}).

AUPRC: The area under the precision-recall curve (AUPRC) is a commonly used metric to evaluate models for classification problems. The PR curve is derived by varying the classification threshold to calculate precision (({rm {precision}}=frac{{rm {TP}}}{{rm {TP+FP}}})) and recall (({rm {recall}}={rm {TPR}}=frac{{rm {TP}}}{{rm {TP}}+{rm {FN}}})). It can be approximated as ∫P(r)dR(r), where P(r) represents precision, R(r) denotes recall, and r represents the decision threshold.

F1 score: The F1 score is a crucial metric for evaluating the accuracy of a model. It is the harmonic mean of precision and recall ((F1=2times frac{,text{precision}times text{recall}}{text{precision}+text{recall},})), providing a single metric that considers both the accuracy and completeness of the model.

These metrics collectively help us comprehensively assess the model’s performance across different aspects. In this experiment, the AUROC, AUPRC, and F1 scores are calculated using functions from the scikit-learn library and code from Justin Cosentino et al.12.

Compared methods

To assess the effectiveness of DeepSpiro, we compared it against several baseline methods, including:

  • FEV1/FVC ratio: The FEV1/FVC ratio is the GOLD standard in clinical practice for determining whether an individual has COPD. If an individual’s ratio is less than 70%, they are considered to have COPD. Therefore, this method also be used as our baseline.

  • ResNet18 (Nature Genetics): The ResNet18 convolutional neural network (CNN) is capable of efficiently learning features from large-scale datasets. In a paper published in Nature Genetics, Justin Cosentino et al. utilized this model for the task of COPD detection12. Therefore, we also adopt this model as the primary baseline.

Related Articles

A 7-point evidence-based care discharge protocol for patients hospitalized for exacerbation of COPD: consensus strategy and expert recommendation

Acute exacerbations of COPD (ECOPD) are an important event in the life of a COPD patient as it causes significant deterioration of physical, mental, and social health, hastens disease progression, increases the risk of dying and causes a huge economic loss. Preventing ECOPD is therefore one of the most important goals in the management of COPD. Before the patient is discharged after hospitalization for ECOPD, it is crucial to offer an evidence-based care bundle protocol that will help minimize the future risk of readmissions and death. To develop the content of this quality care bundle, an Expert Working Group was formed, which performed a systematic review of literature, brainstormed, and debated on key clinical issues before arriving at a consensus strategy that could help physicians achieve this goal. A 7-point consensus strategy was prepared, which included: (1) enhancing awareness and seriousness of ECOPD, (2) identifying patients at risk for future exacerbations, (3) optimizing pharmacologic treatment of COPD, (4) identifying and treating comorbidities, (5) preventing bacterial and viral infections, (6) pulmonary rehabilitation, and (7) palliative care. Physicians may find this 7-point care bundle useful to minimize the risk of future exacerbations and reduce morbidity and mortality.

Probabilistic machine learning for battery health diagnostics and prognostics—review and perspectives

Diagnosing lithium-ion battery health and predicting future degradation is essential for driving design improvements in the laboratory and ensuring safe and reliable operation over a product’s expected lifetime. However, accurate battery health diagnostics and prognostics is challenging due to the unavoidable influence of cell-to-cell manufacturing variability and time-varying operating circumstances experienced in the field. Machine learning approaches informed by simulation, experiment, and field data show enormous promise to predict the evolution of battery health with use; however, until recently, the research community has focused on deterministic modeling methods, largely ignoring the cell-to-cell performance and aging variability inherent to all batteries. To truly make informed decisions regarding battery design in the lab or control strategies for the field, it is critical to characterize the uncertainty in a model’s predictions. After providing an overview of lithium-ion battery degradation, this paper reviews the current state-of-the-art probabilistic machine learning models for health diagnostics and prognostics. Details of the various methods, their advantages, and limitations are discussed in detail with a primary focus on probabilistic machine learning and uncertainty quantification. Last, future trends and opportunities for research and development are discussed.

Iron homeostasis and ferroptosis in muscle diseases and disorders: mechanisms and therapeutic prospects

The muscular system plays a critical role in the human body by governing skeletal movement, cardiovascular function, and the activities of digestive organs. Additionally, muscle tissues serve an endocrine function by secreting myogenic cytokines, thereby regulating metabolism throughout the entire body. Maintaining muscle function requires iron homeostasis. Recent studies suggest that disruptions in iron metabolism and ferroptosis, a form of iron-dependent cell death, are essential contributors to the progression of a wide range of muscle diseases and disorders, including sarcopenia, cardiomyopathy, and amyotrophic lateral sclerosis. Thus, a comprehensive overview of the mechanisms regulating iron metabolism and ferroptosis in these conditions is crucial for identifying potential therapeutic targets and developing new strategies for disease treatment and/or prevention. This review aims to summarize recent advances in understanding the molecular mechanisms underlying ferroptosis in the context of muscle injury, as well as associated muscle diseases and disorders. Moreover, we discuss potential targets within the ferroptosis pathway and possible strategies for managing muscle disorders. Finally, we shed new light on current limitations and future prospects for therapeutic interventions targeting ferroptosis.

Type 2 immunity in allergic diseases

Significant advancements have been made in understanding the cellular and molecular mechanisms of type 2 immunity in allergic diseases such as asthma, allergic rhinitis, chronic rhinosinusitis, eosinophilic esophagitis (EoE), food and drug allergies, and atopic dermatitis (AD). Type 2 immunity has evolved to protect against parasitic diseases and toxins, plays a role in the expulsion of parasites and larvae from inner tissues to the lumen and outside the body, maintains microbe-rich skin and mucosal epithelial barriers and counterbalances the type 1 immune response and its destructive effects. During the development of a type 2 immune response, an innate immune response initiates starting from epithelial cells and innate lymphoid cells (ILCs), including dendritic cells and macrophages, and translates to adaptive T and B-cell immunity, particularly IgE antibody production. Eosinophils, mast cells and basophils have effects on effector functions. Cytokines from ILC2s and CD4+ helper type 2 (Th2) cells, CD8 + T cells, and NK-T cells, along with myeloid cells, including IL-4, IL-5, IL-9, and IL-13, initiate and sustain allergic inflammation via T cell cells, eosinophils, and ILC2s; promote IgE class switching; and open the epithelial barrier. Epithelial cell activation, alarmin release and barrier dysfunction are key in the development of not only allergic diseases but also many other systemic diseases. Recent biologics targeting the pathways and effector functions of IL4/IL13, IL-5, and IgE have shown promising results for almost all ages, although some patients with severe allergic diseases do not respond to these therapies, highlighting the unmet need for a more detailed and personalized approach.

Effect of Baduanjin exercise on health and functional status in patients with chronic obstructive pulmonary disease: a community-based, cluster-randomized controlled trial

Baduanjin has been shown to be an effective method of exercise for promoting body function in patients with chronic obstructive pulmonary disease (COPD). However, this method is rarely used by rural doctors in rural communities. The aim of this study was to compare the therapeutic effects of Baduanjin exercise with those of conventional pulmonary rehabilitation (CPR) implemented by rural doctors in patients with COPD. A total of 472 COPD patients from 18 village clinics were randomly allocated to the Baduanjin group (n = 158), CPR group (n = 154), or control group (n = 160). The Baduanjin group received six months of Baduanjin training, the CPR group received six months of rehabilitation training, and the control group received usual care. The primary outcomes were changes in health status, as measured by the COPD Assessment Test (CAT), and dyspnoea, as assessed via the modified Medical Research Council (mMRC) dyspnoea scale. The secondary outcomes included six-minute walking distance (6MWD) performance and anxiety and depression scores on the Hospital Anxiety and Depression Scale (HADS-A and HADS-D, respectively). Baseline data were collected before randomization, and outcomes were ascertained at the six-month follow-up by blinded assessors. After six months of training, compared with the control group, the Baduanjin group presented significant improvements in CAT scores (t = 5.121, P<0.001), mMRC scale scores (t = 5.719, P < 0.001), 6MWD performance (t = 12.608, P < 0.001), HADS-A scores (t = 6.210, P< 0.001), and HADS-D scores (t = 2.945, P = 0.010). The CPR group also presented significant improvements in CAT scores (t = 4.725, P < 0.001), mMRC scale scores (t = 5.499, P < 0.001), 6MWD performance (t = 11.662, P < 0.001), HADS-A scores (t = 5.697, P < 0.001), and HADS-D scores (t = 2.671, P = 0.023). In contrast, the control group participants showed no significant changes in CAT scores (t = –0.252, P = 0.801), mMRC scale scores (t = –0.085, P = 0.932), 6MWD performance (t = –0.463, P = 0.644), HADS-A scores (t = –2.75, P = 0.783), or HADS-D scores (t = –0.281, P = 0.779). No adverse events were reported during the study. The positive effects of Baduanjin exercise on the subjective symptoms of COPD patients are similar to those of CPR. The trial was registered with the Chinese Clinical Trials Registry (reference: ChiCTR-TRC-12001958) (22/02/2012).

Responses

Your email address will not be published. Required fields are marked *