Abstract
The area under the expiratory flow-volume (AEX-FV) loop has been evaluated before as a spirometric tool for assessing respiratory functional impairment. We computed the AEX-FV curves in spirometry tests performed on 20,313 participants in the National Health and Nutrition Examination Survey (NHANES) study.
We analyzed 108,939 spirometry tests performed between 2007 and 2012 (5964 children; 14,349 adults). In these tests, we computed the three areas from existing NHANES raw data on instantaneous expiratory flows measured at 0.01 s intervals.
Mean best-trial measurements for AEX-FV were 3.4 in boys, 2.8 in girls, 11.8 in men and 7.7 L2/s in women. We characterized indices of central tendency and dispersion of the measurements (eg, means and fifth percentiles—lower limits of normal) by age group (children vs adults), gender, race or ethnicity group and effort grading. Simple regression equations using logarithmic transformations of the above areas and using age, gender and height as inputs provided good predictive ability for the variable AEX-FV.
Regular, digital spirometry could and should make available to clinicians and researchers the area under the curves for flow versus volume graph, providing additional tools in our armamentarium to evaluate ventilatory impairments and patterns, and possibly respiratory disability.
Significance of this study
What is already known about this subject?
Traditional spirometric measurements provide clinically meaningful information: defining normal versus abnormal values, severity stratification of functional defects and the pattern of impairment (eg, obstruction, restriction, mixed defects, small airway disease).
In an era of digital signal processing and significant computational capabilities, alternative spirometric measurements, such as the area under the expiratory flow-volume curve, became available.
Several studies found that area under the expiratory flow-volume loop may become more useful in respiratory functional assessment. So far, no population-based normative data (ie, predicted normal values or lower limits of normal (LLN)) for this measurement have been published.
What are the new findings?
We computed on a large cohort of US population (National Health and Nutrition Examination Survey (NHANES) study) the areas under flow-volume curves. In addition, we characterized their means and LLNs (fifth percentiles), in both adults and children, and we developed simple regression models that predict these functional parameters by age, gender and height.
How might these results change the focus of research or clinical practice?
Current study provides normative data for these alternative spirometric measurements. While the value of area under the expiratory flow-volume curve for diagnosis and severity stratification is currently better understood, further characterization of the other two measurements is warranted in both normal individuals and various disease states.
Introduction
Normative values or lower limits of normal (LLN) for respiratory function are generally dependent on the subjects’ demographic and anthropometric characteristics such as race, gender, age, height and weight.1–5 In practice, for every measured lung volume or flow, calculated volume or capacity, values below the fifth percentiles (or z scores <−1.645) of gender and race-referenced healthy individuals define the LLN. The most important spirometric parameters that are validated and widely used in respiratory physiology, clinical practice and trial assessments are derived from the expiratory phase of testing: (forced) vital capacity (FVC or VC), forced expiratory volume in 1 s (FEV1), forced expiratory volume in 6 s (FEV6), instantaneous isovolumic flows at 25%, 50% or 75% of FVC, or FEF25, FEF50 and FEF75, respectively, and occasionally, forced expiratory flow between 25% and 75% of FVC or FEF25–75. Some of these became important functional parameters more than a century and a half ago (eg, Hutchinson’s VC6), while others have been developed and used more extensively in the 20th century. Their importance in pulmonary function testing (PFT) has been quite significant, yet it may have reached a phase of diminishing returns in providing useful and actionable information on respiratory physiology impairments. Nevertheless, several additional measurements are still available from simple spirometry testing, and their usefulness has not been fully evaluated to date.
We previously evaluated an alternative spirometric measurement, area under the expiratory flow-volume (AEX-FV) loop and its approximations computed from FEF25, FEF50 and FEF75,7–13 as global tools for diagnosis of obstructive, restrictive or mixed ventilatory impairments, for identification of small airway disorders or bronchodilator responsiveness and for severity stratification of respiratory functional impairments. The area approximations10 were found to be good surrogates of the actual area, fact especially relevant when the PFT software did not provide the actual results of the integral function of flow by volume.
In this paper, we expand on prior assessments by computing and determining the LLNs of the area delineated by forced expiratory flow-volume loop (AEX or AEX-FV, figure 1), using an aggregate, large cohort of individuals from the National Health and Nutrition Examination Survey (NHANES), who underwent spirometry between 2007 and 2012 and for whom raw curve data were available.
Methodological approach for computing instantaneous exhaled volumes (Vk) from the matrix of instantaneous flows (φk) collected at δt=10−2 s. The computations for area under the expiratory flow-volume (AEX-FV) curve are shown. PEF, peak expiratory flow.
Methods
The NHANES is a large population-based study, designed to collect information on health and nutrition of the US population, with two distinct parts: a home interview and a health evaluation in a mobile examination center. The NHANES participants were selected using a random sampling method from the US household population. In this study, we performed computations and analyses on several publicly available NHANES datasets: 2007–2008, 2009–2010 and 2011–2012, which are made accessible to researchers by the Centers for Disease Control and Prevention (CDC, Atlanta, Georgia, USA).14 Demographic, anthropometric measurements, pre-bronchodilator spirometry/post-bronchodilator spirometry and especially raw curves data from the NHANES sets were extracted and matched using their unique identifiers.
We computed AEX-FV as the sum of all rectangles represented by the instantaneous flows and the (unequal) intervals of volume, the latter being derived from the formula: delta volume=instantaneous flow multiplied by the time interval.
The data for the raw spirometry curves in the variable SPXRAW were recorded at ATPS (ambient temperature and pressure saturated). For the purpose of these analyses, they were converted to BTPS (body temperature ambient pressure saturated) by using the correction factor provided, that is, multiplying by the variable SPAFACT from the NHANES datasets15 (online supplemental material).
Supplementary data
Descriptive statistical analysis of the available variables was performed. Categorical variables were summarized as frequencies or percentages. Continuous variables were characterized by mean, median and 25th–75th IQR (expressed as the difference 75th minus 25th percentile values), as most distributions were non-Gaussian. The Anderson-Darling test16 was used for goodness of fit of continuous variables. Since usual transformations did not achieve fitting to normality, we used for comparisons mostly non-parametric methods with native or log-transformed variables (eg, Welch’s analysis of variance (ANOVA), Mann-Whitney or Wilcoxon rank score test or Kruskal-Wallis test, as appropriate). The Levene test was used as the default test to compare for unequal variances (an F test from an ANOVA where the response is the absolute value of the difference of each observation and the group mean17).
Statistical analyses were performed using JMP Pro16 software (SAS Institute). Institutional research oversight approvals were obtained to conduct the study.
Results
A total of 108,939 PFT sets from 20,313 participants (5964 children, ie, of age <18 years) were analyzed: 34,857, 36,421 and 37,661 tests from NHANES 2007–2008, 2009–2010 and 2011–2012 cohorts, respectively (figure 2). Mean age was 11 (range: 6–17, median 11, IQR 6) years in children (49% girls) and 44 (range: 18–79, median 45, IQR 30) years in adults (49% women).
Flow diagram of the study (N: number of participants, n: number of spirometry tests). ATS, American Thoracic Society; FET, forced expiratory time; NHANES, National Health and Nutrition Examination Survey.
The main anthropometric characteristics and PFT parameters of the combined cohorts are shown in table 1. Among children, race or ethnicity groups were represented as follows: 22%, 29%, 12%, 31% and 6% as non-Hispanic whites, non-Hispanic blacks, non-Hispanic Asians, Hispanics (both Mexican Americans and other Hispanics) and other or multiracial, respectively. Among adults, 36%, 27%, 14%, 21% and 3% were non-Hispanic whites, non-Hispanic blacks, non-Hispanic Asians, Hispanics (both Mexican Hispanics or other Hispanics) and other or multiracial, respectively.
Subjects’ characteristics of the combined NHANES 2009–2012 (14,349 adults; 5964 children) and pulmonary function testing characteristics
Approximately 37% of the adult participants interviewed about smoking habits reported lifetime smoking of >100 cigarettes cumulatively up to the survey date (18% missing response rate); they reported starting to smoke cigarettes at a median age of 17 (25th–75th percentiles: 15–20) years. Smoking history, quantified based on the report of number of cigarettes smoked per day at the time when they quit, was as follows: mean 20, median 10, 25th–75th percentiles: 2–29 pack-years.
Mean best-trial AEX-FV values were 3.4 in boys and 2.8 in girls (L2/s, p<0.0001); 11.8 in men and 7.7 in women (L2/s, p<0.0001). Figure 3 shows, in bar graph format, medians and LLNs for AEX-FV by gender and age groups, respectively. Figure 4 illustrates, for better illustration of distribution and dispersion, the same area by gender and age groups as box-and-whisker plots (with means) and side histograms. AEX-FV linear fit versus FEV1, FVC and FEF25%–75% was characterized by R2 of 0.07, 0.02 and 0.10 in children and 0.01, 0.04 and 0.12 in adults, respectively. Even when analyzed by gender, age group and effort category, R2 remained below 0.33, which explains the added value of the new physiologic measurement.
Median values and lower limit of normal ((LLN) fifth percentile) for best area under the expiratory flow-volume (AEX-FV) curves by age group (<18 or >18 years) and gender. BTPS, body temperature ambient pressure saturated.
Box-and-whisker plots (means shown) and side histograms of best area under the expiratory flow-volume (AEX-FV) curves by gender and age group (<18 years in upper panel and >18 years of age in lower panel). BTPS, body temperature ambient pressure saturated.
For the AEX-FV variable, we found significant differences by gender and race or ethnicity group (table 2). The AEX variables were larger in black girls and boys, while in adults, they were largest in white men and women (bold values in table 2). Adults and children of Asian ancestry had among the lowest AEX values. Perhaps in keeping with the fact that Hispanics and whites shared the same predictive equations in the Global Lung Initiative (GLI) models for normal spirometric values,18 in our analyses the observed values for FEV1, FVC and FEV1/FVC were, indeed, relatively similar. Nevertheless, we found the normative data for these measurements to be both statistically and clinically different between different groups, as defined by race or ethnicity group in the NHANES study. Table 3 includes the AEX ranges, means, medians, IQRs and several quantiles, including the fifth percentiles (LLN, represented as asterisk) in children and adults, by gender, race or ethnicity group and American Thoracic Society (ATS) acceptability grades.
Mean best value for AEX-FV by gender and race or ethnicity group
Best AEX-FV (BTPS, L2/s)
Only 3/17,303 of the acceptable tests (ie, ATS grades A, B or C, plateau present and forced expiratory time (FET) >6 s) were done in children in the sitting position. Similarly, only 118/54,173 of the acceptable tests were performed on adults in the sitting position; the rest were done while standing. Mean AEX-FV were significantly higher in the sitting versus standing position: 10.7 vs 8.4 (L2/s, p<0.0001), respectively. The statistically significant differences (p<0.0001) remained in multivariate analyses, using native or logarithmic transformations of the three Y variables and after adjusting for FET, testing position and their interaction (in all models, R2 was 0.75 in both the derivation and the validation sets). Overall in these models, testing position contributed slightly less (variable importance main effect 0.46 for AEX-FV) than the main determinant of the Y variables, that is, FET (main effect 0.54 for AEX-FV by dependent resampled method using k nearest neighbors’ technique).
In adults and children with spirometry characterized by good effort, reproducible and acceptable curves by visual inspection (NHANES 2009–2012 selected group—figure 2), the calculated AEX-FV measurements were predicted by models that included the following parameters: age, gender and height, as weight and race or ethnicity group were trimmed off due to their minor, yet significant contribution (figure 5). More complex models based on generalized regression or neural network approaches did not provide considerable improvements to warrant their inclusion here (data not shown). Also, cigarette smoking status or intensity, as established from adult participants’ interviews, as well the serum concentrations of cotinine (a metabolite of nicotine) or urinary concentrations of NNAL (4-(methylnitrosamino)−1-(3-pyridyl)−1-butanonol, a metabolite of a tobacco-specific nitrosamine) did not influence the AEX-FV measurements (data not shown).
Model for the variable LogAEX-FV (natural logarithm of the area under the expiratory flow-volume curve) based on age (years), height (m) and gender (the coefficient is positive for males and negative for females). The main effects listed represent the variable contribution, as assessed by the dependent resampled inputs method (using k nearest neighbors’ technique). BTPS, body temperature ambient pressure saturated; R2, per cent of variance explained by the model; RASE, square root of the mean squared prediction error.
Among all valid tests (ATS grades A, B or C, plateau present and FET >6 s), 18,897 subjects were tested only with ‘baseline’ spirometry (n=66,307), while 1511 individuals underwent both pre-bronchodilator and post-bronchodilator testing (due to initial FEV1/FVC ratios <LLN, n=5290). After bronchodilator administration, best AEX-FV increased from 3.0 to 4.6 in children and from 8.7 to 17.0 L2/s in adults.
Discussion
In this investigation, we computed in a large cohort representative of the US population, the alternative spirometric measurement called AEX-FV curves. Additionally, we characterized indices of central tendency and we identified LLN (as fifth percentiles of the distributions) for various subgroups. Further, we developed a simple regression model for these areas, based on subjects’ age, height and gender, which could be used as predictive equations for this population. These new measurements, easily made available by any modern, digital spirometry software, could become new tools in our armamentarium to characterize global respiratory function impairments.
Several years ago, a few PFT vendors made available to clinicians and researchers, together with other spirometric measurements, the AEX for the flow-volume loop, which is, in fact, the integral function of expiratory flow by the variable volume. Either unnoticed, not well understood, or simply unexplored, it took a few years for the flow-volume loop to be studied in more detail and then used to assess various conditions.7–13 However, the idea is by no means completely new, as other investigators have entertained the idea of exploring in other ways flow-volume loop concavity, defects beyond the classic obstructive–restrictive–mixed-small airway disease categorization, more global assessment tools, etc.19–22
A few comments are warranted here. First, AEX-FV variables were the largest in black children of both genders, while in adults, they were the largest in whites (table 2). While still unclear why, it is our hypothesis that this may be due to different rates of truncal growth in various racial or ethnic groups (which directly influences their lung function). Second, both children and adults of Asian heritage had among the lowest AEX-FV values; however, this group was under-represented in the NHANES cohorts. Third, the differences between whites and Hispanics were both statistically and clinically relevant, while the FEV1, FVC and FEV1/FVC were very similar, perhaps in line with the fact that they share same GLI predictive equations. This points toward the fact that these new spirometric measurements may be able to better separate normal lung function in non-Hispanic whites versus Hispanics of either Mexican or non-Mexican origin.
Several authors have also derived and published the past linear regression-based predictive equations for normal AEX-FV, based on subjects’ age, gender and/or height.20 23 We also found in a PFT laboratory/clinic cohort, which combined normal participants from the Madrid area of Spain and individuals with normal spirometry from Cleveland, Ohio, USA, that a neural network approach (machine learning or artificial intelligence methods) may be superior to other modeling techniques in defining predicted values and LLNs.12 In the current study, performed using NHANES cohorts (which are more closely representative of the general US population), we aimed to accomplish acceptable model performance by using simple regression models. The overall R2 of our current regression models was approximately 0.52, suggesting that the models can be further refined or that factors other than those considered may be of importance. If a testing parameter, that is, the duration of the exhalation (FET), was added to the models, the R2 increased to ~0.79. While the NHANES cohorts did not exclude those with a history of past or present cigarette smoking, neither smoking status nor smoking intensity significantly influenced the normative data, nor the LLNs in the 2007–2012 surveys. In a previous analysis on acceptable flow-volume curves with the integral function flow by volume variable using best tests for comparison (data not published), we have found that the intraindividual coefficient of variation for best AEX-FV was comparable with that of FVC or FEV1, that is, <10% both before and after bronchodilator administration. Here, the mean interindividual coefficient of variation of the best AEX-FV was 2% in children and 9% in adults, respectively, also comparable with the interindividual coefficients of variation for FEV1 and FVC. These analyses confirm that the reproducibility of these tests is excellent and could be used confidently for functional assessments.
The present study has several strengths. First, this is a computation of these global measurements of respiratory physiology derived from simple spirometry signals. Second, the present derivation of LLN values was done on a very large cohort, representative of the US household population. Third, we derived predictive models for the AEX-FV by using the simplest regression models (ie, not resorting to optimized regression techniques or artificial neural networks), which makes it widely accessible. Fourth, our study found significant differences between various demographic groups, thereby offering to become a useful tool in defining normal respiratory function in various populations. Lastly, the evaluation of the effects of bronchodilator administration on these measurements, while limited to ‘only’ 1565 individuals (6531 tests), warrants further exploration, as it may become a useful global tool to assess bronchodilator responsiveness.
Several limitations of this investigation also warrant comment. First, to assure the generalizability of our observations, the computed AEX-FV measurements require further investigation in patients with different pathologies or disease pheno/endotypes, that is, in chronic obstructive pulmonary disease, asthma, small airway disease, restrictive disorders. Second, the 2007–2012 NHANES cohorts included relatively few individuals of Asian origin, hence their normative values may lack the precision of the other ethnic or racial groups. This limitation could be overcome in the future by extending the geographic coverage and the diversity of the pooled tests. Third, the NHANES study did not exclude those with a history of past or present cigarette smoking (an older criticism of the NHANES cohorts being representative of the ‘normal’ US population). While in our analyses on the 2007–2012 surveys, smoking status and its intensity did not significantly influence neither the normative data nor the LLNs, the effects of recall bias and/or secondary smoking exposure could not be completely eliminated. Fourth, standard spirometry is done today in the sitting position, which was not the default position in the NHANES examinations. Once made available on standard commercial PFT platforms, these assessments need to be reproduced in future cohorts and testing conditions. Fifth, the AEX measurements may not be easily available on most spirometry testing equipment at this time. However, digital PFT platforms could very easily resolve the calculation of the AEX for flow-volume curves by computing the integral function of the Y variables versus the X inputs and include them in the standard reporting systems. Lastly, the utility of different AEX measurements needs to be further explored over a broad spectrum of respiratory impairments and disease severities.
Conclusion
While traditional spirometry measurements inform clinical management in important ways, it is conceivable that newer measurements may add materially to the diagnostic value of spirometry. Specific questions include: how much actionable information can these new parameters provide about respiratory physiology impairments or levels of disability, and what is the separation between normal and abnormal by using ‘hard’ LLN? In an era of advanced digital signal processing, now is the time to explore new technologies and other innovative ways of assessing respiratory function. Fortunately, several new metrics are still available in spirometry, and their contribution to clinical assessment has yet to be fully evaluated. To address this gap, the current analysis computed actual and predicted values of AEX-FV curves, prompting consideration of how these new parameters can inform clinically meaningful distinctions in patients.
Data availability statement
Data are available in a public, open access repository. Availability of data and materials: yesRef 13:Centers for Disease Control and Prevention (CDC). National Center for Health Statistics (NCHS). National Health and Nutrition Examination Survey Data. https://wwwn.cdc.gov/nchs/nhanes/. U.S. Department of Health and Human Services, Centers for Disease Control and Prevention; 2020. Accessed November 2020.
Ethics statements
Patient consent for publication
Ethics approval
Emory IRB# 00049576/Atlanta VA R&D Ioachimescu-002.
Acknowledgments
These analyses were performed in accordance with the relevant rules, guidelines and regulations (and regulatory approvals obtained from institutional review boards).
Footnotes
Contributors OCI contributed to concept, data collection and analysis, manuscript writing; KMC and JKS contributed to manuscript writing; OCI is responsible for the overall content as guarantor.
Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.
Disclaimer OCI is employed by the US federal government; no specific copyright statement is required.
Competing interests OCI is a Journal of Investigative Medicine Editorial Board member. No other competing interests declared.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, an indication of whether changes were made, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.