Computer-aided analysis in evaluation and grading of interstitial lung diseases in correlation with CT-based visual scoring and pulmonary function tests
Egyptian Journal of Radiology and Nuclear Medicine volume 51, Article number: 80 (2020)
Interstitial lung diseases (ILDs) represent a large group of more than 200 different entities. High resolution computed tomography (HRCT) is accepted as the gold standard imaging modality in the diagnosis of ILD. The visual-based scoring offers an advantage in finding a specific type of ILD. Computer-aided CT attenuation histogram is another way of characterizing and quantifying diffuse lung disease. The histogram analysis (HIST) consists of calculating skewness, kurtosis, and mean lung density to quantify lung disease and monitor progression. The aim of our study was to investigate the value of computer-aided analysis of HRCT for interstitial lung diseases in correlation with scoring and pulmonary function tests.
This prospective study included 50 patients with suspected ILD. The mean age of patients was 46.7 years ± 12.5. Mean forced expiratory volume FEV1 was 63.6 ± 20.9. HRCT examination was done for all patients followed by CT-based visual scaling. Most of the studied patients (43.3%) had a CT visual semi-quantitative scoring ranged between 40 and 64. CT-based lung density histograms (LDH) were obtained for all patients using the 3D Slicer Software (Chest Imaging Platform). There was a significant difference between patient’s groups of different (mild, moderate, and severe) grades of ILD according to FEV1 regarding MLD, skewness, and kurtosis of corresponding CT-based density histograms (p values < 0.001). More significant and higher correlation was observed between computerized aided CT quantified mean lung densities (MLD) and (FEV1) (p value < 0.001 and r = − 0.570). The ROC curve analysis demonstrated good performance for CT visual scoring with PFT (AUC = 0.71); a cutoff scoring 15 or higher was associated with best sensitivity (75%) and specificity (100%). Meanwhile, ROC curve analysis for MLD and FEV1 demonstrated an excellent performance for computer-based CT quantification (AUC = 0.85) with a value of − 769 HU which increased sensitivity to 65% and specificity to 100%.
Visual-based scoring techniques offer an advantage in finding a specific type of ILD. Computer-based quantification system could be a means for accurately monitoring the disease progression or response to therapy.
Interstitial lung diseases (ILDs) represent a very large group of more than 200 different entities, many of which are rare or “orphan” diseases. Much remains unknown or debatable for many of these ILDs, notable issues of prevalence, incidence, and mortality rates .
This group of diseases is associated with substantial morbidity and mortality. Thus, a multi-disciplinary approach including clinical, pathological, and radiological correlation is required to reach accurate early diagnosis . High resolution computed tomography (HRCT) of the chest became accepted as the gold standard imaging modality in the diagnosis of ILD . The visual-based scoring offers an advantage in finding a specific type of ILD or in distinguishing other causes of increased lung density, such as infection or neoplasm .
The computer-aided CT attenuation histogram is another way of characterizing and quantifying diffuse lung disease. The attenuation of a voxel is determined by the relative contribution of air and blood within that voxel. The relative frequency of voxels with particular attenuation values can be calculated and expressed as a histogram . Normal lung tissue deviates from the Gaussian distribution; it is markedly skewed to the left and peaks at approximately − 900 HU. The histogram analysis (HIST) consists of calculating skewness, kurtosis, and mean lung density to quantify lung disease and monitor progression . The aim of our study was to investigate the value of computer-aided analysis of HRCT for interstitial lung diseases in correlation with semi-quantitative visual scoring and pulmonary function tests.
After approval from the institutional ethical committee and informing the individual patients, 50 patients were incorporated in this prospective clinical study. The patients were subjected to: 1-History taking including the type of work, bird, and animal breeding, drug history; 2-Pulmonary function tests; 3-HRCT of the chest without contrast.
Pulmonary function tests
Pulmonary function tests were performed using a 2130 spirometer (v max, Sensormedia, USA) which was calibrated daily. Results were obtained for forced vital capacity (FVC), forced expiratory volume in the first second (FEV1), and FEV1/FVC ratio. Restrictive ventilatory defect was defined on spirometric findings of FEV1/FVC ratio < 70% and FVC < 80% predicted .
The patients were classified functionally based on FEV1 which represents proportion of patients’ vital capacity that they are able to expire in the first second of forced expiration to the full forced vital capacity. FEV1 categorization: ≤ 70 → mild, 69–50 → moderate, and ≤ 49 → severe.
HRCT exams were performed at the Radiology Department, and the scanning protocol included only unenhanced scans to all the 50 patients with suspected ILD based on history, clinical examination, and pulmonary function tests. The examination was performed using bright speed MDCT 16 slices scanner (General Electric Medical Systems, Milwaukee, WI). The patient was trained on how to hold breath and how to listen and follow the instructions from the recorded voice in the machine. Patients who were unable to hold their breath were instructed to breath as shallow as possible during the acquisition. The images were reviewed for the following: ground glass opacities, fibrotic changes, reticulations, bronchiectatic changes, honey-combing, sub-pleural cysts.
CT-based visual scoring
CT-based semi-quantitative scoring was calculated according to number of lung segments affected on both sides. The finding on each segment was given a score from 1 to 4 as follows: score of (1) for ground glass opacities, score of (2) for reticulations and fibrotic changes, score of (3) for bronchiectatic changes, and score of (4) for honeycombing and sub-pleural cysts. If one segment has two findings or more, we consider the score of the higher finding . The extent of disease was obtained by counting the number of broncho-pulmonary segments involved for each abnormality. In each patient, “severity of disease” score was then calculated as total score with a maximum total score of 80 for whole lungs.
HRCT images were analyzed using the open-source 3D Slicer software (Version 4.8.1) for creating automated computer-based lung density histograms (LDH). It is a multi-platform open source software package for visualization, analysis, and post-processing of medical images. It is built through support from National Institutes of Health. The parenchymal analysis module is a part of chest imaging platform of 3D Slicer. The parenchyma analysis module performs densitometry in chest CT scans by isolating the lung region and computing different phenotypes based on the histogram of the density measurements. First, we select an input CT image. Then, we select Lung Label Map for a selected input CT. Then, filtering option is turned “on” to activate filter. Select whether to apply filter in 2D or 3D. The filtering strength was selected as smooth, medium, or heavy. The slow method takes more time to finish but is more accurate. Lastly, we choose to apply to start the parenchyma analysis.
SPSS version 20 was used for statistical analysis. The quantitative variables tested for normality by Kolmogorov-Smirnov test. The descriptive data were expressed as mean, median, and standard deviation (SD). One-way ANOVA test was used for comparison between different groups. The correlation analyses were performed using Pearson’s and Spearman’s correlation tests. Statistical significance was defined as a P value < 0.05.
This is a prospective study that included 50 patients of suspected interstitial lung disease. There were 12 males (24%) and 38 females (76%). The mean age of patients was (46.7 years ± 12.5), ranged from 24–75 years. The most common clinical presentations were cough (30%), dyspnea (33.33%), or both (36.33%).
Regarding special habits of examined patients, 9/12 male patients (75%) were ex-smokers, and 18/38 female patients (47.4%) were bird breeders. On PFTs, mean of FEV1 was 63.6 ± 20.9, ranged from 13–98. Most cases (46% of patients) had FEV1 (50–70), i.e., moderate as shown in Table 1.
HRCT examination was done for all studied patients followed by visual scaling of CT abnormalities. CT-based lung density histograms (LDH) were then obtained for all patients using the 3D Slicer Software (Chest Imaging Platform).
The most common HRCT patterns of ILD were ground-glass opacification, reticulation, traction bronchiectasis, and honeycombing. Most of the studied patients (43.3%) had a CT visual scoring ranged between 40/80 and 64/80, i.e., 50% and 80%. The distribution of visual scoring among different patients of the study is shown in Table 2.
Mean CT-based visual scoring of studied patients was 34/80 ± 18, i.e., 42.5% ± 22.5%. Mean lung densities (MLD) were − 744.9 ± 37.1 HU and ranged from 622 to − 879 HU. The mean skewness was 1.12 ± 0.43, ranged from 0.50–2.06. Mean kurtosis was 5.97 ± 1.2, ranged from 8.45–3.74 as shown in Table 3.
There was a significant difference between patient’s groups of different (mild, moderate, and severe) grades of ILD according to FEV1 regarding MLD, skewness, and kurtosis of corresponding CT-based density histograms (p value < 0.001).
There was a significant fair negative correlation between CT-based visual scoring of studied patients and PFT represented by forced expiratory volume 1s FEV1 (p value = 0.04 and r = − 0.190). More significant and higher correlation was observed between computerized aided CT quantified mean lung densities (MLD) and (FEV1) (p value < 0.001 and r = − 0.570) (Table 4).
The ROC curve analysis demonstrated a good performance for CT visual scoring with PFT (p value < 0.01, AUC = 0.71), a cutoff scoring 15/80 (18.75%) or higher that was associated with best sensitivity (75%) and specificity (100%). Meanwhile, ROC curve analysis for MLD and FEV1 demonstrated an excellent performance for computer-based CT quantification (p value < 0.001, AUC = 0.85) with a cutoff value of − 769 HU which increased sensitivity to 65% and specificity to 100% (Fig. 1). Illustrative cases of studied patients having different degrees of severity of ILD are shown in Figs. 2, 3, and 4.
Assessment of ILD involves not only accurate and early diagnosis but also evaluation of disease extent and severity which should be integrated into the care provided to ILD patients. However, to date, there are no generally accepted or validated staging systems .
The qualitative visual evaluation of HRCT is the base for detection and classification of the type of lung structural abnormalities. Nevertheless, this can be supplemented by visual scales for a semi-quantitative rating of the extent or severity of ILD . This semi-quantitative visual assessment of disease extent may be reported as poorly defined terms like mild, moderate, or severe, or it can be reported as a scoring system or even as an estimate of percentage of lung affected to the nearest 5%, 10%, or 25% .
In our work, parenchymal abnormalities on HRCT were coded and visually scored in all images according to Warrick et al. . We found a fair negative significant correlation between PFT and CT-based visual scoring (p value = 0.04 and r = − 0.190), indicating that decrease of FEV1 values is associated with an increase in visual scoring of lung abnormalities which is logic and expected. Similarly, Sverzellati et al.  found that visual score was a significant predictor of functional impairment with good correlation (p < 0.05, r = 0.60, r2 = 0.38).
However, visual-based assessments are subjective, with large inter-reader and intra-reader variation. A further difficulty is represented by complexity of integrating the extent of the different components of abnormalities seen on several HRCT slices and deriving a quantitative measure of the total extent of lung abnormality. Moreover, visual-based scoring is not reliable for follow-up of ILD patients [12, 13].
These variabilities and difficulties are a reason for automation in an attempt to provide more consistent indices for assessment of ILD . Multiple commercial software packages are available for lung densitometry and automated quantitative of ILD, but they are not widely used because they are complicated or expensive, even among experienced thoracic radiologists, making automated image analysis of ILD confined mainly to research work. The introduction of free softwares and the development of open-source platforms as well made access and use of lung densitometry relatively easy and potentially free .
In our study, we attempt to investigate structure function relationship between PFT and an open access computer-based CT quantification scoring (3D SLICER). This is to verify whether computer-based data could distinguish between patients who have normal lungs and ILD patients.
Mean lung density (MLD) is the simplest measurement which is utilized especially for pulmonary fibrosis . Threshold of − 900 HU, corresponding to attenuation values of a normal lung inflated by air, has been proposed . The skewness is a measure of the lack of symmetry of the density histogram, whereas kurtosis is a measure to which the distribution is peaked relative to a normal distribution .
MLD for our patients was − 744.9 ± 37.1 HU which is far above normal lung attenuation density. Mean skewness and kurtosis values were 1.12 ± 0.43 and 5.97 ± 1.2, respectively. Mean CT-based visual scoring of studied patients was 42.5% ± 22.5%. In concordance with our results, Sverzellati et al.  found that MLD, skewness, and kurtosis on frequency histograms of their patients were on average − 732.1 HU ± 71, 2 ± 0.5 HU, and 8.4 ± 2.2 HU, respectively. Histogram features, after comparing UIP and non-UIP groups, showed no significant statistical differences (p > 0.05). The extent of interstitial disease on visual score was 39.5% ± 21.2%.
In lung fibrosis (as in our results), collagen deposition caused increasing lung density, with subsequent rightward shift of CT frequency histogram, both skewness and kurtosis will typically decrease .
We observed a good significant correlation between computer-aided CT quantified MLD and FEV1 (p < 0.001, r = − 0.57). In agreement with our results, Salaffi et al. , found that computerized aided scores showed a moderate to high significant negative correlation with forced vital capacity (FVC) (r = − 0.490; p value < 0.0001), forced expiratory volume in 1s (FEV1) (r = − 0.675; p value < 0.0001), and single breath carbon monoxide diffusing capacity of the lung (DLco) (r = − 0.653; p value < 0.0001). Shin et al.  reported that mean lung attenuation best correlated with reticulation extent (p < 0.001, r = 0.42). Best et al.  also found moderate correlations existed between histogram features and PFT results, but kurtosis showed the greatest degree of correlation with physiologic abnormality (p < 0.01, r = 0.53). Ash et al.  found that CT densitometric measures and visual fibrosis score were strongly correlated with FVC% as follows: mean lung attenuation (r = − 0.78), skewness (r = 0.76), kurtosis (r = 0.71) with p values < 0.001.
However, Sverzellati et al.  found a poor correlation between histogram features and functional data in UIP group, which may be explained by more cystic dead spaces of honeycombing, but they confirmed that fibrosis ratio together with histogram features can differentiate fibrotic lung from normal lung . This is supported by a better correlation with DLCO than did visual score at the same study in patients with a predominant pattern of ground-glass and reticular opacities without honeycombing.
We also found that computer-aided CT quantification of lung density histograms showed a more significant, stronger correlation, and higher performance than visual-based semi-quantitative CT scaling (p < 0.01, AUC = 0.85 versus 0.71). Direct comparison of CT densitometry with visual score in patients with pulmonary fibrosis showed that the former is more reproducible and more sensitive . Jacob et al.  reported that baseline texture-based CT quantification of total disease extent was superior to visual scoring in clinic-functional models predictive of outcome in IPF (p = 0.002). Similarly, Salaffi et al.  on ROC curve analysis, computer-aided scores confirmed the highest performance (AUC = 0.861 versus 0.689; p = 0.011)
The qualitative visual assessment of the lung must always be performed before densitometry “eye first role,” since it is fundamental for diagnosis, classification, and decreasing risk of false interpretation of lung density values .
Lung densitometry might replace semi-quantitative visual rating of severity and extension of lung changes . In fact, lung densitometry has several advantages over visual semi-quantitative assessment of diffuse lung alterations. The visual assessment is subjective and shows intra- and inter-observer variations. Thanks to the automatic segmentation of lung tissue and negligible time for software computation of density, the time required for lung densitometry is generally shorter as compared to visual scoring.
In conclusion, taken together, a computer-based quantification system will be efficient and providing the best overall estimates of HRCT-measured lung disease, but they cannot be used alone. The visual-based scoring techniques offer an advantage in finding specific type of ILD or in distinguishing other causes of increased lung density, such as infection or neoplasm. The visual and computer-based quantitative scoring systems are complementary, rather than competitive. In combination with physiologic parameters, a computer-based quantification system could be a means for accurate monitoring of disease progression or response to therapy.
There are still some important topics that need to be addressed in the future. The QCT assessment of ILD might have a crucial role in evaluation of prognosis as well as mortality risk prediction models. It may have important implications for multi-center clinical trials that rely on accurate and reproducible quantitative analysis of CT images collected under varied conditions across multiple sites, scanners, and time points.
We are aware of some limitations in our study. The diagnosis of pulmonary fibrosis was not based on histopathologic examination. Another is that lung volume variations due to different levels of inspiration may represent a major limitation of any density based analysis of the lungs. Such a problem may be overcome by taking into account both lung volume and lung density which is not done in our work.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request
Diffusing capacity of the lungs for carbon monoxide
Forced expiratory volume 1st second
High resolution computed tomography
Interstitial lung disease
Multi-detector computed tomography
Mean lung density
Pulmonary function tests
Demedts M, Wells AU, Antó JM, Costabel U, Hubbard R, Cullinan P et al (2001) Interstitial lung diseases: an epidemiological overview. Eur Respir J Suppl 18:2 s–16 s
Huapaya JA, Wilfong EM, Harden CT, Brower RG, Danoff SK. Risk factors for mortality and mortality rates in interstitial lung disease patients in the intensive care unit. European Respiratory Review. Eur Resp Soc; 2018;27: 180061.
Buzan MTA, Pop CM (2015) State of the art in the diagnosis and management of interstitial lung disease. Clujul Med 88(2):116–123
Salaffi F, Carotti M, Di Donato E, Di Carlo M, Ceccarelli L, Giuseppetti G (2016) Computer-aided tomographic analysis of interstitial lung disease (ILD) in patients with systemic sclerosis (SSc). Correlation with pulmonary physiologic tests and patient-centred measures of perceived dyspnea and functional disability. PLoS One 11(3):e0149240
Silva M, Milanese G, Seletti V, Ariani A, Sverzellati N (2018) Pulmonary quantitative CT imaging in focal and diffuse disease: current research and clinical applications. Br J Radiol 91:20170644
Mascalchi M, Camiciottoli G, Diciotti S (2017) Lung densitometry: why, how and when. J Thorac Dis 9(9):3319–3345
Vandevoorde J, Verbanck S, Schuermans D, Kartounian J, Vincken W (2006) Obstructive and restrictive spirometric patterns: fixed cut-offs for FEV1/FEV6 and FEV6. Eur Respir J 27(2):378–383
Sverzellati N, Calabrò E, Chetta A, Concari G, Larici AR, Mereu M et al (2007) Visual score and quantitative CT indices in pulmonary fibrosis: relationship with physiologic impairment. Radiol Med 112(8):1160–1172
Meyer KC (2014) Diagnosis and management of interstitial lung disease. Transl Respir Med 2:4
Robbie H, Daccord C, Chua F, Devaraj A (2017) Evaluating disease severity in idiopathic pulmonary fibrosis. Eur Respir Rev 26(145):170051
Warrick JH, Bhalla M, Schabel SI, Silver RM (1991) High resolution computed tomography in early scleroderma lung disease. 18(10):1520–1528
Fernández Fabrellas E, Peris Sánchez R, Sabater Abad C, Juan SG (2018) Prognosis and follow-up of idiopathic pulmonary fibrosis. Med Sci 6(2):51
Ricardo Peris Sánchez EF-F, Samper GJ, Domingo ML, MLD M, Vilar LN (2018) Visual HRCT score to determine severity and prognosis of idiopathic pulmonary fibrosis. Int J Respir Pulm Med 5(2):084
Bartholmai BJ, Raghunath S, Karwoski RA, Moua T, Rajagopalan S, Maldonado F et al (2013) Quantitative computed tomography imaging of interstitial lung diseases. J Thorac Imaging 28(5):298–307
Lynch DA (2014) Progress in imaging COPD, 2004 - 2014. Chronic Obstr Pulm Dis 1(1):73–82
Walsh SLFLF, Nair A, Hansell DMM (2013) Post-processing applications in thoracic computed tomography. Clin Radiol 68(5):433–448
Salaffi F, Carotti M, Bosello S, Ciapetti A, Gutierrez M, Bichisecchi E et al (2015) Computer-aided quantification of interstitial lung disease from high resolution computed tomography images in systemic sclerosis: correlation with visual reader-based score and physiologic tests. Biomed Res Int 2015:834262
Shin KE, Chung MJ, Jung MP, Choe BK, Lee KS (2011) Quantitative computed tomographic indexes in diffuse interstitial lung disease: correlation with physiologic tests and computed tomography visual scores. J Comput Assist Tomogr 35(2):266–271
Best AC, Lynch AM, Bozic CM, Miller D, Grunwald GK, Lynch DA. Quantitative CT indexes in idiopathic pulmonary fibrosis: relationship with physiologic impairment. Radiology. 2003;228(2):407–414.
Ash SY, Harmouche R, Vallejo DLL, Villalba JA, Ostridge K, Gunville R et al (2017) Densitometric and local histogram based analysis of computed tomography images in patients with idiopathic pulmonary fibrosis. Respir Res 18(1):45
Camiciottoli G, Orlandi I, Bartolucci M, Meoni E, Nacci F, Diciotti S et al (2007) Lung CT densitometry in systemic sclerosis: correlation with lung function, exercise testing, and quality of life. Chest. 131(3):672–681
Jacob J, Bartholmai BJ, Rajagopalan S, Kokosi M, Nair A, Karwoski R et al (2017) Mortality prediction in idiopathic pulmonary fibrosis: evaluation of computer-based CT analysis with conventional severity measures. Eur Respir J 49(1):1601011
We gratefully acknowledge the hard work, efficiency, and devotion of our imaging technicians, which made this work possible.
No sources of funding
Ethics approval and consent to participate
This study was approved by the Research Ethics Committee of the Faculty of Medicine at Minia University in Egypt on June 2018 (reference number is not applicable). All patients included in this study gave written informed consent to participate in this research.
Consent for publication
All patients included in this research gave written informed consent to publish the data contained within this study.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Higazi, M.M., Abdelgawad, E.A., Kaseem, A.H. et al. Computer-aided analysis in evaluation and grading of interstitial lung diseases in correlation with CT-based visual scoring and pulmonary function tests. Egypt J Radiol Nucl Med 51, 80 (2020). https://doi.org/10.1186/s43055-020-00201-6