Article Text

Download PDFPDF

Original research
Relationship between high-resolution computed tomography quantitative imaging analysis and physiological and clinical features in antisynthetase syndrome-related interstitial lung disease
  1. Sangmee Sharon Bae1,
  2. Fereidoun Abtin2,
  3. Grace Kim2,
  4. Daniela Markovic3,
  5. Cato Chan2,
  6. Siamak Moghadam-Kia4,
  7. Chester V Oddis4,
  8. Daniel Sullivan5,
  9. Galina Marder6,
  10. Swamy Venuturupalli7,
  11. Paul F Dellaripa8,
  12. Tracy J Doyle9,
  13. Gary Matt Hunninghake9,
  14. Jeremy Falk10,
  15. Christina Charles-Schoeman1,
  16. Donald P Tashkin11,
  17. Jonathan Goldin2 and
  18. Rohit Aggarwal4
  1. 1Division of Rheumatology, Department of Medicine, University of California Los Angeles, Los Angeles, California, USA
  2. 2Department of Radiology, University of California Los Angeles, Los Angeles, California, USA
  3. 3Department of Medicine Statistics Core, University of California Los Angeles, Los Angeles, California, USA
  4. 4Department of Medicine, Division of Rheumatology and Clinical Immunology, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
  5. 5Department of Medicine, Division of Pulmonology and Critical Care Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
  6. 6Department of Rheumatology, Northwell Health, New Hyde Park, New York, USA
  7. 7Department of Medicine, Division of Rheumatology, Cedars-Sinai Medical Center, Los Angeles, California, USA
  8. 8Division of Rheumatology, Inflammation, and Immunity, Brigham and Women's Hospital, Boston, Massachusetts, USA
  9. 9Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA
  10. 10Department of Medicine, Division of Pulmonary and Critical Care Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
  11. 11Department of Medicine, Division of Pulmonary and Critical Care Medicine, University of California Los Angeles, Los Angeles, CA, USA
  1. Correspondence to Dr Rohit Aggarwal; aggarwalr{at}upmc.edu

Abstract

Objectives To explore the association between the extent of CT abnormalities by quantitative imaging analysis (QIA) and clinical/physiological disease parameters in patients with antisynthetase syndrome associated interstitial lung disease (ARS-ILD).

Methods We analysed 20 patients with antisynthetase antibodies and active ILD enrolled in the Abatacept in Myositis-Associated Interstitial Lung Disease study. High-resolution chest CT was obtained at weeks 0, 24 and 48 and QIA scored the extent of ground glass (quantitative score for ground glass), fibrosis (quantitative score for lung fibrosis, QLF) and total ILD (quantitative ILD, QILD). Mixed-effects models estimated longitudinal QIA scores over time. Associations between QIA scores with clinical/physiological parameters were analysed longitudinally using repeated-measures mixed-effects models.

Results Patients were median age 57 years, 55% males and 85% white. Higher (worse) baseline QIA scores correlated with lower baseline forced vital capacity (FVC) and diffusing capacity adjusted for haemoglobin (DLCO). Longitudinal QIA trajectories trended towards improving scores during the trial, and patients on O2 at baseline had worsening QIA trajectories which were different from patients who were not on O2. Longitudinal QIA scores demonstrated strong associations with both FVC and DLCO over time. Higher QILD scores over time were also associated with worse dyspnoea scores, pulmonary visual analogue scale, physician and patient global disease activity, health status in 6/8 domains of the Short Form-36 and higher oxygen requirements. Patients with significant radiographic improvement at 48 weeks had higher baseline QLF, QILD and worse DLCO.

Conclusions Longitudinal QIA scores associate with lung physiology, patient perception of respiratory status, overall disease activity and quality of life over time in ARS-ILD. QIA may allow reproducible monitoring of disease progression and response to therapy over time.

Trial registration number NCT03215927.

  • Outcome Assessment, Health Care
  • Pulmonary Fibrosis
  • Dermatomyositis
  • Polymyositis

Data availability statement

Data are available upon reasonable request. The data underlying this article will be shared on reasonable request to the corresponding author.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

WHAT IS ALREADY KNOWN ON THIS TOPIC

  • Interstitial lung disease (ILD) is a prominent feature of antisynthetase syndrome, and high-resolution chest CT plays an important role in the clinical evaluation of ILD. Computer-aided quantitative image analysis (QIA) provides reproducible quantification of ILD-related parenchymal abnormalities and has been used as a surrogate outcome measure in clinical trials of scleroderma-associated ILD and idiopathic pulmonary fibrosis.

WHAT THIS STUDY ADDS

  • Our results show that QIA scores over time have moderate to strong correlations with lung physiology as well as clinical outcome measures of respiratory status, quality of life and physician and patient perception of the disease in anti-aminoacyl-tRNA synthetase-related ILD (ARS-ILD).

HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY

  • QIA has the potential to serve as an imaging biomarker that empowers objective and reliable monitoring of disease progression and response to therapy in ARS-ILD.

Introduction

Idiopathic inflammatory myopathies (IIMs, collectively known as myositis) are a heterogenous group of disorders characterised by autoimmune-mediated inflammation primarily of the skeletal muscle along with a wide spectrum of systemic involvement. In patients with IIM with autoantibodies against aminoacyl-tRNA synthetases (ARS), interstitial lung disease (ILD) can be the first and most prominent feature1 and is an important prognostic factor associated with poor survival.2 3 Earlier diagnosis and effective management of ILD are vital in improving the prognosis in ARS-ILD.1 Despite increased awareness of the significance of ARS-ILD, only a few clinical trials have been performed in this population, probably due, in part, to the paucity of reproducible objective respiratory measures of which the application has been studied in this patient group.

High-resolution chest CT (HRCT) plays an important role in the evaluation of ILD and is a good surrogate marker of important histological findings with prognostic implications in scleroderma-related ILD (SSc-ILD), rheumatoid arthritis-related ILD (RA-ILD) and idiopathic pulmonary fibrosis (IPF).4–9 There has been increasing interest in using quantitative imaging analysis (QIA) for computer-aided quantification of the parenchymal abnormalities of ILD as a surrogate outcome measure in clinical trials of SSc-ILD and IPF.10–14 The complexity and nature of parenchymal abnormalities seen in myositis-ILD and ARS-ILD are different from those seen in SSc-ILD and other connective tissue disease-related ILD and the applicability in this patient group is largely unknown.

In a recent retrospective observational study from a South Korean myositis-ILD cohort, cross-sectional QIA scores correlated well with lung physiological measures, and higher baseline QIA scores were predictive of higher risk of lung transplantation or death.15 However, longitudinal associations between QIA scores and physiological measures over time were not reported in this study. Also, the association between extent of parenchymal abnormalities on HRCT and patient-reported outcomes of dyspnoea, health-related quality of life (QOL) and overall disease activity have not been assessed in myositis-ILD or ARS-ILD.

In the current study, we explored the association between longitudinal quantitative CT scores using QIA and comprehensive physiological and patient-centred indices of disease over time in patients with active ARS-ILD from a 48-week multicentre randomised interventional study.

Methods

Patient selection

We analysed 20 patients enrolled in the Abatacept in Myositis-Associated Interstitial Lung Disease (Attack My-ILD) study (ClinicalTrials.gov identifier: NCT03215927). All subjects gave written informed consent for the study. Patients with antisynthetase antibodies and active ILD were enrolled across four centres in the USA. Antisynthetase antibodies were tested by commercial assays available at the patient’s local laboratory which typically used ELISA. Enrolled patients were randomised to a double-blind placebo period for 24 weeks assigned to either abatacept (n=9) or placebo (n=11), followed by an open label period in which all patients were on abatacept for 24 weeks. All patients were on standard of care immunosuppressive agents for ILD (glucocorticoid or steroid-sparing agent, either azathioprine or mycophenolate, or combination of steroid and one steroid-sparing agent) prior to the trial and were maintained on a stable standard of care regimen and a taper of glucocorticoids throughout the trial.

Active ILD was defined as new-onset ILD (within 3 months of ILD diagnosis) or recent worsening or flare-up of existing ILD requiring treatment, as defined by one or more of the following: (a) worsening of ground glass opacities, reticulation, honeycombing or fibrosis on chest HRCT as per local thoracic radiologist or pulmonologist within the previous 6 months, (b) relative decline in forced vital capacity (FVC) ≥10% within 3 months and/or (c) new oxygen requirement within 6 months. Criteria for active ILD were reviewed by an adjudication committee prior to enrolment. Patients with FVC≤30% predicted, on supplemental O2≥6 L/min for ≥1 month, listed for lung transplantation or with chronic stable ILD or end-stage fibrosis in which the investigator determined there was low potential for improvement were excluded from the trial.

HRCT image assessment by QIA and visual analysis

Chest HRCT scans obtained with standard non-contrast thin section protocols at each institution were analysed. HRCTs were performed at three timepoints (weeks 0, 24 and 48) during the trial and pre-baseline scans were also included when available. Image quality was assessed, and scans obtained with contrast protocol, >3 mm slice thickness, poor resolution or artefacts were excluded. Bilateral lungs (right, left) were each segmented into three zones (upper—apices to aortic arch, middle—aortic arch to inferior pulmonary veins, lower—below inferior pulmonary veins). We applied computer-aided QIA using previously published methods.16–18 QIA scores were assigned as a percentage of the extent of ground glass (quantitative score for ground glass, QGG), fibrotic patterns (quantitative score for lung fibrosis, QLF), honeycombing (quantitative score for honeycombing, QHC) and consolidations (quantitative score for consolidation, QCON). The quantitative ILD (QILD) score represents the sum of QGG+QLF+QHC+QCON. QIA scores were also reported as the absolute volume of the portion of the lung with abnormal parenchyma adjusted for the total lung volume (QIA in mL), in order to account for different levels of inspiration that may vary even within the same patient. QIA scores were calculated for whole lung and also for each zone. The zone of maximum involvement (ZM) was defined as the zone with the highest QILD score at baseline. Radiographic improvement in ILD was defined as QILD decline (improvement) of ≥2% of the whole lung, which was used as the anchor for minimal clinically important difference in SSc-ILD.19 20

Two thoracic radiologists (CC, FA) who were blinded to the patient’s clinical data visually assessed the CT images, which included overlays of the QILD as quantified by QIA. They confirmed that the computer-aided quantification matched the visually assessed extent of ILD with over 80% agreement. Additionally, the radiologists conducted a consensus review to identify the predominant CT features and visual ILD patterns using Fleischner Society glossary of terms.21 22 The readers determined fibrosis to be present if there was evidence for reticulation with architectural distortion with or without traction bronchiectasis and regional volume loss.

Pulmonary function test and clinical outcome measures

Pulmonary function tests (PFTs) and the clinical outcome measures were obtained at five timepoints (weeks 0, 12, 24, 36 and 48). Clinical respiratory outcome measures included dyspnoea score from the University of California San Diego Shortness of Breath Questionnaire (UCSD SOBQ dyspnoea),23 pulmonary visual analogue scale (VAS) from the Myositis Disease Activity Assessment Tool,24 supplemental oxygen (O2) use and a 6 min walk test (6MWT) that measures 6 min walk distance (6MWD)25 as well as pretest and post-test Borg scale for dyspnoea and fatigue.26

Myositis outcomes were also obtained at these visits including a physician global myositis disease activity VAS (MD global VAS 0–10), extramuscular VAS, manual muscle testing in eight muscle groups (MMT, 0–150) and patient-reported global myositis activity VAS (patient global VAS).24 QOL measures included Short Form-36 (SF-36)27 28 and Health Assessment Questionnaire-Disability Index (HAQ-DI).24

PFTs were obtained at each institution using standardised protocols to obtain FVC and diffusing capacity adjusted for haemoglobin (DLCO) in accordance with ATS/ERS recommended standards.29–32 Both measures were reported as % predicted values as well as by absolute volume (mL for FVC, mL/min/Hg for DLCO). Per cent predicted values were calculated centrally using the age, height and haemoglobin collected from the trial using the global lung function initiative equations.33 34

Statistical analysis

Baseline associations between quantitative CT scores with various physiological and clinical parameters were analysed cross-sectionally using Spearman’s correlations.

Mixed-effects models were used to estimate longitudinal trajectories of whole lung QIA scores (in % and in mL) over time. We also tested interactions with treatment group, age, sex, baseline supplemental O2 use, baseline QIA scores, dyspnoea score, FVC and DLCO to determine whether CT score trajectories were different in patient subgroups. In patients who had pre-baseline scans, we compared mixed-effects model slopes of pre-baseline QIA score trajectories to trajectories during the trial (post-baseline) by treatment group using piecewise linear splines.

Repeated-measures analysis using mixed-effects linear models was performed to analyse the longitudinal associations between CT scores and physiological/clinical measures across multiple timepoints. The results, presented as standardised regression coefficients (β), indicate the expected change in the dependent variable in terms of SDs per unit change in the predictor, facilitating a comparison of the relative importance of the predictors.35 Fixed-effects models estimated the associations over time within individuals.36 Association strengths were classified as weak (<0.2), moderate (0.2–0.59) or strong (≥0.6). Interactions with age, baseline O2 use and muscle strength (MMT<150 vs MMT=150) were also tested. Changes in QIA from baseline were explored in relation to clinical parameter changes using unstandardised regression coefficients to estimate effect sizes. Responders to therapy were determined at 48 weeks based on radiographic ILD improvement from baseline. Baseline characteristics of responders were compared with non-responders using Student’s t-test or Wilcoxon rank-sum test for continuous variables and χ2 test for categorical variables.

All statistical testing was two sided with a significance threshold of 0.05 except for exploratory analysis testing associations between changes from baseline in which p value threshold was 0.10 in view of the smaller number of follow-up scans. Statistical analysis was performed using JMP Pro V.16.0 (SAS Institute, Cary, North Carolina, USA) and SAS V.9.4 (SAS Institute).

Patient and public involvement

There was no patient or public involvement in the design or conduct of the study.

Results

Cross-sectional analysis at baseline visit

Patients were middle aged (median age 57 years), 55% male and 85% white with median ILD duration of 2 years (table 1). Anti-Jo1 antibody was the most common antisynthetase antibody (55%) and remaining subjects had two anti-PL-7 and one of each anti-PL-12, OJ and EJ antibodies. Nine (45%) patients had muscle involvement and MD global disease activity was moderate (VAS 4.0 (1.6–5.5), median (IQR)).

Table 1

Baseline characteristics

All patients had active ILD at baseline with moderate to severe physiological impairment based on a median FVC of 64% predicted and DLCO 48% predicted, and six (30%) patients were on supplemental O2. Baseline CT characteristics by visual analysis and QIA are presented in table 1. The most common ILD pattern was non-specific interstitial pneumonia (NSIP)-type pattern (47%), and fibrosis was the predominant CT feature in eight (47%) patients. On QIA, median QILD at baseline was 39% of the whole lung, which was mostly ground glass (QGG 19%) and fibrotic patterns (QLF 16%), while honeycombing (QHC 0%) and consolidation (QCON 0.2%) were absent or minimal. Maximum involvement (ZM) was most frequently seen in lower lung zones.

Spearman’s correlation of baseline characteristics with baseline CT scores showed that higher (worse) QILD scores had moderate to strong correlations with worse baseline FVC and DLCO (r=−0.51 to −0.64, p<0.05). Higher QGG correlated with worse FVC, and higher QLF correlated with worse DLCO. Higher QLF also correlated with shorter 6MWD at baseline (r=−0.54, p=0.03). Baseline QGG and QILD also correlated with patient global VAS (r=0.56 and 0.59, p=0.02 for both), but not with MD global VAS or extramuscular VAS. Worse health status on SF-36 domains for physical functioning, emotional role functioning, bodily pain and general health perceptions also correlated with higher baseline QIA scores cross-sectionally (online supplemental table 1 for details).

Longitudinal quantitative CT scores during the trial period

All baseline and follow-up CT scans that were adequate for QIA were analysed and the slope of the QIA trajectories during the trial period was estimated using mixed-effects linear models (figure 1 for QIA %, online supplemental figure 1 for QIA in mL adjusted for total lung volume). Estimated trajectories for QIA scores over time in the entire cohort trended towards overall improvement.

Figure 1

Quantitative CT scores (QIA-WL % scores) over time in all patients with HRCT scans (n=17). Bold lines are estimated trajectories of QIA (QGG, QLF, QILD) % whole lung scores over time (months) using mixed-effects models in 17 patients who had one or more HRCT scans adequate for QIA. Mean change±SE for QGG was −0.12±0.09 per month, p=0.21; for QLF −0.19±0.18 per month, p=0.30; for QILD −0.33±0.24 per month, p=0.19. HRCT, high-resolution chest CT; QGG, quantitative score for ground glass; QIA, quantitative image analysis; QILD, quantitative interstitial lung disease; QLF, quantitative score for lung fibrosis; WL, whole lung.

Longitudinal QIA scores over time were also estimated in patient subgroups to test whether QIA trajectories differ by certain baseline characteristics. Baseline O2 use was a significant effect modifier in which patients on O2 at baseline trended towards worsening QLF scores over time while patients not on O2 trended towards improved QLF scores over time (interaction p=0.04 in figure 2a for QIA %, p=0.03 in online supplemental figure 2a for QIA in mL). Patients with baseline organising pneumonia (OP)-type pattern trended towards greater improvement in QLF and QILD scores compared with non-OP-type patterns although the interaction was not statistically significant (figure 2b). Longitudinal QIA scores over time also did not have interactions with the following variables measured at baseline: age, sex, ILD disease duration, dyspnoea score, FVC or DLCO (interaction p value=NS for all).

Figure 2

Longitudinal quantitative CT scores in different patient subgroups. P value of interaction term. Estimated trajectories of QIA (QGG, QLF, QILD) % whole lung scores over time (months) in patient subgroups by (a) baseline supplemental oxygen use and (b) baseline visual ILD pattern. ILD, interstitial lung disease; NSIP, non-specific interstitial pneumonia; OP, organising pneumonia; QGG, quantitative score for ground glass; QIA, quantitative image analysis; QILD, quantitative ILD; QLF, quantitative score for lung fibrosis.

Longitudinal quantitative CT scores correlate with PFTs

Mixed-effects linear models were used to estimate the association between QIA scores and PFTs over all timepoints (table 2). Whole lung QGG, QLF and QILD in % all demonstrated moderate to strong associations with both FVC and DLCO (β=−0.35 to −0.64, p<0.05 for all). QIA in mL had similar associations with FVC and DLCO % predicted (online supplemental table 2), while QIA-ZM scores did not (online supplemental table 3). Fixed-effects models for within-person correlations showed DLCO had strong associations with QLF and QILD, while FVC associations with QLF and QILD were no longer statistically significant (online supplemental table 4).

Table 2

Associations between quantitative CT scores and clinical/physiological parameters over time using mixed-effects models

In order to estimate the relationship between change in QIA from baseline (visit QIA-baseline QIA) with change in PFT from baseline (visit PFT-baseline PFT), we performed mixed-effects models in patients who had a baseline and at least one follow-up QIA score. Improvement in QIA scores from baseline trended towards associations with improvement in FVC and DLCO from baseline (table 3). Each unit improvement in DLCO % predicted from baseline associated with 0.34 unit improvement from baseline in QLF (p=0.06), and 0.74 unit improvement from baseline in QILD (p=0.08).

Table 3

Association between changes in QIA whole lung scores from baseline and changes in clinical/physiological parameters from baseline

In order to test whether associations between QIA scores and PFTs differed by certain baseline characteristics, we tested interactions with age, baseline O2 use and muscle weakness (MMT) (online supplemental table 5). None of the interactions tested were statistically significant. However, the associations between QIA scores and PFTs were stronger in patients older than 50 (compared with those younger than 50) and in patients with normal muscle strength (compared with patients with MMT<150).

Longitudinal quantitative CT scores correlate with clinical respiratory outcomes and supplemental O2 use

We also analysed the associations of QIA scores (by % and mL) with clinical respiratory outcomes over all timepoints (table 2 for QIA %, online supplemental table 2 for QIA in mL, online supplemental table 3 for QIA-ZM). Higher whole lung QLF and QILD scores had moderate associations with worse UCSD dyspnoea scores and higher O2 use over time (by % and by mL). Higher QILD scores were also associated with worse pulmonary VAS over time. QIA scores were not associated with 6MWD or Borg scales of fatigue or dyspnoea over time.

To test whether clinical respiratory outcomes had stronger associations with QIA scores or with PFTs, we constructed mixed-effects models between PFT measures and the same respiratory outcomes (online supplemental table 6), and compared standardised coefficients to standardised coefficients in table 2. The UCSD dyspnoea score and pulmonary VAS had stronger coefficients with QIA scores than with PFT measures (for UCSD dyspnoea β=0.45 with QILD vs β=−0.23 to −0.31 with FVC and DLCO; for pulmonary VAS β=0.37 with QILD vs β=−0.18 to −0.27 with FVC and DLCO). O2 use had strong positive correlations with QLF and QILD but was not associated with PFTs (β=−0.14 to −0.36, p=NS for all). On the other hand, 6MWD and Borg scores for dyspnoea and fatigue had many weak but significant associations with PFTs, but no significant associations with QIA scores. It is noteworthy that the trial design included more PFT observations than HRCT observations.

Longitudinal quantitative CT scores correlate with global disease activity and patient-reported outcomes of QOL

Higher longitudinal QILD and QLF scores of whole lung and ZM had moderate associations with higher MD global VAS, extramuscular VAS and patient global VAS over time, but not with MMT (table 2, online supplemental tables 2 and 3). Whole lung QIA scores also associated with worse health status by 6/8 domains of the SF-36 over time, while HAQ-DI did not (table 2).

The associations with whole lung QILD (table 2) were stronger than associations with PFT (online supplemental table 6) for MD global VAS, extramuscular VAS, patient global VAS and the aforementioned 6/8 SF-36 domains.

Responders with improvement in QILD at 12 months

12 patients had HRCT scans at baseline and 48 weeks. In these patients, the median (IQR) change in QILD from baseline to 48 weeks was −4% (−13, 13). Seven (58%) of these patients had improvement in QILD by >2% and were grouped as ‘responders’ (QILD change from baseline −11 (−30, –6), median (IQR)).

We compared baseline predictors between responders and non-responders (table 4). Treatment assignment was not significantly different between the two groups. Responders were more often females, had lower baseline DLCO and higher baseline QLF and QILD. All responders had full muscle strength at baseline which was significantly different from non-responders (median MMT 141). Among those with predominant CT features of fibrosis at baseline, 40% were responders compared with 71% of patients with predominantly non-fibrotic baseline CT features who became to be responders (p=0.27). Baseline QGG, FVC, visual ILD pattern type, age, disease duration, Jo1 versus non-Jo1, 6MWD, pulmonary VAS, Borg scores, UCSD dyspnoea, O2 use, MD global activity VAS, HAQ-DI and SF-36 were not different between responders and non-responders.

Table 4

Baseline characteristics of patients who responded to treatment based on improvement in QILD by >2% at 48 weeks.

Discussion

In the current work, we used a computer-aided QIA to measure the extent of ILD on CT images of patients with ARS-ILD to assess ILD severity over time, and to explore the relationship between QIA scores and clinically important functional and patient-reported outcome measures longitudinally. QIA scores demonstrated moderate to strong associations with both FVC and DLCO at baseline and over time. Higher QLF and QILD scores over time also associated with worse scores in multiple patient and physician-reported outcomes, including the UCSD dyspnoea score, pulmonary VAS, O2 use, MD global activity VAS, extramuscular VAS, patient global VAS and 6/8 domains of the SF-36 longitudinally. Moreover, these outcome measures had more robust associations with quantitative CT scores than with FVC or DLCO.

Higher whole lung QILD scores had moderate to strong correlations with worse FVC and DLCO in ARS-ILD. Correlations between QILD with PFTs were seen in a cross-sectional analysis within an observational myositis-ILD cohort from Korea.15 Our study also evaluated longitudinal associations, demonstrating that changes in QIA scores associated with changes in physiological measures over time, reflecting responsiveness of QILD. Although QIA has mostly been validated in SSc-ILD,16 37 our findings support that this tool may also be useful in myositis-ILD including ARS-ILD as an outcome measure.

Among specific ILD features, QLF scores correlated strongly with DLCO, more so than with FVC or total lung capacity, a pattern also observed in SSc-ILD where DLCO is the best predictor of fibrosis extent.37 In contrast, QGG scores showed stronger correlations with FVC. Ground glass opacities, while potentially indicating active inflammatory alveolitis, can also represent fine intralobular fibrosis or oedema in SSc-ILD.4 38 Longitudinal HRCT studies in SSc-ILD have shown that ground glass opacities may evolve into normal or fibrotic patterns, suggesting a decrease in QGG does not always indicate ILD improvement.10 In patients with NSIP, reductions in ground glass opacities correlated with improvements in FVC, more than in DLCO,39 similar to our findings. Future studies should explore the mapping of specific CT features and their temporal changes to enhance our understanding of their clinical implications in ARS-ILD.

Quantitative CT scores allowed for group-level analysis and comparisons which would have been difficult with visual assessment. We found that patients on O2 at baseline had worsening CT scores over time, unlike patients who were not on O2, suggesting that patients on O2 require closer monitoring and consideration for more aggressive interventions.

Physiological variables are indirect surrogates for the extent of structural disease and can be influenced by patient effort, gender and disease-related factors. In ARS-ILD, the presence of muscle weakness or pain, respiratory muscle weakness and diaphragmatic dysfunction may be additional factors that can impact PFT results. In our current work, PFTs had significant correlations with QIA scores in patients with full muscle strength (MMT=150), whereas correlations were weaker and mostly non-significant in patients with muscle weakness (MMT<150). Although FVC and DLCO are frequently used as a main outcome measure in ILD clinical trials, CT scores may offer a more direct and precise measure of the underlying structural pathological process.

In addition to its correlations with physiological measures, it is important to assess whether CT quantitation is providing a meaningful representation of physician and patient perception of the disease.28 In our current work, higher QILD scores over time correlated with worse respiratory status longitudinally by UCSD dyspnoea scores, pulmonary VAS and higher O2 requirement. The UCSD SOBQ measures the severity of dyspnoea during activities of daily living and has been validated in patients with IPF and fibrotic ILD.40 41 It is noteworthy that UCSD dyspnoea score and pulmonary VAS had more robust associations with QIA scores than with PFT measures. O2 use was not associated with PFTs but strongly correlated with QLF and QILD scores. Our work suggests that QIA scores reflect patient and physician perception of dyspnoea and respiratory status, perhaps more closely than PFTs.

On the other hand, 6MWD and Borg scores at rest and after exertion with a 6MWT were not correlated with QIA scores. A possible explanation for the discrepant results between the 6MWT and other clinical respiratory outcome measures may be due to the fact that many factors other than parenchymal lung disease are known to impact the 6MWD, such as age, body habitus, sex and comorbidities that cause pain or limitations in mobility, as well as muscle weakness in myositis.42 While Borg scales and 6MWT may be valuable measures of patient function and perception of respiratory impairments, our sample size may have been insufficient to adjust for confounders and detect significant relationships between QIA and these measures.

Higher QILD and QLF scores were also associated with higher global disease activity and worse QOL by the SF-36 over time. Association of QIA scores with measures of the patient’s perception of overall disease and health-related QOL adds to its value in measuring disease that impacts daily living.

Patients who had significant improvement in radiographic ILD at 48 weeks had higher baseline QLF, QILD and worse DLCO. Female sex and absence of muscle weakness also appeared to be associated with a favourable outcome. The use of a reliable, reproducible quantitation tool can be helpful in identifying prognostic factors in ILD. For example, higher baseline QLF scores correlated with a favourable treatment response in SSc-ILD9 and high baseline QLF scores were associated with higher 5-year mortality in RA-ILD.43 QIA can detect subtle progression of disease or a favourable response that may not be obvious with semiquantitative visual assessment, an advantage that can be particularly helpful for comparing cohorts that are already on background standard of care therapy.

Such advantages of QIA offer a mean for reliably following patients over time in clinic. Its high reproducibility and sensitivity to change can be helpful when assessing response to treatment and short-term changes can allow physicians to pinpoint patients with rapid/acute progression. QIA also has the theoretical advantage over PFTs or patient-reported outcome measures as it evaluates the extent of parenchymal lung disease with being minimally affected by patient effort, muscle weakness or musculoskeletal pain.

The current work has limitations. First, our sample size was small, mostly white and had a follow-up period limited to 48 weeks. IIM is a rare disease,44 and the Attack My-ILD was a proof-of-concept study with strict inclusion criteria to only include patients with active ILD. The smaller sample size was collected from four centres in the USA and may not be representative of other populations. Also, the sample size and shorter follow-up time limited our ability to perform multivariate adjustments or robust evaluation of predictors of long-term outcomes and the significance level was not adjusted for multiple testing given the exploratory nature of the study. However, our comprehensive assessment of the repeated measures over time and their associations with CT scores over the same time period were made possible by using a well-phenotyped clinical trial cohort. While observational datasets and convenience cohorts often have the advantage of a larger sample size, they pose problems in relation to the inherent heterogeneity in patient characteristics, treatments during the follow-up period and variable timing of outcome assessments.

Another limitation of our study is the lack of standardisation in HRCT protocols such as use of non-volumetric versus volumetric scanners, slice thickness of images and ensuing breath-holds for maximum inspiration. We attempted to minimise variability in the data by performing within-person models and adjusting QIA scores for the total lung volume at each visit, but standardisation of HRCT protocols would be needed for optimal QIA scores.

Myositis-ILD and ARS-ILD have morphological patterns that may not be fully captured by the current QIA software, which was largely designed for SSc-ILD.16 17 For instance, whereas a previous study noted air-space consolidations in 48% of patients with ARS-ILD45 and visual assessment in our cohort described consolidation as the predominant CT feature in 18% of scans, QIA identified a median QCON score of <1%, indicating potential limitations of the QIA in distinguishing consolidations from other patterns like fibrosis. QCON score was not included in our results and we postulate that changes in QLF values may in part represent improvement in consolidations. Nevertheless, visual assessments confirmed that the QILD score effectively differentiates and quantifies all ILD-related abnormalities in ARS-ILD. Also, the strong correlation of QILD with clinical measures underscores its utility. While QIA has been applied across various ILD conditions and in clinical trials,11–15 46 47 future work will aim to optimise the QIA algorithm for myositis-ILD and validate its use in larger prospective cohorts.

Summary and conclusions

In this initial exploratory study, we demonstrate that the extent of lung parenchymal abnormalities on chest HRCT using QIA associates with impairments in lung physiology as well as patient and physician-reported outcomes of respiratory status over time in patients with ARS-ILD. QIA has the potential to serve as an objective imaging biomarker in myositis-ILD and ARS-ILD that may allow objective and reliable monitoring of disease progression and response to therapy over time. Further work is needed to optimise and validate its use in myositis-ILD, determine whether specific CT features correspond to symptoms and/or treatment response and identify potential baseline predictors of long-term ILD outcomes.

Data availability statement

Data are available upon reasonable request. The data underlying this article will be shared on reasonable request to the corresponding author.

Ethics statements

Patient consent for publication

Ethics approval

This study involves human participants and was approved by the University of Pittsburgh Institutional Review Board (study number: 19030443). Participants gave informed consent to participate in the study before taking part.

Acknowledgments

We thank all the patients and participating study sites of the trial. SB is supported by NIAMS (K23 AR081423).

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • Contributors SB conceived, planned the project and wrote the manuscript with the support of RA, DPT, JG and CC-S who developed the theory and supervised the findings of this work. FA, GK and CC performed the analysis of the images. DS performed numerical calculations of PFT results. DM performed the statistical calculations and modelling. RA, SM-K, CVO, GM, SV, PFD, TJD, GMH and JF provided the CT and patient data that were analysed. All authors reviewed the results and contributed to the final manuscript. SB and RA are the guarantors of this work.

  • Funding This work was generated from the following grant: Bristol Myers Squibb Abatacept in the Treatment of Myositis-Associated Interstitial Lung Disease (NCT03215927) and the National Institute of Arthritis and Musculoskeletal and Skin Diseases (K23 AR081423) (PI: SB).

  • Competing interests GK and JG are on the UCLA patent for the quantitative imaging analysis. RA has received research grants from Boehringer Ingelheim, Bristol Myers Squibb, EMD Serono, Janssen, Mallinckrodt, Pfizer and Q32, and serves as a consultant for Actigraph, Alexion, ANI Pharmaceuticals, Argenx, AstraZeneca, Boehringer Ingelheim, Bristol Myers Squibb, Cabaletta Bio, Capella Bioscience, Corbus, CSL Behring, EMD Serono, Galapagos, Horizon Therapeutics, I-Cell, Janssen, Kezar, Kyverna, Merck, Novartis, Nuvig Therapeutics, Octapharma, Pfizer, Regeneron, Roivant, Sanofi, Teva, Artsome, Capstanx and Manta. CC-S has received research grants from Priovant, CSL Behring, Janssen, Octapharma, Pfizer, AbbVie and Bristol Myers Squibb, and serves as a consultant for Boehringer Ingelheim, Recludix, Octapharma, Pfizer, AbbVie and Bristol Myers Squibb. PFD is editor of UpToDate and a member of the FDA Advisory Committee. TJD has received support from Bayer and has been part of a clinical trial funded by Genentech, all unrelated to this study. GMH receives grant support from the NIH including R01 HL111024, R01 HL135142 and R01130974. He has performed consulting work for Boehringer Ingelheim and the Gerson Lehrman Group.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.