Article Text

Download PDFPDF

Original research
Distinction and prognosis of early arthritis phenotypes: an analysis in three European cohorts
  1. Alexandre Sepriano1,2,
  2. Bastiaan van Dijk2,
  3. Sofia Ramiro2,3,
  4. Annette van der Helm-van Mil2,
  5. Bernard Combe4,
  6. Dirkjan van Schaardenburg5,
  7. Maarten de Wit6,
  8. Alison Kent6,
  9. Elsa Mateus6 and
  10. Robert Landewé7,8
  1. 1NOVA Medical School, Universidade Nova de Lisboa, Lisboa, Portugal
  2. 2Rheumatology, Leiden University Medical Center, Leiden, Netherlands
  3. 3Rheumatology, Zuyderland Medical Centre Heerlen, Heerlen, Netherlands
  4. 4University of Montpellier, Montpellier, France
  5. 5Rheumatology, Reade and Amsterdam University Medical Center, Amsterdam, Netherlands
  6. 6EULAR Patient Research Partner Network, Zurich, Switzerland
  7. 7Amsterdam Rheumatology Center, AMC, Amsterdam, Netherlands
  8. 8Rheumatology, Zuyderland Medical Centre, Heerlen, Netherlands
  1. Correspondence to Dr Alexandre Sepriano; alexsepriano{at}gmail.com

Abstract

Objectives The objective of this study is to evaluate whether there are differences in the long-term prognosis across various phenotypes of early arthritis (EA).

Methods Three EA cohorts (Reade, Etude et Suivi des Polyarthrites Indifférenciées Récentes (ESPOIR) and Early Arthritis Clinic (EAC)) were analysed. Clinical data were collected up to 24 years. Hands and feet radiographs were scored according to the Sharp van der Heijde (SvdH) method. Latent class analysis was applied to determine the EA phenotypes at baseline. Each class received a label reflecting its most prominent features. Prognostic outcomes included Health Assessment Questionnaire (HAQ), Short Form 36 (SF36) and SvdH score. The association between class membership and outcomes over time was tested in multivariable models.

Results In total, 390 (Reade), 798 (ESPOIR) and 3991 (EAC) patients were analysed separately. Two classes with symmetrical polyarthritis emerged; one of these labelled as autoimmune inflammatory polyarthritis (AIPA), had high likelihood of acute phase reactants (APR) elevation and autoantibody positivity, while the other (mild-inflammatory polyarthritis; MIPA) had not. A third class had oligoarthritis of upper limbs (OAUL) and could be subdivided into autoimmune OAUL and mild-inflammatory OAUL. A fifth class had oligoarthritis of lower limbs. The SvdH scores were worse in patients with APR/autoantibodies (AIPA) than in those without (MIPA). No clinically meaningful differences across classes in HAQ or SF36 over time were found.

Conclusion Radiographic progression over time primarily occurs in EA patients with APR/autoantibodies. The absence of these markers, however, does not necessarily translate into better long-term function and quality of life. Clinicians should not only aim at preventing joint damage, but look beyond structural progression in order to further improve the lives of people with EA.

  • Arthritis, Rheumatoid
  • Outcome and Process Assessment, Health Care
  • Magnetic Resonance Imaging

Data availability statement

All data relevant to the study are included in the article or uploaded as online supplemental information.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

WHAT IS ALREADY KNOWN ON THIS TOPIC

  • Prognostic markers for patients with early arthritis (EA) have been identified previously. However, whether there are different EA phenotypes with distinct prognostic significance is yet unclear.

WHAT THIS STUDY ADDS

  • Five EA phenotypes, distinguished on the basis of their pattern of joint involvement, autoantibody positivity and elevation of acute phase reactants (APR), could be identified across different cohorts.

  • Radiographic progression primarily occurs in EA patients with autoantibodies and APR elevation (autoimmune inflammatory polyarthritis). Phenotypes that deviate from this classical ‘RA construct’ and have less structural progression still show similar levels of disability and impairment of quality of life.

HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY

  • Clinicians and researchers should aim at developing treatment strategies beyond those targeting the prevention of irreversible joint damage, in order to further improve the lives of people with EA.

Introduction

The field of rheumatoid arthritis (RA) has advanced significantly over the past decades. A better understanding of the mechanisms underlying the disease has translated into an exponential growth of therapeutic options. In this ever-evolving field, the rheumatology community has moved from testing new drugs in clinical trials of patients with ‘advanced’ RA to trials in patients with earlier forms of the disease.1 2 This paradigm shift was driven by the evidence that the earlier the start of a disease-modifying antirheumatic drug (DMARD) the better the outcomes were (eg, higher chances of remission and lower probability of irreversible joint damage) and the term ‘window of opportunity’ was coined.3–5

The concept of ‘window of opportunity’ motivated rheumatologists to pursue ways of earlier identification of RA. Examples of such efforts include the development of referral criteria,3 the implementation of early arthritis (EA) clinics6 and the validation of imaging methods for an earlier detection of synovitis (eg, by ultrasonography or MRI).7–11 A key contribution was the development of the 2010 American College of Rheumatology (ACR)/European Alliance of Associations for Rheumatology RA classification criteria,12 which are more sensitive to capturing the rheumatologist’s concept (Gestalt) of early RA than the more specific 1987 ACR classification criteria.13–15

One challenge of the early recognition of RA is that the earlier the disease is detected, the more heterogeneous and less ‘typical’ the disease manifestations will be. In fact, at disease presentation, it can be difficult to discern the patients with EA who will evolve into RA, or into another well-defined inflammatory rheumatic disease, from those who will spontaneously go into remission, remain undifferentiated forever or will continue as a ‘low-inflammatory disease’ (eg, inflammatory osteoarthritis; OA). Prognostic information can help clinicians in managing EA in clinical practice. Individual markers of persistent/erosive disease, such as a high number of swollen joints at presentation, positivity for rheumatoid factor (RF) and anticitrullinated protein antibodies (ACPA) have already been identified previously.3 However, it is yet unclear whether combinations thereof with other EA features translate into clinically meaningful patterns (or phenotypes) with prognostic significance.

Experienced clinicians will intuitively recognise different forms of disease presentation, but ‘intuitive phenotyping’ will often be influenced by preconceived ideas about which characteristics are more important to the Gestalt of RA. For instance, seropositive (ie, RF/ACPA positive) and seronegative (RF and ACPA negative) RA are frequently studied RA phenotypes.16 However, this split is based on the (prognostic) value attributed by rheumatologists to these autoantibodies. In addition, it is yet uncertain whether different phenotypes based on combinations of these, and other markers of joint damage will also translate into differences in clinical outcomes.

In this study, we propose to evaluate whether there are different phenotypes of EA, using an analytical approach that is less influenced by the rheumatologist’s opinion. In particular, with this study we aim to: (1) evaluate the phenotypes of EA identified by a data-driven approach; (2) determine if EA patients change from one phenotype to another over time; (3) assess how many patients within each EA-phenotype fulfil RA classification criteria and (4) evaluate the clinical and imaging outcomes of the different phenotypes over time.

Methods

Patients and study design

Patients from the Early Arthritis Cohort at Reade (Reade), from the Etude et Suivi des Polyarthrites Indifférenciées Récentes (ESPOIR) cohort and from the Leiden Early Arthritis Clinic (EAC) were included. The three cohorts have been previously described in detail.17–20 Briefly, Reade is a prospective cohort in which patients with ≥2 swollen joints (<2 years), or ≥1 swollen joint if positive for RF and/or ACPA, and who were naïve to DMARDs were included (patients with a diagnosis other than RA excluded). ESPOIR is a prospective cohort of patients with ≥2 swollen joints (≥6 weeks and ≤6 months), who are naïve to DMARDs and for whom the treating rheumatologists considered to have definite RA, probable RA or with the potential to become RA. Finally, the EAC is a prospective cohort of patients with arthritis (<2 years) confirmed by a rheumatologist (patients with a diagnosis of post-traumatic arthritis, crystal arthropathies or inflammatory OA excluded) recruited between 1993 and 2017. Databases were locked, for the current analysis, in June 2016 (Reade), October 2017 (ESPOIR) and February 2018 (EAC).

EA features

Clinical features

In Reade, clinical data were collected at baseline, every 3 months for 1 year, at 18 and 24 months, and yearly thereafter up to 13 years. In ESPOIR, clinical data were collected at baseline, every 6 months for 2 years, and yearly thereafter up to 12 years. In EAC, clinical data were collected at baseline, 2 weeks, 3 or 4 months, 1 year and then yearly up to 24 years.

The following clinical features were collected at each visit: (1) arthritis of small joints (≥1 swollen small joint vs swollen large joints only), (2) (sub)acute onset of arthritis (subacute or acute vs insidious onset), (3) symmetric arthritis (≥1 joint with symmetric arthritis vs asymmetric arthritis only), (4) duration of morning stiffness (≥60 min vs <60 min), (5) arthritis in upper limbs (≥1 swollen joint in the upper limbs vs swollen joints in the lower limbs only), (6) polyarthritis (≥5 swollen joints vs <5 swollen joints), (7) elevated acute phase reactants (APR) (positive C reactive protein (CRP) or erythrocyte sedimentation rate (ESR) vs CRP and ESR negative), (8) presence of autoantibodies (positive RF or ACPA vs RF and ACPA negative), (9) family history of RA (present vs absent), (10) comorbidities (≥1 comorbidity vs none), (11) smoking status (current smoker vs past or never-smoker) and (12) overweight (body mass index (BMI)≥25 kg/m2 vs BMI<25 kg/m2).

Imaging features

Radiographs of the hands and feet were assessed in ESPOIR by three central readers at baseline, 2, 5, 7 and 10 years. In Reade, radiographs were assessed by one central reader at baseline and yearly up to 13 years. In EAC, baseline radiographs were scored by one of two readers; follow-up radiographs were scored by a single reader. In all cohorts, the readers scored the radiographs according to the Sharp van der Heijde (SvdH) method with known chronological order and blinded to clinical data. The SvdH score measures erosions and joint space narrowing in 44 different joints.21 22 In each joint, the erosion score ranges from 0 to 5 in the hands and 0 to 10 in the feet. An erosion score of 1 is given if a clearly visible erosion is identified. The total score ranges from 0 to 448. Radiographic damage was defined as an erosion score ≥1 in ≥3 joints according to the agreement of ≥2 out of 3 readers in ESPOIR or according to one reader in Reade and EAC.

MRIs of the hands were available at the baseline of the EAC cohort only. Images were scored according to the Rheumatoid Arthritis MRI Score by two central readers, who were blinded to the clinical information. Bone marrow oedema (BME) and tenosynovitis on hand MRI was considered present if both readers agreed on its presence in ≥1 bone or tendon, respectively. In addition, lesions were only counted as positive if present at the same location in <5% of age-matched symptom-free controls.23

Radiographs and MRIs from different patients are available in the EAC. Radiographs were available for patients included between 1993 and 2007 (‘radiograph phase’) who fulfilled the 1987 RA classification criteria (baseline and follow-up radiographs) or had a diagnosis of undifferentiated arthritis (UA; baseline radiographs only). MRIs were available for those included between 2010 and 2017 (MRI phase). Scored imaging data were not available for EAC patients included between 2007 and 2010 and also for EAC-patients included between 1993 and 2007 without RA according to the 1987 RA criteria or UA.

Definition of a positive feature

At baseline, the 12 clinical features and the 3 imaging features (radiographic damage, BME and tenosynovitis) were considered positive if present at the time of the study visit. During follow-up, change in time-varying features (all except antibodies in ESPOIR and EAC and BME and tenosynovitis in EAC) was defined as ‘once-a-feature-always-a-feature’: patients positive at baseline remained positive in all follow-up visits, even if becoming negative or missing in between; patients negative at baseline, remained negative at follow-up if no switch to positive or if missing in between. A feature changed to positive if appearing anytime during follow-up.

Other variables

The following variables were collected at baseline: age, gender, fulfilment of the 1987 RA classification criteria (ESPOIR and EAC) and of the 2010 RA classification criteria. RA classification criteria were assessed in all patients in Reade and ESPOIR and in patients with a diagnosis of RA in EAC. The 28-joint Disease Activity Score with ESR (DAS28-ESR) was collected in each visit of each cohort. Treatment with non-steroidal anti-inflammatory drugs (NSAIDs), oral glucocorticoids, conventional synthetic DMARDs (csDMARDs) and biological DMARDs (bDMARDs) was collected in each visit of ESPOIR.

Outcomes

The following outcomes were assessed in each visit: functional disability, quantified by the Health Assessment Questionnaire (HAQ; range: 0–3),24 quality of life (QoL) quantified by the Short Form 36 Physical (SF36 PCS) and Mental (SF36 MCS) Component Scores (range: 0–100),25 presenteeism (percentage of productivity loss experienced by the patient during the past week; range: 0%–100%),26 absenteeism (percentage of work hours lost to absence during the past 3 months; range: 0%–100%),26 and the total SvdH score. SF36 was not available in Reade. In addition, SF36 and work outcomes were only available in the EAC for patients included from 2010 onwards.

Data analysis

Latent class analysis (LCA) was used to estimate the latent (ie, unobserved) Gestalt of EA with baseline data of each cohort separately. LCA splits patients into mutually exclusive classes (here called: phenotypes) based on the covariance of observed EA features.26 All patients from Reade and ESPOIR were included in the analysis. To allow cross-cohort comparisons, the main LCA model in the EAC included only patients from the radiograph phase (recruited 1993–2007). Thirteen EA features (12 clinical plus radiographic damage) available in all cohorts (only subacute onset missing in Reade) were used in the main LCA. These features were selected based on content knowledge without predefined weights. Missing data on each feature was considered to be negative.

Starting with a one-class LCA model, the number of classes was increased, one-by-one, until the best model was found. The best model was defined by statistical criteria previously described in detail27; and by clinically recognisable patterns within each class. The classes of the final model were interpreted according to the probability of each feature and labelled by us as a clinically recognisable entity. Features were defined as: across-class dominant (highest probability across classes); within-class dominant (probability >50% within each class) and not dominant across or within classes. Individual patients were classified based on their posterior probability of class membership (as belonging to the class with the highest probability), which allowed us to further describe the classes and to evaluate the percentage of patients within each class fulfilling the 1987 (only in ESPOIR) and the 2010 RA classification criteria at baseline.

Follow-up data (at 1, 3, 5 and 10 years) of each cohort were used to perform a latent transition analysis (LTA) that serves to estimate the consistency and likelihood of change across classes over time.28 LTA includes the same patients and variables as in each corresponding LCA model. The number of classes best fitting the baseline and follow-up LCA formed the basis of each LTA model. Classes at baseline and follow-up can be assumed as: having the same meaning (full invariance) or a different meaning (full non-invariance). The final LTA model had the number of classes at baseline and follow-up and class-(in)variance that best fitted the data, provided it was clinically meaningful.

To assess the prognosis of the latent classes, we tested the association between class membership at baseline and each outcome (HAQ, SF36 PC, SF36 MC, presenteeism, absenteeism and SvdH score) over time using generalised estimating equations models, adjusted for potential confounders that were chosen on clinical grounds: Age, gender, DAS28-ESR (all cohorts) and treatment with NSAIDs, glucocorticoids, csDMARDs and bDMARDs (only in ESPOIR).

Sensitivity analyses

Predefined sensitivity analyses were conducted to assess the robustness of the latent classes identified in the main analysis: (1) the same LCA model but assuming missing data to be at random (MAR) instead of assuming it to be negative by using full information maximum likelihood approach to estimate the model parameters including data from cases with missing values (all cohorts)29 30; (2) LCA with the two additional MRI features (BME and tenosynovitis) and with erosions defined on MRI (only in the EAC). Finally, to evaluate the possible impact of selecting patients in the EAC based on imaging availability, one additional analysis was performed and (3) LCA with no imaging features, which allowed to include all patients of the EAC. The analysis of prognosis was also performed in this population to allow the evaluation of work outcomes not available to patients included in the main analysis. LCA and LTA were performed in MPlus V.7. Stata V.15.1 was used for all other analyses.

Results

Baseline characteristics

In total, 390 (Reade), 798 (ESPOIR) and 3991 (EAC) patients were analysed. Of the total 3991 patients in EAC, 1878 were from the radiograph phase (1009 with scored baseline radiographs available), 1422 from the MRI phase (1039 with scored baseline MRIs available) and 691 from the no-imaging phase. Patients in Reade (73%) and ESPOIR (77%) were more often female than patients in EAC (59%). Symmetric polyarthritis and involvement of the small joints of the upper limbs was more frequent in ESPOIR than in the EAC and most frequent in Reade (table 1). A similar gradient was observed for the presence of autoantibodies (figure 1). Radiographic damage was less frequent in Reade (23/387; 6%) and in ESPOIR (12/445; 3%) than in the EAC (410/1009; 40%). BME (449/1038; 43%) and tenosynovitis (604/1030; 59%) were frequent in the EAC.

Figure 1

Most distinguishing characteristics of each cohort showing a spectrum between high heterogeneity in EAC and funnelling towards the classical RA phenotype in Reade. Values between brackets are percentages of each feature. EAC, early arthritis clinic; ESPOIR, Etude et Suivi des Polyarthrites Indifférenciées Récentes; RA, rheumatoid arthritis.

Table 1

Observed baseline clinical features in each cohort

Latent phenotypes of EA at baseline

In Reade, three latent classes could be distinguished (table 2 and box 1). Two classes had a high likelihood of symmetric polyarthritis of the small joints of the upper limbs. One of these, which we labelled as autoimmune inflammatory polyarthritis (AIPA), had a high likelihood of ESR/CRP elevation and autoantibodies positivity, while the other (labelled as mild inflammatory polyarthritis; MIPA) had not. The third class resembled AIPA, but with fewer joints involved (oligoarthritis of upper limbs, OAUL). The same classes were identified in ESPOIR and an additional fourth class of patients with oligoarthritis of the lower limbs (OALL) emerged. The LCA in EAC including patients from the radiograph phase (N=1878) was similar to that in ESPOIR, but here the OAUL-class allowed a further subdivision into autoimmune OAUL (AIOAUL) and mild inflammatory OAUL (MIOAUL). Details on model selection criteria in each cohort are provided in online supplemental tables S1–S3.

Box 1

Phenotypes of early arthritis

  • Autoimmune inflammatory polyarthritis (AIPA).

  • Mild inflammatory polyarthritis (MIPA).

  • Autoimmune inflammatory oligoarthritis of upper limbs (AIOAUL).

  • Mild inflammatory oligoarthritis of upper limbs (MIOAUL).

  • Oligoarthritis of lower limbs (OALL).

Table 2

LAC models in Reade, ESPOIR and EAC

Sensitivity analysis 1 (assuming MAR), yielded results consistent with the main analysis of each cohort (data not shown). Sensitivity analysis 2 in the EAC (adding MRI variables), yielded the same latent classes as in the main analysis (online supplemental table S4), but the distinction between AIPA and MIPA (and between AIOAUL and MIOAUL) was primarily driven by a higher likelihood of BME/tenosynovitis in the former than in the latter. The LCA model in EAC without imaging features (sensitivity analysis 3) also yielded the same classes as the main analysis, with the exception of the two oligoarticular classes of the upper limbs being split based on the symmetry of joint involvement (online supplemental table S5).

Observed characteristics and fulfilment of RA classification criteria across phenotypes at baseline

Even though the percentage of patients with elevated ESR/CRP was consistently higher for the AIPA (range: 100%–77%) than the MIPA (0%–66%) class, patients from both classes had a similarly high number of swollen joints at baseline in all cohorts (range 9–15) (table 3). In ESPOIR, the percentage of patients who received a b/csDMARD in ≥1 visit was higher in the AIPA class (89%) than in the MIPA (80%), OAUL (76%) or OALL (50%) classes. OALL patients were younger than in other classes (table 3) and had mono (ESPOIR: 45%; EAC: 64%) or oligoarthritis (ESPOIR: 55%; EAC: 36%) mostly in the lower limbs (ESPOIR 75%; EAC: 85%).

Table 3

Observed baseline characteristics across classes in each cohort

Patients in the AIPA-class more often fulfilled the 2010 RA classification criteria than those in the MIPA-class, in each cohort (Reade: 92% vs 61%; ESPOIR: 94% vs 87%; EAC: 86% vs 36%) (table 4). Fulfilment of the 2010 RA classification criteria was not uncommon for patients with OAUL (Reade: 48%; ESPOIR: 68%); but more likely (in EAC) if autoantibodies and elevation of ESR/CRP were present (AIOAUL: 66% vs MIOAUL: 17%). Patients with OALL were unlikely to fulfil the 2010 classification criteria for RA (ESPOIR: 16%; EAC: 5%). The results were similar for the 1987 ACR RA classification criteria, except that the OAUL class less often fulfilled the 1987 than the 2010 criteria (43% vs 68%).

Table 4

Fulfilment of RA classification criteria across latent classes of EA in each cohort

Transition probabilities over time

In total, 163 (Reade), 504 (ESPOIR) and 322 (EAC) patients attended the 10-year visit. The limited number of patients in Reade did not allow for LTA to be performed in this cohort. In ESPOIR, LTA models were possible to fit in all prespecified visits (1, 3, 5 and 10 years) and in the EAC, LTA was possible only at the 1-year visit (N=1261), due to the limited number of patients thereafter. Details on model selection criteria are provided in online supplemental table S6. The transition probabilities for the 1-year, 3-year and 5-year models in ESPOIR are shown in online supplemental table S7. In ESPOIR, patients from the OAUL-class were most likely to not transition (69%) at all, but some transitioned to AIPA (21%) or MIPA (10%) after 10 years (table 5). Similarly, in the EAC, there were transitions from each oligoarticular class to into their respective polyarticular classes already after 1 year (AIOAUL to AIPA: 14% and MIOAUL to MIPA: 3%). Patients from the AIPA and MIPA classes did not transition to other classes in both cohorts. The OALL class was not identified anymore at follow-up in ESPOIR and mostly did not transition to other classes after 1 year in EAC (92%).

Table 5

Latent transition analysis in ESPOIR (10 years) and EAC (1 year)

Prognosis

In all cohorts, the total SvdH score was consistently lower in the MIPA- than in the AIPA class over time (table 6). Patients in the MIPA-class had on average 7, 4 and 19 SvdH-units less over time than patients in the AIPA-class in Reade, ESPOIR and EAC, respectively (table 6). In ESPOIR, there was an improvement over time for all classes on the mean SF36 PCS (range at baseline 35–40; range at 12 years: 43–51), SF-36 MCS (baseline: 34–41; 12 years: 40–48) and HAQ (baseline: 0.7–1.2; 12 years: 0.3–0.6). Similar improvements in the HAQ score were observed in the other cohorts (data not shown). Contrary to the SvdH score, there were no meaningful differences across classes with regard to the mean HAQ scores and SF36 scores over time, also after adjustment for DMARD-therapy in ESPOIR (table 6). The analysis in EAC including all patients (sensitivity analysis 3) has shown no difference across classes also in terms of presenteeism and absenteeism over time (online supplemental table S8).

Table 6

Prognosis of EA phenotypes: impact of the different classes on outcomes over time

Discussion

In this study, we have analysed three independent EA cohorts and found that the Gestalt of EA comprises five recognisable clinical entities, distinguished by their pattern of joint involvement, autoantibody positivity and elevation of APR. Unlike previous studies, which mostly grouped patients according to individual features (eg, RF/ACPA positive vs negative), these phenotypes were here identified using a technique (LCA) that takes several EA features into account without predefined weights, therefore, avoiding circular reasoning. Expectedly, patients with phenotypes, at presentation, that are most consistent with the classical ‘RA-construct’ (ie, AIPA) fulfil the RA classification criteria most often and develop more radiographic damage over time than other EA phenotypes. However, patients with phenotypes that do not fit this construct (eg, MIPA), who have lower levels of systemic inflammation and joint damage over time, also often classify as RA. The absence of these markers, however, does not necessarily translate into better function and QoL on the long term. These data shed new light on the earliest clinical disease stages of RA and provide new prognostic information to further help clinicians to manage EA patients in clinical practice.

Synovitis, typically identified as a swollen joint at physical examination, is the defining feature of EA and is therefore the common inclusion criterion of the three cohorts in this study. Apart from this communal feature, patients with EA may present with a variety of clinical patterns. The more restrictive the inclusion criteria for a cohort are, the narrower is the range of possible patterns. In Reade, for instance, patients with features that in the opinion of the clinician were consistent with a diagnosis other than RA were excluded upfront. This restriction was not in place in the EAC cohort. Differences in study design can, therefore, explain the narrower phenotypical diversity in Reade at one end of the spectrum, as compared with the EAC at the other end, and with ESPOIR somewhat in between. The phenotype labelled by us as AIPA closely resembles what many clinicians will describe as a textbook example of RA. The finding that this phenotype was far more likely in Reade (71%) than in the EAC (26%), is therefore consistent with the background population of each cohort and adds credibility to our analysis.

Classification criteria for RA include features that are deemed important by experts for ‘capturing’ the Gestalt of RA. As mentioned, patients from the AIPA-class have several of these features (eg, symmetric polyarthritis, elevation of APR, autoantibodies) and therefore almost all (85%–94%) classify as RA, at presentation. Patients from the AIPA class had already more radiographic damage at inclusion and were more likely to experience radiographic progression over time than the other classes. This finding is in agreement with the literature, confirming that features that are common in these patients (eg, elevation of ESR/CRP) increase the likelihood of irreversible joint damage over time.31 32

One key finding of this study is that approximately one fourth of all EA patients present with an AIPA-like phenotype in terms of high number of swollen joints at study entry, but with a lower likelihood of objective signs of inflammation (eg, elevation of APR) and autoimmunity (RF/ACPA). These patients, which we labelled as MIPA may have an inflammatory rheumatic disease other than RA (such as: PsA or systemic lupus erythematosus). In fact, diseases other than RA could in principle be included in each cohort, especially in EAC which had the least restrictive inclusion criteria, and the features we have preselected do not necessarily capture the Gestalt of these other diseases as well as they do for RA. One alternative explanation is that MIPA patients have a disease with joint effusion within a wider-than-expected spectrum of what clinicians call RA. Expectedly, patients with the MIPA-phenotype often fulfil the 2010 classification criteria for RA (36%–87%), which heavily weigh the number of swollen joints,12 but to a lower extent than those with an AIPA phenotype, given the absence of other important classifying features (elevated APR and autoantibodies). A better understanding of the MIPA phenotype is warranted in future studies, perhaps created by researchers using molecular phenotyping of synovial tissue which may better differentiate clinically similar EA phenotypes and therefore provide better prognostic information.33 34

Of note, despite the clear difference in structural prognosis, patients from the AIPA classes and MIPA classes had similarly high levels of disability (HAQ: 0.3–0.6) and impaired QoL (SF36 PCS/MCS: 40–50) at the end of the follow-up, which were both worse than those reported for the general population (HAQ: 0.25; SF36: 64–93).35 36 These data are important to clinicians who manage patients with EA in clinical practice; the data suggest that EA patients presenting without known prognostically unfavourable features (eg, elevated APR and autoantibodies) do not necessarily have better clinical outcomes on the long-term as compared with those who have these markers at presentation. Most patients from the AIPA classes and MIPA classes were treated with DMARDs. It is possible that AIPA patients treated with anti-inflammatory treatment show mean values of HAQ and SF36 similar to those that the MIPA-patients would have achieved even without treatment. However, this study cannot provide resolution as it was not designed for testing treatment effects.

We have also identified two classes of EA-patients who resemble either AIPA or MIPA but with fewer joints involved at presentation (oligoarthritis of the upper limbs). Only a fraction (about 30% in 10 years) of these patients evolve into the more typical symmetric polyarthritis phenotype over time. It should be noted, however, that the phenotype with fewer affected joints did not translate, into a phenotype with more favourable outcomes over time. The fact that these oligoarticular cases are better captured by the 2010 RA classification criteria than the 1987 criteria adds to the body of evidence that shows the former are more sensitive than the latter in early disease.14

The least common of the five phenotypes was characterised by a younger group of patients who had mostly monarthritis or OALL, frequent elevation of APR and a low probability of positivity for RF or ACPA. These patients, who rarely fulfilled RA classification criteria, have a phenotype that is possibly consistent with the construct of (peripheral) spondyloarthritis.37 The fact that these patients are absent in Reade and are most common in the EAC is again concordant with the background population of the study cohorts.

Our study has some strengths and many limitations. While we aimed at ruling out the influence of expert opinion, this aim can only be partially achieved, as our analysis relies on the patients included in each cohort, based on their inclusion criteria, and on the opinion-based features used to derive the EA phenotypes. However, our unsupervised analytical approach is not dependent on the opinion of clinicians regarding a diagnosis. Moreover, the fact that similar phenotypes emerged from three cohorts with different source populations suggests a limited influence of clinician bias. Missing data could also have influenced our results, in particular for radiographic damage, which could only be assessed in a fraction of patients in the EAC. Also, it should be acknowledged that the maximum follow-up time was not the same across cohorts, being shortest in ESPOIR and longest in the EAC (up 12 and 24 years, respectively). Even though selection bias cannot be ruled out, extensive sensitivity analysis with different methods for handling missing data (completers vs assuming MAR) and including all patients from the EAC (without considering imaging features) revealed similar results, and therefore, support the robustness of the main analysis.

In summary, EA patients with elevated APR and autoantibodies (the phenotype most consistent with the ‘classic RA-construct’) develop more radiographic damage, but do not have worse function or QoL over time, than those without these markers. These data provide relevant prognostic information to clinicians managing patients with different EA phenotypes in clinical practice. In addition, this study suggests that future research should go beyond the prevention of irreversible joint damage towards optimising treatment strategies (pharmacological or non-pharmacological) with the ultimate goal of improving the lives of people living with arthritis.

Data availability statement

All data relevant to the study are included in the article or uploaded as online supplemental information.

Ethics statements

Patient consent for publication

Ethics approval

This study involves human participants and the study was conducted according to Good Clinical Practice guidelines and approved by the appropriate local ethical committees. Written informed consent was obtained from participating patients before inclusion in each cohort.

Acknowledgments

The authors sincerely thank all patients, the staff, and all recruiting rheumatologists of the Reade, ESPOIR and EAC cohorts.

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • Contributors All authors made contributions to conception and/or implementation of the study. AS and BTvD performed the analysis. All authors contributed to the interpretation of the results. AS wrote the first draft of the manuscript and all authors were involved in reviewing the manuscript and gave final approval to the version to be published.

  • Funding This study was supported by a grant of the Foundation for Research in Rheumatology (FOREUM), under the topic call on Innovative Medicine 2019 (project ID: 036). ESPOIR Funding: An unrestricted grant from Merck Sharp and Dohme (MSD) was allocated for the first 5 years of the cohort study. Two additional grants from INSERM supported part of the biological database. The French Society of Rheumatology, Abbvie, Pfizer, Lilly and more recently Fresenius and Galapagos supported the ESPOIR cohort study.

  • Competing interests AS: speaking and/or consulting fees from Abbvie, Novartis, UCB and Lilly. Bastiaan van Dijk: None. Sofia Ramiro: research grants and/or consultancy fees: AbbVie, Eli Lilly, Galapagos, MSD, Novartis, Pfizer, UCB, Sanofi. AvdH: None. BC: consulting fees from AbbVie, Celltrion, Chugaï, Eli-Lilly, Galapagos, Gilead, Janssen, Merck, Novartis, Pfizer. DvS: speaking fees from Janssen, Galapagos. MdW: Over the last three years Stichting Tools has received fees for lectures or consultancy provided by MdW from Celgene, Eli Lilly, Pfizer and UCB. Alison Kent: None. EF-M: Consulting fees from Boehringer Ingelheim Portugal. Over the last three years, the Portuguese League Against Rheumatic Diseases (LPCDR) received non-financial and financial support from AbbVie, A. Menarini Portugal—Farmacêutica, S.A., Amgen Biofarmacêutica, Grünenthal S.A., GSK, Lilly Portugal, Medac, MSD, Novartis, Pfizer, Roche. RL: Received consulting fees from AbbVie, BMS, Celgene, Eli-Lilly, Galapagos, Gilead, Glaxo-Smith-Kline, Janssen, Merck, Novartis, Pfizer, Roche, UCB and is Director of Rheumatology Consultancy bv

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.