Objective Health utilities represent preference values that persons attach to health states. This study aims to develop one general and six country-specific algorithms to calculate societal preference values for health of patients with spondyloarthritis (SpA), as assessed by the disease-specific Assessment of SpondyloArthritis international Society Health Index (ASAS HI).
Methods A survey was performed in random population samples from six European countries. In a best-worst choice experiment, subjects were asked to indicate repeatedly which of 4 random aspects of the 17-item ASAS HI was were most and least important. Bayesian analysis provided the relative importance of each of the 17 items. To rescale the relative importance scores on the absolute utility scale between 0 and 1, participants additionally completed two lead time trade-off experiments, one for ‘severe SpA’ and one for ‘best health’ without SpA. Six country-specific algorithms and one general algorithm were derived. The general algorithm was tested in 199 patients with axial SpA (axSpA).
Results 3039 subjects, mean age 47 years (SD 15) and 52% female completed the experiments. The population’s health utility value for SpA varied between − 0.24 for ‘worst’ SpA (country range −0.35 to 0.03), and 0.88 for ‘best’ health (country range 0.81 to 0.90). Among 199 patients with axSpA, the mean utility was 0.36 (SD 0.30, range −0.24 to 0.88) and discriminated well between patients having high (Bath Ankylosing Spondylitis Disease Activity Index (BASDAI) ≥ 4) or low (BASDAI < 4) disease activity (0.18 (SD 0.24) vs 0.51(SD 0.27), p<0.01).
Conclusion One general and six country-specific algorithms are available to convert scores from the ASAS HI into disease-specific societal utility values.
- societal utility value
- asas health index
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Statistics from Altmetric.com
Whatis already known about this subject?
The ASAS-HI offers the opportunity to develop disease-specific health utility-valuations for patients with spondyloarthritis.
What does this study add?
Several algorithms are proposed to convert scores on the ASAS-HI into societal health utilities. These algorithms perform equal in non-radiographic and radiographic SpA and discriminated between levels of disease activity.
How might this impact on clinical practice?
These algorithms can be used to derive societal health utilities in patients with spondyloarthritis.
The aim of diagnosing and treating patients with spondyloarthritis (SpA) is to increase the patients’ health for the remaining part of their life. Although no cure exists for patients with SpA, earlier diagnosis and innovations in pharmacological treatment substantially improved the outcome of SpA.1 To assess the impact of the disease and its treatment on overall functioning and health of patients with SpA, disease-specific instruments to assess overall health such as the Ankylosing Spondylitis Quality of Life Questionnaire and the Assessment of SpondyloArthritis international Society Health Index (ASAS HI) have been developed.2 3 However, these traditional outcome instruments do not reveal the ‘preferences’ or ‘values’ for different health states, that can be represented by the combinations of items in these questionnaires. Different (combinations of) items might not be equally important for functioning and health. Preference-based valuations of health are based on choice experiments that force respondents to indicate their preference for different health states against each other. For application in healthcare, health utilities are preference-based valuation methods that anchor the value of preference of health states on a 0 to 1 scale, in which 0 corresponds a state equivalent to death and 1 to full health. This common scaling between 0 and 1 allows comparison of health valuations between conditions. Health utilities are often integrated over time and quality-adjusted life years (QALYs) allow assessing the life impact of the disease. QALYs are useful when rational choices have to be made by decision makers when allocating resources to healthcare and to research,4 5 but are increasingly also considered in clinical care in the context of value-based healthcare.6
When allocating societal resources, it is considered appropriate to consider the societal point of view or perspective on the ‘value’ for health.7 In other words, persons that do not suffer from health problems are invited to value the health of persons with impairments. Arguments favouring a societal perspective indicate that those who do not have the disease did not adapt and can better judge the true value of health, are free of self-interest and have a democratic right to decide on distribution of (tax-based) societal resources.7 8 Of interest, previous research revealed that cultural differences exist in the value or preference for health and functioning, and recommend country-specific societal valuations.9
To assess societal health utilities in SpA, generic approaches are available among which the EuroQoL five dimensions (EQ-5D) and the Short Form six dimensions (SF-6D) are the best known.10 11 For these patient-reported measures of overall health, algorithms are available that can convert the patients’ reported scores into a societal health utility value.12 As these utility values are obtained by applying a societally derived algorithm to a patient-reported outcome (and not through direct descriptions of profiles of health impairments), these approaches to asses societal utility are called ‘indirect’ methods. However, using patient-reported measures as a basis for indirect health utility valuation may not accurately reflect the effect of interventions on specific impairments and limitations typically experienced by these patients. It is increasingly recognised that disease-specific measures of overall health better reflect health problems of patients with specific conditions.13 This holds especially true when choices between interventions within the same disease have to be made. In SpA, the ASAS HI has recently been developed to assess the broad influence of SpA on functioning and health. The ASAS HI therefore offers an appropriate starting point to develop a disease-specific utility index, as it was developed with the specific aim to reflect aspects of health important and typical for patients with axial SpA (axSpA).14 Moreover, the ASAS HI has been developed in parallel in several countries, and validated in additional countries.15
The objective of this study was to develop an algorithm to calculate the societal health utilities specific for SpA based on scores on the ASAS HI. As there can be cultural differences in preference for health, it was decided to develop a general as well as several country-specific algorithms. A secondary aim was to assess the new societal utility based on ASAS HI among patients with SpA.
Materials and methods
A cross-sectional multinational online survey was conducted among general population subjects 18 years or older from the UK, France, Germany, the Netherlands, Spain and Italy. Recruitment of subjects and distribution of the questionnaire was performed by IPSOS, an international institution specialised in data collection for market and population research.16 IPSOS approached subjects that were voluntarily subscribed to their panel. Participation for each subsequent survey is again on a completely voluntary basis. Surveys were conducted online through a password-protected website. The aim was to include 500 subjects per country to ensure a sufficient sample size. Within each country, inclusion was stratified by IPSOS for age categories, gender and region to guarantee a representative sample of each country’s general population. The online survey was distributed between March and May 2015. Subjects were informed that they participated in a survey that aimed to assess the value of disease burden of patients with a chronic condition, and were not aware of the specific inflammatory rheumatic disease that was considered. IPSOS complies to the International Code on Market, Opinion and Social Research and Data Analytics (ICC/ESOMAR code)17 on protection of personal data, but no ethics’ approval was required for this general population survey.
Sociodemographic background and health
Subjects provided sociodemographic characteristics including age, gender, region of residence, highest achieved educational level (university/non-university higher professional education, secondary school/low and middle professional education, or primary school) and work status (ie, having paid work or not). The presence of comorbidities was assessed with the Self-Administered Comorbidity Questionnaire modified for patients with SpA.18 The EuroQoL Visual Analogue Scale (EQ-VAS) was used to measure overall health,19 20 as we wanted to be informed about the self-perceived health (and not the societal utility valuation of reported health, as represented by the 5 domains of the EuroQoL (EQ-5D)). Finally, subjects completed the ASAS HI3 14 to assess presence of impairments typical and relevant for patients with SpA (0–17, 17=most/worst influence on functioning and health).
The survey included two experiments. First, a best-worst scaling (BWS) experiment estimated the relative importance of each item of the ASAS HI on a latent scale of functioning and health (preference-based valuation). The BWS experiment consists of 17 choice tasks, each with 4 randomly selected aspects of the 17 items contained in the ASAS HI (see example in the online supplementary figure 1). One additional choice task was added to check for reliability. In each choice task, subjects had to indicate which of the four presented items they considered the most important and least important for overall functioning and health. Two different questionnaire sets were generated by the Sawtooth Software using an efficient approach taking into account balanced frequency (each item appeared an equal number of times), orthogonality (each item is paired an equal number of times with each other’s attributes) and positional balance (each item appeared in each position across the choice sets, for example, first, second, third and fourth positions). Within each country, the two versions were randomly assigned to the subjects, but further equal distribution among age categories, gender and regional representation was not specifically accounted for.
Second, two lead time trade-off (LTTO) experiments were performed to anchor two health states of the ASAS HI on the 0 to 1 health utility scale. The first LTTO experiment (LTTO-1) aimed to reflect ‘severe SpA’ and the second (LTTO-2) a ‘common health’ of a person without SpA. In each LTTO experiment, subjects had to choose repeatedly whether they preferred life A, life B or were indifferent. Life A was characterised by 10 years in full health (lead time) followed by 10 years either in a severe (‘bad’) health state (LTTO-1), or 10 years in their own health (LTTO-2). Life B was also characterised by 10 years in full health (lead time), but followed by immediate death in the first choice task, but by an increasing number of life years in full health (max 10 years) in the following choice tasks. An example of the LTTO and the LTTO protocol is presented in online supplementary figure 2. The point of indifference or infliction between life A and life B is used to derive the person’s preference valuation or utility for health state in life A. It reflects the number of healthy years (out of 10 years life expectancy) the respondent is willing to give up, to avoid living 10 years in the health state described in life A.21 The ‘severe health state’ in LTTO-1 was defined as impairments in the eight aspects of health of the ASAS HI that mattered most to patients, as revealed in a preceding study: pain, sleeping, being exhausted, standing, motivation to do anything that requires physical effort, getting frustrated, problems with running and restrictions in travelling.22 Data of subjects that indicated no problems at all on the eight aspects of ASAS HI were used to assess the utility value of the participants’ ‘best health state’ in LTTO-2.
After finishing the experiments, subjects were also asked to rate each experiment’s difficulty (Numeric Rating Scale (NRS) 0–10, 10=extremely difficult)
The BWS experiment was analysed using the Sawtooth Software. Based on the raw coefficients of the preference function, the relative importance scores of each aspect of health were estimated using the hierarchical Bayes method. They represent the probability that a subject would choose the specific aspect as most important. Overall, the relative importance scores sum to 100. If all aspects of health would be equally important, each aspect’s relative importance score would be 5.88 (100/17). Fit statistics was used to identify unreliable responders, which should be higher than the chance that a patient gives random answers (>0.25).23 Reliability was also assessed by the number of patients that indicated the same ‘most/least’ important aspect as in the identical choice task.
To obtain health utility values on the QALY Scale for ‘severe SpA’ and ‘best health’, results of LTTO-1 and LTTO-2 were used, respectively. A person’s health utility was retrieved by identifying the point in which life B (giving up life years) was preferred, or in which the subject became indifferent for life A or life B. The health utility was than calculated as the number of years willing to give up (life B) divided by 10 (online supplementary figure 3).
Finally, to develop the algorithm to convert the ASAS HI into a utility value, the relative importance scores of the 17 items of the ASAS HI as derived by BWS were anchored on the health utility scale based on the LTTO absolute values for ‘severe SpA’ and ‘best health’ on the absolute health utility scale.
All steps described above were performed in the total sample and repeated in the country-specific subsamples. Country-specific values (of relative importance, anchors or utility) were compared using the one-way analysis of variance (ANOVA) test or Kruskall-Wallis test, using SPSS V.20.0. If adjustment for multiple testing was necessary, a Bonferroni correction was used.
To apply the algorithm in patients with axSpA, data from an international worldwide survey that comprised the ASAS HI were used.13 First, the distribution of utility values derived for the ASAS HI was explored (mean, SD, median, minimum, maximum, quartiles). Discriminative ability between patients with high or low disease activity (Bath Ankylosing Spondylitis Disease Activity Index (BASDAI)≤4 or BASDAI>4) or SpA phenotype (axSpA vs non-radiographic axSpA (nr-axSpA)) was explored using one-way ANOVA, with a significance level of 0.05.
All statistical analyses were performed using SPSS V.20.0.
In total, 3039 subjects were included. The mean age was 46.5 years (SD 15.2), and 1556 were women (51.2%) (table 1). Depression (18.6%), hypertension (24.0%), and pain in joints other than the back (25.1%) were the three most frequently reported health problems. EQ-VAS was 76.0 (SD 20.0) and the mean ASAS HI was 4.4 (SD 4.2). These results were comparable across countries.
Relative importance of the different aspects derived from ASAS HI
Of the 17 aspects of health typical and important to patients with SpA, the general population considered that pain and sleep were the most important impairments, followed by being exhausted, overcoming difficulties, concentrating, standing and experiencing financial changes (table 2). There was no aspect that was not considered important. The relative importance differed significantly for several aspects of health across countries (p<0.003) (table 2 and figure 1).
Subjects rated the difficulty of the BWS experiment as 3.7 (SD 2.8). The fit statistic was good (0.46). In the reliability test, 2102 (69.2%) and 1858 (61.1%) of the subjects indicated the same most important and least important items, respectively.
Anchor values for ‘best health’ and ‘severe SpA’
Of the 3039 participants, 715 (23%) indicated to prefer ‘immediate death’ when valuing their own health state. However, their ASAS HI, EQ-VAS and the EQ-5D utility were actually moderate to good and fully comparable to the EQ-VAS and EQ-5D of persons with similar good score on the ASAS HI but who did actually trade-off between immediate death or living longer. As this seemed implausible, these persons must have misunderstood the experiment and were excluded. Of note, excluded persons had attained less frequently a university or non-university higher professional degree (35.2% vs 42.6%), and had less frequently paid work (48.5% vs 53.0%).
After excluding the 715 (23%) persons with implausible data, the value for ‘best health’ in 462 subjects without a problem in any of the most relevant aspects of health of the ASAS HI was 0.88 (SD 0.18). The value for ‘severe SpA’ in 2.324 persons of was 0.29 (SD 0.34). Differences among the countries tested were statistically significant (table 3). Difficulty of the LTTO experiment was rated as 3.4 (SD 2.9).
Scaling the relative importance of the attributes using anchors
By scaling the relative importance of the 17 items of ASAS HI between the absolute ‘anchors’ of the health utility scale, the algorithms of the general and six country-specific utility valuations can be easily computed (table 4). The general population assigns a utility value of −0.24 (country range −0.35 to 0.03) to a hypothetical patient with problems on all aspects of ASAS HI. As an example, a patient with ‘mild’ SpA who would have problems with pain, sleeping and standing, but no limitations in any other item of ASAS HI, would be assigned a utility value of 0.59 (country range 0.54 to 0.64).
Application of the algorithm
The mean age of 199 patients with axSpA was 42.3 years (SD 13.6), mean disease duration 11.1 years (SD 11.2) and 117 (58.8%) of the patients were male. AS was present in 130 (65.3%) patients and 67 (33.7%) had nr-axSpA. The mean BASDAI was 3.8 (SD 2.3) and the mean ASAS HI 7.2 (SD 4.5). Figure 2 indicates a normal distribution of the utility values with a mean of 0.36 (SD 0.30, range −0.24 to 0.88). The average utility value was significantly higher in patients with BASDAI <4 (0.67 (SD 0.15) compared with patients with BASDAI ≥4 (0.49, (SD 0.14), p<0.01). Average utility values were not significantly different between patients with AS (0.37 (SD 0.31)) and nr-axSpA (0.34 (SD 0.29), p=0.56).
This study provides one general and six country-specific algorithms to convert patients’ scores on ASAS HI into societal health utility values. The theoretical range of the obtained health utilities varied from −0.24, for a patient with worst SpA, to 0.88, representing best health. When applying the algorithm in patients with axSpA, mean utility was 0.36 and the full range of the theoretical scale was represented. Comparable utility values were obtained for patients with AS and nr-axSpA indicating the algorithm can be applied in both phenotypes. Importantly, a good discriminative ability between patients with low and high disease activity was observed.
The availability of ASAS HI was essential for the development of the disease-specific utility algorithm, as description of health states ‘typical and relevant for SpA’ is a core aspect of a preference-based valuation. Of note, ASAS HI is grounded in the universally endorsed International Classification of Functioning and Health, and accounted in each step of the development from the perspective of the patients, ensuring all aspects of health important for patients with SpA are represented.3 14
BWS and LTTO were combined to compute the conversion algorithm. BWS is an elegant approach to understand the relative importance of a large number of aspects of health in valuations of health state.24 On that line, all 17 items of the ASAS HI could be considered. All 17 items were also shown to be relevant when valuing health and remained included in the utility algorithm. Further advantages of BWS are the ease of completion for respondents, absence of scale bias and effective discrimination between items. Recently, BWS was also used to value the child health utility, 9D.25 However, BWS provides the relative importance of the items on a latent scale, while the utility requires values that are expressed on an absolute scale in which ‘zero’ presents death and ‘one’ full health. Although a subject of discussion, conventional standard gamble and TTO are still the preferred options to define absolute anchors of the utility scale, as they remain consistent with the utility theory.26–29 In LTTO, trading between quality and quantity of life is postponed to a later period in life (after a lead time of 10 years in the current study), as this seems more realistic for persons with chronic diseases that are less likely to take immediate life risks. Notwithstanding, in the current online survey, a considerable number of subjects seemed not to have understood the experiment, as they chose to die immediately after the lead time, despite the absence of severe health impairments. Although, a previous study revealed that interview-based TTO provided overall similar results as an online unassisted experiment,30 we cannot exclude that LTTO is more challenging in an unassisted online environment. Of interest, persons with implausible choices had lower level of education in our study. Notwithstanding, by excluding subjects that chose immediate death despite ‘good health’, we cannot exclude the fact that some persons made their choice consciously and therefore our anchors are overestimated. However, during cognitive debriefing interviews among 12 persons without chronic disease and no impairments on the most important items of ASAS HI (50% female, average age 47 years (range 21years to 74 years)), the interviewer indicated that about 40% of subjects needed some support (sometimes a simple confirmation they were thinking correctly) and none of them chose to die immediately when valuing own health, providing support to the fact that our exclusion criteria was valid. Notwithstanding, (L)-TTO remains a challenging experiment, in which training of investigators and subjects is necessary to perform the experiment accurately. Cognitive debriefing seems insufficient as a precaution and a pilot study before the actual experiment might have pinpointed the misunderstanding.
The main interest of this study can be found in the availability of an algorithm that enables estimation of societal health utilities and QALYs specific for patients with SpA. Generic utilities, such as EQ-5D and SF-6D, might not capture all the aspects of health important for patients with SpA.31 Generic instruments might therefore discriminate less between relevant subgroups and be less sensitive to changes over time.31 Examples of other disease-specific utility measures are the preference-based index for visual function and the utility valuation based on the European Organisation for Research and Treatment of Cancer Core Quality of Life Questionnaire.32 33 Although, there might be doubts about the comparability of disease-specific utility values across disease, it can also be argued that the key towards comparability is found in a common methodological approach to (A) Select aspects of health that are most typical and relevant. (B) Value their relative importance. (C) Scale these aspects between absolute anchors on the health utility scale. Notwithstanding, the main application of disease-specific utilities would serve decisions within one disease. Further research is now required to compare the ASAS HI utility with generic utility measures regarding construct validity, sensitivity to change and influence on cost-effectiveness.
Our study is one of the first to study cultural differences in preferences for health using the same design and sampling across countries. We now provide algorithms on the societal perspective of six western European countries, and hopefully other countries are encouraged to use our methodology for new country-specific algorithms. We confirm differences in preference for health are related to the relative importance of the aspects of health, as well as to the absolute values for severe and best health on the ‘zero to one’ utility scale. It seems that these differences did not have a major influence on the value of mild health states, but impact on sensitivity to change and cost-effectiveness needs to be still explored. For the time being, we feel the country-specific utility algorithm should be used when available; otherwise the general algorithm seems appropriate.
In conclusion, one generic and six country-specific algorithms are now available to convert scores on ASAS HI into a utility of the societal perspective. This makes it possible to use disease-specific utilities and QALYs in decision-making processes when comparing treatment strategies among patients with SpA.
The authors thank Sandrine Bours (French), Victoria Navarro (Spanish) and Michelle Gillio (Italian) for their help with the translation of the instructions for the online survey.
Contributors IE and AB contributed to the development of the study, statiscal analyses and writing proces. MH contributed to the development of the study and statistical analyses. All other coauthors contributed to the development of the study and reviewed the manuscript.
Funding An unconditional ASAS grant was received to conduct this study.
Competing interests None declared.
Patient consent for publication Not required.
Provenance and peer review Not commissioned; externally peer reviewed.
Data availability statement Data are available upon reasonable request.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.