Objectives Ankylosing spondylitis (AS) is the most prevalent form of spondyloarthritis, with a known genetic association with the HLA-B27 molecule. The aim of this study was to assess the contribution of the HLA-G, HLA-E and HLA-F to AS susceptibility/protection in Portuguese patients with HLA-B27 AS and HLA-B27 unaffected controls.
Methods High-resolution typing of HLA-G, HLA - E and HLA - F was performed in 228 patients with HLA-B27 AS and 244 HLA-B27 unaffected controls. Allelic, genotypic and haplotypic frequencies were compared between cohorts. To replicate the results, single nucleotide polymorphisms (SNPs) in HLA-E and HLA-F genes were typed in Australian cohorts. For further confirmation, a group of European-descent patients with AS and unaffected controls were genotyped for Major Histocompatibility Complex SNPs using the Illumina microarray.
Results In the Portuguese population, no significant differences were found in HLA-G. For HLA-E, a significant difference was detected for the genotype HLA-E*01:01:01/01:03:01 (p=0.009; pc=0.009; OR=0.51), with a protection effect. For HLA-F, significant differences were detected in the allele HLA-F*01:01:02 (p=0.0049; pc=0.0098; OR=0.60) and corresponding SNP rs2075682 (p=0.0004; pc=0.0008; OR=0.53), suggesting protection and in the genotype HLA-F*01:01:01/01:03:01 (p=0.011; pc=0.043; OR=2.00), suggesting a susceptibility effect. Three G-E-F haplotypes with significant differences were detected but occur in a very small number of individuals. The only significant differences detected in the replication studies were for HLA-E rs1059510 in the Australians and for HLA-F rs1736924 in the European-descent cohorts.
Conclusion Our results reveal suggestive AS protective and susceptibility effects from both HLA-E and HLA - F loci, however with population differences. To our knowledge, this is the first study showing association of HLA-F with AS.
- ankylosing spondylitis
- autoimmune diseases
- gene polymorphism
This is an Open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Statistics from Altmetric.com
What is already known about this subject?
Ankylosing spondylitis (AS) is strongly associated with HLA-B27, with only 20% of the overall genetic risk. Previous studies raised the possibility that AS susceptibility could be related to other MHC genes.
What does this study add?
To our knowledge, this is the first study showing potential association of HLA-F with AS. AS protective and susceptibility effects from both HLA-E and HLA-F loci were detected, although with population differences.
How might this impact on clinical practice?
The better knowledge of AS genetics will help to unveil the molecular mechanisms of the disease and contribute to the design of novel therapeutics for disease treatment.
Spondyloarthropathies (MIM: 106300; SpAs) are the second most common group of chronic inflammatory rheumatic disorders among the adult Caucasian population.1–3 Ankylosing spondylitis (AS) is the most prevalent form of SpA, affecting 0.3%–0.5% of Europeans, with a worldwide prevalence of 0.1%–1.4%.3 4
Despite the poor understanding of this pathology, it has long been known that susceptibility to AS is strongly associated with the Major Histocompatibility Complex (MHC) and in particular with the HLA-B27 molecule. HLA-B27 confers the greatest known AS risk as it is found in over 90% of patients with AS of European ancestry, but only ≈8% of healthy controls.5 However, studies indicate that HLA-B27 is only responsible for ≈20% of the overall genetic risk, suggesting a contribution of other genes to disease susceptibility.4 6 Previous studies raised the possibility that AS susceptibility could be related to other MHC genes.7–9
The MHC is a large cluster of 128 genes and 96 pseudogenes, located on the short arm of chromosome 6, many of which with important roles in the immune system. Traditionally, the MHC is divided into classes containing groups of genes with related functions; the MHC class I and II genes encode for human leucocyte antigens (HLA), proteins that are displayed on the cell surface. In humans, MHC class I molecules comprise the classical (class Ia) HLA-A, HLA-B and HLA-C, and the non-classical (class Ib) HLA-E, HLA-F, HLA-G and HLA-H (HFE) molecules.10 Both categories are similar in their mechanisms of peptide binding and presentation and in the induced T-cell responses.11 The most remarkable feature of MHC class Ib molecules is their highly conserved nature12 exhibiting very low levels of allelic polymorphism.13 Understanding of the different roles played by class Ib molecules is rapidly increasing, with known roles including pathogen recognition, virus-induced immunopathology, tumour immune surveillance and regulation of autoimmunity.14
HLA-G biological features include restricted tissue expression, the presence of membrane-bound and soluble isoforms, generated by alternative splicing, limited protein variability, unique molecular structure, with a reduced cytoplasmic tail and modulation of the immune response.15 HLA-E is the best-characterised MHC class Ib molecule. With a low level of polymorphism,10 HLA-E molecules bind peptides derived from the signal sequences of classical MHC I molecules and present these to NK cells bearing CD94/NKG2 receptors.16 However, it has become apparent that HLA-E molecules can bind a wider selection of peptides than canonical MHC I leader sequence-derived peptides and that these peptides can be presented to CD8 T cells.17–20
The HLA-F gene was first identified in 199021 22 being, so far, the least characterised non-classical class I molecule23 24 and neither its native structure nor function is well known.25 Several studies confirmed HLA-F protein expression in a number of diverse tissues and cell lines, including bladder, skin and liver cell lines, but no surface expression was detected in the majority of them, except for activated lymphocytes.24 Just like other class Ib molecules, HLA-F restrictive tissue expression suggests specialised functions and tight transcriptional control of the gene26 with unique potential regulator motifs.21
The aim of this study was to assess the possible contribution of non-classical HLA-G, HLA-E and HLA -F to AS susceptibility/protection in cohorts of patients with HLA-B27-positive AS and unaffected controls, independently of the HLA-B27 effect.
Three sets of patients with HLA-B27-positive AS and unaffected controls were used in this study. The Portuguese set was composed by individuals of Portuguese ancestry, recruited from the Azores and mainland Portugal. Patients with AS (n=228) were diagnosed using the modified New York Criteria.27 The control group included 244 HLA-B27-positive unaffected subjects older than 35 years. One hundred individuals of each group were randomly selected for typing HLA-G 3′ UTR 14 bp indel. In order to replicate the results obtained in the Portuguese population, a group of single nucleotide polymorphisms (SNPs), in HLA-E and HLA-F genes, were genotyped in 222 Australian patients with HLA-B27-positive AS and 618 HLA-B27-positive unaffected controls. The third set were previously reported European-descent patients with AS and unaffected controls genotyped for MHC SNPs using the Illumina microarray.7 The control group frequencies were in Hardy-Weinberg equilibrium (HWE) for all the studied SNPs.
Genomic DNA was extracted from peripheral blood cells according to standard procedures. The amplification of specific fragments of HLA-G, HLA-E and HLA -F was performed with optimised protocols. For the Portuguese cohorts, a published protocol was used for HLA-G typing.28 For the amplification of HLA-G 3′ UTR 14 bp indel fragments, primers were designed using Primer3 software.29 PCR primers for HLA-E and HLA -F were designed, covering all the already known polymorphic regions in either exonic or intronic regions, with Primer3 software.29 Primer design for HLA-E and HLA -F fragments was based on sequences from IMGT/HLA Database (http://www.ebi.ac.uk/ipd/imgt/hla/align.html). Primer sequences and PCR conditions are available on request.
Sequencing was performed using an ABI 3130xl Genetic Analyzer with Big Dye Terminator V.1.1 and V.3.1 (Applied Biosystems). HLA-E and HLA -G genotype assignment was performed using Assign SBT 3.5+ software (Conexio Genomics, Fremantle, Australia). HLA-F allele assignment was performed manually, using the library compiled from the current ImMunoGeneTics database.30 31 HLA-G 3′ UTR 14 bp indel genotyping was performed by fragment analysis method using an ABI Prism 310 Genetic Analyzer (Applied Biosystems) with 310 GeneScan V.3.1.2 software. Allele designations are according to the WHO Nomenclature Committee.30
Australian cohorts were typed using two different approaches; HLA-E and HLA -F SNPs rs2075682, rs17875379, rs2076183, rs1059510 and rs1264457 were typed with the previously described sequence-based typing protocol. The remaining SNPs were genotyped using Taqman SNP genotyping assays (Applied Biosystems, Foster City, California, USA) performed according to the manufacturer’s instructions. Genotyping reactions were carried out with an ABI 7500 Fast Real-Time PCR System. Allele calls were obtained by the AB software V.2.0.5, by the analysis of allelic discrimination plots.
Arlequin software V.22.214.171.1242 was used to test for Hardy-Weinberg equilibrium, perform haplotype inference and measure linkage disequilibrium (LD) (D′ and r2). LD measuring for SNPs was performed using SNPClip from LDlink (https://analysistools.nci.nih.gov/LDlink/). Genotype and allele frequencies were directly enumerated. Inferred haplotypes were assessed for missingness. Haplotypes with >10% missingness were excluded. SNP association analysis was performed using Cochrane-Armitage test of trend as implemented in PLINK V.1.07 (http://pngu.mgh.harvard.edu/purcell/plink/). The Bonferroni correction was obtained using the formula (pcorrected=1−(1 p)n), where the number (n), used for correction, was the number of comparisons one or more of which shows a significant result (http://www.dorak.info/hla/stat.html). The correction procedure for SNPs included the removal of SNPs in LD (r2≈1), where the number (n) to use for correction was the remaining number of SNPs with significant differences between groups. ORs with a 95% CI were calculated for the minor alleles of the genotyped SNPs. OR >1 indicates a susceptibility allele and OR <1 indicates a protective allele. Haplotype association analysis was performed using Fisher’s exact test.
Preventive fraction was calculated using the following formula: [A/(A+B)]−[C/(C+D)], where A is the number of individuals positive for the SNP in patient cohort, B is the number of individuals positive for the SNP in control cohort, C is the number of individuals negative for the SNP in patient cohort and D is the number of individuals negative for the SNP in control cohort.
The HLA-G analysis was based on exons 2, 3 and 4 polymorphisms, allowing a four-digit high-resolution typing. The allelic and genotypic frequencies of patients with HLA-B27 AS and unaffected controls are shown in table 1. A total of five different alleles were identified; the most representative allele, with a frequency of 82% in both patients with AS and unaffected controls, was G*01:01. Ten different genotypes were identified. The most prevalent genotype was clearly G*01:01/G*01:01. In this locus, no significant differences were observed between patients with AS and unaffected controls.
The HLA-E analysis was based on the known intronic and exonic polymorphisms, allowing six-digit high-resolution typing. Only three alleles were identified; the most prevalent, in patients with AS and unaffected controls, was E*01:01:01, followed by E*01:03:02. Six genotypes were identified, being the most prevalent E*01:01:01/E*01:01:01 and E*01:01:01/E*01:03:02 (table 2).
No significant differences were detected in the HLA-E alleles between patients with AS and unaffected controls. However, a significant difference between groups was detected for the genotype E*01:01:01/E*01:03:01, suggesting a protective effect for AS. This genotype had a frequency of 0.12 in patients with AS, almost doubling in unaffected controls (0.21) with a preventive fraction of 16%. Significant differences were also detected in rs1059510, where carriage of the genotype TT more than doubled in unaffected controls; however, it was not significant after Bonferroni correction, and rs1264457 where genotype AG is augmented in the control group suggesting a protective effect.
The HLA-F analysis was based on 18 SNP differences, in both intronic and exonic areas, allowing a six-digit high-resolution typing. Fourteen presented significant differences between patients with AS and unaffected controls and seven maintained significance after the Bonferroni correction (table 3).
Five alleles were identified; the most prevalent allele was F*01:01:01. Fourteen genotypes were identified and F*01:01:01/F*01:01:01 was the most prevalent. Significant differences were detected in the alleles F*01:01:02 and F*01:03:01, respectively. The first allele is increased in unaffected controls with a protective fraction of 12%, suggesting a putative protective effect and the second is increased in patients with AS with an attributable risk of 10%. The significance of allele F*01:03:01 was not maintained after Bonferroni correction (table 3).
Four genotypes with significant differences were detected: F*01:01:01/F*01:03:01 and F*01:01:03/F*01:01:03 with 17% and 33% of attributable risk, respectively, which suggests a susceptibility effect. On the other hand, a protective effect is suggested by two other genotypes F*01:01:02/F*01:01:03 and F*01:01:02/F*01:03:01 with preventive fractions of 29% and 25%, respectively. After Bonferroni correction, genotype F*01:01:01/F*01:03:01 was the only that maintained the significance.
Significant differences were detected in variants rs2075682 and rs1736924, but only the first maintained significance after Bonferroni correction (table 3).
A total of 28 different HLA-G, HLA-E and HLA -F haplotypes were inferred for patients with AS and 26 for controls. The most frequent haplotypes, displaying the same frequency in patients with AS and unaffected controls, were G*01:01/E*01:01:01/F*01:01:01 (25%), G*01:01:01/E*01:03:02/F*01:01:01 (20%) and G*01:01/E*01:01:01/F*01:01:02 (11%). In addition, three low-frequency extended haplotypes were inferred. Haplotype G*01:03/E*01:01:01/F*01:01:02 (p=0.0003; pc=0.0008), with frequencies 0.012 and 0.053 in patients with AS and unaffected controls, respectively, integrates the allele F*01:01:02. On the other hand, the haplotype G*01:01/E*01:03:01/F*01:03:01 (p=0.0002; pc=0.0005) with frequency of 0.043 and 0.004 in patients with AS and unaffected controls, respectively, integrates the allele F*01:03:01. Another haplotype, G*01:01/E*01:03:01/F*01:01:01 (p=0.000003; pc=0.00001), showed significant differences between patients with AS and unaffected controls (0.015 and 0.074, respectively); although significant, these haplotypes were present in only a very small number of individuals.
In order to replicate the obtained results, 10 SNPs (two in HLA-E and eight in HLA-F loci) were typed in cohorts of Australian patients with HLA-B27-positive AS and unaffected controls. The results are displayed in tables 2 and 3. Significant differences were detected in HLA-E, for rs1059510 genotypes CC and CT.
For further confirmation, an Immunochip-typed dataset with 4466 patients with AS of European ancestry (Australian and UK samples) and 9753 UK unaffected controls was studied, with HLA-B*27 status imputed using SNP2HLA, and genotypes in the HLA region imputed using 1000G reference haplotypes. The results of the Immunochip dataset imputation for 18 SNPs (15 in HLA-F and 3 in HLA-E) showed four SNPs with significant differences (rs1736924, rs1632953, rs1736926 and rs1736925). However, as these were in LD, three of them were removed (rs1632953, rs1736926 and rs1736925) and only the SNP rs1736924 was maintained for further analysis (table 4). Association was performed by logistical regression using PLINK V.1.90b3.36. The p value of conditioning on HLA-B*27 obtained for rs1736924 is in the same order of magnitude of the one obtained for the Portuguese population (table 4).
The HLA-G, HLA-E and HLA-F genes encode for molecules involved in regulation of autoimmune disease.34
The presence of HLA-G molecules in both membrane-bound and soluble forms has been associated with tolerogenic functions against innate and adaptative immune system.35 Studies have indicated an immunoregulatory role of HLA-G wider than maintenance of tolerance on fetal–placental interface, describing the expression of HLA-G in several inflammatory myopathies, atopic dermatitis and cutaneous psoriasis.36 Taking into consideration this inflammatory disease association, one could hypothesise a putative role in AS. The presence of 14 bp sequence insertion in HLA-G 3′ UTR has been associated with significantly reduced mRNA levels and lower soluble HLA-G in healthy subjects’ serum.15 In this study, no significant differences were identified between patients with HLA-B27 AS and HLA-B27 unaffected controls.
The existence of two major HLA-E alleles with similar frequency in most populations, probably resulting from balancing selection, indicates that there may be a functional difference between the alleles.12 13 They differ at only one amino acid position (107), where an arginine ER (E*01:01) is replaced by a glycine EG (E*01:03).13 Both variants are indistinguishable in their structure and peptide binding features, although HLA-EG homozygous cells seem to express higher levels of HLA-E at the cell surface.12 13 A previous study investigating HLA-B27 extended haplotypes in Sardinia revealed, in patients with AS, a significantly higher frequency of ER.37 38 On the other hand, and for the same polymorphism, a markedly increased prevalence of heterozygosity among HLA-B27-positive unaffected controls was found, suggesting a protective role of EG in AS.37
The results of our investigation showed a highly significant difference in the genotype E*01:01:01/E*01:03:01 between patients with AS and unaffected controls. This genotype is twice as frequent in HLA-B27 unaffected controls when compared with patients with HLA-B27 AS. The biological meaning of this result, suggesting a putative protective effect with a protective fraction of 16%, is still unclear since no significant differences were observed in allelic frequencies between both groups. Further studies are necessary to clarify this fact. Furthermore, in this study, the rs1264457 genotype AG is augmented in the control group, with results similar to those previously obtained by Paladini et al,38 but not replicated in the Australian/British cohort. Significant differences in rs1059510 genotype TT were detected but not maintained after Bonferroni correction. In contrast, and showing population discrepancies for the same SNP, in the Australian cohorts the significant differences were detected in genotypes CC and CT, both maintained after Bonferroni correction, the first with an OR suggesting protection and the other with an OR suggesting susceptibility. Data obtained in a 2015 study also suggest that HLA-E polymorphisms influence rheumatoid arthritis (RA) susceptibility and modulate the clinical outcome of anti-tumour necrosis factor treatment in female RA cases.39
The HLA-F role in the immune system regulation has been gradually unveiled and the wide-range immunoregulatory capacity is becoming clear, despite the limited understanding of the underlying structural and biochemical properties that govern its function.40 It is known that HLA-F is expressed at the surface of activated lymphocytes; high level of HLA-F surface expression was observed in activated B, T and NK cells, with the exception of Treg,24 but it was generally thought that HLA-F does not present antigen and instead may function as an empty, open conformer (OC) that heterodimerises with other MHC-I OC molecules.25 41 However, a recent study establishes that HLA-F can exist both as an OC and peptide-presenting MHC molecule, with distinct NK cell receptor binding partners.40 Expressed during an inflammatory response as OC, both HLA-F and MHC-I OC have been implicated in a novel pathway for uptake of extracellular antigen for cross-presentation.42 HLA-F binds most allelic forms of MHC-I OC. This physical interaction likely stabilises the otherwise unstable MHC-I OC, contributing to their facility as ligands for a specific subset of killer Ig-like receptors (KIRs).43 The coincident upregulation of KIRs with the upregulation of the HLA-F-HLA I heavy chain complex suggests an immunoregulatory role of HLA-F in inflammatory response.44 Both KIR3DS145 46 and KIR3DL247 were previously associated with increased susceptibility to AS. Interestingly, HLA-F OCs were established as ligands of KIR3DS1, and it was also demonstrated that cell-context-dependent expression of HLA-F may explain the widespread influence of KIR3DS1 in human diseases, including AS.45 48 Speciﬁcally, KIR3DL2 interacts with HLA-B*27 expressed as a homodimer without peptide.49 Considering the strong association of HLA-B*27 with AS and the capacity of KIR3DL2 to detect HLA-F and MHC-I OC expressed as a homodimer raises the possibility that KIR3DL2 recognition of HLA-B*27 may represent an aberrant encounter that does not typically occur under resting conditions, leading to immune dysregulation. This possibility may relate to the expression of KIR3DL2 on Th17 CD4 T cells and their apparent increase in responsiveness in patients with AS.43 47 In our study, two HLA-F alleles and four genotypes displayed significant differences in the Portuguese cohorts. However, it should be taken into account the low frequencies of some of these genotypes. It was possible to group the SNPs with significant differences into each one of the alleles (F*01:01:02 and F*01:03:01). From this group of SNPs, two with potential interest were highlighted, given that they are located in HLA-F gene regions that may influence in one case expression levels and in the other case molecule conformation.
Variant rs2075682, allele F*01:01:02, with a highly significant difference between patients with AS and unaffected controls, is positioned in the SXY module X box of HLA-F promoter. It is the only allele with a thymine in that position, being T the ancestral allele. It is possible that this modification, in a promoter zone that controls gene transactivation, influences HLA-F expression, conferring a protective effect on unaffected HLA-B27-positive individuals.
HLA-F SXY module shows strong sequence homology to those of the classical MHC class I genes; it is composed by S, X and Y boxes and is bound by RFX, CREB/ATF and NFY protein complex, implicated in CIITA activity. Besides other induction pathways, HLA-F is induced by Class II Transactivator (CIITA), which is in agreement with the SXY module sequence conservation.26
Interestingly, the other variant (rs1736924) with a significant difference in both the Portuguese and Immunochip-typed Australian/British populations, although not significant enough to survive Bonferroni correction in the Portuguese population (pc=0.06), is part of the genomic sequence that is translated into the α3 domain of HLA-F protein. HLA α3 domain residue conservation is greater than α1 and α2, with virtually no regions of particular sequence variability. Seventy-eight per cent of the residues are totally conserved and 7% are conservatively replaced and similar percentages are found for the HLA-F.50 In rs1736924, a serine at residue 272 of α3 is substituted by a proline. Serine has an uncharged polar side chain and proline, with a non-polar side chain unique among the standard 20 amino acids, can sometimes substitute for other small amino acids, although it does not often substitute well.51 Taking into account previous studies, two hypotheses could be proposed to explain the possible effect of rs1736924 substitution: (1) conformational changes in the molecule could transform the flexible loop that clamps the CD8 homodimer to the α3 domain, compromising the binding of CD8, consequent HLA stabilisation and recognition by TCR.52 (2) The presence of a proline in position 272 may influence the binding of β2m to the α chain, stimulating the formation of HLA-F dimers. Although interesting, the statistical significance of our findings is not sufficiently strong to support our hypothesis, requiring further investigation.
Regarding the association discrepancies seen in the populations in this study, differences in the association of HLA and non-HLA genes with AS have already been reported between Portuguese and other Caucasian populations. An ERAP1 haplotype, conferring protective effect to AS in HLA-B27-positive individuals was described, for the first time, with this same cohort.53 An association study between IL23 and ERAP1 genes and AS in the Portuguese population reported that no association was established with IL23R rs11209032 in Portuguese or Spanish populations, which was strongly associated with AS in other Caucasian populations. In addition, the study did not demonstrate any protective effect against AS for the Arg381Gln SNP (rs11209026) in the IL23R gene showing deviations between the Portuguese and other Caucasian populations.54
This is the first study in which the non-classical HLA-G, HLA -E and HLA -F genes are meticulously investigated for possible association with AS, independently of the HLA-B27 risk contribution. The results are promising, revealing protective and susceptibility effects from both HLA-E and HLA- F loci. To our knowledge, this is the first study showing potential association of HLA-F with AS.
We wish to acknowledge all the individuals who accepted to participate in this study. We would also like to acknowledge Dr Maria José Peixoto, Dr Bruna Parreira and Dr Vanessa Faustino for organising, optimising and sequencing some of the samples.
Contributors MRS, MAB and JB-A conceived the study. FPS, JEF and JB-A provided clinical support to patient follow-up. MRS, IF, ZL, RM, LW and JP performed experiments. BFB, MRS, ZL, LW and IF performed statistical analysis. BFB, MRS, ZL, LW and MAB interpreted data. ARC, AM, HA, ML, MAB and JB-A gave critical revision of the manuscript for important intellectual content. All authors have read and accepted the submitted version of the manuscript.
Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.
Competing interests None declared.
Patient consent Obtained.
Ethics approval Comissão de Ética do Hospital de Santo Espírito da Ilha Terceira.
Provenance and peer review Not commissioned; externally peer reviewed.
Data statement No additional data are available.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.