Abstract
Introduction
Celiac disease (CeD) is a complex disease triggered by gluten consumption in genetically susceptible individuals. The disease can onset at any age with a wide range of symptoms, often leading to undiagnosed cases due to symptom overlap with other conditions or lack of awareness. Current diagnostic methods involve repeated serological examinations and invasive biopsies, with a strict gluten-free diet recommended upon diagnosis. Genetic screening for HLA DQ2/DQ8 haplotype is recommended for high-risk patients, but these tests have a low positive predictive value. The use of a polygenic risk score (PRS) is encouraged to assign a probability of disease development based on an individual’s genetic makeup, aiding in early intervention and potentially improving patient outcomes.
Methods
The study used data from the HUNT study, conducted in Nord-Trøndelag County, Norway between 2017-2019, with a focus on adults over 19 years old, yielding a 54% response rate and 56,939 participants, over 50% of whom were women. A serological test was used for both TG2 IgA and IgG antibodies. Confirmed cases of CeD were identified based on positive serology and biopsy results, with incident cases being those identified during the study and prevalent cases being those with a previous diagnosis. Genotyping was performed using Illumina HumanCoreExome arrays, and imputation was carried out using the Haplotype Reference Consortium (HRC) reference panel and the Positional Burrows Wheeler Transform method. Stratification of HLA risk groups was based on imputed HLA genotypes (data provided by NIDDK CR, a program of the National Institute of Diabetes and Digestive and Kidney Diseases.), with individuals classified into low-risk, high-risk, and intermediate risk groups based on their HLA-DQ2/DQ8 status. Similarly, the PRS was stratified into three groups. The study reproduced a previously developed PRS with the highest accuracy, incorporating a Bayesian regression framework for marker specific adaptive shrinkage. The overall discrimination of the genetic model was evaluated by calculating the area under the ROC curve for HLA only and combining HLA and the PRS, as well as the net reclassification improvement and the integrated discrimination improvement.
Results
The PRS assigns a probabilistic value of developing CeD, with individuals in the top quintile at a higher risk. However, it doesn’t predict the final outcome and its performance in differentiating potential cases from controls is poor. Removing the HLA variants from the PRS decreases its discrimination power, indicating that the major discrimination power between potential and confirmed cases lies with the HLA variants included in the PRS. The PRS can reclassify a significant percentage of individuals from the lower HLA risk group to a high PRS risk group, identifying populations at risk beyond HLA screening. The PRS could potentially be incorporated during newborn screening, saving the healthcare sector from unnecessary screening and testing, and sparing patients from discomfort and time if they are less likely to develop the disease.
Conclusion
Despite the limitations of the model, including non-HLA risk factors can reclassify a significant portion of the population into more accurate risk categories, which might help to make a better selection of those who need closer follow-up and repetitive antibody testing. This could lead to improved diagnosis of CeD and early intervention, alleviating the socioeconomic burden on patients, their families, and society.