Novel data-mining approach identifies biomarkers for diagnosis of Kawasaki disease

Tremoulet, Adriana H.; Dutkowski, Janusz; Sato, Yuichiro; Kanegaye, John T.; Ling, Xuefeng B.; Burns, Jane C.

doi:10.1038/pr.2015.137

Download PDF

Translational Investigation
Published: 03 August 2015

Novel data-mining approach identifies biomarkers for diagnosis of Kawasaki disease

Adriana H. Tremoulet^1,2^na1,
Janusz Dutkowski^3,4^na1,
Yuichiro Sato^1,2,
John T. Kanegaye^1,2,
Xuefeng B. Ling⁵ &
Jane C. Burns^1,2 on behalf of
; Pediatric Emergency Medicine Kawasaki Disease Research Group

Pediatric Research volume 78, pages 547–553 (2015)Cite this article

1479 Accesses
18 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Background:

As Kawasaki disease (KD) shares many clinical features with other more common febrile illnesses and misdiagnosis, leading to a delay in treatment, increases the risk of coronary artery damage, a diagnostic test for KD is urgently needed. We sought to develop a panel of biomarkers that could distinguish between acute KD patients and febrile controls (FC) with sufficient accuracy to be clinically useful.

Methods:

Plasma samples were collected from three independent cohorts of FC and acute KD patients who met the American Heart Association definition for KD and presented within the first 10 d of fever. The levels of 88 biomarkers associated with inflammation were assessed by Luminex bead technology. Unsupervised clustering followed by supervised clustering using a Random Forest model was used to find a panel of candidate biomarkers.

Results:

A panel of biomarkers commonly available in the hospital laboratory (absolute neutrophil count, erythrocyte sedimentation rate, alanine aminotransferase, γ-glutamyl transferase, concentrations of α-1-antitrypsin, C-reactive protein, and fibrinogen, and platelet count) accurately diagnosed 81–96% of KD patients in a series of three independent cohorts.

Conclusion:

After prospective validation, this eight-biomarker panel may improve the recognition of KD.

A novel nomogram model for differentiating Kawasaki disease from sepsis

Article Open access 13 August 2020

Bridging a diagnostic Kawasaki disease classifier from a microarray platform to a qRT-PCR assay

Article Open access 22 June 2022

A machine learning model for distinguishing Kawasaki disease from sepsis

Article Open access 02 August 2023

Main

The etiology of Kawasaki disease (KD), the leading cause of acquired heart disease in children, remains unknown, and there is no definitive diagnostic test (1). The diagnosis rests upon clinical criteria that are shared by other common pediatric illnesses (2). Clinical confusion can lead to a missed or delayed diagnosis, which increases the risk of coronary artery aneurysms (3,4). Between 15 to 30% of KD patients do not meet complete clinical criteria and are defined as having “incomplete” KD, which further contributes to delayed diagnosis (3,5,6,7,8). Treatment with intravenous immunoglobulin (IVIG) is effective in reducing the cardiovascular complications if administered within the first 10 d after the onset of fever (9). Without prompt treatment, ~25% of children with KD will develop coronary artery aneurysms, which can lead to myocardial infarction and other cardiovascular sequelae later in life. Thus, a diagnostic test for KD is urgently needed to help identify patients who require treatment.

Endothelial cell and cardiomyocyte injury, platelet activation, acute phase response, and immune activation are hallmarks of acute KD. Previous studies have evaluated candidate biomarkers in each of these pathways as tools for diagnosing acute KD, including vascular endothelial growth factor, amino-terminal pro brain natriuretic peptide (NT-proBNP), BNP, fibrinogen, α-1-antitrypsin (A1AT), thrombopoietin, matrix metalloproteinases, and eotaxin (10,11,12,13,14,15). No biomarker has demonstrated sufficient sensitivity or specificity when used alone to reliably identify KD patients. However, a combination of biomarkers representing different biologic pathways might improve diagnostic accuracy. Thus, our objective was to develop a panel of biomarkers that could distinguish between acute KD patients and febrile controls (FC) with sufficient accuracy to be clinically useful.

Results

Clinical Characteristics of the Study Population

There were no significant statistical differences in clinical and laboratory parameters among the KD subjects or among FC subjects in all three cohorts ( Table 1 ). Compared to FC subjects, KD subjects had higher levels of inflammation based on standard clinical laboratory testing, although there was substantial overlap between the groups ( Table 1 ).

Table 1 Clinical and laboratory characteristics of acute KD and FC subjects in Cohorts 1, 2, and 3

Full size table

Clustering and Pathway Analyses

Among the markers which discriminated between KD and FC in the univariate analysis of Cohorts 1 and 2 (Supplementary Tables S1 and S2 online), 16 and 11 biomarkers, respectively, were identified by unsupervised clustering ( Figure 1 ). The 16-biomarker panel ( Figure 1a ) correctly classified all but 1 febrile control, while the 11-biomarker panel misclassified five of the 44 KD subjects (11%). Unsupervised principal component analysis uncovered distinct groups corresponding to KD patients and FC subjects ( Figure 2 ). Supervised classification with Random Forests with 16 significant biomarkers identified in Cohort 1 yielded an area under the curve (AUC) of 0.84 when tested in Cohort 2, while the 11 biomarkers identified in Cohort 2 yielded an AUC of 0.93 when tested in Cohort 1 ( Figure 3a , b , red curve). Pathway analysis of the statistically significant markers revealed that these biomarkers pertained to several inflammatory pathways including IL-17, innate immunity, and T-cell signaling ( Table 2 ).

Table 2 Diagnoses of febrile controls

Full size table

Reducing the panel size to six biomarkers to include only those analytes commonly available in the hospital laboratory (absolute neutrophil count (ANC), erythrocyte sedimentation rate (ESR), concentrations of A1AT, C-reactive protein (CRP), and fibrinogen, and platelet count) identified by the unsupervised clustering analyses of both Cohorts 1 and 2, the AUCs were 0.91 and 0.94 when tested on Cohorts 2 and 1 ( Figure 3a , b , green curve). The addition of alanine amino transferase (ALT) and γ-glutamyl transferase (GGT), which are routinely acquired in the evaluation of possible KD patients, increased the AUC to 0.92 and 0.96, respectively ( Figure 3a , b , blue curve). In a third, independent cohort (Cohort 3), the AUC was 0.80 if trained on Cohort 1 and 0.75 if trained on Cohort 2 using ANC, ESR, concentrations of A1AT, CRP, and fibrinogen and platelet count ( Figure 4 ). This increased to an AUC of 0.83 and 0.81 when ALT and GGT were added.

Discussion

We found that a panel of biomarkers commonly available in the hospital laboratory (ANC, ESR, ALT, GGT, concentrations of A1AT, CRP, and fibrinogen, and platelet count) accurately diagnosed 81–96% of KD patients in a series of three independent cohorts. A central problem in the diagnosis of KD and the development of a diagnostic test is that the host response to inflammation involves pathways that are shared by many of the rash-fever illnesses that are in the differential diagnosis of KD, including adenovirus infection and scarlet fever (16). Thus, proper controls from children with rash-fever illnesses that mimic KD are essential to the development of a clinically useful diagnostic test. Previous studies have used either healthy children or children with febrile illnesses that do not mimic KD (such as pneumonia and bronchiolitis) as controls, thus ignoring the importance of pretest probability in the evaluation of a diagnostic test. Indeed, 89–100% of the FCs used in the study were actually referred by outside medical providers for evaluation of possible KD. The selection of FCs with diseases that mimic KD is critical as it is this population in which the test will eventually be used in the clinical setting.

We focused on validating markers of inflammation that are readily available in most hospital laboratories. The goal is to have a panel of biomarkers with a rapid turn-around time in order to diagnose and treat KD in a timely fashion. The ANC, ESR, concentration of CRP, and platelet count, which were significant discriminators and common to both Cohorts 1 and 2, are routinely measured in children being evaluated for possible KD. To this group of markers we added ALT and GGT as our previous study demonstrated that ALT and GGT are associated with markers of cardiomyocte strain (NT-proBNP and ST-2) and thus in part related to oxidative stress during acute KD (17).

We recognize several strengths and weaknesses of this study. This is the first study to use unsupervised clustering of a panel of biomarkers to distinguish between febrile controls and children with acute KD. This study demonstrates promise for a group of commonly available laboratory tests in the diagnosis of KD, which will have wide applicability once validated. Limitations of this study include its small size and the absence of a gold standard for KD diagnosis. Echocardiography to rule out coronary artery dilatation in the FC subjects would be an additional safeguard against misclassification but was not practical in the ED setting (18,19). While a Random Forest analysis reduces overfitting of the data and thus is thought to provide a highly accurate analysis of the data, the order of the variables that drives its predictions are not revealed and thus it is not possible to see how the biomarkers interrelate to lead to a diagnosis of KD. Thus, if validated, an online algorithm would have to be devised rather than a simple clinical decision rule.

Future studies will need to prospectively validate the eight-biomarker panel in additional independent cohorts and evaluate whether addition of other markers could improve the accuracy of diagnosing KD. Furthermore, future studies could assess if this panel can predict coronary artery abnormalities or IVIG-resistance. Thresholds for each individual marker will need to be determined that yield the best diagnostic accuracy. In addition, a prospective assessment of pretest probability of KD will need to be included so that the positive and negative predictive value of the biomarker panel can be calculated.

Methods

Subjects

Acute KD samples were from children who met American Heart Association criteria and had banked acute plasma samples (ethylenediaminetetraacetic acid (EDTA) and sodium citrate, obtained prior to IVIG) obtained within the first 10 d of illness (illness day 1 = first calendar day of fever) (2). The diagnosis of KD was established by one of two KD expert clinicians (A.H.T. and J.C.B.) at the KD Research Center according to an established protocol with standardized, prospective data collection.

FC subjects were previously healthy children who presented to the Emergency Department at Rady Children’s Hospital San Diego with ≥ 3 d of fever and at least one of the clinical signs of KD: rash, conjunctival injection, cervical lymphadenopathy, erythematous oral mucosa, or erythematous or edematous hands or feet. The diagnoses of the FC subjects, adjudicated by chart review by two expert clinicians (J.C.B. and J.T.K.) after all culture and laboratory data were available, included viral and bacterial infections and systemic drug reactions ( Table 3 ). We assembled three cohorts of sex- and age-matched (± 7 mo, 1:1 matching) acute KD and FC subjects: 28, 44, and 30 subjects each in Cohorts 1, 2, and 3, respectively ( Table 1 ). Of the FC subjects, 100% in Cohort 1, 91% in Cohort 2, and 89% in Cohort 3 were referred by primary care or nonpediatric emergency department physicians for evaluation of possible KD.

Table 3 Pathway analysis of biomarkers that were significantly different (P < 0.05) between KD and FC cohorts

Full size table

Signed informed consent was obtained from the parents of all subjects, and child or adolescent assent was obtained as appropriate. The protocol was approved by the Institutional Review Board at University of California San Diego and covered the use of all clinical data, the sampling of 12.5 ml of extra blood, and throat, nasopharyngeal, and rectal swabs for viral diagnosis.

For all subjects, we recorded age, sex, illness day at patient evaluation, and clinical laboratory data ( Table 1 ). We normalized the hemoglobin concentration for age (zHgb) to allow valid comparisons across the age spectrum of our subjects (20). For KD subjects, we recorded response to IVIG and echocardiographic data. IVIG-resistance was defined as persistent or recrudescent fever (temperature ≥ 38 °C) at least 36 h after completion of the IVIG infusion (2 g/kg). Coronary artery status was defined by Z score normalized for body surface area for the left anterior descending and right coronary arteries, with normal defined as <2.5 SD units and dilated as ≥ 2.5 SD from the mean, normalized for body surface area. An aneurysm was defined as focal dilation of an arterial segment >1.5 times the diameter of the adjacent segment (21).

Biomarker Assay

A total of 88 protein analytes (Supplementary Table S1 online) in inflammatory and cardiovascular pathways were measured in Cohorts 1 and 2 using the Luminex antibody-coated bead system using a previously published methodology (Human Map, version 1.6, Rules Based Medicine, Austin, TX). The dataset (Supplementary Table S2 online) used in this project can be downloaded from the iDASH repository (digital data repository based on open source MIDAS platform (San Diego, CA)) by making a request to the authors for access (22). Enzyme-linked immunosorbent assays for human fibrinogen and α1-antitrypsin (A1AT) were performed in Cohort 3 using commercially available kits (GenWay, San Diego, CA) following the manufacturer’s instructions.

Statistical Analysis

Demographic, clinical, and laboratory data were compared between KD and FC subjects using Wilcoxon rank sum test for continuous data and Fisher’s exact and chi-square tests for categorical data. We assessed the differences in the demographic, clinical, and laboratory data across the 3 KD cohorts and across the 3 FC cohorts using the Kruskal-Wallis test. We used two different approaches to devise a diagnostic test for KD: (i) unsupervised clustering of significant markers from the univariate analysis of the 88 candidate biomarkers or a smaller subset of markers commonly available from a hospital laboratory and (ii) supervised clustering of these markers using a Random Forest model.

For each cohort, discriminating markers were selected using the Wilcoxon rank sum test setting the P value threshold at ≤0.05. A heatmap was created for each cohort using only the significant markers for that cohort. Unsupervised hierarchical clustering using Ward’s method was performed on each cohort using the significant markers. Next, markers identified as significant from Cohort 2 were used to perform principal component analysis of Cohort 1 and, vice-versa. Supervised classification analysis was performed by training a Random Forest model, a machine-based learning method that works by constructing an ensemble of decision trees, on Cohort 1 and predicting the diagnosis of subjects in Cohort 2, and vice versa, and represented as receiver-operator characteristic curves (23). In each case, markers were selected for the training cohort from: (i) all significant markers on heatmap or (ii) a subset of markers available in most hospital clinical laboratories that were significant in both cohorts (ANC, ESR, concentrations of A1AT, CRP, and fibrinogen, and platelet count). All analyses were performed using the R language and environment for statistical computing.

To characterize the canonical pathways in which the biomarkers were involved, protein biomarkers with P values <0.05 in the unsupervised clustering analysis from Cohorts 1 and 2 were mapped to known entries in the Ingenuity Pathway Analysis canonical pathway database.

Statement of Financial Support

This work was supported in part by grants from the David Gordon Louis Daniel Foundation (McLean, Virginia) to J.C.B., the Mario Batali Foundation (New York, New York) to J.C.B., the National Institutes of Health, National Heart, Lung, Blood Institute HL69413 (Bethesda, Maryland) to J.C.B., The Hartwell Foundation (Memphis, Tennessee) to A.H.T., and The Harold Amos Medical Faculty Development Program/Robert Wood Johnson Foundation (Indianapolis, Indiana) to A.H.T. Database supported by the National Institutes of Health (Bethesda, Maryland) through the NIH Roadmap for Medical Research, Grant U54HL108460.

Disclosure

The authors have no conflicts of interest to report.

References

Taubert KA, Rowley AH, Shulman ST. Nationwide survey of Kawasaki disease and acute rheumatic fever. J Pediatr 1991;119:279–82.
Article CAS Google Scholar
Newburger JW, Takahashi M, Gerber MA, et al.; Committee on Rheumatic Fever, Endocarditis and Kawasaki Disease; Council on Cardiovascular Disease in the Young; American Heart Association; American Academy of Pediatrics. Diagnosis, treatment, and long-term management of Kawasaki disease: a statement for health professionals from the Committee on Rheumatic Fever, Endocarditis and Kawasaki Disease, Council on Cardiovascular Disease in the Young, American Heart Association. Circulation 2004;110:2747–71.
Article Google Scholar
Wilder MS, Palinkas LA, Kao AS, Bastian JF, Turner CL, Burns JC. Delayed diagnosis by physicians contributes to the development of coronary artery aneurysms in children with Kawasaki syndrome. Pediatr Infect Dis J 2007;26:256–60.
Article Google Scholar
Tremoulet AH, Best BM, Song S, et al. Resistance to intravenous immunoglobulin in children with Kawasaki disease. J Pediatr 2008;153:117–21.
Article CAS Google Scholar
Tsuchiya K, Imada Y, Aso S, Sonobe T. [Diagnosis of incomplete Kawasaki disease]. Nihon Rinsho 2008;66:321–5.
PubMed Google Scholar
Rowley AH. Incomplete (atypical) Kawasaki disease. Pediatr Infect Dis J 2002;21:563–5.
Article Google Scholar
Sonobe T, Kiyosawa N, Tsuchiya K, et al. Prevalence of coronary artery abnormality in incomplete Kawasaki disease. Pediatr Int 2007;49:421–6.
Article Google Scholar
Anderson MS, Todd JK, Glodé MP. Delayed diagnosis of Kawasaki syndrome: an analysis of the problem. Pediatrics 2005;115:e428–33.
Article Google Scholar
Newburger JW, Takahashi M, Beiser AS, et al. A single intravenous infusion of gamma globulin as compared with four infusions in the treatment of acute Kawasaki syndrome. N Engl J Med 1991;324:1633–9.
Article CAS Google Scholar
Takeshita S, Kawamura Y, Takabayashi H, Yoshida N, Nonoyama S. Imbalance in the production between vascular endothelial growth factor and endostatin in Kawasaki disease. Clin Exp Immunol 2005;139:575–9.
Article CAS Google Scholar
Dahdah N, Siles A, Fournier A, et al. Natriuretic peptide as an adjunctive diagnostic test in the acute phase of Kawasaki disease. Pediatr Cardiol 2009;30:810–7.
Article Google Scholar
Yu HR, Kuo HC, Sheen JM, et al. A unique plasma proteomic profiling with imbalanced fibrinogen cascade in patients with Kawasaki disease. Pediatr Allergy Immunol 2009;20:699–707.
Article Google Scholar
Ishiguro A, Ishikita T, Shimbo T, et al. Elevation of serum thrombopoietin precedes thrombocytosis in Kawasaki disease. Thromb Haemost 1998;79:1096–100.
Article CAS Google Scholar
Sakata K, Hamaoka K, Ozawa S, et al. Matrix metalloproteinase-9 in vascular lesions and endothelial regulation in Kawasaki disease. Circ J 2010;74:1670–5.
Article Google Scholar
Kuo HC, Wang CL, Liang CD, et al. Association of lower eosinophil-related T helper 2 (Th2) cytokines with coronary artery lesions in Kawasaki disease. Pediatr Allergy Immunol 2009;20:266–72.
Article Google Scholar
Barone SR, Pontrelli LR, Krilov LR. The differentiation of classic Kawasaki disease, atypical Kawasaki disease, and acute adenoviral infection: use of clinical features and a rapid direct fluorescent antigen test. Arch Pediatr Adolesc Med 2000;154:453–6.
Article CAS Google Scholar
Sato YZ, Molkara DP, Daniels LB, et al. Cardiovascular biomarkers in acute Kawasaki disease. Int J Cardiol 2013;164:58–63.
Article Google Scholar
Bratincsak A, Reddy VD, Purohit PJ, et al. Coronary artery dilation in acute Kawasaki disease and acute illnesses associated with Fever. Pediatr Infect Dis J 2012;31:924–6.
Article Google Scholar
Muniz JC, Dummer K, Gauvreau K, Colan SD, Fulton DR, Newburger JW. Coronary artery dimensions in febrile children without Kawasaki disease. Circ Cardiovasc Imaging 2013;6:239–44.
Article Google Scholar
Gunn L, Nechyba C. The Harriet Lane Handbook. 16th edn. Mosby: Philadelphia, 2002.
Google Scholar
Tremoulet AH, Jain S, Chandrasekar D, Sun X, Sato Y, Burns JC. Evolution of laboratory values in patients with Kawasaki disease. Pediatr Infect Dis J 2011;30:1022–6.
Article Google Scholar
Ohno-Machado L, Bafna V, Boxwala AA, et al.; iDASH team. iDASH: integrating data for analysis, anonymization, and sharing. J Am Med Inform Assoc 2012;19:196–201.
Article Google Scholar
Breiman L. Random forests. MLear 2001;45:5–32.
Google Scholar

Download references

Acknowledgements

The authors wish to thank Nicki Daniels and the San Diego KD pediatric parents group who raised money to help fund this research. Members of the Pediatric Emergency Medicine Kawasaki Disease Research Group included Jim R. Harley, MD, MPH, Simon J. Lucio, MD, Seema Shah, MD, and Stacey Ulrich, MD.

Author information

Adriana H. Tremoulet and Janusz Dutkowski: The first two authors contributed equally to this work and should be considered as joint first authors.

Authors and Affiliations

Department of Pediatrics, University of California San Diego, La Jolla, California
Adriana H. Tremoulet, Yuichiro Sato, John T. Kanegaye & Jane C. Burns
Rady Children’s Hospital San Diego, San Diego, California
Adriana H. Tremoulet, Yuichiro Sato, John T. Kanegaye & Jane C. Burns
Data4Cure, Inc., La Jolla, California
Janusz Dutkowski
Department of Medicine, University of California San Diego, La Jolla, California
Janusz Dutkowski
Department of Surgery, Stanford University, Palo Alto, California
Xuefeng B. Ling

Authors

Adriana H. Tremoulet
View author publications
You can also search for this author in PubMed Google Scholar
Janusz Dutkowski
View author publications
You can also search for this author in PubMed Google Scholar
Yuichiro Sato
View author publications
You can also search for this author in PubMed Google Scholar
John T. Kanegaye
View author publications
You can also search for this author in PubMed Google Scholar
Xuefeng B. Ling
View author publications
You can also search for this author in PubMed Google Scholar
Jane C. Burns
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

; Pediatric Emergency Medicine Kawasaki Disease Research Group

Corresponding author

Correspondence to Adriana H. Tremoulet.

Supplementary information

Supplementary Table S1

(XLS 484 kb)

Supplementary Table S2

(DOC 39 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tremoulet, A., Dutkowski, J., Sato, Y. et al. Novel data-mining approach identifies biomarkers for diagnosis of Kawasaki disease. Pediatr Res 78, 547–553 (2015). https://doi.org/10.1038/pr.2015.137

Download citation

Received: 05 January 2015
Accepted: 17 April 2015
Published: 03 August 2015
Issue Date: November 2015
DOI: https://doi.org/10.1038/pr.2015.137

This article is cited by

A machine learning model for distinguishing Kawasaki disease from sepsis
- Chi Li
- Yu-chen Liu
- Cheng Wu
Scientific Reports (2023)
From molecular mechanisms of prostate cancer to translational applications: based on multi-omics fusion analysis and intelligent medicine
- Shumin Ren
- Jiakun Li
- Bairong Shen
Health Information Science and Systems (2023)
Risk factors and implications of progressive coronary dilatation in children with Kawasaki disease
- Ming-Yu Liu
- Hsin-Min Liu
- Mei-Hwan Wu
BMC Pediatrics (2017)

Subjects

Abstract

Background:

Methods:

Results:

Conclusion:

Similar content being viewed by others

Main

Results

Clinical Characteristics of the Study Population

Clustering and Pathway Analyses

Discussion

Methods

Subjects

Biomarker Assay

Statistical Analysis

Statement of Financial Support

Disclosure

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

; Pediatric Emergency Medicine Kawasaki Disease Research Group

Corresponding author

Supplementary information

PowerPoint slides

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links