EFTA00315081
EFTA00315100 DataSet-9
EFTA00315114

EFTA00315100.pdf

DataSet-9 14 pages 13,438 words document
P17 V11 P22 V14 V16
Open PDF directly ↗ View extracted text
👁 1 💬 0
📄 Extracted Text (13,438 words)
Ience Rapid Whole-Genome Sequencing for Genetic Disease Diagnosis in Translational Neonatal Intensive Care Units Medicine AAAS Carol Jean Saunders et at Sci Trans! Med 4, 154ra135 (2012); DOI: 10.1126/scitranslmed.3004041 Editor's Summary Speed Heals The waiting might not be the hardest part for families receiving a diagnosis in neonatal intensive care units (NICUs), but it can be destructive nonetheless. While they wait on pins and needles for their newborn baby's diagnosis, parents anguish, nurture false hope, wrestle with feelings of guilt—and all the while, treatment and counseling are delayed. Now, Saunders et at describe a method that uses whole-genome sequencing (WGS) to achieve a differential diagnosis of genetic disorders in 50 hours rather than the 4 to 6 weeks. Many of the -3,500 genetic diseases of known cause manifest symptoms during the first 28 days of life, but full clinical symptoms might not be evident in newborns. Genetic screens performed on newborns are rapid, but are designed to unearth only a few genetic disorders, and serial gene sequencing is too slow to be clinically useful. Together, these complicating factors lead to the administration of treatments based on nonspecific or obscure symptoms, which can be unhelpful or dangerous. Often, either death or release from the hospital occurs before the 0 diagnosis is made. The new WGS protocol cuts analysis time by using automated bioinformatic analysis. Using their newly 2 developed protocol, the authors performed retrospective 50-hour WGS to confirm, in two children, known molecular diagnoses that had been made using other methods. Next, prospective WGS revealed a molecular diagnosis of a u- BRAT1-related syndrome in one newborn; identified the causative mutation in a baby with epidermolysis bullosa: ruled out the presence of defects in candidate genes in a third infants; and, in a pedigree, pinpointed BCL9L as a new co recessive gene (HTX6) that gives rise to visceral heterotaxy —the abnormal arrangement of organs in the chest and $6 abdominal cavities. WGS of parents or affected siblings helped to speed up the identification of disease genes in the icf prospective cases. These findings strengthen the notion that WGS can shorten the differential diagnosis process and E quicken to move toward targeted treatment and genetic and prognostic counseling. The authors note that the speed c and cost of WGS continues to rise and fall, respectively. However, fast WGS is clinically useful when coupled with fast and affordable methods of analysis. -a E E A complete electronic version of this article and other services, including high-resolution figures, -§ can be found at: http://stm.sciencemag.org/content/4/154/154ra135.full.html a Supplementary Material can be found in the online version of this article at: http://stm.sciencemag.org/content/suppl/2012/10/01/4.154.154ra135.DC1.html Related Resources for this article can be found online at: http://stm.sciencemag.org/content/scitransmed/5/194/194cm5.full.html http://stm.sciencemag.org/content/scitransmed/5/194/194ed10.full.html http://stm.sciencemag.org/content/scitransmed/5/198/198ed12.full.html http://www.sciencemag.org/content/sci/342/6155/197.full.html Information about obtaining reprints of this article or about obtaining permission to reproduce this article in whole or in part can be found at: http://www.sciencemag.org/about/permissions.dtl Science Translational Medicine (print ISSN 1946-6234; online ISSN 1946-6242) is published weekly, except the last week in December. by the American Association for the Advancement of Science, 1200 New York Avenue NW, Washington, DC 20005. Copyright 2012 by the American Association for the Advancement of Science; all rights reserved. The title Science Translational Medicine is a registered trademark of AAAS. EFTA00315100 RESEARCH ARTICLE DIAGNOSTICS Rapid Whole-Genome Sequencing for Genetic Disease Diagnosis in Neonatal Intensive Care Units Carol Jean Saunders,1,2,3,4,5* Neil Andrew MiIler,l '2'4* Sarah Elizabeth Soden,"4* Darrell Lee Dinwiddie,"31"* Aaron Noll,' Noor Abu Alnadi,4 Nevene Andraws,3 Melanie LeAnn Patterson," Lisa Ann Krivohlavek," Joel Ferns,' Sean Humphray,' Peter Saffrey,' Zoya Kingsbury,' Jacqueline Claire Weir,' Jason Betley," Russell James Grocock,' Elliott Harrison Margulies,' Emily Gwendolyn Farrow,' Michael Artman,2'4 Nicole Pauline Safina," Joshua Erin Petrikin," Kevin Peter Hall,' Stephen Francis Kingsmore'•2,3,4,st Monogenic diseases are frequent causes of neonatal morbidity and mortality, and disease presentations are often undifferentiated at birth. More than 3500 monogenic diseases have been characterized, but clinical testing is avail- able for only some of them and many feature clinical and genetic heterogeneity. Hence, an immense unmet need exists for improved molecular diagnosis in infants. Because disease progression is extremely rapid, albeit hetero- geneous, in newborns, molecular diagnoses must occur quickly to be relevant for clinical decision-making. We de- 0 scribe 50-hour differential diagnosis of genetic disorders by whole-genome sequencing (WGS) that features automated bioinformatic analysis and is intended to be a prototype for use in neonatal intensive care units. Ret- rospective 50-hour WGS identified known molecular diagnoses in two children. Prospective WGS disclosed potential molecular diagnosis of a severe G22-related skin disease in one neonate; BRAT/-related lethal neonatal rigidity and multifocal seizure syndrome in another infant identified Ba9L as a novel, recessive visceral heterotaxy gene (H7X6) in a a) pedigree; and ruled out known candidate genes in one infant. Sequencing of parents or affected siblings expedited the C identification of disease genes in prospective cases. Thus, rapid WGS can potentially broaden and foreshorten differ- 0 ential diagnosis, resulting in fewer empirical treatments and faster progression to genetic and prognostic counseling. rn til INTRODUCTION rapid, identify only a few genetic disorders for which inexpensive tests E Genomic medicine is a new, structured approach to disease diagnosis and cost-effective treatments exist (14, 15). Further complicating diag- 8 C tb and management that prominently features genome sequence infor- nosis is the fact that the full clinical phenotype may not be manifest •O mation (1). Whole-genome sequencing (WGS) by next-generation in newborn infants (neonates), and genetic heterogeneity can be im- N sequencing (NGS) technologies has the potential for simultaneous, mense. Thus, acutely ill neonates with genetic diseases are often dis- E comprehensive, differential diagnostic testing of likely monogenic ill- charged or deceased before a diagnosis is made. As a result, NICU E nesses, which accelerates molecular diagnoses and minimizes the du- treatment of genetic diseases is usually empirical, may lack efficacy, ration of empirical treatment and time to genetic counseling (2-7). may be inappropriate, or may cause adverse effects. Indeed, in some cases, WGS or exome sequencing provides molecular NICUs are also suitable for early adoption of genomic medicine diagnoses that could not have been ascertained by conventional single- because extraordinary interventional efforts are customary and inno- gene sequencing approaches because of pleiotropic clinical presenta- vation is encouraged. Indeed, NICU treatment is among the most tion or the lack of an appropriate molecular test (7-9). cost-effective of high-cost health care, and the long-term outcomes of Neonatal intensive care units (NICUs) are especially suitable for most NICU subpopulations are excellent (16-18). In genetic diseases 8 early adoption of diagnostic WGS because many of the 3528 mono- for which treatments exist, rapid diagnosis is critical for timely delivery genic diseases of known cause are present during the first 28 days of of interventions that lessen morbidity and mortality (14-17, 19, 20). life (10). In the United States, more than 20% of infant deaths are For neonatal genetic diseases without effective therapeutic interven- caused by congenital malformations, deformations, and chromosomal tions, of which there are many (21), timely diagnosis avoids futile inten- abnormalities that cause genetic diseases (11-13). Although this pro- sive care and is critical for research to develop management guidelines portion has remained unchanged for the past 20 years, the precise that optimize outcomes (22).In addition to influencing treatment, neo- prevalence of monogenic diseases in NICUs is poorly understood be- natal diagnosis of genetic disorders and genetic counseling can spare cause ascertainment rates are low. Serial gene sequencing is too slow parents diagnostic odysseys that instill inappropriate hope or perpetuate to be clinically useful for NICU diagnosis. Newborn screens, while needless guilt Two recent studies exemplify the diagnostic and therapeutic uses of NGS in the context of childhood genetic diseases. WGS of fraternal 'Center for Pediatrocancenc Medidne Children's Mercy Hospital• Kansas City,M0 69103, USA /Department of Pediatric,. Children's Mercy Hospital. Kansas City. MO 6910. IA& twins concordant for 3,4-dihydroxyphenylalanine (dopa)-responsive 'Department of Pathology. Children's Mercy Hosptat Kansas City. M069103. USA °School dystonia revealed known mutations in the sepiapterin reductase of Medicine. University of Missoun-Kansas Cny. Kansas City. MO 6410& USA. sUni.ersity of (SPR) gene (3). In contrast to other forms of dystonia, treatment with Kansas Medical Center. KansisCny. KS66160, USA'1Ilumina Inc,Chesterfad Research Park 5-hydroxytryptamine and serotonin reuptake inhibitors is beneficial in Little Chesterford. CBIO I XL Essex UK 'These authors contributed equally to this work patients with SPR defects. Application of this therapy in appropriate tic, whom correspondence should be addressed. E-mail stkingsrnore@cmhedu cases resulted in clinical improvement. Likewise, extensive testing wiwSdenceTranslatIonalMedlamorg 3 October 2012 Vol 4 Issue 154 154ral 3S 1 EFTA00315101 RESEARCH ARTICLE failed to provide a molecular diagnosis for a child with fulminant pan- per patient (maximum, 430; minimum, 5). Thus, SSAGA displayed colitis (extensive inflammation of the colon) (8), in whom standard sufficient sensitivity for the initial selection of known, recessive candi- treatments for presumed Crohn's disease—an inflammatory bowel date genes in children with specific clinical presentations. disease—were ineffective. NGS of the patient's exome, together with confirmatory studies, revealed X-linked inhibitor of apoptosis (XIAP) Rapid WGS deficiency. The treating physicians had not entertained this diagnosis To assess our ability to recapitulate known results, we performed rapid because X1AP mutations had not previously been associated with co- WGS retrospectively on DNA samples from two infants with molec- litis Hemopoietic progenitor cell transplant was performed, as indi- ular diagnoses that had previously been identified by clinical testing. cated for XIAP deficiency, with complete resolution of colitis. last, Then, to assess the potential diagnostic use of rapid WGS, we prospec- for —3700 genetic illnesses for which a molecular basis has not yet tively performed WGS in five undiagnosed newborns with clinical been established (10), WGS can suggest candidate genes for functional presentations that strongly suggested a genetic disorder as well as their and inheritance-based confirmatory research (23). siblings. The current cost of research-grade WGS is $7666 (24)-which is Automation of the five main components of WGS as well as similar to the current cost of commercial diagnostic dideoxy sequencing bioinformatics-based gene-variant characterization and clinical inter- of two or three disease genes. Within the context of the average cost per pretation, all in an integrated workflow, made possible —50-hour time day and per stay in a NICU in the United States (13), WGS in care- to differential molecular diagnosis of genetic disorders (Fig. 1). fully selected cases is acceptable and even potentially cost-saving (3-7). Specifically, sample preparation for WGS was shortened from 16 to However, the turnaround time for interpreted WGS results, such as 4.5 hours, while a physician simultaneously entered into SSAGA clin- that of dideoxy sequencing, is too slow to be of practical use for NICU ical terms that described the neonates' illnesses (fig. SI). For each sample, diagnoses or clinical guidance (typically -4 to 6 weeks) (2-4). Here, we rapid WGS [2 x 100 base pair (bp) reads, including on-board cluster report a system that permits WGS and bioinfonnatic analysis (largely generation and paired-end sequencing] was performed in a single run as as automated) of suspected genetic disorders within 50 hours, a time frame on the alumina HiSeq 2500 and took —26 hours. Base calling, genomic 2 that appears to be promising for emergency use in level 3 NICUs. sequence alignment, and gene variant calling took —15 hours. The HiSeq .o 2500 runs yielded 121 to 139 gigabases (GB) of aligned sequences (34- ts. to 4I-fold aligned genome coverage; 'fable I). Eighty-eight to 91% of 5 RESULTS bases had >99.9% likelihood of being correct (quality score >30, using ,c2) Illumina software equivalent to Phred) (31, 32). We detected 4.00 ± of Symptom- and sign-assisted genome analysis (SSAGA) is a new din- 0.20 million nudeotides that differed from the reference genome se- icopathological correlation tool that maps the clinical features of 591 quence (variants) (mean ± SD) in nine samples, one from each of nine E well-established, recessive genetic diseases with pediatric presentations infants (Table 1). C (table SI) to corresponding phenotypes and genes known to cause the N symptoms (2, 10). SSAGA was developed for comprehensive auto- Analytical metrics mated performance of the following two tasks: (i) WGS analyses re- In three samples, genome variants identified by 50-hour WGS were E stricted to a superset of gene-associated regions relevant to clinical compared with those identified by deep targeted sequencing of either E presentations, in accord with published guidelines for genetic testing exons and 20 intron-exon boundary nucleotides of a panel of 525 re- in children (25-28), and (ii) prioritization of clinical information to cessive disease genes [Children's Mercy Hospital Diagnostic panel assist in the interpretation of WGS results. SSAGA has a menu of 1 (CMH-Dxl)] or the exome (Table 2). CMH-Dx1 comprised 8813 -0 227 clinical terms arranged in nine symptom categories (fig. SI). Stan- exonic and intronic targets, totaling 2.1 million nucleotides (table SI) cu dardized clinical terms (29) have been mapped to 591 genetic diseases (2, 33). The exome and CMH-Dx1 methods, which used Illumina on the basis of authoritative databases (10, 30) and expert physician TruSeq enrichment and HiSeq 2000 sequencing, took —19 days. In reviews. Each disease gene is represented by an average of 8 terms and contrast, rapid WGS did not use target enrichment, was performed 8 at most 11 terms (minimum, I term, 15 disease genes; maximum, 11 with the HiSeq 2500 instrument, and took —50 hours. Samples terms, 3 disease genes). CMH064, UDT002, and UDTI73 were sequenced using these three To validate the feasibility of automated matching of clinical terms methods, and variants were detected with a single alignment method to diseases and genes, we entered retrospectively the presenting fea- [the Genomic Short-read Nucleotide Alignment Program (GSNAP)] tures of 533 children who have received a molecular diagnosis at (34) and variant caller [the Genome Analysis Tool Kit (GATK)] (35). our institution [Children's Mercy Hospital (CMH), Kansas City, MO] Rapid WGS detected —96% of the variants identified by a target en- within the last 10 years into SSAGA. Sensitivity was 99.3% (529), as richment method and —99.5% of the variants identified by both determined by correct disease and affected gene nominations. Failures methods had identical genotypes (Table 2), indicating that rapid induded a patient with glucose-6-phosphate dehydrogenase deficiency WGS is highly concordant with established clinical sequencing who presented with muscle weakness [which is not a feature men- methods (33). In contrast, analysis of the rapid WGS data set from tioned in authoritative databases (10, 30)]; a patient with Janus kinase sample CMH064 with three different alignment and variant detection 3 mutations who had the term "respiratory infection" in his medical methods [GSNAP/GATK, the alumina CASAVA alignment tool, and records rather than "increased susceptibility of infections," which is the Burrows-Wheeler Alignment (BWA) tool] revealed surprising dif- the description in authoritative databases; and a patient with cystic ferences between the variants detected. Only —80% of the variants de- fibrosis who had the term "recurrent infections" in his medical records tected using GATK/GSNAP or BWA were also detected with CASAVA rather than "respiratory infections," which is the description in au- (Table 2 and table S2) (36-41). This suggests that additional studies will thoritative databases. SSAGA nominated an average of 194 genes be needed to define optimal alignment methods for dinical sequencing. wwwSdenceTranslatIonalMediclnenrg 3 October 2012 Vol 4 Issue 154 154ra135 2 EFTA00315102 RESEARCH ARTICLE Nevertheless, there was good concordance between the genotypes of variants detected by rapid WGS (using the HiSeq 2500 and CASAVA) and targeted sequencing (using exome enrichment, the HiSeq 2000, and GATKIGSNAP)-99.5% (UDT002), 99.9% (UDT173), and 99.7% (CMH064) (Table 2)—further indicating that rapid WGS is highly con- cordant with an established genotyping method (33). In subsequent Obtain consent and blood sample studies, the rapid WGS technique used CASAVA for alignment and variant detection. Genomic variants were characterized with respect to functional consequence and zygosity with a new software pipeline [Rapid Understanding of Nucleotide variant Effect Software (RUNES), fig. S21 that analyzed each sample in 2.5 hours. Samples contained a mean of 4.0D ± 0.20 million (SD) genomic variants, of which a mean of 1.87 ± 0M9 million (SD) were se-striated with protein-encoding genes (Table 1). Prepare sequencing library Less than I% of these variants (mean, 10,848 ± 523 SD) were also of a functional class that could potentially be disease causative (Fable 1) Enter clinical findings into SSAGA (25-27). Of these, —14% (mean, 1530 ± 518 SD) had an allele fiequen- cy that was sufficiently low to be a candidate for being causative in an uncommon disease (<1% allele frequency in 836 individuals sequenced d at CMH) (42). Last, of these, —71% (mean, 1083 ± 240 SD) were also of S a functional class that was likely to be disease causative [American i• College of Medical Genetics (ACMG) categories Ito 31 (Table 1). This 2 set of variants was evaluated for disease causality in each patient, with priority given to variants within the candidate genes that had been Li- HiSeq 2500 2 x 100 bp sequencing nominated by an individual patient presentation. Es Retrospective analyses ° Patient UDT002 was a male who presented at 13 months of age with na hypotonia, developmental regression. Brain magnetic resonance imag- 5 ing (MRI) showed diffuse white matter changes suggesting leukodys- trophy. Three hundred fifty-two disease genes were nominated by .2 0 one of the three clinical terms hypotonia, developmental regression, or 01 CASAVA base calling leukodystrophy, 150 Aise-ase genes were nominated by two terms; and 9 disease genes were nominated by all three terms (table S3). Only E RUNES variant annotation 16 known pathogenic variants had allele frequencies in dbSNP and the 2 CMH cumulative database that were consistent with uncommon dis- ease mutations. Of these, only two variants mapped to the nine can- didate genes; the variants were both compound heterozygous (verified oc g, by parental testing) substitution mutations in the gene that encodes the t a subunit of the lysosomal enzyme hexosaminidase A [HEXA Chr 15:72,641,417c>C (gene symbol, chromosome number, chromosome coordinate, reference nucleotide > variant nucleotide), c.986+3A>G SSAGA-delimited variant analysis (transcript coordinate, reference nucleotide, variant nucleotide), and and interpretation Chr15:72,640,388C>T, c.1073+1G>A1. The c.986+3A>G alters a 5' exon-flanking nucleotide and is a known mutation that causes Tay-Sachs disease ('15D), a debilitating lysosomal storage disorder [Online Mendelian Inheritance in Man (OMIM) number 2728001. The variant had not previously been observed in our database of 651 individuals or dbSNP, which is relevant because mutation databases are contaminated with some common polymorphisms, and these can be distinguished from true mutations on the basis of allele frequency (33). The c1073+1G>A Verbal interim report of diagnosis variant is a known l'SD mutation that affects an exonic splice donor site pending CLIA confirmation (dbSNP r576173977). The variant has been observed only once before in our database of 414 samples, which is consistent with an allele frequen- cy of a causative mutation in an orphan genetic disease. Thus, the known Fig. 1. STAT-Sea. Summary of the steps and timing of STAT-Seq, result- diagnosis of 'Est) was confirmed in patient UDT002 by rapid WGS. ing in an interval of 50 hours between consent and delivery of a pre- Patient UDT173 was a male who presented at 5 months of age with liminary, verbal diagnosis. t, hours. developmental regression, hypotonia, and seizures. Brain MRI showed www.5cienceTranslationalMedlciae.org 3 October 2012 Vol 4 Issue 154 154ral 35 3 EFTA00315103 RESEARCH ARTICLE dysmyelination, hair shaft analysis revealed pill ford (kinky hair), and (B1nrlec SUbstitution Matrix) scores. The known diagnosis of Menkes serum copper and ceruloplasmin were low. On the basis of this clinical disease (OMIM number 309400) was recapitulated. As a further assess- presentation, 276 disease genes matched one of these clinical terms and ment of the reliability of variant detection of rapid WGS, samples 3 matched three terms (table S4). 'There were no previously reported UDT002 and UDTI73 were aligned to the reference genome with three disease-causing variants in these 276 genes. However, five of the candi- different alignment methods. The causative variants were recovered date genes contained either variants of a type that is expected to be with each method. disease-causing based on their predicted functional consequence or missense variants of unknown significance (VUS). One of these var- Prospective analyses iants was in a gene that matched all three clinical terms and was a Mutations in 35 genes can cause generalized, erosive dermatitis of the hemizygous substitution mutation in the gene that encodes the a poly- type found in CMH064 (table S5). The severe phenotype, negative peptide of copper-transporting adenosine triphosphatase (ATP7A Chr family history, and absence of consanguinity suggested dominant de )C77,271,307C>T, c.2555C>T, i, aberrant forms of which are novo or recessive inheritance. No known pathogenic mutations were known to cause Menkes disease, a copper-transport disorder. This identified in the candidate genes that had low allele frequencies in the variant—new to our database and dbSNP—specified a nonconserva- CMH cumulative genome and exome sequence database and similar tive substitution in an amino acid that was highly conserved across public datakcec Average coverage of the genomic regions corresponding species and had deleterious SIFT (Sons Intolerant From Tolerant sub- to the candidate genes was 38.9-fold, and 98.4% of candidate gene stitutions), PolyPhen2 (Polymorphism Phenotyping), and BLOSUM nucleotides had >I6x high-quality coverage (sufficient to rule out a n Table 1. Sequencing, alignment, and variant statistics of nine samples analyzed by rapid WGS. ACMG category 1 to 4 variants are a subset of gene- (5 associated variants. L'" ets ACMG ACMG n2 High- ACMG categories categories Candidate or Run Mitochondria Nuclear Gene- nor Sequence quality categories 1 to 4 1 to 3 Candidate gene Candidate c Sampletime genome )genome ° (hours) variants variants variants (%) variants frequency frequency 1 variants <1% <1% 0) to UDT002 255 133 91 33 4,014,761 1,888,650 10,733 1,989 1,330 352 (9) 2 0 E UDT173 255 139 89 40 3,977,062 1,859,095 10,501 2,190 1,296 347 (3) 0 I C CD CMH064 26.6 121 88 41 3,985,929 1,869,515 10,701 1.884 1,348 35 0 2 to CMH076 25.7 134 88 34 4,498,146 2,098,886 11,891 2,552 1,351 89 0 CMH172 265 113 91 39 3,759,165 1,749,868 10,135 1,456 982 174 0 N CMH184 265 137 90 37 3,921,135 1,840,738 10,883 833 12 0 0 E 1,168 2 CMH185 40 117 93 37 3,922,736 1,831,997 10,810 1,164 840 14 0 0 CMH186 255 113 93 37 3,933,062 1,827,499 10,713 1,202 868 14 § CMH2O2 40 116 93 39 3,947,053 1,849,647 10,805 1,283 901 C C 00 Table 2. Variants and genotypes. Comparisons of variants and genotypes for 523 genes and Merl 2000 sequencing. Average coverage of target nucleo- obtained In three samples using three target enrichment methods, two se- tides indicates the average aligned sequence depth over the corresponding quencing methods, and two alignment methods. The SO hour WGS (STAT-Seci) target panel. For WGS, the target is the genome; for come sequencing, the was not enriched and used HiSeq 2500 sequencing. CMHDx1 was erviched target is the exome and for CMH-Corl, the targets are 523 genes. Sample Target Sequencing Alignment Sequence Average coverage of Variants detected Genotypes Identical to enrichment method method (GB) target nucleotides by rapid WGS both methods (%) CMH064 Exorne HiSeq 2000 GATK/GSNAP 9.8 79 46,756 (96.0%) 99.4 None (WGS) HiSeq 2500 12.1 40 UDT173 CMH-Dxl HiSeq 2000 GATK/GSNAP 4.1 784 1539 (96.7%) 99.60 None (WGS) HiSeq 2500 13.9 46 UDT173 CMH-Dxl HiSeq 2000 GATK/GSNAP 4.1 784 1457 (83.0%) 99.9 None (WGS) HiSeq 2500 CASAVA 13.9 46 UDT002 CMH-Dxl HiSeq 2000 GATK/GSNAP 4.2 770 1341 (76.6%) 99.5 None (WGS) HiSeq 2500 CASAVA 13.3 44 www.ScienceTranslationalMedicine.org 3 October 2012 Vol 4 Issue 154 154ra135 4 EFTA00315104 RESEARCH ARTICLE heterozygous variant; table S6). Five candidate genes had 100% nu- (46). Dideoxy sequencing confirmed the variant to be homozygous in cleotides with >16-fold high-quality coverage and, thus, lacked a known CMH172 and heterozygous in both parents. pathogenic mutation in an exon or within 20 nucleotides of the intron- Rapid WGS was performed simultaneously on proband CMHI84 exon boundaries. Eighteen of the candidate genes had >99% nucleotides (male), affected sibling (brother) CMHI85, and their healthy parents, with >16-fold high-quality coverage, and 31 had >95% nucleotides with CMH186 and CMH2O2. Twelve genes have been associated with the at least this level of coverage. Furthermore, while 26 of the candidate clinical features of the brothers (heterotaxy and congenital heart dis- genes had pseudogenes, paralogs, and/or repeat segments (table S6) ease table S9). Co-occurrence in two siblings strongly suggested reces- that could potentially result in misalignment and variant miscalls, only sive inheritance. No known disease-causing variants or homozygous/ 0.03% of target nucleotides had poor alignment quality scores. compound heterozygous VUS with low allele frequencies were identi- Among the 35 candidate genes nominated by the phenotype, two fied in these genes. A genome-wide search of novel, homozygous/ rare heterozygous VUS were detected in CMH064; however, dideoxy compound heterozygous, likely pathogenic VUS that were common sequencing of both healthy parents exduded one, in the keratin 14 to the affected brothers and heterozygous in their parents yielded gene, as a de novo mutation. The exomes of both parents were sub- two nonsynonymous variants in the B cell CLL/ghc -like gene sequently sequenced, and variants were examined in the trio at length. (BCL9L, Chr 11:118,772,350G>A,c.2102G>A, and Chr Three likely de novo mutations with excellent sequence coverage were 11:118,774,140G>A, c.554C>T, i. Evidence supporting the identified in disease-causing genes. Of these, one was a candidate gene candidacy of BCL9L for heterotaxy and congenital heart disease is for CMH064. It was an in-frame deletion of three nucleotides in GIB2, presented below. (NM_004004), which encodes the connedn 26 protein. The variant, 0 c85_87de1, removes a highly conserved amino acid within 0 the first transmembrane helix (43). Dideoxy sequencing confirmed it DISCUSSION to be a de novo mutation. Dominant, de novo 6,1B2 mutations have a's been associated with severe neonatal lethal disorders of the skin, such Genomic medicine, empowered by WGS, has been heralded as trans- g as keratitis-ichthyosis-deafness syndrome (KIDS), that involve the formational for medical practice (2, 4, 5, 47). Over the last several y suprabasilar layers of the epidermis (OMIM number 148210) (44). The years, the cost of WGS has fallen markedly, potentially bringing it LI- phenotype of CMH064 was atypical for KIDS, and functional studies within the realm of cost-effectiveness for high-intensity medical prac- are in progress to determine causality definitively. tice, such as occurs in NICUs (3, 8, 23, 24). Furthermore, experience IF Diagnoses suggested by the presentation in CMH076 were mito- has been gained with clinical use of WGS that has instructed initial ; chondrial disorders, organic acidemia, or pyruvate carboxylase defi- guidelines for its use in molecular diagnosis of genetic disorders (9). ciency. Together, 75 nuclear genes and the mitochondrial genome However, a major impediment to the implementation of practical ge- cause these diseases (table S7). A negative family history suggested re- nomic medicine has been time to result cessive inheritance that resulted from compound heterozygous or hemi- This limitation has always been a problem for diagnosis of genetic .0 a) zygous variants or a heterozygous de novo dominant variant. Rapid disease& Time to result and cost have greatly constrained the use of 1.'1 WGS excluded known pathogenic mutations in the candidate genes. serial analysis of single-gene targets by dideoxy sequencing, Hitherto, One novel heterozygous VUS was found. However, de novo occur- clinical use of WGS by NGS has also taken at least a month: Sample rence of this variant was ruled out by exome sequencing of his healthy preparation has taken at least a day; clustering 5 hours; 2 x 100 nu- O parents. No homozygous or compound heterozygous VUS with suit- cleotide sequencing 11 days; alignment, variant calling, and genotyping ably low allele frequencies were identified in the known disease genes. 1 day, variant characterization a week and clinical interpretation at least Potential novel candidates included 929 nuclear genes that encode a week Although exome sequencing lengthens sample preparation by 113 mitochondrial proteins but have not yet been associated with a genetic several days, it decreases computation time somewhat and is less costly. t disease (45). Only one of these had a homozygous or compound het- For use in acute care, the turnaround time of molecular diagnosis, 3 erozygous VUS with an allele frequency in dbSNP and the CMH including analysis, must match that of medical decision-making, which database that was sufficiently low to be a candidate for causality in ranges from Ito 3 days for most acute medical care. Herein, we de- an uncommon inherited disease. Deep exome sequencing of both scribed proof of concept for 2-day genome analysis of acutely ill neo- parents excluded this variant and did not disclose any further poten- nates with suspected genetic disorders. tially causal variants. A total of 174 genes are known to cause epilepsy of the type found Automating medicine in CMH172 (table S8). A positive family history of neonatal epilepsy and Rapid WGS was made possible by two innovations. First, a widely evidence of shared parental ancestry strongly suggested recessive inheri- used WGS platform has been modified to generate up to 140 GB of tance. No known disease-causing variants or homozygous/compound sequence in less than 30 hours (HiSeq 2500): Sample preparation took heterozygous VUS with low allele frequencies were identified in these 4.5 hours, and 2 x 100 bp genome sequencing took 25.5 hours (Fig. 1). genes, which largely excluded them as causative in this patient A genome- The total "hands-on" time for technical staff was 5 hours. Modifica- wide search of homozygous, likely pathogenic VUS that were novel in tions included a new flowcell design and faster imaging and chemistry. the CMH database and dbSNP disclosed a frame-shifting insertion in Previously, NGS has either lacked sufficient sequence quantity, quality, the BRCA)-associated protein required for ATM activation-I (BRAT), or read lengths for clinical use of WGS or been too slow for use in acute Chr 7:2,583,573.2,583,574insATCITCTC,c453_454insATCITCTC, patient care. Rapid WGS generated -40-fold aligned genome coverage. . A literature search yielded a very recent study of The sequence quality was very similar to that obtained with its predeces- BRAT) mutations in two infants with lethal, multifocal seizures, hyper- sor (HiSeq 2000), as determined by quality scores and alignment rates tonia, microcephaly, apnea, and bradycardia (OMINI number 614498) (48). Genotypes of nucleotide variants were >99.5% concordant with wwwSdenceTranslatIonalMedicintorg 3 October 2012 Vol 4 Issue 154 154ral 35 5 EFTA00315105 RESEARCH ARTICLE those of very deeply sequenced, partial exomes (33). The accuracy of the features. In male UDT173, a hemizygous (X-linked) VUS was identi- latter has been extensively benchmariced and is >99.9% (33). fied in the single candidate gene matching all clinical features. The Second, we automated much of the onerous characterization of ge- variant, a nonsynonymous nucleotide substitution, was predicted to nome variation and facilitated interpretation by restricting and prior- be damaging. Rapid WGS also provided a definitive diagnosis in itizing variants with respect to allele frequency (42), likelihood of a one of four infants enrolled prospectively. In CMH172, with refractory functional consequence (25), and relevance to the prompting illness. epilepsy, rapid WGS disclosed a novel, homozygous frame-shifting Thus, rapid WGS, as described herein, was designed for prompt dis- insertion in a single candidate gene (BRATI). BRATI mutations were ease diagnosis rather than carrier testing or newborn screening. SSAGA very recently reported in two unrelated Amish infants who suffered mapped the clinical features in ill neonates and children to disease lethal, multifocal seizures (46). A molecular diagnosis was reached genes. Thereby, analysis was limited only to the parts of the genome within 1 hour of WGS data inspection in CMH172, even though a- relevant to an individual patient's presentation, in accord with guide- tant reference databases [Human Gene Mutation Database (HGMD) lines for genetic testing in children (25-28). This greatly decreased and OMIMI had not yet been updated with a BRATI disease associ- the number of variants to be interpreted. In particular, SSAGA caused ation. The diagnosis was made clinically reportable by resequencing most incidental (secondary) findings to be masked. In the setting of the patient and her parents. Had this diagnosis been obtained in real acute care in the NICU, secondary findings are anticipated to impede time, it may have expedited the decision to reduce or withdraw facile interpretation, reporting, and communication with physicians and support. The latter decision was made in the absence of a molecular patients greatly (9, 49, 50). SSAGA also assisted in test ordering, diagnosis after 5 weeks of ventilatory support, testing, and unsuc- itr permitting a broad selection of genes to be nominated for testing based cessful interventions to control seizures. Given high rates of NICU a on entry of the patients' clinical features with easy-to-use pull-down bed occupancy, accelerated diagnosis by rapid WGS has the potential menus. The versi
ℹ️ Document Details
SHA-256
3a0c128d499a89c23b9b73dbe61e4d4da2827da31858430392d87e6787a5cdfe
Bates Number
EFTA00315100
Dataset
DataSet-9
Document Type
document
Pages
14

Comments 0

Loading comments…
Link copied!