DATABASES AND GENOME PROJECTS
Am J Pharmacogenomics 2005; 5 (4): 233-246 1175-2203/05/0004-0233/$34.95/0 © 2005 Adis Data Information BV. All rights reserved.
The Autism Genome Project Goals and Strategies Diane Hu-Lince, David W. Craig, Matthew J. Huentelman and Dietrich A. Stephan Neurogenomics Division, Translational Genomics Research Institute, Phoenix, Arizona, USA
Contents Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233 1. Clinical Description of Autism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 234 2. The Search for Candidate Genes in Autism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235 2.1 Genetic Insight into Autism through Cytogenetically Visible Chromosomal Abnormalities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 236 2.2 Hypothesis Driven Genetic Studies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237 2.3 Whole Genome Linkage Scan Results in Autism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 238 3. The Autism Genome Project: Phase I . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 238 3.1 Whole-Genome Linkage Scanning in Complex Traits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239 3.2 Single Nucleotide Polymorphisms (SNP) Genotyping and Microsatellite Marker Genotyping . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240 3.2.1 Accuracy and Reproducibility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240 3.2.2 Informativeness of SNPs versus Microsatellites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241 3.2.3 Linkage Disequilibrium as a Confounding Factor in High Density SNP Linkage Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241 3.2.4 Availability of Analysis Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241 3.2.5 The GeneChip® DNA Analysis Software (GDAS) Port . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241 3.2.6 Common Existing Analysis Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242 3.2.7 New Analysis Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242 3.2.8 Workflow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242 4. The Autism Genome Project: Phase II . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242 5. Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243
Abstract
Autism is a complex neurodevelopmental disorder with a broad spectrum of symptoms and varying severity. Currently, no biological diagnosis exists. Although there has been a significant increase in autism genetics research recently, validated susceptibility genes for the most common, sporadic forms of autistic disorder, as well as familial autism, have yet to be identified. The identification of autism-susceptibility genes will not only assist in the identification and/or development of better medications that can help improve the health and neurodevelopment of children with autism, but will also allow for better perinatal diagnosis. The Autism Genome Project (AGP) is a large-scale, collaborative genetics research project initiated by the National Alliance for Autism Research and the National Institutes of Health, and is aimed at sifting through the human genome in search of autism-susceptibility genes. Phase I of the AGP will consist of genome-wide scans utilizing both SNP array and microsatellite technologies. Linkage analysis will subsequently be performed on approximately 1500 pedigrees as will downstream fine-mapping and sequencing of the critical linkage intervals. Ultimately, the vision will be to identify the exact nucleotide variants within genes which give rise to predisposition. The AGP intends to move the field of autism clinical management forward by answering questions about the causal mechanisms underlying the pathophysiology of autism. From this knowledge, therapeutic targets for drug treatments, and ultimately, a newborn screening diagnostic that would allow for early intervention, can begin to be developed.
234
In recent years, interest in deciphering autism has increased markedly. This may be attributed to a sudden reported increase in the incidence of autism, and better diagnosis. Autism now ranks as the leading childhood developmental disorder in the US, with an incidence of 1 in 166 births and a rate four times higher in boys than in girls.[1-3] There are currently no specific diagnostic criteria to determine which children are at risk before the onset of the disorder, and no effective therapeutic once the disorder strikes. Progress toward finding susceptibility genes for autism has been impeded because of the multigenic nature of the disorder, a lack of access to tissue at the site of pathology, and a lack of robust rodent models of the trait. Recently, the Autism Genome Project (AGP) has been nucleated by both the National Alliance for Autism Research (NAAR) and National Institutes of Health (NIH) in an attempt to identify the community-specific nucleotide variants that are implicated as influencing the phenotype. The AGP is a large-scale, collaborative genetics research project drawing on very large patient cohorts from around the globe to garner sufficient statistical power to tease out the multiple underlying genetic roots of this disorder. Over 40 academic and research institutions across North America and Europe (table I) will band together to contribute approximately 7000 samples from approximately 1500 multiplex families (a family with at least two children affected with autistic spectrum disorder) to this effort. This will be the largest genome scan for autism ever performed. The first phase of the AGP will consist of genome-wide scans utilizing both single nucleotide polymorphisms (SNP) and microsatellite technologies. The Translational Genomics Research Institute (TGen), a nonprofit biomedical research institute, in conjunction with Affymetrix, Inc., will perform the genome scan utilizing high-density SNP genotyping arrays. A concurrent microsatellite marker-based linkage scan on the identical cohort will be conducted by the Center for Inherited Disease Research (CIDR) [figure 1]. Linkage analysis will subsequently be performed on the approximately 1500 pedigrees by a data coordinating site. This article highlights the current state of knowledge regarding the genetics of autism. Many other excellent reviews are available in the literature.[4-7] 1. Clinical Description of Autism Autism is a neurodevelopmental disorder characterized by varying degrees of impairment in three behavioral domains: (i) social interaction; (ii) language, communication, and imaginative play; and (iii) range of interests and activities.[8,9] For the past 50 years diagnosis of autism has relied on clinical presentation only. Currently there is no single biological or clinical marker for autism. © 2005 Adis Data Information BV. All rights reserved.
Hu-Lince et al.
The Autism Diagnostic Interview-Revised (ADI-R) has been used extensively as a comprehensive interview to aid in the diagnosis and treatment of autism.[10] Developmental abnormalities are evident by the age of 3 years and persist into adulthood.[2,6] Though it is difficult to accurately estimate the population prevalence of autism, it is probably within the range of 4–6 per 10 000 and some speculate even in the range of 40 per 10 000.[3,11] This significant increase in prevalence rates may be a reflection of changes in case definition and improved recognition. For reasons unknown, the prevalence in males is four times that of females.[3] The first clinical descriptions of autism were brought forth by psychiatrist Leo Kanner and pediatrician Hans Asperger in the early 1940s. Evidence to suggest that autism is genetic stems from family and twin studies. Family studies support the recurrence risk to siblings to be 2–6%, significantly higher than the risk to the general population (<0.1–0.2%).[12] Twin studies have also shown the importance of genetic factors in the etiology of autism. Depending on either narrow or broad phenotype, the concordance rate when one monozygotic (MZ) twin is diagnosed with autism is 70–90%. In contrast, a 0–25% concordance rate is observed among same-sex dizygotic (DZ) twins.[13-16] The overlap of autism with known genetic disorders such as fragile X,[17,18] tuberous sclerosis,[19] Prader-Willi/Angelman syndrome,[20] and neurofibromatosis[21] further supports a genetic predisposition to autism. Moreover, cytogenic abnormalities associated with autism phenotypes have been found on practically every chromosome, most notably chromosomes 7, 15, and X.[22] Despite evidence from family and twin studies, few significant genetic linkages to autism have been identified. The plethora of cytogenetic abnormalities and linkage regions found to be associated with autism phenotypes suggest that genetic heterogeneity is likely to be frequent across sporadic patients. The rate by which autism frequency drops among first, second, and third degree relatives is another indication that disease susceptibility arises from the combined effects of multiple genes.[23,24] Whole-genome screens performed on multiplex families with semblances of Mendelian inheritance patterns suggest that as many as ten or more genes may be interacting individually or in concert to cause the autism phenotype.[8,25] Additionally, allelic heterogeneity is also likely to be frequent. An example of allelic heterogeneity can be found in the monogenic disorder cystic fibrosis, whereby many different mutations in a single gene (CFTR) have been found to contribute to the disorder.[26] Similarly, distinct defects in the same gene may lead to the same or different patterns of autism. This, and the multigenic nature of autism, poses a considerable challenge for genetic analysis. Am J Pharmacogenomics 2005; 5 (4)
The Autism Genome Project
235
Table I. Participants in the Autism Genome Project Autism Genetics Cooperative (AGC) The International Molecular Genetic The Collaborative Programs of Study of Autism Consortium (IMGSAC) Excellence (CPEA)
Autism Genetics Resource Exchange (AGRE)
Duke University Medical Center (Durham, NC, USA) University of North Carolina (Chapel Hill, NC, USA) University of South Carolina (Columbia, SC, USA) University of Iowa (Iowa City, IA, USA) Stanford University (Palo Alto, CA, USA) Mt Sinai Medical Center (New York, NY, USA) Vanderbilt University (Nashville, TN, USA) INSERM U513 (Creteil, France) Institut Pasteur (Paris, France) Goteborg University (Goteborg, Sweden) McMaster University (Hamilton, ON, Canada) University of Toronto (Toronto, ON, Canada) Trinity College of Dublin (Dublin, Ireland) University College of Dublin (Dublin, Ireland) Instituto Gulbenkian de Ciencia (Oeiras, Portugal)
University of California at Los Angeles (Los Angeles, CA, USA)
Institute of Psychiatry (London, England) Wellcome Trust Centre for Human Genetics University of Oxford (Oxford, England) University Section of Child and Adolescent Psychiatry University of Oxford (Oxford, England) Newcomen Centre at Guy’s Hospital (London, England) University of Newcastle (Newcastle Upon Tyne, England) University of Manchester (Manchester, England) Porton Down, ECACC (Wiltshire, England) Videnscenter og Centre for Autisme/ National Center for Autism (Virum, Denmark) Centre d’Etudes et de Recherches en Psychopathologie (CERPP) [Toulouse, France] Deutsches Krebsforschungszentrum (Heidelberg, Germany) J.W. Goethe Universitat (Frankfurt, Germany) “Agia Sophia” Children’s Hospital/ University Department of Child Psychiatry (Athens, Greece) University of Bologna (Bologna, Italy) AZU-Department of Child and Adolescent Psychiatry (Ultrecht, The Netherlands) McGill University/Montreal Children’s Hospital (Montreal, QC, Canada) University of Michigan (Ann Arbor, MI, USA) Yale University (New Haven, CT, USA) University of Chicago (Chicago, IL, USA)
2. The Search for Candidate Genes in Autism Diseases with complex inheritance are presumed to be caused by multiple genes interacting with one another and/or with environmental factors.[27] Examples of complex genetic diseases include asthma, diabetes mellitus, epilepsy, hypertension, bipolar disorder, schizophrenia, and autism. The genetic etiology of many complex diseases has been progressively studied in order to gain a better understanding of their pathogenesis with the eventual goal © 2005 Adis Data Information BV. All rights reserved.
Boston University (Boston, MA, USA) University of California at Los Angeles (Los Angeles, CA, USA) University of Rochester Medical Center (Rochester, NY, USA) University of Pittsburgh (Pittsburgh, PA, USA) University of California at Irvine (Irvine, CA, USA) Yale University (New Haven, CT, USA) University of Washington (Seattle, WA, USA) University of Utah (Salt Lake City, UT, USA)
of improving diagnostic tools, preventative strategies, and therapies.[28] Genetic mapping has been a very powerful tool for identifying human disease genes. In the last 20 years, family-based linkage analysis has led to the identification of many genes that control Mendelian traits but only a paucity of genes underlying complex diseases.[29] Unlike the predicted pattern of inheritance seen in autosomal or X-linked recessive and dominant disorders, the discovery of susceptibility genes in complex human diseases has Am J Pharmacogenomics 2005; 5 (4)
236
Hu-Lince et al.
43 institutions 7000 DNA samples 1500 multiplex families
Phase 1 Linkage analysis
Phase 2
Single nucleotide polymorphism array technology (TGen, Affymetrix, Inc.)
Fine mapping of linkage regions
Microsatellite technology (CIDR)
Nucleotide variant identification Mouse model generation
Fig. 1. The Autism Genome Project process (organized by the National Alliance for Autism Research [NAAR] and the National Institutes of Health [NIH]). CIDR = Center for Inherited Disease Research; TGen = Translational Genomics Research Institute.
been complicated by a host of factors including considerable etiological heterogeneity, the influence of any single gene-variant on diseases status having a small effect, and the associated requirement for large samples. In addition, mapping of human susceptibility loci can be hampered by any or combinations of the following: incomplete penetrance, high population frequency, phenocopies, genetic heterogeneity, possible epistasis, and pleiotropy.[27-30] Nonetheless, genetic linkage analysis has successfully identified a number of susceptibility genes for complex diseases such as type 1 diabetes mellitus[31-33] and Alzheimer disease.[34-36] Also, some rare Mendelian families can provide insight into the more common sporadic forms of the trait. In recent years several approaches have been employed to locate autism susceptibility genes. Among these are cytogenetic studies, hypothesis-driven studies of very specific biological processes, and genome-wide linkage scans. A brief overview of the results of each approach is given here. Table II gives a summary of the current positive associations found between candidate gene allelic variants and autism. 2.1 Genetic Insight into Autism through Cytogenetically Visible Chromosomal Abnormalities
Structural chromosomal abnormalities are usually of one of three types: (i) deletions, where material is lost from a single chromosome; (ii) inversions, where there are two breaks within a single chromosome and the broken segment inverts and reattaches to form a chromosome that is structurally out of sequence; and (iii) translocations, where there is an exchange of material between two or more chromosomes. It is thought that less than 10% of autism cases result from chromosomal abnormalities;[62,63] however, cytogenetic scans have provided some insight into locating specific genes or chromosomal loci potentially associated with autism. Cytogenetic abnormalities on chromosome 15 have been identified in several individuals with autism, indicating that this region may harbor potential susceptibility genes for autism.[22,49,63,64] Various studies have described duplications,[49,63,65-67] dele© 2005 Adis Data Information BV. All rights reserved.
tions,[53,63,68,69] and inversions[68,69] at locus 15q11-q13. Several chromosome 15 abnormalities resulting in features characteristic of autism are inherited duplications of maternal origin,[63,65] which suggests the possibility of an imprinting mechanism in the regulation of gene expression. This is characterized to varying degrees by language delay, ataxia, epilepsy, mental retardation, and facial dysmorphology. Hence, the phenotypic overlap with the autistic phenotype implicates this region as a potential autism hotspot. One of the potential candidate genes within this locus is the γaminobutyric acid (GABAA) receptor gene cluster, containing genes coding for the α5, β3 and γ3 receptor units (GABRA5, GABRB3, and GABRG3, respectively). GABA is expressed in early development and is the principal inhibitory neurotransmitter in the mammalian CNS, controlling excitability in the adult brain.[70] One initial study identified an association at 155CA-2, a microsatellite repeat polymorphism within the GABRB3 gene;[49] this finding was replicated in another study.[50] Several other studies, however, have not been able to replicate this finding.[71-73] In addition to the GABAA cluster, the ubiquitin protein ligase E3A gene (UBE3A) has also been implicated in the autistic phenotype.[53] This gene is expressed predominantly in the human brain and is regulated by mechanisms involved in imprinting and possibly silencing by antisense RNA transcribed from the paternal chromosome.[53,66,74] One sibling pair study revealed association at D15S122, a microsatellite marker located at the 5′ end of the UBE3A gene.[53] It is noteworthy to mention that individuals who possess an abnormality at chromosome 15q11-q13 do not necessarily develop autism. This finding further strengthens the multigenic nature of the disorder and suggests that multiple genes on different chromosomes may be acting individually or in concert. The chromosome 7q22-23 region has also been associated with chromosomal translocations.[22,75-77] Reelin, a protein that is encoded by the RELN gene at the site of the translocation at 7q22, is a large secreted glycoprotein potentially involved in neural migration during development. Specific mutations in RELN have been identified and cause autosomal recessive lissencephaly, a disorder of failed neuronal migration.[78] One study of autistic disorder reported transmission of longer triplet repeats (CGG) in the 5′UTR of RELN, conferring vulnerability to autistic disorder.[43] Another study concluded that the overall transmission disequilibrium was not significant but that ‘large’ alleles (≥11 repeats) were preferentially transmitted.[42] Further studies, however, have found no evidence for an association between the trait and the RELN trinucleotide repeat.[79-81] A plethora of other genes located at the 7q22-q33 locus have also shown an association with the autistic phenotype, though further validation is warranted.[40,77,82-84] Am J Pharmacogenomics 2005; 5 (4)
The Autism Genome Project
237
Table II. Positive associations found between candidate gene allelic variants and autism Genea
Physical location
Phenotype
Study size and design
Reference
SLC25A12
2q24-q33
Autism
411 Autistic families; linkage and association
37
HLA-DRB1
6p21
Autism
50 Patients versus 79 controls
38
GRIK2
6q21
Autism
107 trios and 57 multiplex families; linkage and association
39
LAMB1
7q31
Autism
48 individuals
40
30 families
41
NRCAM
7q31
Autism
48 individuals
40
RELN
7q22
ASD-with delayed phrase speech
126 families
42
7q22
Autism
95 individuals; linkage and association 172 trios; TDT
43
GRM8
7q31
Autism
196 families; TDT
44
WNT2
7q31-q33
Autism with severe 50 families language abnormality
45
7q32
Autism
170 multiplex families; TDT with 76 markers
46
D721804 b D7S2533
b
7q33
Autism
170 multiplex families; TDT with 76 markers
46
EN2
7q36
Autism
138 trios; TDT
47
HRAS
11p15
Autism
Case versus control (n = 55; n = 55)
48
GABRB3
15q11.2-q13
Autism
140 multiplex families; LD
49
GABRB3
15q11-q13
Autism
80 families; TDT
50
GABRA5
15q11.2-q13
Autism
123 multiplex families; LD
51
GABRG3
15q11.2-q13
Autism
226 families; PDT
52
UBE3A
15q11-q13
Autism
94 multiplex families; LD
53
OMG
17q11
Autism (DQ >30)
Case versus control (n = 37; n = 101)
54
BLMH
17q11
Autism
81 trios; TDT
55
SLC6A4
17q11-q12
Autism
86 trios; TDT
56
17q11-q12
Autism
52 trios; TDT
57
17q11-q12
Autism
33 families
58
17q11-q12
Autism
71 patients; TDT
59
ADA
20q13
Autism
118 patients versus 126 controls
60
DXS287 b
Xq23
Infantile autism
Case control
61
a
All genes are reported using approved symbols according to the Human Genome Nomenclature Committee database (see URL: http:// www.gene.ucl.ac.uk/cgi-bin/nomenclature/searchgenes.pl), except where otherwise indicated.
b
Chromosomal marker.
ASD = autistic spectrum disorder; DQ = development quotient; LD = linkage disequilibrium; PDT = pedigree disequilibrium test; TDT = transmission disequilibrium test.
2.2 Hypothesis Driven Genetic Studies
A priori knowledge of biologically plausible gene candidates and pathways is the basis for hypothesis driven research. These studies predict the involvement of certain candidate genes based on clinical and empirical evidence. Studies have shown that serotonin reuptake inhibitors, dopamine antagonists as well as androgenic drugs, have a positive effect on autistic behavior.[85] Hence, genes that code for neurotransmitters or receptors of these substances are potential candidates for autism. © 2005 Adis Data Information BV. All rights reserved.
Serotonin is a neurotransmitter involved in many functions, including mood, appetite, and sensory perception. Altered serotonin levels may contribute to the core behavioral characteristics of autism.[85,86] The serotonin transporter gene (SLC6A4, also known as 5-HTT) has been studied extensively by many researchers. One study found preferential inheritance of a short promoter variant of SLC6A4 in affected individuals, while others have reported that affected family members inherited the long promoter variant more frequently.[57,58] However, other autistic populations have not Am J Pharmacogenomics 2005; 5 (4)
238
Hu-Lince et al.
shown an association between autism and serotonin transporter promoter variants. Dopamine is a neurotransmitter of the catecholamine family, which also includes norepinephrine and epinephrine and is involved in emotional behavior. One study has shown that children with autism have abnormal dopaminergic activity in the low medial prefrontal cortex[87] as well as elevated levels of catecholamines in the blood, urine, and cerebrospinal fluid.[88,89] Studies of the dopamine receptors D2, D3, and D5, the tyrosine hydroxylase gene (TH) and the dopamine β hydroxylase gene (DBH), however, have given few positive results.[89,90] Other avenues of interest include the homeo box (HOX) genes that regulate hindbrain development, differentiation of the urogenital system and appendicular skeletal growth,[91,92] the glutamatergic system,[93,94] and the arginine-vasopressin system.[95] 2.3 Whole Genome Linkage Scan Results in Autism
In linkage analysis, the goal is to identify a region of the genome that co-segregates with a trait through multiple meiotic events. The strategy of linkage analysis is to utilize polymorphic markers (microsatellite markers or SNPs) to discriminate which alleles are inherited through each meiotic event. Nonparametric, or model-free, linkage analysis approaches are typically favored for complex traits because they make no assumptions about disease transmission. This approach also relies on the use of sibling pairs to identify increased sharing of alleles among affected family members.[96] To date, at least 13 whole genome scans and several follow-up linkage studies targeting particular regions of the genome have been published in efforts to identify susceptibility genes in autism.[8,97-108] With the exception of regions on 2q where two studies found strong linkage signals[97,102] and chromosome 16p, where three studies showed suggestive signals,[97,100,101] most of these studies did not report any significant results and only give indications to either possible genetic loci or mild-to-moderate evidence for linkage. Whole-genome linkage scans conducted between 1998 and 2001 show the strongest evidence for linkage on chromosome 7q, whereas suggestive linkage was reported in three studies.[97,98,101] Many prior reviews have summarized these whole-genome linkage studies in detail.[4,62,109] Unfortunately, though some linkage peaks increased with the addition of new samples, suggestive linkage peaks from the first study decreased. Interestingly, the overlap of regions between studies or even in follow-up studies performed by the same group using the identical polymorphic marker panels in those independent cohorts is rare. Despite concerted efforts, no single gene has been identified that explains a significant proportion of cases of autism. It is becoming © 2005 Adis Data Information BV. All rights reserved.
increasingly evident that better and more powerful strategies aimed at identifying the genetic underpinnings of autism are warranted. 3. The Autism Genome Project: Phase I The completion of the Human Genome Project has paved the way for the study of complex human diseases. Recent progress in genomics, especially the development of high-throughput platforms for large-scale genotyping, has the potential to dramatically accelerate our ability to identify rare or, more likely, common polymorphisms that may be causal in both Mendelian and complex diseases. In phase I of the AGP, linkage analysis will be performed in a huge cohort of more than 1500 multiplex pedigrees collected from around the world using both the standard microsatellite linkage panels (at CIDR/NIH) and the Affymetrix >10 000 SNP array (GeneChip® Mapping 10K Array, Affymetrix, Inc., Santa Clara, CA, USA) and all data will be analyzed in an integrated fashion. By genotyping thousands of SNPs across the genome, it is possible to track regions of chromosomes that co-segregate through a pedigree with a trait. Historically, in a whole-genome scan, multiple affected members in a pedigree segregating a single-gene defect (Mendelian inheritance pattern) are genotyped using a set of 300–400 microsatellite markers spaced at 10 centiMorgan (cM) intervals across the genome. Microsatellite markers are polymorphic stretches of DNA that consist of tandem repeats of a simple sequence of nucleotides, which vary in length throughout the population.[110] If a gene that is responsible for the disease exists somewhere in the genome, affected family members are expected to have inherited the same disease-predisposing allele at the locus, and markers that lie physically near this disease gene will be transmitted along with the disease allele. With increasing density of the marker panel, the likelihood that at least one of these markers will be in sufficient proximity to the disease gene is significantly increased. As mentioned previously, the detection of genes for complex phenotypes is hampered by the relatively low genetic relative risk conferred by each locus. Bolstering information content by either genotyping more families or adding more markers could potentially decrease false-positive results and increase the detection of true susceptibility loci. While the use of microsatellite markers spaced at intervals of ~10cM across the genome has been the traditional method employed in genome-wide linkage scans, SNPs are beginning to emerge as an alternative to whole-genome genotyping because of their ease of use and ability to perform massively parallel genotyping. SNPs are variations in DNA sequence that occur when a single nucleotide, adenine (A), thymine (T), cytosine (C), or Am J Pharmacogenomics 2005; 5 (4)
The Autism Genome Project
guanine (G), in the genome sequence is altered. For example, a SNP might change the DNA sequence from GAGCCTA to GTGCCTA. While 99.9% of the human genome is identical between two individuals, it is estimated that a SNP can be found approximately every kilobase.[111-113] A map of closely spaced SNPs may offer equal or superior power to detect linkage, compared with low-density microsatellite maps even though they are individually less informative (only bi-allelic) than microsatellites.[114,115] Though there are several types of polymorphisms in the human genome (SNPs, repeat polymorphisms, and deletions or insertions, ranging in size from a single base pair to thousands of base pairs), DNA sequence variation in the human genome is predominantly in the form of SNPs.[116] SNPs can have very different consequences at the phenotypic level depending on the location. For example, SNPs residing in the coding regions of genes that alter the function or structure of the encoded proteins are the cause of classic single nucleotide mutations in most of the known recessively or dominantly inherited monogenic disorders. Other groups of SNPs alter the primary structure of a protein involved in drug metabolism and are, thus, obvious targets for pharmacogenetic intervention.[117] Missense SNPs that reside in the coding regions of genes also contribute to common diseases. Examples of missense SNPs include SNPs found in the apolipoprotein E (APOE) gene[118] and the factor V Leiden mutation.[119] An assessment of the risk of a particular disease can be accomplished by analyzing the SNP in question. Lastly, SNPs that reside in the regulatory regions of genes might influence the risk of common disease. Because most SNPs are found in noncoding regions of the human genome, they have no known direct impact on the phenotype of the individual. Thus, these types of SNPs are useful genetic markers to localize nearby disease-causing events through classic linkage analysis, association analysis, and linkage disequilibrium studies in both founder and outbred populations.[111-113,120] The use of SNPs in highdensity genotyping offers an alternative to microsatellites because of the sheer number of markers, which provide higher overall information that is more evenly distributed.[121] One of the first high-density whole-genome studies demonstrated that a carefully selected set of 2988 SNPs is more informative than the Marshfield Clinic screening microsatellite version.[122] 3.1 Whole-Genome Linkage Scanning in Complex Traits
The successful use of high-density, whole-genome SNP genotyping for linkage analysis has been demonstrated in several studies in the literature. For example, Puffenberger et al.[123] identified the causative mutation in the TSPYL gene (encoding © 2005 Adis Data Information BV. All rights reserved.
239
testis-specific protein-like 1) from a 3.6 Mb autozygous region in a study of 21 individuals with sudden infant death with dysgenesis of the testes (SIDDT) syndrome using the GeneChip® Mapping 10K Array and Assay Kit containing >10 000 SNPs on a single silicon array. In 36 individuals, Shrimpton et al.[124] identified a single missense mutation in the HOXD10 gene from a 6 Mb region in an autosomal dominant form of Charcot-Marie-Tooth disease also using the Affymetrix GeneChip® Mapping 10K Array. In addition, Sellick et al.[125] were able to identify three loci for three separate diseases (two of which were novel), using the GeneChip® Mapping 10K Array. Moreover, there have been several examples of linkage studies in complex diseases where the informativeness of the 10K SNP panel was compared with the common microsatellite panels. For example, Schaid et al.[126] identified in 467 men with prostate cancer from 167 families several genetic loci including chromosomes 8, 2, 6, and 12 that were missed using 402 microsatellite markers with the ABI PRISM® Linkage Mapping Set (Applied Biosystems, Foster City, CA, USA). This study also provided a direct comparison between traditional microsatellite markers and SNP markers for genotyping, revealing that the information content increased from 41% for microsatellite markers to 61% for SNPs. The finding that the ~10 000 SNPs have a higher and more evenly distributed total information content than ~400 microsatellite markers has been corroborated by other groups using simulation approaches, as calculated by the program Merlin.[114,127] Other examples of linkage studies that compare SNPs to a microsatellite panel include studies on rheumatoid arthritis and bipolar disorder.[115,128] It was found that the SNP panel consistently gave better coverage and a higher information content (IC) compared with the microsatellite assays and, in addition, that linkage peaks identified would have gone undetected in previous whole-genome scans. Moreover, Murray et al.[129] were able to demonstrate that the information content for the Illumina® Linkage III SNP panel (Illumina, Inc., San Diego, CA, USA) was higher than both a 10cM and 5cM genome scan using conventional microsatellite markers. Additionally, there has been one study whereby 811 microsatellites have been compared with both the GeneChip® Mapping 10K Array platform, and the BeadArray™ platform (Illumina, Inc.). It was found that the GeneChip® system extracts marginally more information, that the BeadArray™ system exhibits slightly higher call rates, and that both high-density SNP platforms illustrate a higher information content than the 811 Applied Biosystems high-density linkage mapping microsatellite marker set (HDLMS) [GeneChip® 91.5%, BeadArray™ 90.7%, HDLMS 82.8%].[130] The sheer density of coverage in the genome that SNPs provide has been shown to allow genes to be honed in on at a much quicker Am J Pharmacogenomics 2005; 5 (4)
240
Hu-Lince et al.
rate. This not only allows loci to be defined more precisely, but also provides the framework for conducting whole-genome casecontrol association studies. This type of study seeks to identify SNPs that statistically associate to complex, multi-genic diseases due to their close proximity to other, possibly unmapped, polymorphisms.[131,132] However, the number of SNPs required to realistically conduct a whole-genome association study is believed to be in the order of hundreds of thousands of SNPs spaced evenly across the genetic blueprint.[28] The AGP is a large-scale, collaborative genetics research project aimed at mapping the human genome in the search for autismsusceptibility genes. Together with over 40 academic and research institutions across North America and Europe, approximately 7000 samples from around 1500 multiplex families (a family with two or more children affected with autistic spectrum disorder according to the international classification of disease criteria[133]) will be combined to form the largest genome-wide linkage scan for autism to date. Phase I of the NAAR Genome Project will consist of genome-wide scans utilizing both SNP array and microsatellite technologies. Linkage analysis will subsequently be performed on the approximately 1500 pedigrees. Ultimately, the vision will be to identify the exact nucleotide variants within genes which give rise to predisposition. 3.2 Single Nucleotide Polymorphisms (SNP) Genotyping and Microsatellite Marker Genotyping
As stated previously, while linkage studies have historically relied on multi-allelic microsatellite markers, SNP markers are emerging as an alternative approach. High-density SNP genotyping offers an alternative to microsatellites in that while they are biallelic markers, the sheer number of markers typed provides higher overall information which is more evenly distributed.[134] The AGP will be comprehensive not only in that it is powered well for identification of predisposing alleles, but also because each DNA sample will be genotyped on both the Affymetrix GeneChip® Mapping 10K SNP array and the ~400 microsatellite marker panel. The utility of having every sample in the cohort genotyped on the identical set of markers will allow full integration of the assembled pedigrees – something that is never fully possible with meta-analyses post hoc. The Affymetrix GeneChip® DNA array technology is a direct extension of the more commonly used expression profiling GeneChip® assays.[135] In the 10K SNP genotyping assay, fluorescently labeled fragments of DNA containing specific SNPs are genotyped by whether or not they hybridize to a sequencespecific oligomer probe set tiled on a silicon wafer.[135] The GeneChip® Mapping Array assay uses whole-genome PCR ampli© 2005 Adis Data Information BV. All rights reserved.
fication of digested genomic DNA ligated to universal adaptors.[136-138] DNA is purified, biotin labeled, fragmented, and hybridized to a microarray. For each SNP, 40 tiled oligomer probes with 10 mismatched calls and 10 perfect matched calls interrogate each of 2 possible SNP alleles.[138] At the time of publication, three mapping panels were available using this platform: Affymetrix GeneChip® Mapping 10K Array, Affymetrix GeneChip® Mapping 100K Array, and an Affymetrix GeneChip® Mapping 500K Array in a two chip set. Array calls for each SNP are made by comparing the relative allele signal for each SNP to a trained database[139] (10K SNP panel) or by using a model-based algorithm[140] (100K SNP panel). The trained database model determines if the two relative allele signal (RAS) discriminators from both the sense and anti-sense direction fall into one of three silhouettes (homozygous AA, heterozygous AB, homozygous BB) derived from a training set. In comparison, the model-based algorithm calculates a log likelihood of all possible genotype models, homozygous, heterozygous, or no call, according to the hybridization signals for the 40 probes. A Wilcoxon Signed Rank test is used to compute confidence intervals. The model-based algorithm is necessary at higher densities since a training set is not always available for SNPs only found at low frequencies. Through these conservative calling algorithms, call accuracy is reported to be 99.9% and call rates (the number of SNPs successfully genotyped on each array) average 95%.[134] In our experience there are a set of approximately 400 SNPs out of the >10 000 which are consistently not called and account for the vast majority of the failed genotypes in a systematic fashion. 3.2.1 Accuracy and Reproducibility Accuracy
To assess the accuracy of the Affymetrix GeneChip® Mapping 10K SNP array platform, we have used three different methods. First, the genotypes generated by the Mapping 10K array were compared with genotypes generated by a single base extension (SBE or mini-sequencing) method across 40 Caucasian samples used by The SNP Consortium (TSC) allele frequency project. There was an average of 546 SNPs compared for each sample, for a total of 21 858 genotypes generated for this accuracy measure. 108 discordant genotypes were identified yielding a concordance rate of 99.5%. This represents a lower bound of accuracy, as we do not know if discordant genotypes were due to SBE errors or Mapping 10K errors. In the second method, sixty SNPs were genotyped in six individuals by di-deoxy sequencing analysis. There was only one discordant genotype among 341 compared, giving an overall concordance of 99.7%. The third method used PEDCHECK[141] to look at Mendel errors in five Centre d’Etude du Polymorphisme Humain (CEPH) trios.[134] Here, 104 errors Am J Pharmacogenomics 2005; 5 (4)
The Autism Genome Project
were seen in 162 404 genotypes, giving an estimated accuracy of 99.9%. This number is likely to be an overestimate of accuracy, as some genotyping errors will go undetected due to uninformative meioses. Reproducibility
To assess reproducibility, 16 individuals were each genotyped 9 times for a total of 1 623 443 genotypes. 192 did not match the consensus genotypes, giving a reproducibility of 99.99%. In another set of experiments, 24 samples were tested in triplicate. Ten of 24 samples had 100% reproducibility across three replicates. Thus, the SNP arrays have an accuracy of between 99.5 and 99.9% by several independent estimates.[115,134] 3.2.2 Informativeness of SNPs versus Microsatellites
Information content is a function of marker heterozygosity, distance between markers, and pedigree structure. While each individual SNP is less informative than a microsatellite marker in most cases (e.g. on the Mapping 10K array the average heterozygosity is 0.37, whereas on the CIDR panel the average heterozygosity is 0.72), the greater number of SNPs in aggregate leads to higher information content at any particular point in the genome.[134] The benefits of higher information content are higher logarithm of odds (LOD) scores, smaller linkage intervals, and the ability to find linkage intervals that were missed by lower information content approaches. Matsuzaki et al.[138] were able to replicate a linkage for chronic mucocutaneous candidiasis and thyroid disease with a higher LOD score, along with similar p-values and a decrease in the linkage interval from 35.28cM to 17.73cM. Sellick et al.[125] were able to identify novel regions linked to neonatal diabetes, and separately, craniosynostosis, that were missed by previous microsatellite scans on the same samples. Kennedy et al.[134] made several direct comparisons of information content on a group of 10 423 SNPs compared with 365 microsatellite markers in a set of 550 samples. The SNPs showed a much higher entropy score (a measurement of information content) compared with the microsatellite markers (mean IC = 0.54 for 365 microsatellites vs mean IC = 0.74 for 11 555 SNPs).[138] 3.2.3 Linkage Disequilibrium as a Confounding Factor in High Density SNP Linkage Analysis
There exists the potential for linkage disequilibrium between markers to skew multipoint LOD calculations because the intermarker density averages 210kb with the >10 000 SNPs on the Affymetrix GeneChip® 10K SNP array. This does not affect the accuracy of 2-point LOD score calculations, but will skew multipoint LOD calculations because alleles of adjacent markers do not segregate independently. The ability to define haplotypes as ‘virtual markers’ and use these in the multipoint LOD calculation will © 2005 Adis Data Information BV. All rights reserved.
241
eventually provide a solution to this problem. Haplotyping will be performed by taking advantage of informative meioses within pedigrees as well as any information that can be gleaned from the Haplotype Mapping (HapMap) project.[142] The Varia software (Agilent Technologies, Palo Alto, CA, USA) [described below] does have this function built in, and we are currently using this software in conjunction with the standard packages discussed in the next section. 3.2.4 Availability of Analysis Tools
A number of freely available analysis tools have been utilized by the genetic analysis community for a number of years. All of these tools work with the Affymetrix GeneChip® 10K SNP genotype data. There are a number of options for 10K array analysis which include conversion tools to import Affymetrix files into common linkage packages (GeneChip® DNA Analysis Software [GDAS] Port), standard analysis tools which handle microsatellite data and also handle SNP array genotype data (Merlin,[127] GeneHunter,[143] LINKAGE suite,[144] SimWalk;[145] Mega2 converts between all file formats), new tools have been created specifically for GeneChip® 10K SNP data that have graphical user interfaces and seamless connection to genomic annotations (Varia and dChipSNP), and finally new tools, algorithms, and approaches are sure to be developed as a result of the Genetics Analysis Workshop 2004 which focused on comparing the Affymetrix GeneChip® Mapping 10K array to the standard ABI PRISM® microsatellite panel. 3.2.5 The GeneChip® DNA Analysis Software (GDAS) Port
This module formats the Affymetrix output file (‘GDAS analysis file’ containing SNP ID, TSC ID, dbSNP-assigned reference SNP [rs] ID, genetic map location, relative allele signals, SNP calls, and call zones [www.affymetrix.com]) for import into common analysis packages. This module is available from Affymetrix at no cost. The two input files are: • Pedigree file (.txt) family ID, individual ID, father, mother, sex, affection status (1 unaffected, 2 is affected). The standard pedfile is modified by making an additional column before the family ID column, which contains the individual ID from the GDAS output column header for the calls ending in the prefix “*_call”. • GDAS analysis file (.txt) should be updated with respect to genetic marker information from NetAffx at least every three months. The deCODE marker map (deCODE Genetics, Reykjavik, Iceland) is used because it has the highest resolution. The deCODE marker order map is really a linkage map derived from the latest decode microsatellite genetic map with the SNPs plotted relative to the microsatellite markers using genome sequence data. The NetAffx™ tool (Affymetrix) also enables Am J Pharmacogenomics 2005; 5 (4)
242
Hu-Lince et al.
downloading Caucasian, African American, and Asian allele frequencies for the analysis packages. Files in the correct format can be exported, and Mendelian inheritance of each marker is checked. If a marker does not pass this error check, the specific genotype call data is deleted and not used. 3.2.6 Common Existing Analysis Tools
The existing analysis tools include Merlin, GeneHunter, and Simwalk. Using these tools, we have obtained results using the 10K SNP array and compared those results to linkage data obtained using microsatellite marker panels. Multi-tool analysis of data generated from the GeneChip® 10K SNP array has been done on multiple pedigrees to illustrate that all common analysis tools work with the SNP data and produce LOD scores often in excess of those produced using microsatellite markers. Both parametric analysis (Varia and M-Link) and nonparametric analysis (Merlin and SimWalk2) can be done. A script for GeneHunter is being written to reduce the marker files to <400 markers each. 3.2.7 New Analysis Tools
There are currently two new analysis tools which are mature and which we have utilized. These tools not only perform standard linkage using standard analysis algorithms or modified algorithms, but also provide a host of visualization tools to query the genotypes, haplotypes, haplotype breakpoints, etc. These are: • Varia from Silicon Genetics (now part of Agilent Technologies). We have worked with Silicon Genetics to develop a new software package entitled Varia. This analysis solution can import Affymetrix GDAS files directly. • dChipSNP. This is an implementation of the Lander-Green algorithm for doing parametric linkage analysis. Data can be imported from GDAS analysis files directly into dChipSNP, and the software allows elegant visualization of linkage results, SNP genotypes for haplotype breakpoint mapping, and gene content in the interval. The basic dChipSNP was described in a recent publication by Leiberfarb et al.[146] 3.2.8 Workflow
Workflow is dramatically different between a microsatellite genome scan and a SNP array genome scan. Overall, with similar infrastructures available for each, SNP 10K arrays generate two orders of magnitude increase in throughput, with complete elimination of the need to check allele calling manually. • Microsatellites are generally genotyped using PCR amplification of a single marker in 96 or 384 well format. Individual marker amplicons are fluorescently tagged and size selected, so that up to 16 markers can be combined into a single well after amplification. Thus, 16 plates of PCR are robotically combined © 2005 Adis Data Information BV. All rights reserved.
•
into a single plate which can be run on an automatic sequencer. Current sequencers can handle 96 wells at a time, so 4 runs of an ABI 3730xl are needed to genotype 16 markers in 384 individuals. The robotics, PCR machine infrastructure, and DNA sequencer capacity allow us to genotype 48 markers per day on 384 individuals. To do all ~400 microsatellites on 384 individuals would take about 2 weeks. Thus, the yearly maximum (working 45 weeks a year) that can be achieved using microsatellite markers at a mid-sized genome institute would be 8640 individuals genotyped with the microsatellite marker panel, with almost 3 500 000 genotypes generated. There would also have to be a significant team of scientists checking alleles and Mendelian inheritance of alleles in order to achieve this throughput. SNP arrays take 3 days each to run. One DNA sample is labeled with one primer and hybridized to one array which allows (on the 10K SNP array) >10 000 SNP genotypes to be called in that time-period. Because they can be run in parallel, data can be generated at a rate of 1000 arrays per week with 6 technicians and 3 scanners and 12 daisy-chained fluidics stations. The major bottleneck of verifying the integrity of each call is not necessary with the Affymetrix genotyping software. Calls are made automatically and have an accuracy of >99.9%. Additionally, a >95% call rate is the average seen. The genotype calling algorithm is extremely conservative so that if no unambiguous call can be made, the software assigns an ‘absent’ call for that genotype. There is a user interface available within Affymetrix GDAS program that allows any genotype to be queried with respect to signal strength and call confidence. 4. The Autism Genome Project: Phase II
The second phase of the AGP is focused on identifying the exact nucleotide changes that predispose to autism and reside within the linkage intervals derived from phase I. The general strategy for phase II will be to sequence each linkage interval to completion in probands from families which contribute the majority of the LOD score to each genetic interval. These will be different families for each positive linkage interval. Sequencing technologies have recently become available and costs have plummeted, enabling large-scale sequencing of intervals. If a functional nucleotide variant is found in a gene, and other significantly functional variants (i.e. ‘mutations’) are found in the same gene in different pedigrees from that linkage interval, it may become obvious as to what is the disease-causing gene. If no functional nucleotide variants are identified, variants will be tested through informatics to assess whether there is a probability of that variant being functionally relevant (nonsynonymous, evolutionariAm J Pharmacogenomics 2005; 5 (4)
The Autism Genome Project
ly conserved nucleotide, etc.). The subset of variants that survive must reside on the at-risk allele, which will be tested by genotyping in the original pedigree structure. Those 50% of variants which reside on the correct allele will be prioritized with respect to functional significance again, and then gene-specific clustering will be performed across the pedigrees. If the genetic risk is imparted through a common nucleotide variant which is present in all probands on the correct allele, a larger case-control association study will be needed to solidify the association of that SNP with the phenotype. Phase II of the AGP is also ultimately interested in developing murine models for autism based on the genetic findings. Murine models will be essential for testing therapeutic interventions. Ultimately, we hope that the work performed in both phase I and II will move the entire field of autism clinical management forward in the next decade to the point where early and accurate newborn screening can be performed, and effective interventions can be initiated to ensure that the disorder does not strike. 5. Summary Autism is a major public health concern, conferring a deep burden on patients and their families. While some with autism are only mildly affected, most people with the condition will require lifelong supervision and care and have significant impairments. This poses a tremendous challenge to the health professionals who treat, and the families who live with and care for, these individuals. Early diagnosis and intervention will significantly improve a child’s long-term outcome and can reduce the challenges associated with the disorder, lessen disruptive behavior, provide for some degree of independence and increase a child’s ability to grow and develop new skills. Because autism requires lifelong treatment, any insight provided by the AGP study in identifying susceptibility genes responsible for the autistic phenotype could significantly alter a person’s lifestyle. The AGP will move the entire field of autism clinical management forward in that we hope the findings from the study will begin to answer questions about the causal mechanisms underlying the pathophysiology of autism. As well, more robust knockout mouse models using the candidate gene findings will aid in targeting new pathways and mechanisms. From here we can develop therapeutic targets for drug treatments; the ultimate goal is to develop a newborn screening diagnostic that would allow for early intervention. Interestingly, no drugs are currently available for use directly in autism. Defining key regulatory points in the disease pathogenesis that would potentially help in the development of new therapies targeting these regulatory events would provide families with much needed hope. © 2005 Adis Data Information BV. All rights reserved.
243
Moreover, autism is a disorder with symptoms that are also associated with other brain-based conditions. Identifying the genetic causes of autism has the potential to yield significant advances for other disorders such as attention deficit-hyperactivity disorder, fragile X, and some forms of mental retardation. Over the past two decades 1200 human disease genes have been identified,[147] together with what we learn from the outcome of the AGP, a paradigm for other complex genetic traits may result. Of course, even after susceptibility genes are identified, many questions will still need to be answered. What is the relative importance of each gene on the disease process? How will we understand and treat the disease based on the genetic underpinnings that are identified? Key regulatory points in the disease pathogenesis will need to be defined and new therapies targeting these regulatory events will certainly need to be developed. Acknowledgments Dr Hu-Lince received a National Research Service Award Fellowship (P32 NS43932).
References 1. Stokstad E. Development: new hints into the biological basis of autism. Science 2001; 294: 34-7 2. Filipek PA, Accardo PJ, Baranek GT, et al. The screening and diagnosis of autistic spectrum disorders. J Autism Dev Disord 1999; 29: 439-84 3. Fombonne E. The epidemiology of autism: a review. Psychol Med 1999; 29: 769-86 4. Santangelo SL, Tsatsanis K. What is known about autism: genes, brain, and behavior. Am J Pharmacogenomics 2005; 5 (2): 71-92 5. Muhle R, Trentacoste SV, Rapin I. The genetics of autism. Pediatrics 2004; 113: e472-86 6. Lotspeich LJ, Ciaranello RD. The neurobiology and genetics of infantile autism. Int Rev Neurobiol 1993; 35: 87-129 7. Spence MA. The genetics of autism. Curr Opin Pediatr 2001; 13: 561-5 8. Risch N, Spiker D, Lotspeich L, et al. A genomic screen of autism: evidence for a multilocus etiology. Am J Hum Genet 1999; 65: 493-507 9. Bailey A, Phillips W, Rutter M. Autism: towards an integration of clinical, genetic, neuropsychological, and neurobiological perspectives. J Child Psychol Psychiatry 1996; 37: 89-126 10. Lord C, Rutter M, Le Couteur A. Autism diagnostic interview-revised: a revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. J Autism Dev Disord 1994; 24: 659-85 11. Rutter M. Incidence of autism spectrum disorders: changes over time and their meaning. Acta Paediatr 2005; 94: 2-15 12. Chakrabarti S, Fombonne E. Pervasive developmental disorders in preschool children. JAMA 2001; 285: 3093-9 13. Folstein S, Rutter M. Infantile autism: a genetic study of 21 twin pairs. J Child Psychol Psychiatry 1977; 18: 297-321 14. Ritvo ER, Freeman BJ, Mason-Brothers A, et al. Concordance for the syndrome of autism in 40 pairs of afflicted twins. Am J Psychiatry 1985; 142: 74-7 15. Steffenburg S, Gillberg C, Hellgren L, et al. A twin study of autism in Denmark, Finland, Iceland, Norway and Sweden. J Child Psychol Psychiatry 1989; 30: 405-16 16. Bailey A, Le Couteur A, Gottesman I, et al. Autism as a strongly genetic disorder: evidence from a British twin study. Psychol Med 1995; 25: 63-77 17. Brown WT, Jenkins EC, Cohen IL, et al. Fragile X and autism: a multicenter survey. Am J Med Genet 1986; 23: 341-52 Am J Pharmacogenomics 2005; 5 (4)
244
18. Folstein SE, Piven J. Etiology of autism: genetic influences. Pediatrics 1991; 87: 767-73 19. Smalley SL. Autism and tuberous sclerosis. J Autism Dev Disord 1998; 28: 407-14 20. Steffenburg S, Gillberg CL, Steffenburg U, et al. Autism in Angelman syndrome: a population-based study. Pediatr Neurol 1996; 14: 131-6 21. Folstein SE, Rutter ML. Autism: familial aggregation and genetic implications. J Autism Dev Disord 1988; 18: 3-30 22. Gillberg C. Chromosomal disorders and autism. J Autism Dev Disord 1998; 28: 415-25 23. Lamb JA, Moore J, Bailey A, et al. Autism: recent molecular genetic advances. Hum Mol Genet 2000; 9: 861-8 24. Rutter M. Genetic studies of autism: from the 1970s into the millennium. J Abnorm Child Psychol 2000; 28: 3-14 25. Pickles A, Bolton P, Macdonald H, et al. Latent-class analysis of recurrence risks for complex phenotypes with selection and measurement error: a twin and family history study of autism. Am J Hum Genet 1995; 57: 717-26 26. Ferrari M, Antonelli M, Bellini F, et al. Genetic differences in cystic fibrosis patients with and without pancreatic insufficiency: an Italian collaborative study. Hum Genet 1990; 84 (5): 435-8 27. Weeks DE, Lathrop GM. Polygenic disease: methods for mapping complex disease traits. Trends Genet 1995; 11: 513-9 28. Risch NJ. Searching for genetic determinants in the new millennium. Nature 2000; 405: 847-56 29. Glazier AM, Nadeau JH, Aitman TJ. Finding genes that underlie complex traits. Science 2002; 298: 2345-9 30. Lander ES, Schork NJ. Genetic dissection of complex traits. Science 1994; 265: 2037-48 31. Todd JA, Bell JI, McDevitt HO. HLA-DQ beta gene contributes to susceptibility and resistance to insulin-dependent diabetes mellitus. Nature 1987; 329: 599-604 32. Todd JA, Acha-Orbea H, Bell JI, et al. A molecular basis for MHC class II: associated autoimmunity. Science 1988; 240: 1003-9 33. Todd JA, Wicker LS. Genetic protection from the inflammatory disease type 1 diabetes in humans and animal models. Immunity 2001; 15: 387-95 34. Pericak-Vance MA, Bebout JL, Gaskell PC, et al. Linkage studies in familial Alzheimer disease: evidence for chromosome 19 linkage. Am J Hum Genet 1991; 48: 1034-50 35. Corder EH, Saunders AM, Strittmatter WJ, et al. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer’s disease in late onset families. Science 1993; 261: 921-3 36. Bales KR, Verina T, Cummins DJ, et al. Apolipoprotein E is essential for amyloid deposition in the APP(V717F) transgenic mouse model of Alzheimer’s disease. Proc Natl Acad Sci U S A 1999; 96: 15233-8 37. Ramoz N, Reichert JG, Smith CJ, et al. Linkage and association of the mitochondrial aspartate/glutamate carrier SLC25A12 gene with autism. Am J Psychiatry 2004; 161: 662-9 38. Warren RP, Odell JD, Warren WL, et al. Strong association of the third hypervariable region of HLA-DR beta 1 with autism. J Neuroimmunol 1996; 67: 97-102 39. Jamain S, Betancur C, Quach H, et al. Linkage and association of the glutamate receptor 6 gene with autism. Mol Psychiatry 2002; 7: 302-10 40. Bonora E, Lamb JA, Barnby G, et al. Mutation screening and association analysis of six candidate genes for autism on chromosome 7q. Eur J Hum Genet 2005; 13: 198-207 41. Hutcheson HB, Olson LM, Bradford Y, et al. Examination of NRCAM, LRRN3, KIAA0716, and LAMB1 as autism candidate genes. BMC Med Genet 2004; 5: 12 42. Zhang H, Liu X, Zhang C, et al. Reelin gene alleles and susceptibility to autism spectrum disorders. Mol Psychiatry 2002; 7: 1012-7 43. Persico AM, D’Agruma L, Maiorano N, et al. Reelin gene alleles and haplotypes as a factor predisposing to autistic disorder. Mol Psychiatry 2001; 6: 150-9 44. Serajee FJ, Zhong H, Nabi R, et al. The metabotropic glutamate receptor 8 gene at 7q31: partial duplication and possible association with autism. J Med Genet 2003; 40: e42 45. Wassink TH, Piven J, Vieland VJ, et al. Evidence supporting WNT2 as an autism susceptibility gene. Am J Med Genet 2001; 105: 406-13 © 2005 Adis Data Information BV. All rights reserved.
Hu-Lince et al.
46. International Molecular Genetic Study of Autism Consortium (IMGSAC). Further characterization of the autism susceptibility locus AUTS1 on chromosome 7q. Hum Mol Genet 2001 Apr 15; 10 (4): 973-82 47. Gharani N, Benayed R, Mancuso V, et al. Association of the homeobox transcription factor, ENGRAILED 2, 3, with autism spectrum disorder. Mol Psychiatry 2004; 9: 474-84 48. Herault J, Petit E, Martineau J, et al. Autism and genetics: clinical approach and association study with two markers of HRAS gene. Am J Med Genet 1995; 60: 276-81 49. Cook EH, Courchesne RY, Cox NJ, et al. Linkage-disequilibrium mapping of autistic disorder, with 15q11-13 markers. Am J Hum Genet 1998; 62: 1077-83 50. Buxbaum JD, Silverman JM, Smith CJ, et al. Association between a GABRB3 polymorphism and autism. Mol Psychiatry 2002; 7: 311-6 51. McCauley JL, Olson LM, Delahanty R, et al. A linkage disequilibrium map of the 1-Mb 15q12 GABA(A) receptor subunit cluster and association to autism. Am J Med Genet 2004; 131B: 51-9 52. Menold MM, Shao Y, Wolpert CM, et al. Association analysis of chromosome 15 gabaa receptor subunit genes in autistic disorder. J Neurogenet 2001; 15: 245-59 53. Nurmi EL, Bradford Y, Chen Y, et al. Linkage disequilibrium at the Angelman syndrome gene UBE3A in autism families. Genomics 2001; 77: 105-13 54. Vourc’h P, Martin I, Marouillat S, et al. Molecular analysis of the oligodendrocyte myelin glycoprotein gene in autistic disorder. Neurosci Lett 2003; 338: 115-8 55. Kim SJ, Herzing LB, Veenstra-Vander Weele J, et al. Mutation screening and transmission disequilibrium study of ATP10C in autism. Am J Med Genet 2002; 114: 137-43 56. Cook Jr EH, Courchesne R, Lord C, et al. Evidence of linkage between the serotonin transporter and autistic disorder. Mol Psychiatry 1997; 2: 247-50 57. Klauck SM, Poustka F, Benner A, et al. Serotonin transporter (5-HTT; gene variants associated with autism? Hum Mol Genet 1997; 6: 2233-8 58. Yirmiya N, Pilowsky T, Nemanov L, et al. Evidence for an association with the serotonin transporter promoter region polymorphism and autism. Am J Med Genet 2001; 105: 381-6 59. Tordjman S, Gutknecht L, Carlier M, et al. Role of the serotonin transporter gene in the behavioral expression of autism. Mol Psychiatry 2001; 6: 434-9 60. Bottini N, De Luca D, Saccucci P, et al. Autism: evidence of association with adenosine deaminase genetic polymorphism. Neurogenetics 2001; 3: 111-3 61. Petit E, Herault J, Raynaud M, et al. X chromosome and infantile autism. Biol Psychiatry 1996; 40: 457-64 62. Folstein SE, Rosen-Sheidley B. Genetics of autism: complex aetiology for a heterogeneous disorder. Nat Rev Genet 2001; 2: 943-55 63. Schroer RJ, Phelan MC, Michaelis RC, et al. Autism and maternally derived aberrations of chromosome 15q. Am J Med Genet 1998; 76: 327-36 64. Herzing LB, Kim SJ, Cook EH, et al. The human aminophospholipid-transporting ATPase gene ATP10C maps adjacent to UBE3A and exhibits similar imprinted expression. Am J Hum Genet 2001; 68: 1501-5 65. Gurrieri F, Battaglia A, Torrisi L, et al. Pervasive developmental disorder and epilepsy due to maternally derived duplication of 15q11-q13. Neurology 1999; 52: 1694-7 66. Herzing LB, Cook EH, Ledbetter DH. Allele-specific expression analysis by RNAFISH demonstrates preferential maternal expression of UBE3A and imprint maintenance within 15q11-q13 duplications. Hum Mol Genet 2002; 11: 1707-18 67. Bolton PF, Dennis NR, Browne CE, et al. The phenotypic manifestations of interstitial duplications of proximal 15q with special reference to the autistic spectrum disorders. Am J Med Genet 2001; 105: 675-85 68. Wolpert CM, Donnelly SL, Cuccaro ML, et al. De novo partial duplication of chromosome 7p in a male with autistic disorder. Am J Med Genet 2001; 105: 222-5 69. Borgatti R, Piccinelli P, Passoni D, et al. Relationship between clinical and genetic features in “inverted duplicated chromosome 15” patients. Pediatr Neurol 2001; 24: 111-6 70. Owens DF, Kriegstein AR. Is there more to GABA than synaptic inhibition? Nat Rev Neurosci 2002; 3: 715-27 71. Maestrini E, Lai C, Marlow A, et al. Serotonin transporter (5-HTT; and gammaaminobutyric acid receptor subunit beta3 (GABRB3; gene polymorphisms are Am J Pharmacogenomics 2005; 5 (4)
The Autism Genome Project
72.
73.
74. 75. 76.
77. 78.
79.
80. 81.
82.
83.
84.
85.
86. 87. 88.
89. 90. 91.
92. 93.
94.
95. 96.
not associated with autism in the IMGSA families. The International Molecular Genetic Study of Autism Consortium. Am J Med Genet 1999; 88: 492-6 Martin ER, Menold MM, Wolpert CM, et al. Analysis of linkage disequilibrium in gamma-aminobutyric acid receptor subunit genes in autistic disorder. Am J Med Genet 2000; 96: 43-8 Salmon B, Hallmayer J, Rogers T, et al. Absence of linkage and linkage disequilibrium to chromosome 15q11-q13 markers in 139 multiplex families with autism. Am J Med Genet 1999; 88: 551-6 Rougeulle C, Cardoso C, Fontes M, et al. An imprinted antisense RNA overlaps UBE3A and a second maternally expressed transcript. Nat Genet 1998; 19: 15-6 Ashley-Koch A, Wolpert CM, Menold MM, et al. Genetic studies of autistic disorder and chromosome 7. Genomics 1999; 61: 227-36 Yan WL, Guan XY, Green ED, et al. Childhood-onset schizophrenia/autistic disorder and t(1;7) reciprocal translocation: identification of a BAC contig spanning the translocation breakpoint at 7q21. Am J Med Genet 2000; 96: 749-53 Scherer SW, Cheung J, MacDonald JR, et al. Human chromosome 7: DNA sequence and biology. Science 2003; 300: 767-72 Hong SE, Shugart YY, Huang DT, et al. Autosomal recessive lissencephaly with cerebellar hypoplasia is associated with human RELN mutations. Nat Genet 2000; 26: 93-6 Krebs MO, Betancur C, Leroy S, et al. Absence of association between a polymorphic GGC repeat in the 5′ untranslated region of the reelin gene and autism. Mol Psychiatry 2002; 7: 801-4 Bonora E, Beyer KS, Lamb JA, et al. Analysis of reelin as a candidate gene for autism. Mol Psychiatry 2003; 8: 885-92 Devlin B, Bennett P, Dawson G, et al. Alleles of a reelin CGG repeat do not convey liability to autism in a sample from the CPEA network. Am J Med Genet 2004; 126B: 46-50 Newbury DF, Bonora E, Lamb JA, et al. FOXP2 is not a major susceptibility gene for autism or specific language impairment. Am J Hum Genet 2002; 70: 1318-27 Petek E, Windpassinger C, Vincent JB, et al. Disruption of a novel gene (IMMP2L) by a breakpoint in 7q31 associated with Tourette syndrome. Am J Hum Genet 2001; 68: 848-58 Vincent JB, Herbrick JA, Gurling HM, et al. Identification of a novel gene on chromosome 7q31 that is interrupted by a translocation breakpoint in an autistic individual. Am J Hum Genet 2000; 67: 510-4 McDougle CJ, Posey D. Genetics of childhood disorders: XLIV. Autism: Part 3. Psychopharmacology of autism. J Am Acad Child Adolesc Psychiatry 2002; 41: 1380-3 Chugani DC. Role of altered brain serotonin mechanisms in autism. Mol Psychiatry 2002; 7 Suppl. 2: S16-7 Ernst M, Zametkin AJ, Matochik JA, et al. Low medial prefrontal dopaminergic activity in autistic children [letter]. Lancet 1997; 350: 638 Gillberg C, Svennerholm L. CSF monoamines in autistic syndromes and other pervasive developmental disorders of early childhood. Br J Psychiatry 1987; 151: 89-94 Martineau J, Herault J, Petit E, et al. Catecholaminergic metabolism and autism. Dev Med Child Neurol 1994; 36: 688-97 Philippe A, Guilloud-Bataille M, Martinez M, et al. Analysis of ten candidate genes in autism by association and linkage. Am J Med Genet 2002; 114: 125-8 Ingram JL, Stodgell CJ, Hyman SL, et al. Discovery of allelic variants of HOXA1 and HOXB1: genetic susceptibility to autism spectrum disorders. Teratology 2000; 62: 393-405 Goodman FR, Scambler PJ. Human HOX gene mutations. Clin Genet 2001; 59: 1-11 Nilsson M, Waters S, Waters N, et al. A behavioural pattern analysis of hypoglutamatergic mice: effects of four different antipsychotic agents. J Neural Transm 2001; 108: 1181-96 Carlsson ML. Hypothesis: is infantile autism a hypoglutamatergic disorder? Relevance of glutamate-serotonin interactions for pharmacotherapy. J Neural Transm 1998; 105: 525-35 Wassink TH, Piven J, Vieland VJ, et al. Examination of AVPR1a as an autism susceptibility gene. Mol Psychiatry 2004; 9: 968-72 Badner JA, Gershon ES, Goldin LR. Optimal ascertainment strategies to detect linkage to common disease alleles. Am J Hum Genet 1998; 63: 880-8
© 2005 Adis Data Information BV. All rights reserved.
245
97. International Molecular Genetic Study of Autism Consortium. A full genome screen for autism with evidence for linkage to a region on chromosome 7q. Hum Mol Genet 1998; 7: 571-8 98. Collaborative Linkage Study of Autism. An autosomal genomic screen for autism. Am J Med Genet 2001; 105: 609-15 99. Philippe A, Martinez M, Guilloud-Bataille M, et al. Genome-wide scan for autism susceptibility genes. Paris Autism Research International Sibpair Study. Hum Mol Genet 1999; 8: 805-12 100. Liu J, Nyholt DR, Magnussen P, et al. A genomewide screen for autism susceptibility loci. Am J Hum Genet 2001; 69: 327-40 101. International Molecular Genetic Study of Autism Consortium (IMGSAC). A genomewide screen for autism: strong evidence for linkage to chromosomes 2q, 7q, and 16p. Am J Hum Genet 2001; 69: 570-81 102. Buxbaum JD, Silverman JM, Smith CJ, et al. Evidence for a susceptibility gene for autism on chromosome 2 and for genetic heterogeneity. Am J Hum Genet 2001; 68: 1514-20 103. Shao Y, Wolpert CM, Raiford KL, et al. Genomic screen and follow-up analysis for autistic disorder. Am J Med Genet 2002; 114: 99-105 104. Alarcon M, Cantor RM, Liu J, et al. Evidence for a language quantitative trait locus on chromosome 7q in multiplex autism families. Am J Hum Genet 2002; 70: 60-71 105. Auranen M, Vanhala R, Varilo T, et al. A genomewide screen for autism-spectrum disorders: evidence for a major susceptibility locus on chromosome 3q25-27. Am J Hum Genet 2002; 71: 777-90 106. Yonan AL, Alarcon M, Cheng R, et al. A genomewide screen of 345 families for autism-susceptibility loci. Am J Hum Genet 2003; 73: 886-97 107. Buxbaum JD, Silverman J, Keddache M, et al. Linkage analysis for autism in a subset families with obsessive-compulsive behaviors: evidence for an autism susceptibility gene on chromosome 1 and further support for susceptibility genes on chromosome 6 and 19. Mol Psychiatry 2004; 9: 144-50 108. Auranen M, Nieminen T, Majuri S, et al. Analysis of autism susceptibility gene loci on chromosomes 1p, 4p, 6q, 7q, 13q, 15q, 16p, 17q, 19q and 22q in Finnish multiplex families. Mol Psychiatry 2000; 5: 320-2 109. Newschaffer CJ, Fallin D, Lee NL. Heritable and nonheritable risk factors for autism spectrum disorders. Epidemiol Rev 2002; 24: 137-53 110. Ellegren H. Microsatellites: simple sequences with complex evolution. Nat Rev Genet 2004; 5: 435-45 111. Ardlie KG, Kruglyak L, Seielstad M. Patterns of linkage disequilibrium in the human genome. Nat Rev Genet 2002; 3: 299-309 112. Carlson CS, Eberle MA, Rieder MJ, et al. Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans. Nat Genet 2003; 33: 518-21 113. Kruglyak L, Nickerson DA. Variation is the spice of life. Nat Genet 2001; 27: 234-6 114. Evans DM, Cardon LR. Guidelines for genotyping in genomewide linkage studies: single-nucleotide-polymorphism maps versus microsatellite maps. Am J Hum Genet 2004; 75: 687-92 115. John S, Shephard N, Liu G, et al. Whole-genome scan, in a complex disease, using 11,245 single-nucleotide polymorphisms: comparison with microsatellites. Am J Hum Genet 2004; 75: 54-64 116. Sachidanandam R, Weissman D, Schmidt SC, et al. A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature 2001; 409: 928-33 117. Evans WE, Relling MV. Pharmacogenomics: translating functional genomics into rational therapeutics. Science 1999; 286: 487-91 118. Davignon J, Gregg RE, Sing CF. Apolipoprotein E polymorphism and atherosclerosis. Arteriosclerosis 1988; 8: 1-21 119. Bertina RM, Koeleman BP, Koster T, et al. Mutation in blood coagulation factor V associated with resistance to activated protein C. Nature 1994; 369: 64-7 120. Gabriel SB, Schaffner SF, Nguyen H, et al. The structure of haplotype blocks in the human genome. Science 2002; 296: 2225-9 121. Rosenberg NA, Pritchard JK, Weber JL, et al. Genetic structure of human populations. Science 2002; 298: 2381-5 122. Matise TC, Sachidanandam R, Clark AG, et al. A 3.9-centimorgan-resolution human single-nucleotide polymorphism linkage map and screening set. Am J Hum Genet 2003; 73: 271-84 Am J Pharmacogenomics 2005; 5 (4)
246
123. Puffenberger EG, Hu-Lince D, Parod JM, et al. Mapping of sudden infant death with dysgenesis of the testes syndrome (SIDDT) by a SNP genome scan and identification of TSPYL loss of function. Proc Natl Acad Sci U S A 2004; 101: 11689-94 124. Shrimpton AE, Levinsohn EM, Yozawitz JM, et al. A HOX gene mutation in a family with isolated congenital vertical talus and Charcot-Marie-Tooth disease. Am J Hum Genet 2004; 75: 92-6 125. Sellick GS, Longman C, Tolmie J, et al. Genomewide linkage searches for Mendelian disease loci can be efficiently conducted using high-density SNP genotyping arrays. Nucleic Acids Res 2004; 32: e164
Hu-Lince et al.
136. Gentalen E, Chee M. A novel method for determining linkage between DNA sequences: hybridization to paired probe arrays. Nucleic Acids Res 1999; 27: 1485-91 137. Fan JB, Chen X, Halushka MK, et al. Parallel genotyping of human SNPs using generic high-density oligonucleotide tag arrays. Genome Res 2000; 10: 853-60 138. Matsuzaki H, Loi H, Dong S, et al. Parallel genotyping of over 10,000 SNPs using a one-primer assay on a high-density oligonucleotide array. Genome Res 2004; 14: 414-25 139. Liu WM, Di X, Yang G, et al. Algorithms for large-scale genotyping microarrays. Bioinformatics 2003; 19: 2397-403
126. Schaid DJ, Guenther JC, Christensen GB, et al. Comparison of microsatellites versus single-nucleotide polymorphisms in a genome linkage screen for prostate cancer-susceptibility loci. Am J Hum Genet 2004; 75: 948-65
140. Cutler DJ, Zwick ME, Carrasquillo MM, et al. High-throughput variation detection and genotyping using microarrays. Genome Res 2001; 11: 1913-25
127. Abecasis GR, Cherny SS, Cookson WO, et al. Merlin: rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 2002; 30: 97-101
141. O’Connell JR, Weeks DE. PedCheck: a program for identification of genotype incompatibilities in linkage analysis. Am J Hum Genet 1998; 63: 259-66
128. Middleton FA, Pato MT, Gentile KL, et al. Genomewide linkage analysis of bipolar disorder by use of a high-density single-nucleotide-polymorphism (SNP) genotyping assay: a comparison with microsatellite marker assays and finding of significant linkage to chromosome 6q22. Am J Hum Genet 2004; 74: 886-97
142. International HapMap Consortium. Integrating ethics and science in the International HapMap Project. Nat Rev Genet 2004; 5: 467-75
129. Murray SS, Oliphant A, Shen R, et al. A highly informative SNP linkage panel for human genetic studies. Nat Methods 2004; 1: 113-7 130. Sawcer SJ, Maranian M, Singlehurst S, et al. Enhancing linkage analysis of complex disorders: an evaluation of high-density genotyping. Hum Mol Genet 2004; 13: 1943-9 131. Risch N, Merikangas K. The future of genetic studies of complex human diseases. Science 1996; 273: 1516-7 132. Carlson CS, Eberle MA, Kruglyak L, et al. Mapping complex disease loci in whole-genome association studies. Nature 2004; 429: 446-52 133. Sponheim E. Changing criteria of autistic disorders: a comparison of the ICD-10 research criteria and DSM-IV with DSM-III-R, CARS, and ABC. J Autism Dev Disord 1996; 26: 513-25 134. Kennedy GC, Matsuzaki H, Dong S, et al. Large-scale genotyping of complex DNA. Nat Biotechnol 2003; 21: 1233-7 135. Chee M, Yang R, Hubbell E, et al. Accessing genetic information with high-density DNA arrays. Science 1996; 274: 610-4
© 2005 Adis Data Information BV. All rights reserved.
143. Lander ES, Green P. Construction of multilocus genetic linkage maps in humans. Proc Natl Acad Sci U S A 1987; 84: 2363-7 144. Lathrop GM, Lalouel JM, Julier C, et al. Strategies for multilocus linkage analysis in humans. Proc Natl Acad Sci U S A 1984; 81: 3443-6 145. Sobel E, Lange K. Descent graphs in pedigree analysis: applications to halotyping, location scores, and marker-sharing statistics. Am J Hum Genet 1996; 58: 1323-37 146. Lieberfarb ME, Lin M, Lechpammer M, et al. Genome-wide loss of heterozygosity analysis from laser capture microdissected prostate cancer using single nucleotide polymorphic allele (SNP) arrays and a novel bioinformatics platform dChipSNP. Cancer Res 2003; 63: 4781-5 147. Botstein D, Risch N. Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease. Nat Genet 2003; 33 Suppl.: 228-37
Correspondence and offprints: Dr Dietrich A. Stephan, Neurogenomics Division, The Translational Genomics Research Institute, 445 North Fifth Street, 5th Floor, Phoenix, AZ 85004, USA.
Am J Pharmacogenomics 2005; 5 (4)