Neurogenetics (2007) 8:111–120 DOI 10.1007/s10048-007-0081-5
ORIGINAL ARTICLE
A novel gene derived from a segmental duplication shows perturbed expression in Alzheimer’s disease Dimitrios Avramopoulos & Ruihua Wang & David Valle & M. Daniele Fallin & Susan S. Bassett
Received: 10 November 2006 / Accepted: 29 January 2007 / Published online: 16 February 2007 # Springer-Verlag 2007
Abstract Alzheimer’s disease (AD) is a disabling neurodegenerative disorder with onset commonly in late life. Three genes have been identified causing earlier onset AD, and a fourth has been shown to be a risk factor for late onset AD (LOAD), while many more yet unrecognized genes are thought to contribute to susceptibility. Many studies have reported linkage to LOAD on human chromosome 10, where we have identified a parent of origin effect [Bassett SS, Avramopoulos D, Perry RT, Wiener H, Watson B Jr, Go RC, Fallin MD. Am J Med Genet B Neuropsychiatr Genet 141:537–540, (2006), Bassett SS, Avramopoulos D, Fallin D. Am J Med Genet 114:679–686, (2002)]. In this paper, we report on a gene in this region that shows reduced expression with increasing age, reduced expression in females across ages, and further reduction in LOAD patients. In concordance with the observed parent of origin effect on the D. Avramopoulos : R. Wang : S. S. Bassett Department of Psychiatry, School of Medicine, Johns Hopkins University, Baltimore, MD, USA D. Avramopoulos (*) : D. Valle McKusick–Nathans Institute of Genetic Medicine, School of Medicine, Johns Hopkins University, 733 N. Broadway, BRB-509, Baltimore, MD 21205, USA e-mail:
[email protected] M. D. Fallin Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA M. D. Fallin Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
linkage, this reduction is more pronounced in patients with an affected mother. We discovered this gene while studying the alkaline ceramidase gene (ASAH2); it is a partial paralog of ASAH2, and we call it ASAH2L. It is the result of a partial duplication of ASAH2 on chromosome 10q11.23, just downstream from the sequence with promoter activity. ASAH2L has a polymorphic start codon with a single nucleotide change of the original ASAH2 sequence plus other putative translation start sites that might produce novel proteins. It is expressed in all the tissues we tested including the brain and is an interesting example of the generation of a new gene. Comparison of primate and other mammal genomes suggests that ASAH2L is human specific. Further research would be necessary to determine the function of the ASAH2L transcript and explore any possible involvement in neurodegeneration. Keywords Brain . Alzheimer disease . Gene . Gene duplication . Gene expression
Introduction Alzheimer’s disease (AD) is characterized by progressive cognitive decline over a course of about 8 years from diagnosis to incapacitating dementia and death. Risk factors for the disease include increased age, positive family history, and female sex [3, 4]. With the exception of a few cases presenting before age 65 and inherited in an autosomal dominant manner, the vast majority of AD cases are of later onset. Despite a high heritability estimated at about 70% [5, 6], late onset AD (LOAD) does not show a clear mode of transmission, and it is believed to follow complex genetics with several genes influencing the risk.
112
Neurogenetics (2007) 8:111–120
Initial successes in the field of AD genetics lead to the identification of three genes for the early onset forms (APP [7], PSEN1 [8], and PSEN2 [9]) and one gene, APOE [10], for the common later onset form. Subsequently reported genes have shown weaker associations and inconsistent replications [11]. Several genome scans have been performed implicating many regions that might harbor susceptibility genes, but none has provided strong evidence or has been consistently replicated in independent scans. Nevertheless, few regions show more promise, as they have provided positive results in more than one study including chromosomes 6, 9, 10, and 12 [12]. In 2002, we published strong linkage for LOAD on chromosome 10q11–q21 with an observed parent of origin effect, where families with an affected mother showed significantly stronger linkage [2]. We later increased our sample size, gained further support for the parent of origin effect, and reported a logarithm of the odds score of 3.7 in 68 maternal pedigrees [1]. Chromosome 10q has shown positive results in several linkage studies, and it is one of the most promising locations for LOAD susceptibility genes today [12, 13]. Our region of interest is complicated by the presence of numerous complex duplications that cover a total of ∼3 Mb. Such duplications often include entire genes, and they have been associated with the creation of new genes as the function of paralogous genes diverges [14–16]. They also provide a substrate for non-allelic homologous recombination [17] leading to chromosomal rearrangements. Variations in the gross structure of the region, including possible copy number differences between individuals, could be of importance for disease susceptibility [18]. At the same time, the repetitive nature of these sequences renders genotyping for association studies difficult. In our ongoing search for susceptibility genes for LOAD, we screened 12 functional candidate genes in 10q11–q21 (ACF, ALOX5, ASAH2, CREM, CUL2, CXCL12, GDF2, MAPK8, PARG, SLC18A3, TIMM23, and UBE2D1) for differences in their expression in the temporal lobe of the brain of LOAD cases and healthy controls. One gene showed lower expression in females and an inverse correlation between RNA levels and age. LOAD patients showed an even further reduction in age- and sexadjusted expression. Consistent with our observation of
increased linkage in families with affected mothers, the expression reduction was more pronounced in patients from those maternal families. Unexpectedly, we found that the transcript was not the product of the originally selected candidate gene (ASAH2) but rather originated from a partial paralog located 520 Kb telomerically, which we call ASAH2L. We show that this novel and possibly human specific transcript is abundantly expressed in all tissues, including the brain, in contrast with ASAH2 that is not expressed at significant levels in the human brain. Our data suggest a possible involvement of ASAH2L in AD.
Materials and methods The AD DNA samples used in this study were collected for the National Institute of Mental Health genetics initiative and have been described elsewhere [19]. In some instances, brain tissue was collected postmortem from the same individuals. The right brain hemisphere was flash-frozen and maintained at −80°C at the Johns Hopkins University Brain Resource Center (BRC). We used tissue punched out from the superior temporal cortex, a region that shows volume reduction and synapse loss in AD patients [20]. Samples from controls without detectable brain pathology and obtained in the same manner were provided by the BRC. The initial screen of 12 candidate genes was performed on 12 unrelated AD cases with an affected mother and 9 healthy controls (unpublished data). The results we discuss in this paper also include 16 cases with no reported affected parents and a total of 25 controls. The details of the subsets in terms of age at death, sex, race, and postmortem delay (PMD) until harvesting the tissue are given in Table 1. All patients and controls were reported as white except for four African Americans, one case and three controls. We extracted DNA from brain tissue using a Puregene Tissue DNA extraction kit (Gentra Systems, Minneapolis) and RNA using two methods: a commercial kit based on filter retention (Qiagen cat. no. 75842) and a method utilizing the TRIzol Reagent (Invitrogen, Carlsbad, CA, USA) according to the manufacturer’s instructions. Reverse transcriptions (RT) were performed using the TaqMan
Table 1 Sample description
Controls Cases maternal Cases other
Number of cases
Average age (±SD)
Sex N of males
Race N of AA
PMD (±SD)
25 12 16
74.6 (±16.3) 84.2 (±5.6) 83.1 (±4.0)
11 1 4
3 1 0
10.5 (±4.8) 7.4 (±4.2) 8.0 (±4.1)
Descriptive information on samples used for analysis of gene expression separated in three categories: controls, cases with an affected mother, and cases without a parent reported as affected by dementia. SD, Standard deviation; N, number; AA, African Americans (all others are whites); PMD, postmortem delay in harvesting the brain
Neurogenetics (2007) 8:111–120
113
reverse transcription kit (Applied Biosystems cat. no. N808 0234) and according to the manufacturer’s instructions. We input 300 ng of total RNA in a 20-μl reaction and used random primers for the generation of complementary DNA (cDNA). Quantification of the transcripts of the initial candidates was performed by real time PCR on the RT products using pre-designed TaqMan® expression assays from Applied Biosystems (Foster City, CA, USA; assay IDs ACF, Hs01002352_m1; ALOX5, Hs00386528_m1; ASAH2, Hs00184096_m1; CREM, Hs01590456_m1; CUL2, Hs00180203_m1; CXCL12, Hs00171022_m1; GDF2, Hs00211913_m1; MAPK8, Hs00177083_m1; PARG, Hs00608254_m1; SLC18A3, Hs00268179_s1; TIMM23, Hs00197056_m1; and UBE2D1, Hs00234280_m1) on RNA extracted with a Qiagen kit. Follow up in the extended sample was performed by measuring the expression three times (see Fig. 1) in three independent experiments: (1) a pre-designed TaqMan® expression assay from Applied Biosystems (Hs00184096_m1) and the company recommended reaction conditions on RNA extracted using the Qiagen kit; (2) a SYBR-green based real time PCR using a different set of primers (ASAH-QUANT-F and ASAH-QUANT-F; see Table 2) on RNA from a new extraction, again using the Qiagen kit; (3) a SYBR-green based real time PCR using the same set of primers but RNA extracted using TRIzol. We performed three independent RT for these experiments (Fig. 1) and measured fluorescence in real time on a 9700 sequence detection system (SDS). For each measurement, the transcript levels were normalized against the housekeeping gene actin B (ACTB) transcript using the Applied Biosystems assay Hs99999903_m1 for
QIAGEN®-1
RT
TaqMan® (ASAH2L & ASAH2)
RNAs: R= 0.75
28 cases
QIAGEN®-2
RT
24 controls
SYBR-green (ASAH2L & ASAH2) R= 0.74
TRIZOL®
RT
SYBR-green (ASAH2L & ASAH2)
R= 0.87
SYBR-green (ASAH2L specific)
R= 0.74
NORMALIZED LOG AVERAGE
Fig. 1 The three experimental procedures for RNA quantification of ASAH2/ASAH2L and the one specific to ASAH2L are shown along with the correlation coefficients between them. The normalized average measurement of three experiments (as noted) was used for the original statistical analyses
TaqMan quantification or the primers ACTB-QUANT-A and ACTB-QUANT-B when quantifying by SYBR green. All assays were designed to amplify across intron–exon junctions, ensuring that the product represented messenger RNA (mRNA) rather than possibly contaminating genomic DNA. Standard curves were generated by five twofold dilutions of a control sample starting with an input about twice that of the measured samples. A reaction with no template cDNA was used to exclude errors due to amplicon contaminations. One microliter of the RT product was used for each reaction, and all reactions were performed in triplicate. For each experiment, the same panel of RT products and standard dilutions was used to quantify both the transcript of interest and the ACTB transcript. ACTB is abundantly expressed in all cell types, and it was used for normalization to eliminate differences stemming from variation in RNA input, PCR efficiency, and other variables not related to the tested transcript abundance. We analyzed the raw data using the SDS 2.1 software (Applied Biosystems) to obtain quantities relative to the standard dilution curve. When outliers were detected in the triplicates (one replica showing > twofold difference from the other two, observed in <10% of triplicates), we deleted the outlier data and averaged the remaining replicate values. These values represent expression ratios; therefore, they are not expected to have a normal distribution. For further analysis, we used the base 2 logarithm of these values, which was normally distributed (Kolmogorov–Smirnov test p>0.2 for all measurements). To reduce random noise due to sampling and/or other experimental variables, we combined the three independent measurements by averaging after centering each on the corresponding mean. We
114
Neurogenetics (2007) 8:111–120
Table 2 Primers used for the PCR amplifications Primer name
Sequence
ASAH-QUANT-F ASAH-QUANT-R ACTB-QUANT-A ACTB-QUANT-B Exon1-F Exon2-F Exon3-F Exon3-R Exon4-F Exon4-R Exon5-F
5′-GACCCATCAGACCTTCCTCA-3′ 5′-CCCTTGTGCCAATAAAAACG-3′ 5′-CGAGAAGATGACCCAGATCA-3′ 5′-AGAGGCGTACAGGGATAGCA-3′ 5′-GTGGGACCATTGAAAACCAC-3′ 5′-CAGTGCCTTTAACCCCAGAG-3′ 5′-CCTTCATCATGGCAGAACCT-3′ 5′-GTGGGACCATTGAAAACCAC-3′ 5′-TGCCAGTGAAGGATTTAGCA-3′ 5′-CTGCAGGACCTGAATGAGTG-3′ 5′-GCATTGACATAGCACACACA AA-3′ 5′-CTGCACACCATCCACATTTC-3′ 5′-TTGAAAATGGTAGATTTGAAT GGA-3′ 5′-CGGTCAGCATGAACAACAGT-3′ 5′-CACCAATGGGACAAGTGCTA-3′ 5′-ATTGCTAAGGGACCTGGACA-3′ 5′-ATTGAGGCCTCCAACTCCA-3′ 5′-GATCCATTTTGGGACACCAT-3′ 5′-AGATGGCTTTCCCAGGATCT-3′ 5′-GGATGCAGAACATGACTGTTG-3′ 5′-AGACGTTGCATAGACCTGAAA-3′ 5′-GATATGAGGCAGCATCGACA-3′ 5′-GGGGAAGTTGCTGAAGTTAT ATT-3′ 5′-TTCTTCGGGTTAGCACCTACA-3′ 5′-TGACAGCAGGCTTCAGAATG-3′ 5′-GAAGCGGACCTGAGGAGTC-3′
Exon5-R Exon6-F Exon7-F Exon8-R Exon9-F Exon11-R Exon12-F Exon12-R Exon15-F Exon15-R Exon16-F Exon18-F Exon18-R Exon20-R Exon1-52.17 Mb specific-F Exon5-52.17 Mb-R ASAH-52QUANT-F ASAH-52QUANT-R
5′-TGCCACGATGTTGAAGTAGC-3′ 5′-GGAGTCGCAATTGTGAACAG-3′ 5′-GCAAGGTTTCTGAAGAGCTGA-3′
The names indicate the corresponding gene, otherwise they indicate the corresponding exon of the ASAH2 gene. The suffix “-F” denotes a forward and “-R” a reverse primer. Multiple combinations of primers were used as described in the text.
further analyzed the data using the statistical software package, Statistica, employing a generalized linear model to measure the independent effect of each variable on the expression levels. The log(2) of the normalized expression of the transcript was the dependent variable, while age and PMD were continuous predictors, and sex and affection status were categorical predictors along with the corresponding two-way interaction term. The primers used in various combinations to amplify the ASAH2 cDNA in the experiments described below are shown in Table 2. PCR amplifications were performed on cDNA using 0.8 units of Taq DNA polymerase from Promega (cat. no. M166F) and the manufacturers buffer, 100 μM deoxyribonucleotide triphosphate and 200 nM of each primer in 25 μl reactions and 35 cycles of denaturing
at 94°C for 30 s, annealing at 60°C for 30 s, and a varying time (from 30 s to 3 min) of extension depending on the expected product length. Sequencing reactions on PCRamplified DNA were performed using the Sanger method with fluorescently labeled di-deoxy terminators and reagents from Applied Biosystems (cat. no. 4336919). The products were treated with exonuclease and shrimp alkaline phosphatase and after ethanol precipitation was subjected to capillary electrophoresis and visualized using a 3100 automated sequencing apparatus. Northern blot analyses were performed on a commercially available multi-tissue blot (Ambion, item no. 3140) using Rapidhyb™ hybridization buffer (GE Healthcare, cat. no. RPN1635) and the manufacturer’s protocol. We generated probes by PCR using primer pairs Exon16-forward and Exon20-reverse as well as Exon1-forward and Exon4reverse on cDNA templates. The products were inserted into the pCR2.1-TOPO vector and transformed into Escherichia coli One Shot Top10 chemically competent cells using reagents and protocols provided by Invitrogen. The probe identity was verified by sequencing, and probes were labeled using the Rediprime™ II DNA labeling system (GE Healthcare cat. no. RPN1633) according to the manufacturer’s protocol.
Results Expression levels analyses In our initial screen of 12 genes in 12 LOAD cases and 9 controls, we observed no difference in expression for ALOX5, CREM, CUL2, CXCL12, MAPK8, PARG, TIMM23, and UBE2D1 (all p>0.1), while ALOX5, GDF2, and SLC18A3 failed to amplify. ASAH2, however, showed a difference in expression levels (p=0.008), which prompted us to acquire an additional 16 cases and 16 controls. The complete set of cases and controls showed modest differences in terms of age at death (cases, 83.5±4.7; controls, 74.6±16.3; p= 0.01), postmortem delay (cases, 7.8±4.1; controls, 10.6± 4.8; p=0.03), and male–female composition (5/28 males in cases vs 11/25 in controls, Fisher’s exact test p=0.03). We examined the effect of these variables in a control only analysis, and subsequently, we corrected for their possible effect by using a generalized linear model that regressed the measured expression levels on the age at death and the PMD and included sex and affection status as categorical predictors. We did not include race in the model, as there was only one African American case and three controls. Analyses on the white samples alone gave similar results to those reported below both in terms of effect sizes and significance. The three different quantifications that we performed with assays expected to recognize the ASAH2 transcript
Neurogenetics (2007) 8:111–120
115
Table 3 Statistical analysis results
Controls only Controls—all AD Controls—maternal Controls—not maternal
Number of samples
Affection (unaff.)
Sex (male)
Age of death
PMD
Control
AD
Beta
p value
Beta
p value
Beta
p value
Beta
p value
25 25 25 25
0 28 12 16
n/a 0.38 0.63 0.29
n/a 0.0008 0.0007 0.0107
0.38 0.21 0.03 0.22
0.003 0.038 0.881 0.031
−0.016 −0.017 −0.016 −0.017
0.043 0.030 0.050 0.018
−0.048 −0.043 −0.46 −0.048
0.09 0.04 0.06 0.03
Effects of affection status, sex, age, and postmortem delay on the expression levels of the ASAH2L transcript. Beta indicates the difference in log (2) expression levels for each unit of change of the variable. For binary variables, beta corresponds to the group shown in parenthesis. Significant effects are marked in red. In addition to the entire sample (second row) the control-only analysis and the analyses of the samples with maternal disease origin or with unknown disease origin are shown.
were strongly correlated (R≈0.75 for each of the three possible pairs; see Fig. 1). The results were similar for each analysis individually and for the average of their centered value, which we used to minimize the effects of experimental noise in our estimates. We first analyzed healthy controls only to determine the effects of variables other than the disease status. Variables that showed significant effects on RNA levels included age at death, PMD, and sex (Table 3). Female sex was accompanied by a 0.77-fold decrease in the ASHA2 transcript; for every year of age, there was a 1.1% reduction. PMD also correlated with lower transcript measurements possibly indicating a systematic difference in degradation rate between this transcript and our internal control ACTB, and thus, PMD kept in the model in further analyses. We subsequently performed analyses including the patients and adding the affection status variable. This analysis reiterated age and sex as significant determinants of RNA levels and showed that patient status is associated with an additional 0.77-fold lower expression (Table 3). To investigate the possibility that this gene expression difference is related to the linkage in this genomic region in families with an affected mother, we analyzed the data for the 12 maternal LOAD cases separately from the 16 cases with no reported affected parents, comparing each against the controls. We found that the association between disease status and low gene expression was significant in both sets but appeared much stronger in the maternal cases (0.63-fold lower expression in maternal cases vs 0.82 in other cases; Table 3). These results prompted us to characterize the ASAH2 transcript in more detail. ASAH2 is located close to the centromere of human chromosome 10 (51.65 Mb on the UCSC May 2004 assembly) in a region that contains numerous low copy repeats (LCRs). The gene comprises of 20 exons with the 5′ end located telomerically. Only the first three exons of ASAH2 are at a single copy, while the remaining 17 exons are duplicated at 47.41 Mb on the same chromosome, and the last 5 exons are present in yet another copy at 52.17 Mb. Figure 2 shows the ASAH2 gene with shading over the part that is duplicated at 52.17 Mb, while
the copy at 52.17 Mb is aligned below it. The LCRs are located well within our linkage peak, which extends roughly from 40–60 Mb. The targets of our TaqMan and SYBR green ASAH2 expression assays were all positioned in the last three exons as noted on Fig. 2, in the duplicated part of the gene. To sort out potential interference caused by the duplicated sequence of ASAH2, we designed primers complementary to many different exons (see “Materials and methods”) and attempted to amplify different regions of the transcript from temporal lobe cDNA. We were consistently successful in amplifying the last five exons of the gene but failed to obtain a product with any pair of primers that included one in the first 15 exons. To test the functionality of our primers, we performed two RT-PCR assays on a panel of RNAs from multiple tissues using primers annealing to exons 1 and 4 and exons 12 and 15 (exons 12 to 15 shown in Fig. 3a). No product was observed with whole brain cDNA; yet, strong products of the expected size for ASAH2 were obtained with cDNA from colon and small intestine, as well as weaker products from kidney and testes, confirming that our primers and PCR conditions were appropriate. After increasing the number of PCR amplification cycles to 37 and using primers complimentary at exons 1 and 4, we did achieve amplification using RNA from whole adult brain, but the product was barely detectable, suggesting low abundance and possibly expression in only specific parts of the brain, most likely not the temporal lobe. With this result in mind, we reviewed the genome databases and observed that 16 of 18 expressed sequence tags (ESTs) from the ASAH2 gene extended from exons 16 to 20. Additionally, 12 out of a total of 16 ESTs including all three brain-derived ESTs (all from the hippocampus) aligned better at 52.17 Mb than with ASAH2, as they contained leading sequences from regions adjacent to the partial copy of ASAH2 (see Fig. 2). We used one of the reported hippocampus ESTs (BI553338) and bioinformatics tools to examine the possibility of transcription from this location. We aligned BI553338 to the genome and extended
116
Neurogenetics (2007) 8:111–120
Fig. 2 Genomic structure of ASAH2 and ASAH2L. The top panel shows the full-length alkaline ceramidase (ASAH2) gene on chromosome 10 q at 51.65 Mb (genome build 35). Vertical bars denote the 20 ASAH2 exons while arrows indicate the direction of transcription. Three representative ESTs with the corresponding dbEST accession numbers are shown aligned to the genomic sequence. The 3′ end of the gene is located in an LCR indicated by the gray shaded region. The
bottom panel shows another copy of the LCR located at 52.17 Mb on the same chromosome (note the opposite orientation). The same three ESTs are shown. An asterisk denotes sequences on the ESTs that are present only at 52.17 Mb suggesting transcription from that location. Pairs of arrows indicate the location of the three primer pairs (a, b, and c) used for quantitative PCR
the sequence 5 Kb in the 5′ direction to include possible promoter sequences. We used this extended sequence and the web-based software Dragon Promoter Finder [21] (http://
www.research.i2r.a-star.edu.sg/promoter/promoter1_5/ DPF.htm) and found a CpG island with a putative promoter and a predicted transcription initiation site 100 bp upstream
Fig. 3 RT-PCR for ASAH2 and ASAH2L on RNA from multiple tissues. a Amplification using primers in exons 12 and 15 of the ASAH2 transcript. These primers are not expected to amplify any transcripts from the 52.17 Mb location and are most likely specific to the ASAH2 transcript. Strong bands are observed in colon and small intestine and weak bands in kidney and testes. b Amplification with a
forward primer complementary to sequences specific to transcripts from the 52.17 Mb location (primer Exon1-52.17 Mb specific-F; first exon of BI553338) and a reverse primer in exon 19 of the ASAH2 gene (primer Exon5-52.17 Mb-R; fifth exon of BI553338). Amplification of the expected fragment is observed in all tissues tested
Neurogenetics (2007) 8:111–120
from the beginning of BI553338. This, together with the failure to amplify the upstream exons, increased our suspicion that the transcript we were investigating for expression might be transcribed from the partial copy of the ASAH2 gene at 52.17 Mb, rather than the complete ASAH2 at 51.65 Mb. The Vega Human Genome database (http:// www.vega.sanger.ac.uk) based on the bioinformatics and sequence annotates at that location a putative transcript (OTTHUMG00000018239), which corresponds to our prediction. To experimentally confirm the presence of the transcript, from this putative new gene that we refer to as ASAH2L, we designed specific primers complementary to sequences present only in ASAH2L (primers exon152.17 Mb specific-forward and exon5-52.17 Mb-reverse). We were able to amplify cDNA from all tissues tested (Fig. 3b), and nucleotide sequencing showed that the amplified fragment corresponded to ASAH2L. A multi-tissue Northern blot, using a probe spanning the exons 16 to 20 that are common to both ASAH2 and ASAH2L (Fig. 4a), revealed an abundant 5.3 Kb transcript in colon with less abundant hybridizing transcripts of 6.5 and 2.5 Kb. All the other tissues included on the blot showed a major transcript of 3.2 Kb and a less abundant transcript at 5.3 Kb. After stripping the membrane and obtaining a clear 72-h exposure, we re-hybridized using a probe from the first four exons of ASAH2, expected to be specific to ASAH2. After a 72-h exposure, we observed again a strong transcript in colon at 5.3 Kb, now accompanied by less abundant transcripts at 2.5 and 1.5 Kb. There were no detectable transcripts in other tissues. These results led us to suspect that the ASAH2 transcript is expressed primarily in the intestine and is not abundant in adult brain. Most importantly, this result is consistent with our hypothesis that our RT-PCR expression assays were measuring the ASAH2L transcript in the brain. Fig. 4 Northern blot analysis of RNA from multiple tissues. a Hybridization with a probe complementary to the 3′ end of the ASAH2 gene (exons 16 to 20), which is also present in the novel transcript. b The same Northern blot hybridized with a probe that is complementary only to the ASAH2 transcript (exons 1–4). With the exception of the colon, which shows very high expression of ASAH2, all other tissues do not show hybridization to the ASAH2-specific probe but only to the probe that also recognizes ASAH2L
117
To confirm this, we designed primers for quantitative PCR that would specifically amplify from ASAH2L and performed an additional expression experiment (primers ASAH-52QUANT-F and ASAH-52QUANT-R). The data for this pair of primers correlated strongly with the previous data (R=0.87 with the experiment using the same RNA extraction and RT and R=0.53 and 0.69 with the other two experiments). The lower expression in patients (p=0.0016) as well as the effects of sex and age in controls (p=0.04 and p=0.08) remained. These results support that ASAH2L not ASAH2 mRNA is reduced in the LOAD brain. To assess the possible functionality of ASAH2L, we searched for open reading frames (ORFs) beginning with putative translation start codons. We used the sequence of the Vega-annotated transcript ASAH2B-002 and found four ORFs starting with a methionine codon and extending more than 50 codons. The first, starting at position 291, was 123codons long but had a putative initiation methionine codon that did not conform well to the Kozak consensus [22, 23], as it lacked a purine at position −3 relative to the ATG codon, a residue shown to be of critical importance for start codon recognition. The putative protein product of this ORF shows no matches or homologies with any human or non-human protein in GenBank according to our basic local alignment search tool (BLAST) [24] searches, except for a hypothetical transcript derived from the chimpanzee DNA sequence, which does not come from the region syntenic to the human ASAH2 gene locus and its paralogs. A second ORF starts at position 785 at a methionine codon with a strong Kozak consensus sequence. This ORF encodes a 160-amino-acid peptide corresponding to the carboxyterminus region of alkaline ceramidase. One of the residues of this start codon, however, was different in the sequence of the EST BI553338. We used specific primers for the 52.17 Mb repeat location and found by nucleotide
118
sequencing that this nucleotide is polymorphic (C/T) in ASAH2L. The derived allele (T) creates a translation start codon (ATG), and it is present in ∼17% of the population (allele frequency ∼8.5%). We found no evidence of differences in the frequency of its occurrence between cases and controls (5/28 controls and 4/25 cases had the T variant). Additionally, by sequencing both genomic and cDNA for heterozygotes for this variant, we observed expression of both alleles, thus no evidence of imprinting. Although the balanced distribution of the polymorphic start site argues against a possible involvement of the corresponding transcript in AD, its presence might prevent the use of a downstream transcript, thus reducing its levels in heterozygotes. Consistent with this, we found that cases with this start site had 15% higher transcript levels than the rest of the cases (not a statistically significant difference), and exclusion of cases and controls carrying this start site slightly increased the observed effects (from 0.38 on Table 3 to 0.41). A third ORF at position 1,960 is the only one present in all individuals and has a putative initiation methionine codon that conforms to the Kozak consensus. It encodes a novel polypeptide of 109 amino acids that shows no homology to known proteins.
Discussion We report the identification of a novel transcript, which we refer to as ASAH2L, a transcript derived from a 5′ truncated paralog of the ASAH2 gene. The expression of ASAH2L is lower in females, decreases with age, and appears further reduced in Alzheimer’s patients. Female sex and increasing age are both recognized risk factors for LOAD [3, 4]. Our data are consistent with a model where low expression of ASAH2L is a risk factor for LOAD; however, further studies are needed to test this model. ASAH2L is a novel gene of unknown function resulting from a duplication of a fragment of ASAH2 and insertion at a new location next to promoter sequences. Human alkaline ceramidase was first cloned by El Bawab et al. [25]. Their data supported strong expression of ASAH2 in the brain; however, they used a probe from the part of the 3′-end gene that we now know to be duplicated. The major transcript they detected was ∼3.5 Kb, and we now show that it most likely corresponded to the ASAH2L transcript. We do not observe the 3.5-Kb fragment when using a probe specific to ASAH2. The predicted Vega transcript ASAH2B-002 (ID, OTTHUMT00000048083) that corresponds ASAH2L at 52.17 Mb is 3.7 Kb long consistent with our observations. El Bawab et al. [25] also observed a larger transcript, which they sized at ∼7 Kb, as
Neurogenetics (2007) 8:111–120
opposed to our sizing of the larger transcript we observed at 5.3 Kb. If one disregards the sizing difference, their blot (Fig. 4 in their paper) looks very similar to ours (Fig. 4a). In our data, the larger transcript is the same size with that seen strongly in colon with the ASAH2 specific probe (Fig. 4b). This larger transcript was barely detectable in human brain RNA in the El Bawab et al. [25] study while it was observed in RNA from kidney and pancreas. In our Northern blot, the 5.3-Kb transcript is strong in the intestine and very weak, but detectable, in all other tissues tested including kidney and pancreas. Our RT-PCR results are consistent with Northern blot results on mouse tissues published by Choi et al. [26] who detected the presence of an ASAH2 transcript in roughly the same set of tissues as we did without strong expression in the brain. Our results suggest strong ASAH2 expression only in the intestine. This is consistent with recent literature on a major role of the ASAH2 gene product, alkaline ceramidase, in the intestine for the degradation of sphingolipids [27, 28]. Nevertheless, alkaline ceramidase activity has been reported in the brain of rats and zebrafish [28, 29]. In contrast with the results of Choi et al. [26], Tani et al. [30] also observed it in mouse brain. The later discrepancies may be explained by possible differences in the included brain regions, while differences across species could also exist. Our RT-PCR experiments failed to amplify an ASAH2-specific transcript in the temporal lobe, while after increasing the PCR cycles, we weakly amplified the transcript in cDNA from whole human brain, suggesting low expression limited to brain regions other than the temporal lobe. Alkaline ceramidase is involved in the hydrolysis of ceramide in the neutral alkaline pH range, and it is involved in programmed cell death [26]. It has been suggested that it localizes in the mitochondria [25], although recently, this has been challenged [31]. As ceramide and ceramidase are strongly involved in apoptosis signaling and the mitochondria are maternally inherited and also known to be important in apoptosis [32], involvement of ASAH2 in AD would fit the maternal effect observation. We showed, however, that the downregulated transcript is not the product of ASAH2 but rather ASAH2L, a novel gene that includes sequences from the 3′ end of ASAH2 corresponding to the carboxy-terminus of ceramidase. This transcript contains many potential translation start sites including one that is polymorphic and would produce a fragment of alkaline ceramidase. Further study is needed to show if any of these putative proteins are actually produced and have a function. The possibility that the RNA itself has some function influencing the alkaline ceramidase gene can also not be excluded. Our observation of an AD-related expression difference for ASAH2L has been robust to increases in sample size and
Neurogenetics (2007) 8:111–120
use of different quantification approaches. The downregulation with age and the lower expression in females, both of which are risk factors for AD, are consistent with the hypothesis that reduced ASAH2L transcription might increase susceptibility to AD. The localization of ASAH2L under a strong linkage peak also argues for this possibility, especially as the reduced expression is more pronounced in the maternal cases, consistent with the observed increased linkage in maternal families [2]. Nevertheless the expression data alone cannot establish an involvement of this gene in AD, and further investigation is necessary. There are a number of limitations that need to be taken into account in reference to our conclusions. One could argue that the observed differences might be due to the reference gene, ACTB, rather than the target. This, however, is very unlikely, as none of the other genes tested using the same reference showed expression differences. The examination of 12 genes, on the other hand, raises a multiple comparisons question. The observed significance levels can withstand correction at least regarding the disease effect (corrected p=0.0096), which was the only effect studied across all genes. Additionally, 11 genes were not tested on the extended sample, and three of them did not provide data; thus, even this correction is conservative. The observed gene expression differences are based on statistical analyses and, as with any such study, type I errors cannot be excluded with certainty. A replication study would be very valuable to this end. It also remains to be investigated whether the reduction in expression constitutes a risk factor or is a consequence of the disease process. The localization of the gene under a linkage peak and the parent of origin effect are more consistent with a causative role; however, showing causation would require further experimental evidence including identification of the relevant DNA variant(s) and the biological mechanism involved. Beyond the possible implications for AD, we find the emergence of a novel transcript from a partial duplication of a previously existing gene intriguing. The rearrangement that took place in this region is possibly human specific, as according to our genome database searches, the corresponding sequences in the chimpanzee, rhesus monkey, and other mammals are not known to be duplicated. Consistently with this, Choi et al. [26] and Tani et al. [30] did not observe the new transcript in their work on mouse ceramidase. Another interesting observation regarding this new transcript is the existence of a derived polymorphic allele that creates the strongest translation start site, likely leading to variation regarding the translation of the corresponding protein and proteins corresponding to downstream translation start sites. We find this new gene to be a very attractive target of additional study, as it might teach us more about gene evolution and the birth of new genes,
119
while it will also likely promote our knowledge on neurodegeneration and Alzheimer’s disease. Acknowledgment This work was supported by grants RO1AG022099 and RO1AG021804 from the National Institute of Aging to DA and SSB, respectively. All described experiments comply with the laws of the United States of America.
References 1. Bassett SS, Avramopoulos D, Perry RT, Wiener H, Watson B Jr, Go RC, Fallin MD (2006) Further evidence of a maternal parent-of-origin effect on chromosome 10 in late-onset Alzheimer’s disease. Am J Med Genet B Neuropsychiatr Genet 141:537–540 2. Bassett SS, Avramopoulos D, Fallin D (2002) Evidence for parent of origin effect in late-onset Alzheimer disease. Am J Med Genet 114:679–686 3. Miech RA, Breitner JC, Zandi PP, Khachaturian AS, Anthony JC, Mayer L (2002) Incidence of AD may decline in the early 90s for men, later for women: the cache county study. Neurology 58:209– 218 4. Devi G, Ottman R, Tang MX, Marder K, Stern Y, Mayeux R (2000) Familial aggregation of Alzheimer disease among whites, African Americans, and Caribbean Hispanics in northern Manhattan. Arch Neurol 57:72–77 5. Bergem AL, Engedal K, Kringlen E (1997) The role of heredity in late-onset Alzheimer disease and vascular dementia. A twin study. Arch Gen Psychiatry 54:264–270 6. Gatz M, Pedersen NL, Berg S, Johansson B, Johansson K, Mortimer JA, Posner SF, Viitanen M, Winblad B, Ahlbom A (1997) Heritability for Alzheimer's disease: the study of dementia in Swedish twins. J Gerontol Ser A Biol Sci Med Sci 52:M117– M125 7. Goate A, Chartier-Harlin MC, Mullan M, Brown J, Crawford F, Fidani L, Giuffra L, Haynes A, Irving N, James L et al (1991) Segregation of a missense mutation in the amyloid precursor protein gene with familial Alzheimer’s disease. Nature 349:704–706 8. Sherrington R, Rogaev EI, Liang Y, Rogaeva EA, Levesque G, Ikeda M, Chi H, Lin C, Li G, Holman K et al (1995) Cloning of a gene bearing missense mutations in early-onset familial Alzheimer’s disease. Nature 375:754–760 9. Levy-Lahad E, Wijsman EM, Nemens E, Anderson L, Goddard KA, Weber JL, Bird TD, Schellenberg GD (1995) A familial Alzheimer’s disease locus on chromosome 1. Science 269:970–973 10. Strittmatter WJ, Saunders AM, Schmechel D, Pericak-Vance M, Enghild J, Salvesen GS, Roses AD (1993) Apolipoprotein E: high-avidity binding to beta-amyloid and increased frequency of type 4 allele in late-onset familial Alzheimer disease. Proc Natl Acad Sci USA 90:1977–1981 11. Brookes AJ, Prince JA (2005) Genetic association analysis: lessons from the study of Alzheimers disease. Mutat Res 573:152–159 12. Kamboh MI (2004) Molecular genetics of late-onset Alzheimer’s disease. Ann Hum Genet 68:381–404 13. Lendon C, Craddock N (2001) Susceptibility gene(s) for Alzheimer’s disease on chromosome 10. Trends Neurosci 24:557–559 14. Walsh JB (1995) How often do duplicated genes evolve new functions? Genetics 139:421–428 15. Long M, Langley CH (1993) Natural selection and the origin of jingwei, a chimeric processed functional gene in Drosophila. Science 260:91–95
120 16. Samonte RV, Eichler EE (2002) Segmental duplications and the evolution of the primate genome. Nat Rev Genet 3:65–72 17. Lupski JR, Stankiewicz P (2005) Genomic disorders: molecular mechanisms for rearrangements and conveyed phenotypes. PLoS Genet 1:e49 18. Sharp AJ, Locke DP, McGrath SD, Cheng Z, Bailey JA, Vallente RU, Pertz LM, Clark RA, Schwartz S, Segraves R et al (2005) Segmental duplications and copy-number variation in the human genome. Am J Hum Genet 77:78–88 19. Blacker D, Bertram L, Saunders AJ, Moscarillo TJ, Albert MS, Wiener H, Perry RT, Collins JS, Harrell LE, Go RC et al (2003) Results of a high-resolution genome screen of 437 Alzheimer’s disease families. Hum Mol Genet 12:23–32 20. Busatto GF, Garrido GE, Almeida OP, Castro CC, Camargo CH, Cid CG, Buchpiguel CA, Furuie S, Bottino CM (2003) A voxelbased morphometry study of temporal lobe gray matter reductions in Alzheimer’s disease. Neurobiol Aging 24:221–231 21. Bajic VB, Tan SL, Chong A, Tang S, Strom A, Gustafsson JA, Lin CY, Liu ET (2003) Dragon ERE Finder version 2: a tool for accurate detection and analysis of estrogen response elements in vertebrate genomes. Nucleic Acids Res 31:3605–3607 22. Kozak M (1984) Compilation and analysis of sequences upstream from the translational start site in eukaryotic mRNAs. Nucleic Acids Res 12:857–872 23. Kozak M (1986) Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes. Cell 44:283–292
Neurogenetics (2007) 8:111–120 24. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410 25. El Bawab S, Roddy P, Qian T, Bielawska A, Lemasters JJ, Hannun YA (2000) Molecular cloning and characterization of a human mitochondrial ceramidase. J Biol Chem 275:21508–21513 26. Choi MS, Anderson MA, Zhang Z, Zimonjic DB, Popescu N, Mukherjee AB (2003) Neutral ceramidase gene: role in regulating ceramide-induced apoptosis. Gene 315:113–122 27. Kono M, Dreier JL, Ellis JM, Allende ML, Kalkofen DN, Sanders KM, Bielawski J, Bielawska A, Hannun YA, Proia RL (2006) Neutral ceramidase encoded by the asah2 gene is essential for the intestinal degradation of sphingolipids. J Biol Chem 281:7324–7331 28. Yoshimura Y, Tani M, Okino N, Iida H, Ito M (2004) Molecular cloning and functional analysis of zebrafish neutral ceramidase. J Biol Chem 279:44012–44022 29. El Bawab S, Bielawska A, Hannun YA (1999) Purification and characterization of a membrane-bound nonlysosomal ceramidase from rat brain. J Biol Chem 274:27948–27955 30. Tani M, Okino N, Mori K, Tanigawa T, Izu H, Ito M (2000) Molecular cloning of the full-length cDNA encoding mouse neutral ceramidase. A novel but highly conserved gene family of neutral/alkaline ceramidases. J Biol Chem 275:11229–11234 31. Hwang YH, Tani M, Nakagawa T, Okino N, Ito M (2005) Subcellular localization of human neutral ceramidase expressed in HEK293 cells. Biochem Biophys Res Commun 331:37–42 32. Green DR, Reed JC (1998) Mitochondria and apoptosis. Science 281:1309–1312