Jacoby Transfer - The Jacoby Transfer convention is a popular bidding treatment to enable the Notrump opener to become declarer when the responder holds five or more cards in a suit. The Jacoby Transfer bid allows the stronger hand to remain unexposed. Further, leading up to the strong declarer's hand (the Notrump opener) may cause the leader to inadvertently finesse the defender's side. Finally, not seeing the strong Notrump bidders assets make it more challenging to defend the hand. Also see Retransfers and Texas Transfers.
Associated Data
Abstract
Despite its biological importance, transfer RNA (tRNA) could not be adequately sequenced by standard methods due to abundant post-transcriptional modifications and stable structure, which interfere with cDNA synthesis. We achieve efficient and quantitative tRNA sequencing using engineered demethylases to remove base methylations and a highly processive thermostable group II intron reverse transcriptase to overcome these obstacles (DM-TGIRT-seq). Our method should be applicable to investigations of tRNA in all organisms.
High-throughput RNA sequencing (RNA-seq) has revolutionized our understanding of gene expression. Widely used RNA-seq methods start with adapter ligation and cDNA synthesis of biological RNA samples followed by PCR amplification to generate sequencing libraries. These standard methods work well for most cellular RNAs, such as mRNA, long ncRNA, miRNA, or fragments derived from rRNA, snRNA, and snoRNA. tRNA is the only class of cellular RNA for which the standard sequencing methods cannot yet be applied efficiently and quantitatively, although attempts have been made (e.g. ref ). Significant obstacles for the sequencing of tRNA include the presence of numerous post-transcriptional modifications and its stable and extensive secondary structure, which interfere with cDNA synthesis and adapter ligation. tRNAs are essential for cells and their synthesis is under stringent cellular control. Accumulating evidence shows that tRNA expression and mutations are associated with various diseases such as neurological pathologies and cancer development,. The lack of efficient and quantitative tRNA-seq methods has hindered biological studies of tRNA.
We applied two strategies to eliminate or substantially reduce the obstacles of tRNA modification and structure for efficient and quantitative tRNA sequencing (Fig. 1a). A DNA and RNA repair protein AlkB-derived enzyme mixture was first used to remove methylations at the Watson-Crick face. Three specific modifications, N1-methyladenosine (m1A), N3-methylcytosine (m3C), and N1-methylguanosine (m1G), are abundant in eukaryotic tRNAs and are particularly problematic for reverse transcriptases (RTs) by causing cDNA synthesis to stop or misincorporate. In mammals, known modifications of these types include m1A present in all tRNAs at position 58, m3C present in five tRNAs at position 32 and the variable loop and m1G present in about half of all tRNAs at positions 37 or 9. We applied two recombinant enzymes as a mixture to remove these three methylations in human tRNAs. The first was the wild-type enzyme AlkB (wtAlkB) from E. coli, which is known to very efficiently demethylate m1A and m3C in single-stranded nucleic acids as its DNA and RNA repair function,. Wild-type AlkB, however, works very poorly on m1G modification. Based on its known three-dimensional structure complexed with nucleic acids, we engineered AlkB to generate a specific mutant, D135S that efficiently converted m1G to G (Supplementary Figs. 1a,b). We optimized the reaction conditions and enzyme to RNA ratios using a mixture of the wild-type and the D135S AlkB proteins (Supplementary Figs. 1c,d). We were able to remove >80% of total m1A, m3C, and ~70% m1G without a noticeable change in tRNA quality (Fig. 1b). The remaining m1A or m1G may be more buried in the tRNA tertiary structure (m1A or m1G at position 9 of tRNAs) and thus not easily accessible to demethylase treatment without causing tRNA degradation.
Demethylase-thermostable group II intron RT tRNA sequencing (DM-TGIRT-seq)(a) Our method differs from standard RNA-seq in two ways. First, a demethylase mixture was used to remove m1A, m1G and m3C modifications located at the Watson-Crick face. Second, a thermostable group II intron RT (TGIRT) was used that is less sensitive to tRNA structure and adds RNA-seq adaptors by template-switching without RNA ligation. (b) Demethylation efficiency for total tRNA as measured by QQQ LC-MS. (c) RT reaction for both purified tRNA and total RNA as template with (+) or without (−) demethylase treatment. Blue line shows the gel region excised for library construction.
We used a thermostable group II intron reverse transcriptase (TGIRT) with high processivity to generate cDNA from highly structured tRNA. The TGIRT reaction does not require adapter ligation; it synthesizes cDNA by template-switching from the adaptor to the 3’ end of the target RNA-. Importantly, demethylase treatment markedly reduced the amount of RT stops at the m1A58 and m1G37 positions. At the same time, the amount of longer and full-length cDNAs was substantially increased after demethylase treatment of tRNA (Fig. 1c). Similar results regarding demethylase treatment were obtained when the RNA template was either total cellular RNA or gel-purified total tRNA (Fig. 1c). These results show that the demethylase treatment is effective in producing longer reads; this property will be crucial for the ability to adequately map mammalian tRNA expression where many tRNAs have just one or a handful of differences in their sequences,.
We performed Illumina sequencing using the libraries generated from tRNAs with and without the demethylase treatment. To facilitate our focus on tRNA at this time, the sequencing reads were mapped only to the genomic tRNA database, which contains 515 predicted tRNA genes distributed over 330 unique sequences and 110 predicted tRNA pseudogenes. In all, 6.6–15.7 million and 2.3–4.7 million reads were mapped to the genomic tRNA database when using gel-purified tRNA or total RNA as template, respectively (Supplementary Fig. 2a). These variations in read numbers were derived from sample handling, as similar proportions of reads were obtained for the added internal tRNA standards (Supplementary Fig. 2b). For the biological replicate from HEK293T cells, both untreated and treated samples showed similarly high reproducibility with r2-values from 0.985–0.991 (Supplementary Figs. 3a-d). As already indicated by the cDNA bands (Fig. 1c), the read length and in particular the proportion of reads corresponding to full-length tRNA is increased substantially upon demethylase treatment, as shown by the position plots along individual tRNA genes (Figs. 2a,b). The highest fraction of reads in a LeuAAG tRNA in the untreated samples were RT stops due to the m1G37 modification, whereas all stops at this position were removed with the corresponding increase of full-length tRNA reads upon demethylase treatment (Fig. 2a). Similarly, the substantial RT stop at m1G9 of a GlnCTG tRNA was markedly reduced upon demethylase treatment, together with a substantially increased generation of full-length reads (Fig. 2b). We still detected a strong stop at the m22G26 (N2,N2-dimethylguanosine) residue in LeuAAG tRNA that remained unchanged upon demethylase treatment. This result indicates that our demethylase mixture is not effective in removing m22G modifications, which are present in ~20% of tRNAs. Nevertheless, our results indicate that the demethylase treatment is very effective in producing longer reads; this property is crucial for the ability to adequately map the mammalian tRNAome at single base resolution.
Sequencing results(a) 5’ position sequencing plot of a LeuAAG tRNA containing an m1G37 modification. (b) 5’ position sequencing plot of a GlnTTG tRNA containing an m1G9 modification. For panels (a) and (b), m1A58 stops correspond to very short DNA fragments that are outside the range of our sequenced cDNA, and hence cannot be visualized here. (c) As expected, tRNA isoacceptor expression in HEK293T cells does not correlate with the gene copy number, since human tRNA expression is tissue specific. (d) Comparison of the array fluorescent signals and sequencing reads for the Arg-tRNA family; error bar, n = 4 ± SD. (e) Mismatch and stops with and without demethylase treatment. Shown are modification positions of m1A58 in ValCAC, m3C32 in ThrAGT, m1G37 in ProTGG, and m1G9 in GlnCTG. Star indicates stops for ValCAC corresponding to very short reads that were not determined in this experiment. n = 4 ± SD. (f) Sequence logo at the same modified positions as in (e), centered at the modified residue. Untreated (−) is at the top and the demethylases treated (+) is at the bottom.
We performed additional analysis to further demonstrate the usefulness of our sequencing method. Plotting each tRNA isoacceptor against their genomic gene copy number showed a poor correlation (Fig. 2c), consistent with the known tissue specific tRNA expression in humans13,. We compared the fraction read of the tRNAArg isoacceptors to the fluorescence hybridization signals of the Arg-tRNA probes from tRNA microarrays which were obtained through hybridization without the need for cDNA synthesis13,. The sequencing and array results showed the same trend of isoacceptor abundance, thus validating the quantitative nature of tRNA abundance obtained independently through sequencing and hybridization based approaches (Fig. 2d). We also compared RT stops and misincorporations at known modification positions with and without demethylase treatment. In the case of m1A58 in ValCAC and m3C32 in ThrAGT, the demethylases completely removed these modifications as demonstrated by the reversion to cognate sequence at these positions (Figs. 2e-f). In the case of m1G37 in ProTGG and m1G9 in GlnCTG, the demethylase treatment removed a large majority of the modification so the mismatch and stops were substantially reduced (Figs. 2e-f). Therefore, our DM-TGIRT-seq method can determine differences in the modification dynamics of m1A, m1G and m3C at single base resolution, as well as potentially infer positions of non-demethylated modifications.
We also examined the expression of unique tRNA genes from chromosome 6. Human chromosome 6 contains approximately one third of all tRNA genes, and over 150 genes are clustered within a 2.7 Mbp region near the class I major histocompatibility (MHC) genes. We found that the tRNA expression levels within the tRNA gene cluster were higher compared to the tRNA genes outside of the cluster (Supplementary Fig. 4). The expression level of tRNA genes in the cluster was uneven, suggesting that the expression of tRNA genes was not coordinated throughout the entire cluster in HEK293T cells.
In summary, despite its biological importance, tRNA has not been adequately studied at the transcriptome level due to its unique characteristics in modification and structure. Our approach described here makes efficient and quantitative tRNA-seq feasible, and has an additional advantage of being able to study modifications in a high throughput manner. Our method is accessible to any lab and offers broad potentials in basic research and diagnostic applications.
METHODS
Design of the D135S AlkB mutant
The substrates for the wild-type AlkB (wtAlkB), m1A and m3C, are positively charged. A close examination of the structures of substrate-bound AlkB revealed that these positively charged substrates are favorably positioned in the active site by interacting with the negatively charged carboxylic group of Asp135 (D135) (Supplementary Fig. 1a). We reasoned that single mutants at the D135 position might allow better accommodation of m1G in the active site (Supplementary Fig. 1a), thereby enhancing the demethylation efficiency of m1G. Truncated but catalytically active AlkB-ΔN11 was used and mutants with single amino acid replacement at D135 were screened. Of several mutations tested, AlkB D135S gave the best demethylation yields towards m1G (Supplementary Fig. 1b). In the AlkB-m1G structure model constructed by computationally mutating the AlkB structure (Supplementary Fig. 1a), the shorter side chain of S135 seems to allow more room to accommodate m1G while sustaining the crucial hydrogen bond with m1G, which may explain the improvement in activity.
We also performed pH-activity profiling for the demethylation of m1A and m1G in tRNA. Decreasing demethylation activity was observed with increasing pH for both m1A and m1G substrates (Supplementary Fig. 1c). Therefore, we chose pH 5.0 for detailed kinetic analysis and subsequent experiments.
Cloning, expression and purification of AlkB and mutant
A truncated AlkB with deletion of the amino (N)-terminal 11 amino acids was cloned into a pET30a vector (Novagen) and overexpressed in E. coli BL21(DE3). The proteins were purified following procedures published previously. Briefly, cells were grown at 37 °C in the presence of 50 μM kanamycin until the OD600 reached 0.6–0.8. After the addition of IPTG (1 mM) and FeSO4 (5 μM), the cells were incubated for an additional 4 hr at 30 °C. Cells were collected, pelleted and then resuspended in lysis buffer (10 mM Tris pH 7.4, 300 mM NaCl, 5% glycerol, 2 mM CaCl2, 10 mM MgCl2, 10 mM 2-mercaptoethanol). The cells were lysed by sonication and then centrifuged at 17,418 rcf for 20 min. The soluble proteins were first purified using a Ni-NTA superflow cartridge (Qiagen), and then further purified by ion-exchange (Mono S GL, GE Healthcare) and gel-filtration (Superdex-200, Pharmacia) chromatography. All protein purification steps were performed at 4 °C. The Asp135 to Ser mutation was introduced using the QuikChange Site-Directed Mutagenesis Kit (Agilent). The mutant protein was expressed and purified following the same procedure as that of the wild-type protein.
Mammalian cell culture and RNA preparations
Human embryonic kidney cell line HEK293T (CRL-11268) were obtained from American Type Culture Collection (ATCC) and used without further validation. Cells were cultured in DMEM (Thermo) media supplemented with 10% FBS and 1% 100 × Pen Strep (Gibco). Cells were routinely checked for mycoplasma contamination every 3–6 months, using Universal Mycoplasma Detection Kit (ATCC).
Total RNA was isolated by using a mirVana miRNA Isolation Kit (Life Technologies). Purified total RNA was premixed with the T7 RNA polymerase transcripts of three tRNA standards13 (0.01 pmol each standard per μg of total RNA) and deacylated by incubating in 0.1 M Tris-HCl, pH 9 at 37°C for 45 min. Although not necessary for studies of mature tRNAs, which all end with 3’CCA, deacylated RNAs with or without demethylation treatment could be treated with T4 polynucleotide kinase (Epicentre) at 37C° for 30 min to further warrant a free 3’ hydroxyl group for template-switching.
When necessary, total tRNA was subsequently isolated using a denaturing 10% polyacrylamide gel followed by passive gel elution and ethanol precipitation.
Demethylation reactions
Demethylation activity assay was performed for either gel purified total tRNA or total cellular RNA. For total tRNA, 100 μl of reaction mixture containing 1 μg of tRNA (~40 pmol) was treated with 2× molar ratio of wtAlkB (80 pmol) and 4× molar ratio of D135S mutant (160 pmol). For total cellular RNA, 5 μg of total RNA (estimated to contain ~40 pmol tRNA) was treated with 4× molar ratio of wtAlkB (160 pmol) and 4× molar ratio of D135S (200 pmol). More demethylases were used for total RNA to alleviate potential interference by rRNA and mRNA in the reaction. The reaction buffer contained 300 mM KCl, 2 mM MgCl2, 50 μM of (NH4)2Fe(SO4)2·6H2O, 300 μM 2-ketoglutarate (2-KG), 2 mM L-ascorbic acid, 50 μg/mL BSA, 50 mM MES buffer (pH 5.0). In both cases, the reaction was incubated for 2 hr at room temperature, and quenched by the addition of 5 mM EDTA. After phenol–chloroform extraction, tRNA was recovered by ethanol precipitation.
Thermostable group II intron RT template-switching
Template-switching reactions were performed as described-. Briefly, we used an initial template-primer substrate consisting of a 41-nt RNA oligonucleotide (5’-AGA UCG GAA GAG CAC ACG UCU AGU UCU ACA GUC CGA CGA UC/3SpC3/-3’) that contains Illumina Read1 and Read2 primer-binding sites and a 3’ blocking group (three carbon spacer; Integrated DNA technologies, inc.) annealed to a complementary 32P-labeled DNA primer with a single-nucleotide 3’ overhang, T, which facilitates the template switch to full-length tRNAs that mostly contain a 3’ CCA end. For TGIRT template-switching reactions, typically 100 ng of demethylated tRNAs or 1 μg of demethylated total RNA were mixed with the initial template-primer substrate (100 nM) and 500 nM TGIRT (GsI-IIC MalE rigid fusion RT) in reaction medium containing 450 mM NaCl, 5 mM MgCl2, 20 mM Tris-HCl, pH 7.5, and 5 mM DTT. The reactions were pre-incubated at room temperature for 30 min, initiated by adding 25 mM dNTPs (an equimolar mix of 25 mM dATP, dCTP, dGTP and dTTP) to a final concentration of 1 mM and incubating at 60°C for 30 min. The reactions were terminated by adding 5 M NaOH to a final concentration of 0.25 M, incubating at 95°C for 3 min, and neutralizing with 5 M HCl. The cDNAs resulting from template-switching were analyzed in a denaturing 6% polyacrylamide gel, electroeluted using a D-tube Dialyzer Maxi with MWCO of 6–8 kDa (EMD Millipore), and ethanol precipitated with 0.3 M sodium acetate, pH 5.2, in the presence of 25 μg of linear acrylamide (Life Technologies) carrier. The purified cDNAs were then circularized with CircLigase II (Epicentre) using the manufacturer's protocol with an extended incubation time of 5 hr at 60°C, extracted with phenol-chloroform-isoamyl alcohol (25:24:1), ethanol precipitated and amplified with Phusion-HF (Thermo Scientific) using Illumina multiplex (5’- AAT GAT ACG GCG ACC ACC GAG ATC TAC ACG TTC AGA GTT CTA CAG TCC GAC GAT C -3’) and barcode (5’- CAA GCA GAA GAC GGC ATA CGA GAT BARCODE GTG ACT GGA GTT CAG ACG TGT GCT CTT CCG ATC T -3’) primers for 12 cycles of 98°C for 5 sec, 60°C for 10 sec and 72°C for 10 sec. The PCR products were sequenced on an Illumina HiSeq system.
Quantitative analysis of modification levels using LC-MS/MS
Quantitative analysis of modified nucleotides was done as previously described. Briefly, 100 ng of tRNA was digested by nuclease P1 (2 U) in 30 μl of buffer containing 25 mM NaCl, and 2.5 mM ZnCl2 at 37°C for 1 hr, followed by the addition of NH4HCO3 (100 mM) and alkaline phosphatase (0.5 U). After an additional incubation at 37°C for 1 hr, the solution was diluted to 60 μl, and 10 μl of the solution was injected for LCMS/MS. Nucleosides were separated by reverse phase ultra-performance liquid chromatography on a C18 column with online mass spectrometry detection using an Agilent 6410 QQQ triple-quadruple LC mass spectrometer in positive electrospray ionization mode. The nucleosides were quantified using the nucleoside to base ion mass transitions of 282 to 150 (m1A), 268 to 166 (A), 298 to (m1G), 284 to 152 (G), 258 to 126 (m3C) and 244 to 112 (C). Quantification was performed by comparison with the standard curve obtained from pure nucleoside standards running in the same batch of samples. Modification levels were compared by the ratios of methylated base (m1A, m1G, m3C) over regular base (A, G, C).
tRNA microarrays
The tRNA microarray assay consists of four steps starting from purified tRNA or total RNA without the need of cDNA synthesis: (i) deacylation to remove all 3’ attached amino acids, (ii) selective fluorophore labeling of tRNA using oligonucleotide ligation with T4 DNA ligase to the 3’CCA of all tRNAs, (iii) hybridization, and (iv) data analysis. The reproducibility of the tRNA microarray method and validation of the results have been extensively described previously13,.
Sequencing read mapping
Sequencing reads were aligned using Bowtie to a modified hg19 genomic tRNA database. A single mismatch was allowed in order to identify potential modification misincorporations at a modification site. Briefly, a tRNA library was adapted from the tRNAScan-SE library by appending CCA to tRNAs from the genomic tRNA database (http://gtrnadb.ucsc.edu/Hsapi19/). Isodecoders with identical scores were consolidated for ease of identity assignment, decreasing the number of reference genes and pseudogenes from 625 to 462. Prior to mapping, reads were processed using Trimmomatic v0.32. Sequences greater than 15 bp were then aligned to the aforementioned culled tRNA library using Bowtie2 with sensitive options. Reads mapping to multiple isodecoders due to length of fragment were discarded. Modification fractions were determined by analyzing at a putative position ‘n’ the number of correct reads, the number of misincorporations, as well as the number of reads stopped at the n+1 position.
Supplementary Material
1
ACKNOWLEDGEMENTS
This work was supported by US National Institutes of Health (NIH) grant DP1GM105386 to T.P, NIH grants R01GM37949 and GM37951 to A.M.L, K01HG006699 to Q.D., NIH MCB Training Grant (T32GM007183) to W.C.C, and a Chicago Biomedical Consortium Postdoctoral Research Grant Award to G.Z. We thank L.Zhang, W.J.Chen, and I.A.Gagnon for technical assistance. C.H. is supported by US Howard Hughes Medical Institute as an investigator.
Footnotes
ACCESSION NUMBERS
The GEO accession number for the sequencing data reported in this paper is GSE66550.
AUTHOR CONTRIBUTIONS
G.Z., Y.Q., W.C.C., Q.D, A.L, and T.P. designed and performed experiments, and analyzed data. G.Z. and T.P. conceived the project. G.Z., C.Y. and C.H. designed the demethylase constructs. G.Z., Y.Q., W.C.C., A.L. and T.P. wrote the paper.
COMPETING FINANCIAL INTERESTS
Thermostable group II intron reverse transcriptase (TGIRT™) enzymes and methods for their use are the subject of patents and patent applications that have been licensed by the University of Texas at Austin and East Tennessee State University to InGex, LLC. A.M.L. and the University of Texas are minority equity holders in InGex, LLC, and A.M.L. and other present and former members of the Lambowitz laboratory receive royalty payments from sales of TGIRT™ enzymes and licensing of intellectual property.
REFERENCES
Associated Data
Abstract
Despite its biological importance, transfer RNA (tRNA) could not be adequately sequenced by standard methods due to abundant post-transcriptional modifications and stable structure, which interfere with cDNA synthesis. We achieve efficient and quantitative tRNA sequencing using engineered demethylases to remove base methylations and a highly processive thermostable group II intron reverse transcriptase to overcome these obstacles (DM-TGIRT-seq). Our method should be applicable to investigations of tRNA in all organisms.
High-throughput RNA sequencing (RNA-seq) has revolutionized our understanding of gene expression. Widely used RNA-seq methods start with adapter ligation and cDNA synthesis of biological RNA samples followed by PCR amplification to generate sequencing libraries. These standard methods work well for most cellular RNAs, such as mRNA, long ncRNA, miRNA, or fragments derived from rRNA, snRNA, and snoRNA. tRNA is the only class of cellular RNA for which the standard sequencing methods cannot yet be applied efficiently and quantitatively, although attempts have been made (e.g. ref ). Significant obstacles for the sequencing of tRNA include the presence of numerous post-transcriptional modifications and its stable and extensive secondary structure, which interfere with cDNA synthesis and adapter ligation. tRNAs are essential for cells and their synthesis is under stringent cellular control. Accumulating evidence shows that tRNA expression and mutations are associated with various diseases such as neurological pathologies and cancer development,. The lack of efficient and quantitative tRNA-seq methods has hindered biological studies of tRNA.
We applied two strategies to eliminate or substantially reduce the obstacles of tRNA modification and structure for efficient and quantitative tRNA sequencing (Fig. 1a). A DNA and RNA repair protein AlkB-derived enzyme mixture was first used to remove methylations at the Watson-Crick face. Three specific modifications, N1-methyladenosine (m1A), N3-methylcytosine (m3C), and N1-methylguanosine (m1G), are abundant in eukaryotic tRNAs and are particularly problematic for reverse transcriptases (RTs) by causing cDNA synthesis to stop or misincorporate. In mammals, known modifications of these types include m1A present in all tRNAs at position 58, m3C present in five tRNAs at position 32 and the variable loop and m1G present in about half of all tRNAs at positions 37 or 9. We applied two recombinant enzymes as a mixture to remove these three methylations in human tRNAs. The first was the wild-type enzyme AlkB (wtAlkB) from E. coli, which is known to very efficiently demethylate m1A and m3C in single-stranded nucleic acids as its DNA and RNA repair function,. Wild-type AlkB, however, works very poorly on m1G modification. Based on its known three-dimensional structure complexed with nucleic acids, we engineered AlkB to generate a specific mutant, D135S that efficiently converted m1G to G (Supplementary Figs. 1a,b). We optimized the reaction conditions and enzyme to RNA ratios using a mixture of the wild-type and the D135S AlkB proteins (Supplementary Figs. 1c,d). We were able to remove >80% of total m1A, m3C, and ~70% m1G without a noticeable change in tRNA quality (Fig. 1b). The remaining m1A or m1G may be more buried in the tRNA tertiary structure (m1A or m1G at position 9 of tRNAs) and thus not easily accessible to demethylase treatment without causing tRNA degradation.
Demethylase-thermostable group II intron RT tRNA sequencing (DM-TGIRT-seq)(a) Our method differs from standard RNA-seq in two ways. First, a demethylase mixture was used to remove m1A, m1G and m3C modifications located at the Watson-Crick face. Second, a thermostable group II intron RT (TGIRT) was used that is less sensitive to tRNA structure and adds RNA-seq adaptors by template-switching without RNA ligation. (b) Demethylation efficiency for total tRNA as measured by QQQ LC-MS. (c) RT reaction for both purified tRNA and total RNA as template with (+) or without (−) demethylase treatment. Blue line shows the gel region excised for library construction.
We used a thermostable group II intron reverse transcriptase (TGIRT) with high processivity to generate cDNA from highly structured tRNA. The TGIRT reaction does not require adapter ligation; it synthesizes cDNA by template-switching from the adaptor to the 3’ end of the target RNA-. Importantly, demethylase treatment markedly reduced the amount of RT stops at the m1A58 and m1G37 positions. At the same time, the amount of longer and full-length cDNAs was substantially increased after demethylase treatment of tRNA (Fig. 1c). Similar results regarding demethylase treatment were obtained when the RNA template was either total cellular RNA or gel-purified total tRNA (Fig. 1c). These results show that the demethylase treatment is effective in producing longer reads; this property will be crucial for the ability to adequately map mammalian tRNA expression where many tRNAs have just one or a handful of differences in their sequences,.
We performed Illumina sequencing using the libraries generated from tRNAs with and without the demethylase treatment. To facilitate our focus on tRNA at this time, the sequencing reads were mapped only to the genomic tRNA database, which contains 515 predicted tRNA genes distributed over 330 unique sequences and 110 predicted tRNA pseudogenes. In all, 6.6–15.7 million and 2.3–4.7 million reads were mapped to the genomic tRNA database when using gel-purified tRNA or total RNA as template, respectively (Supplementary Fig. 2a). These variations in read numbers were derived from sample handling, as similar proportions of reads were obtained for the added internal tRNA standards (Supplementary Fig. 2b). For the biological replicate from HEK293T cells, both untreated and treated samples showed similarly high reproducibility with r2-values from 0.985–0.991 (Supplementary Figs. 3a-d). As already indicated by the cDNA bands (Fig. 1c), the read length and in particular the proportion of reads corresponding to full-length tRNA is increased substantially upon demethylase treatment, as shown by the position plots along individual tRNA genes (Figs. 2a,b). The highest fraction of reads in a LeuAAG tRNA in the untreated samples were RT stops due to the m1G37 modification, whereas all stops at this position were removed with the corresponding increase of full-length tRNA reads upon demethylase treatment (Fig. 2a). Similarly, the substantial RT stop at m1G9 of a GlnCTG tRNA was markedly reduced upon demethylase treatment, together with a substantially increased generation of full-length reads (Fig. 2b). We still detected a strong stop at the m22G26 (N2,N2-dimethylguanosine) residue in LeuAAG tRNA that remained unchanged upon demethylase treatment. This result indicates that our demethylase mixture is not effective in removing m22G modifications, which are present in ~20% of tRNAs. Nevertheless, our results indicate that the demethylase treatment is very effective in producing longer reads; this property is crucial for the ability to adequately map the mammalian tRNAome at single base resolution.
Sequencing results(a) 5’ position sequencing plot of a LeuAAG tRNA containing an m1G37 modification. (b) 5’ position sequencing plot of a GlnTTG tRNA containing an m1G9 modification. For panels (a) and (b), m1A58 stops correspond to very short DNA fragments that are outside the range of our sequenced cDNA, and hence cannot be visualized here. (c) As expected, tRNA isoacceptor expression in HEK293T cells does not correlate with the gene copy number, since human tRNA expression is tissue specific. (d) Comparison of the array fluorescent signals and sequencing reads for the Arg-tRNA family; error bar, n = 4 ± SD. (e) Mismatch and stops with and without demethylase treatment. Shown are modification positions of m1A58 in ValCAC, m3C32 in ThrAGT, m1G37 in ProTGG, and m1G9 in GlnCTG. Star indicates stops for ValCAC corresponding to very short reads that were not determined in this experiment. n = 4 ± SD. (f) Sequence logo at the same modified positions as in (e), centered at the modified residue. Untreated (−) is at the top and the demethylases treated (+) is at the bottom.
We performed additional analysis to further demonstrate the usefulness of our sequencing method. Plotting each tRNA isoacceptor against their genomic gene copy number showed a poor correlation (Fig. 2c), consistent with the known tissue specific tRNA expression in humans13,. We compared the fraction read of the tRNAArg isoacceptors to the fluorescence hybridization signals of the Arg-tRNA probes from tRNA microarrays which were obtained through hybridization without the need for cDNA synthesis13,. The sequencing and array results showed the same trend of isoacceptor abundance, thus validating the quantitative nature of tRNA abundance obtained independently through sequencing and hybridization based approaches (Fig. 2d). We also compared RT stops and misincorporations at known modification positions with and without demethylase treatment. In the case of m1A58 in ValCAC and m3C32 in ThrAGT, the demethylases completely removed these modifications as demonstrated by the reversion to cognate sequence at these positions (Figs. 2e-f). In the case of m1G37 in ProTGG and m1G9 in GlnCTG, the demethylase treatment removed a large majority of the modification so the mismatch and stops were substantially reduced (Figs. 2e-f). Therefore, our DM-TGIRT-seq method can determine differences in the modification dynamics of m1A, m1G and m3C at single base resolution, as well as potentially infer positions of non-demethylated modifications.
We also examined the expression of unique tRNA genes from chromosome 6. Human chromosome 6 contains approximately one third of all tRNA genes, and over 150 genes are clustered within a 2.7 Mbp region near the class I major histocompatibility (MHC) genes. We found that the tRNA expression levels within the tRNA gene cluster were higher compared to the tRNA genes outside of the cluster (Supplementary Fig. 4). The expression level of tRNA genes in the cluster was uneven, suggesting that the expression of tRNA genes was not coordinated throughout the entire cluster in HEK293T cells.
In summary, despite its biological importance, tRNA has not been adequately studied at the transcriptome level due to its unique characteristics in modification and structure. Our approach described here makes efficient and quantitative tRNA-seq feasible, and has an additional advantage of being able to study modifications in a high throughput manner. Our method is accessible to any lab and offers broad potentials in basic research and diagnostic applications.
METHODS
Design of the D135S AlkB mutant
The substrates for the wild-type AlkB (wtAlkB), m1A and m3C, are positively charged. A close examination of the structures of substrate-bound AlkB revealed that these positively charged substrates are favorably positioned in the active site by interacting with the negatively charged carboxylic group of Asp135 (D135) (Supplementary Fig. 1a). We reasoned that single mutants at the D135 position might allow better accommodation of m1G in the active site (Supplementary Fig. 1a), thereby enhancing the demethylation efficiency of m1G. Truncated but catalytically active AlkB-ΔN11 was used and mutants with single amino acid replacement at D135 were screened. Of several mutations tested, AlkB D135S gave the best demethylation yields towards m1G (Supplementary Fig. 1b). In the AlkB-m1G structure model constructed by computationally mutating the AlkB structure (Supplementary Fig. 1a), the shorter side chain of S135 seems to allow more room to accommodate m1G while sustaining the crucial hydrogen bond with m1G, which may explain the improvement in activity.
We also performed pH-activity profiling for the demethylation of m1A and m1G in tRNA. Decreasing demethylation activity was observed with increasing pH for both m1A and m1G substrates (Supplementary Fig. 1c). Therefore, we chose pH 5.0 for detailed kinetic analysis and subsequent experiments.
Cloning, expression and purification of AlkB and mutant
A truncated AlkB with deletion of the amino (N)-terminal 11 amino acids was cloned into a pET30a vector (Novagen) and overexpressed in E. coli BL21(DE3). The proteins were purified following procedures published previously. Briefly, cells were grown at 37 °C in the presence of 50 μM kanamycin until the OD600 reached 0.6–0.8. After the addition of IPTG (1 mM) and FeSO4 (5 μM), the cells were incubated for an additional 4 hr at 30 °C. Cells were collected, pelleted and then resuspended in lysis buffer (10 mM Tris pH 7.4, 300 mM NaCl, 5% glycerol, 2 mM CaCl2, 10 mM MgCl2, 10 mM 2-mercaptoethanol). The cells were lysed by sonication and then centrifuged at 17,418 rcf for 20 min. The soluble proteins were first purified using a Ni-NTA superflow cartridge (Qiagen), and then further purified by ion-exchange (Mono S GL, GE Healthcare) and gel-filtration (Superdex-200, Pharmacia) chromatography. All protein purification steps were performed at 4 °C. The Asp135 to Ser mutation was introduced using the QuikChange Site-Directed Mutagenesis Kit (Agilent). The mutant protein was expressed and purified following the same procedure as that of the wild-type protein.
Mammalian cell culture and RNA preparations
Human embryonic kidney cell line HEK293T (CRL-11268) were obtained from American Type Culture Collection (ATCC) and used without further validation. Cells were cultured in DMEM (Thermo) media supplemented with 10% FBS and 1% 100 × Pen Strep (Gibco). Cells were routinely checked for mycoplasma contamination every 3–6 months, using Universal Mycoplasma Detection Kit (ATCC).
Total RNA was isolated by using a mirVana miRNA Isolation Kit (Life Technologies). Purified total RNA was premixed with the T7 RNA polymerase transcripts of three tRNA standards13 (0.01 pmol each standard per μg of total RNA) and deacylated by incubating in 0.1 M Tris-HCl, pH 9 at 37°C for 45 min. Although not necessary for studies of mature tRNAs, which all end with 3’CCA, deacylated RNAs with or without demethylation treatment could be treated with T4 polynucleotide kinase (Epicentre) at 37C° for 30 min to further warrant a free 3’ hydroxyl group for template-switching.
When necessary, total tRNA was subsequently isolated using a denaturing 10% polyacrylamide gel followed by passive gel elution and ethanol precipitation.
Demethylation reactions
Demethylation activity assay was performed for either gel purified total tRNA or total cellular RNA. For total tRNA, 100 μl of reaction mixture containing 1 μg of tRNA (~40 pmol) was treated with 2× molar ratio of wtAlkB (80 pmol) and 4× molar ratio of D135S mutant (160 pmol). For total cellular RNA, 5 μg of total RNA (estimated to contain ~40 pmol tRNA) was treated with 4× molar ratio of wtAlkB (160 pmol) and 4× molar ratio of D135S (200 pmol). More demethylases were used for total RNA to alleviate potential interference by rRNA and mRNA in the reaction. The reaction buffer contained 300 mM KCl, 2 mM MgCl2, 50 μM of (NH4)2Fe(SO4)2·6H2O, 300 μM 2-ketoglutarate (2-KG), 2 mM L-ascorbic acid, 50 μg/mL BSA, 50 mM MES buffer (pH 5.0). In both cases, the reaction was incubated for 2 hr at room temperature, and quenched by the addition of 5 mM EDTA. After phenol–chloroform extraction, tRNA was recovered by ethanol precipitation.
Thermostable group II intron RT template-switching
Template-switching reactions were performed as described-. Briefly, we used an initial template-primer substrate consisting of a 41-nt RNA oligonucleotide (5’-AGA UCG GAA GAG CAC ACG UCU AGU UCU ACA GUC CGA CGA UC/3SpC3/-3’) that contains Illumina Read1 and Read2 primer-binding sites and a 3’ blocking group (three carbon spacer; Integrated DNA technologies, inc.) annealed to a complementary 32P-labeled DNA primer with a single-nucleotide 3’ overhang, T, which facilitates the template switch to full-length tRNAs that mostly contain a 3’ CCA end. For TGIRT template-switching reactions, typically 100 ng of demethylated tRNAs or 1 μg of demethylated total RNA were mixed with the initial template-primer substrate (100 nM) and 500 nM TGIRT (GsI-IIC MalE rigid fusion RT) in reaction medium containing 450 mM NaCl, 5 mM MgCl2, 20 mM Tris-HCl, pH 7.5, and 5 mM DTT. The reactions were pre-incubated at room temperature for 30 min, initiated by adding 25 mM dNTPs (an equimolar mix of 25 mM dATP, dCTP, dGTP and dTTP) to a final concentration of 1 mM and incubating at 60°C for 30 min. The reactions were terminated by adding 5 M NaOH to a final concentration of 0.25 M, incubating at 95°C for 3 min, and neutralizing with 5 M HCl. The cDNAs resulting from template-switching were analyzed in a denaturing 6% polyacrylamide gel, electroeluted using a D-tube Dialyzer Maxi with MWCO of 6–8 kDa (EMD Millipore), and ethanol precipitated with 0.3 M sodium acetate, pH 5.2, in the presence of 25 μg of linear acrylamide (Life Technologies) carrier. The purified cDNAs were then circularized with CircLigase II (Epicentre) using the manufacturer's protocol with an extended incubation time of 5 hr at 60°C, extracted with phenol-chloroform-isoamyl alcohol (25:24:1), ethanol precipitated and amplified with Phusion-HF (Thermo Scientific) using Illumina multiplex (5’- AAT GAT ACG GCG ACC ACC GAG ATC TAC ACG TTC AGA GTT CTA CAG TCC GAC GAT C -3’) and barcode (5’- CAA GCA GAA GAC GGC ATA CGA GAT BARCODE GTG ACT GGA GTT CAG ACG TGT GCT CTT CCG ATC T -3’) primers for 12 cycles of 98°C for 5 sec, 60°C for 10 sec and 72°C for 10 sec. The PCR products were sequenced on an Illumina HiSeq system.
Quantitative analysis of modification levels using LC-MS/MS
Quantitative analysis of modified nucleotides was done as previously described. Briefly, 100 ng of tRNA was digested by nuclease P1 (2 U) in 30 μl of buffer containing 25 mM NaCl, and 2.5 mM ZnCl2 at 37°C for 1 hr, followed by the addition of NH4HCO3 (100 mM) and alkaline phosphatase (0.5 U). After an additional incubation at 37°C for 1 hr, the solution was diluted to 60 μl, and 10 μl of the solution was injected for LCMS/MS. Nucleosides were separated by reverse phase ultra-performance liquid chromatography on a C18 column with online mass spectrometry detection using an Agilent 6410 QQQ triple-quadruple LC mass spectrometer in positive electrospray ionization mode. The nucleosides were quantified using the nucleoside to base ion mass transitions of 282 to 150 (m1A), 268 to 166 (A), 298 to (m1G), 284 to 152 (G), 258 to 126 (m3C) and 244 to 112 (C). Quantification was performed by comparison with the standard curve obtained from pure nucleoside standards running in the same batch of samples. Modification levels were compared by the ratios of methylated base (m1A, m1G, m3C) over regular base (A, G, C).
tRNA microarrays
The tRNA microarray assay consists of four steps starting from purified tRNA or total RNA without the need of cDNA synthesis: (i) deacylation to remove all 3’ attached amino acids, (ii) selective fluorophore labeling of tRNA using oligonucleotide ligation with T4 DNA ligase to the 3’CCA of all tRNAs, (iii) hybridization, and (iv) data analysis. The reproducibility of the tRNA microarray method and validation of the results have been extensively described previously13,.
Sequencing read mapping
Sequencing reads were aligned using Bowtie to a modified hg19 genomic tRNA database. A single mismatch was allowed in order to identify potential modification misincorporations at a modification site. Briefly, a tRNA library was adapted from the tRNAScan-SE library by appending CCA to tRNAs from the genomic tRNA database (http://gtrnadb.ucsc.edu/Hsapi19/). Isodecoders with identical scores were consolidated for ease of identity assignment, decreasing the number of reference genes and pseudogenes from 625 to 462. Prior to mapping, reads were processed using Trimmomatic v0.32. Sequences greater than 15 bp were then aligned to the aforementioned culled tRNA library using Bowtie2 with sensitive options. Reads mapping to multiple isodecoders due to length of fragment were discarded. Modification fractions were determined by analyzing at a putative position ‘n’ the number of correct reads, the number of misincorporations, as well as the number of reads stopped at the n+1 position.
Supplementary Material
1
ACKNOWLEDGEMENTS
This work was supported by US National Institutes of Health (NIH) grant DP1GM105386 to T.P, NIH grants R01GM37949 and GM37951 to A.M.L, K01HG006699 to Q.D., NIH MCB Training Grant (T32GM007183) to W.C.C, and a Chicago Biomedical Consortium Postdoctoral Research Grant Award to G.Z. We thank L.Zhang, W.J.Chen, and I.A.Gagnon for technical assistance. C.H. is supported by US Howard Hughes Medical Institute as an investigator.
Footnotes
ACCESSION NUMBERS
The GEO accession number for the sequencing data reported in this paper is GSE66550.
AUTHOR CONTRIBUTIONS
G.Z., Y.Q., W.C.C., Q.D, A.L, and T.P. designed and performed experiments, and analyzed data. G.Z. and T.P. conceived the project. G.Z., C.Y. and C.H. designed the demethylase constructs. G.Z., Y.Q., W.C.C., A.L. and T.P. wrote the paper.
COMPETING FINANCIAL INTERESTS
Thermostable group II intron reverse transcriptase (TGIRT™) enzymes and methods for their use are the subject of patents and patent applications that have been licensed by the University of Texas at Austin and East Tennessee State University to InGex, LLC. A.M.L. and the University of Texas are minority equity holders in InGex, LLC, and A.M.L. and other present and former members of the Lambowitz laboratory receive royalty payments from sales of TGIRT™ enzymes and licensing of intellectual property.