Browsing by Subject "Nucleic Acid Conformation"
Now showing 1 - 16 of 16
- Results Per Page
- Sort Options
Item Open Access A discrete chromatin loop in the mouse Tcra-Tcrd locus shapes the TCRδ and TCRα repertoires.(Nat Immunol, 2015-10) Chen, Liang; Carico, Zachary; Shih, Han-Yu; Krangel, Michael SThe locus encoding the T cell antigen receptor (TCR) α-chain and δ-chain (Tcra-Tcrd) undergoes recombination of its variable-diversity-joining (V(D)J) segments in CD4(-)CD8(-) double-negative thymocytes and CD4(+)CD8(+) double-positive thymocytes to generate diverse TCRδ repertoires and TCRα repertoires, respectively. Here we identified a chromatin-interaction network in the Tcra-Tcrd locus in double-negative thymocytes that was formed by interactions between binding elements for the transcription factor CTCF. Disruption of a discrete chromatin loop encompassing the D, J and constant (C) segments of Tcrd allowed a single V segment to frequently contact and rearrange to D and J segments and dominate the adult TCRδ repertoire. Disruption of this loop also narrowed the TCRα repertoire, which, we believe, followed as a consequence of the restricted TCRδ repertoire. Hence, a single CTCF-mediated chromatin loop directly regulated TCRδ diversity and indirectly regulated TCRα diversity.Item Open Access A Novel Genetic Variant in Long Non-coding RNA Gene NEXN-AS1 is Associated with Risk of Lung Cancer.(Scientific reports, 2016-10-07) Yuan, Hua; Liu, Hongliang; Liu, Zhensheng; Owzar, Kouros; Han, Younghun; Su, Li; Wei, Yongyue; Hung, Rayjean J; McLaughlin, John; Brhane, Yonathan; Brennan, Paul; Bickeboeller, Heike; Rosenberger, Albert; Houlston, Richard S; Caporaso, Neil; Landi, Maria Teresa; Heinrich, Joachim; Risch, Angela; Christiani, David C; Gümüş, Zeynep H; Klein, Robert J; Amos, Christopher I; Wei, QingyiLung cancer etiology is multifactorial, and growing evidence has indicated that long non-coding RNAs (lncRNAs) are important players in lung carcinogenesis. We performed a large-scale meta-analysis of 690,564 SNPs in 15,531 autosomal lncRNAs by using datasets from six previously published genome-wide association studies (GWASs) from the Transdisciplinary Research in Cancer of the Lung (TRICL) consortium in populations of European ancestry. Previously unreported significant SNPs (P value < 1 × 10-7) were further validated in two additional independent lung cancer GWAS datasets from Harvard University and deCODE. In the final meta-analysis of all eight GWAS datasets with 17,153 cases and 239,337 controls, a novel risk SNP rs114020893 in the lncRNA NEXN-AS1 region at 1p31.1 remained statistically significant (odds ratio = 1.17; 95% confidence interval = 1.11-1.24; P = 8.31 × 10-9). In further in silico analysis, rs114020893 was predicted to change the secondary structure of the lncRNA. Our finding indicates that SNP rs114020893 of NEXN-AS1 at 1p31.1 may contribute to lung cancer susceptibility.Item Open Access Computational Methods for RNA Structure Validation and Improvement.(Methods Enzymol, 2015) Jain, Swati; Richardson, David C; Richardson, Jane SWith increasing recognition of the roles RNA molecules and RNA/protein complexes play in an unexpected variety of biological processes, understanding of RNA structure-function relationships is of high current importance. To make clean biological interpretations from three-dimensional structures, it is imperative to have high-quality, accurate RNA crystal structures available, and the community has thoroughly embraced that goal. However, due to the many degrees of freedom inherent in RNA structure (especially for the backbone), it is a significant challenge to succeed in building accurate experimental models for RNA structures. This chapter describes the tools and techniques our research group and our collaborators have developed over the years to help RNA structural biologists both evaluate and achieve better accuracy. Expert analysis of large, high-resolution, quality-conscious RNA datasets provides the fundamental information that enables automated methods for robust and efficient error diagnosis in validating RNA structures at all resolutions. The even more crucial goal of correcting the diagnosed outliers has steadily developed toward highly effective, computationally based techniques. Automation enables solving complex issues in large RNA structures, but cannot circumvent the need for thoughtful examination of local details, and so we also provide some guidance for interpreting and acting on the results of current structure validation for RNA.Item Open Access Cyanobacterial ribosomal RNA genes with multiple, endonuclease-encoding group I introns.(BMC evolutionary biology, 2007-09-08) Haugen, P; Bhattacharya, D; Palmer, JD; Turner, S; Lewis, LA; Pryer, KMGroup I introns are one of the four major classes of introns as defined by their distinct splicing mechanisms. Because they catalyze their own removal from precursor transcripts, group I introns are referred to as autocatalytic introns. Group I introns are common in fungal and protist nuclear ribosomal RNA genes and in organellar genomes. In contrast, they are rare in all other organisms and genomes, including bacteria.Here we report five group I introns, each containing a LAGLIDADG homing endonuclease gene (HEG), in large subunit (LSU) rRNA genes of cyanobacteria. Three of the introns are located in the LSU gene of Synechococcus sp. C9, and the other two are in the LSU gene of Synechococcus lividus strain C1. Phylogenetic analyses show that these introns and their HEGs are closely related to introns and HEGs located at homologous insertion sites in organellar and bacterial rDNA genes. We also present a compilation of group I introns with homing endonuclease genes in bacteria.We have discovered multiple HEG-containing group I introns in a single bacterial gene. To our knowledge, these are the first cases of multiple group I introns in the same bacterial gene (multiple group I introns have been reported in at least one phage gene and one prophage gene). The HEGs each contain one copy of the LAGLIDADG motif and presumably function as homodimers. Phylogenetic analysis, in conjunction with their patchy taxonomic distribution, suggests that these intron-HEG elements have been transferred horizontally among organelles and bacteria. However, the mode of transfer and the nature of the biological connections among the intron-containing organisms are unknown.Item Open Access Development of universal antidotes to control aptamer activity.(Nat Med, 2009-10) Oney, S; Lam, RTS; Bompiani, KM; Blake, CM; Quick, G; Heidel, JD; Liu, JYC; Mack, BC; Davis, ME; Leong, KW; Sullenger, BAWith an ever increasing number of people taking numerous medications, the need to safely administer drugs and limit unintended side effects has never been greater. Antidote control remains the most direct means to counteract acute side effects of drugs, but, unfortunately, it has been challenging and cost prohibitive to generate antidotes for most therapeutic agents. Here we describe the development of a set of antidote molecules that are capable of counteracting the effects of an entire class of therapeutic agents based upon aptamers. These universal antidotes exploit the fact that, when systemically administered, aptamers are the only free extracellular oligonucleotides found in circulation. We show that protein- and polymer-based molecules that capture oligonucleotides can reverse the activity of several aptamers in vitro and counteract aptamer activity in vivo. The availability of universal antidotes to control the activity of any aptamer suggests that aptamers may be a particularly safe class of therapeutics.Item Open Access Identification and utilization of arbitrary correlations in models of recombination signal sequences.(Genome Biol, 2002) Cowell, Lindsay G; Davila, Marco; Kepler, Thomas B; Kelsoe, GarnettBACKGROUND: A significant challenge in bioinformatics is to develop methods for detecting and modeling patterns in variable DNA sequence sites, such as protein-binding sites in regulatory DNA. Current approaches sometimes perform poorly when positions in the site do not independently affect protein binding. We developed a statistical technique for modeling the correlation structure in variable DNA sequence sites. The method places no restrictions on the number of correlated positions or on their spatial relationship within the site. No prior empirical evidence for the correlation structure is necessary. RESULTS: We applied our method to the recombination signal sequences (RSS) that direct assembly of B-cell and T-cell antigen-receptor genes via V(D)J recombination. The technique is based on model selection by cross-validation and produces models that allow computation of an information score for any signal-length sequence. We also modeled RSS using order zero and order one Markov chains. The scores from all models are highly correlated with measured recombination efficiencies, but the models arising from our technique are better than the Markov models at discriminating RSS from non-RSS. CONCLUSIONS: Our model-development procedure produces models that estimate well the recombinogenic potential of RSS and are better at RSS recognition than the order zero and order one Markov models. Our models are, therefore, valuable for studying the regulation of both physiologic and aberrant V(D)J recombination. The approach could be equally powerful for the study of promoter and enhancer elements, splice sites, and other DNA regulatory sites that are highly variable at the level of individual nucleotide positions.Item Open Access In vivo architecture of the telomerase RNA catalytic core in Trypanosoma brucei.(Nucleic acids research, 2021-12) Dey, Abhishek; Monroy-Eklund, Anais; Klotz, Kaitlin; Saha, Arpita; Davis, Justin; Li, Bibo; Laederach, Alain; Chakrabarti, KausikTelomerase is a unique ribonucleoprotein (RNP) reverse transcriptase that utilizes its cognate RNA molecule as a template for telomere DNA repeat synthesis. Telomerase contains the reverse transcriptase protein, TERT and the template RNA, TR, as its core components. The 5'-half of TR forms a highly conserved catalytic core comprising of the template region and adjacent domains necessary for telomere synthesis. However, how telomerase RNA folding takes place in vivo has not been fully understood due to low abundance of the native RNP. Here, using unicellular pathogen Trypanosoma brucei as a model, we reveal important regional folding information of the native telomerase RNA core domains, i.e. TR template, template boundary element, template proximal helix and Helix IV (eCR4-CR5) domain. For this purpose, we uniquely combined in-cell probing with targeted high-throughput RNA sequencing and mutational mapping under three conditions: in vivo (in WT and TERT-/- cells), in an immunopurified catalytically active telomerase RNP complex and ex vivo (deproteinized). We discover that TR forms at least two different conformers with distinct folding topologies in the insect and mammalian developmental stages of T. brucei. Also, TERT does not significantly affect the RNA folding in vivo, suggesting that the telomerase RNA in T. brucei exists in a conformationally preorganized stable structure. Our observed differences in RNA (TR) folding at two distinct developmental stages of T. brucei suggest that important conformational changes are a key component of T. brucei development.Item Open Access Infrared Spectroscopic Observation of a G-C+ Hoogsteen Base Pair in the DNA:TATA-Box Binding Protein Complex Under Solution Conditions.(Angewandte Chemie (International ed. in English), 2019-08) Stelling, Allison L; Liu, Amy Y; Zeng, Wenjie; Salinas, Raul; Schumacher, Maria A; Al-Hashimi, Hashim MHoogsteen DNA base pairs (bps) are an alternative base pairing to canonical Watson-Crick bps and are thought to play important biochemical roles. Hoogsteen bps have been reported in a handful of X-ray structures of protein-DNA complexes. However, there are several examples of Hoogsteen bps in crystal structures that form Watson-Crick bps when examined under solution conditions. Furthermore, Hoogsteen bps can sometimes be difficult to resolve in DNA:protein complexes by X-ray crystallography due to ambiguous electron density and by solution-state NMR spectroscopy due to size limitations. Here, using infrared spectroscopy, we report the first direct solution-state observation of a Hoogsteen (G-C+ ) bp in a DNA:protein complex under solution conditions with specific application to DNA-bound TATA-box binding protein. These results support a previous assignment of a G-C+ Hoogsteen bp in the complex, and indicate that Hoogsteen bps do indeed exist under solution conditions in DNA:protein complexes.Item Open Access Insights into Watson-Crick/Hoogsteen breathing dynamics and damage repair from the solution structure and dynamic ensemble of DNA duplexes containing m1A.(Nucleic acids research, 2017-05) Sathyamoorthy, Bharathwaj; Shi, Honglue; Zhou, Huiqing; Xue, Yi; Rangadurai, Atul; Merriman, Dawn K; Al-Hashimi, Hashim MIn the canonical DNA double helix, Watson-Crick (WC) base pairs (bps) exist in dynamic equilibrium with sparsely populated (∼0.02-0.4%) and short-lived (lifetimes ∼0.2-2.5 ms) Hoogsteen (HG) bps. To gain insights into transient HG bps, we used solution-state nuclear magnetic resonance spectroscopy, including measurements of residual dipolar couplings and molecular dynamics simulations, to examine how a single HG bp trapped using the N1-methylated adenine (m1A) lesion affects the structural and dynamic properties of two duplexes. The solution structure and dynamic ensembles of the duplexes reveals that in both cases, m1A forms a m1A•T HG bp, which is accompanied by local and global structural and dynamic perturbations in the double helix. These include a bias toward the BI backbone conformation; sugar repuckering, major-groove directed kinking (∼9°); and local melting of neighboring WC bps. These results provide atomic insights into WC/HG breathing dynamics in unmodified DNA duplexes as well as identify structural and dynamic signatures that could play roles in m1A recognition and repair.Item Open Access Mitochondrial mutations in adenoid cystic carcinoma of the salivary glands.(PLoS One, 2009-12-30) Mithani, Suhail K; Shao, Chunbo; Tan, Marietta; Smith, Ian M; Califano, Joseph A; El-Naggar, Adel K; Ha, Patrick KBACKGROUND: The MitoChip v2.0 resequencing array is an array-based technique allowing for accurate and complete sequencing of the mitochondrial genome. No studies have investigated mitochondrial mutation in salivary gland adenoid cystic carcinomas. METHODOLOGY: The entire mitochondrial genome of 22 salivary gland adenoid cystic carcinomas (ACC) of salivary glands and matched leukocyte DNA was sequenced to determine the frequency and distribution of mitochondrial mutations in ACC tumors. PRINCIPAL FINDINGS: Seventeen of 22 ACCs (77%) carried mitochondrial mutations, ranging in number from 1 to 37 mutations. A disproportionate number of mutations occurred in the D-loop. Twelve of 17 tumors (70.6%) carried mutations resulting in amino acid changes of translated proteins. Nine of 17 tumors (52.9%) with a mutation carried an amino acid changing mutation in the nicotinamide adenine dinucleotide dehydrogenase (NADH) complex. CONCLUSIONS/SIGNIFICANCE: Mitochondrial mutation is frequent in salivary ACCs. The high incidence of amino acid changing mutations implicates alterations in aerobic respiration in ACC carcinogenesis. D-loop mutations are of unclear significance, but may be associated with alterations in transcription or replication.Item Open Access New insights into Hoogsteen base pairs in DNA duplexes from a structure-based survey.(Nucleic Acids Res, 2015-04-20) Zhou, H; Hintze, BJ; Kimsey, IJ; Sathyamoorthy, B; Yang, S; Richardson, JS; Al-Hashimi, HMHoogsteen (HG) base pairs (bps) provide an alternative pairing geometry to Watson-Crick (WC) bps and can play unique functional roles in duplex DNA. Here, we use structural features unique to HG bps (syn purine base, HG hydrogen bonds and constricted C1'-C1' distance across the bp) to search for HG bps in X-ray structures of DNA duplexes in the Protein Data Bank. The survey identifies 106 A•T and 34 G•C HG bps in DNA duplexes, many of which are undocumented in the literature. It also uncovers HG-like bps with syn purines lacking HG hydrogen bonds or constricted C1'-C1' distances that are analogous to conformations that have been proposed to populate the WC-to-HG transition pathway. The survey reveals HG preferences similar to those observed for transient HG bps in solution by nuclear magnetic resonance, including stronger preferences for A•T versus G•C bps, TA versus GG steps, and also suggests enrichment at terminal ends with a preference for 5'-purine. HG bps induce small local perturbations in neighboring bps and, surprisingly, a small but significant degree of DNA bending (∼14°) directed toward the major groove. The survey provides insights into the preferences and structural consequences of HG bps in duplex DNA.Item Open Access New tools provide a second look at HDV ribozyme structure, dynamics and cleavage.(Nucleic Acids Res, 2014-11-10) Kapral, Gary J; Jain, Swati; Noeske, Jonas; Doudna, Jennifer A; Richardson, David C; Richardson, Jane SThe hepatitis delta virus (HDV) ribozyme is a self-cleaving RNA enzyme essential for processing viral transcripts during rolling circle viral replication. The first crystal structure of the cleaved ribozyme was solved in 1998, followed by structures of uncleaved, mutant-inhibited and ion-complexed forms. Recently, methods have been developed that make the task of modeling RNA structure and dynamics significantly easier and more reliable. We have used ERRASER and PHENIX to rebuild and re-refine the cleaved and cis-acting C75U-inhibited structures of the HDV ribozyme. The results correct local conformations and identify alternates for RNA residues, many in functionally important regions, leading to improved R values and model validation statistics for both structures. We compare the rebuilt structures to a higher resolution, trans-acting deoxy-inhibited structure of the ribozyme, and conclude that although both inhibited structures are consistent with the currently accepted hammerhead-like mechanism of cleavage, they do not add direct structural evidence to the biochemical and modeling data. However, the rebuilt structures (PDBs: 4PR6, 4PRF) provide a more robust starting point for research on the dynamics and catalytic mechanism of the HDV ribozyme and demonstrate the power of new techniques to make significant improvements in RNA structures that impact biologically relevant conclusions.Item Open Access Quantification of DNA cleavage specificity in Hi-C experiments.(Nucleic Acids Res, 2016-01-08) Meluzzi, Dario; Arya, GauravHi-C experiments produce large numbers of DNA sequence read pairs that are typically analyzed to deduce genomewide interactions between arbitrary loci. A key step in these experiments is the cleavage of cross-linked chromatin with a restriction endonuclease. Although this cleavage should happen specifically at the enzyme's recognition sequence, an unknown proportion of cleavage events may involve other sequences, owing to the enzyme's star activity or to random DNA breakage. A quantitative estimation of these non-specific cleavages may enable simulating realistic Hi-C read pairs for validation of downstream analyses, monitoring the reproducibility of experimental conditions and investigating biophysical properties that correlate with DNA cleavage patterns. Here we describe a computational method for analyzing Hi-C read pairs to estimate the fractions of cleavages at different possible targets. The method relies on expressing an observed local target distribution downstream of aligned reads as a linear combination of known conditional local target distributions. We validated this method using Hi-C read pairs obtained by computer simulation. Application of the method to experimental Hi-C datasets from murine cells revealed interesting similarities and differences in patterns of cleavage across the various experiments considered.Item Open Access Sensitive and precise quantification of insulin-like mRNA expression in Caenorhabditis elegans.(PLoS One, 2011-03-22) Baugh, L Ryan; Kurhanewicz, Nicole; Sternberg, Paul WInsulin-like signaling regulates developmental arrest, stress resistance and lifespan in the nematode Caenorhabditis elegans. However, the genome encodes 40 insulin-like peptides, and the regulation and function of individual peptides is largely uncharacterized. We used the nCounter platform to measure mRNA expression of all 40 insulin-like peptides as well as the insulin-like receptor daf-2, its transcriptional effector daf-16, and the daf-16 target gene sod-3. We validated the platform using 53 RNA samples previously characterized by high density oligonucleotide microarray analysis. For this set of genes and the standard nCounter protocol, sensitivity and precision were comparable between the two platforms. We optimized conditions of the nCounter assay by varying the mass of total RNA used for hybridization, thereby increasing sensitivity up to 50-fold and reducing the median coefficient of variation as much as 4-fold. We used deletion mutants to demonstrate specificity of the assay, and we used optimized conditions to assay insulin-like gene expression throughout the C. elegans life cycle. We detected expression for nearly all insulin-like genes and find that they are expressed in a variety of distinct patterns suggesting complexity of regulation and specificity of function. We identified insulin-like genes that are specifically expressed during developmental arrest, larval development, adulthood and embryogenesis. These results demonstrate that the nCounter platform provides a powerful approach to analyzing insulin-like gene expression dynamics, and they suggest hypotheses about the function of individual insulin-like genes.Item Open Access Surface-enhanced Raman scattering plasmonic enhancement using DNA origami-based complex metallic nanostructures.(Nano letters, 2014-01) Pilo-Pais, M; Watson, A; Demers, S; LaBean, TH; Finkelstein, GDNA origami is a novel self-assembly technique allowing one to form various two-dimensional shapes and position matter with nanometer accuracy. We use DNA origami templates to engineer surface-enhanced Raman scattering substrates. Specifically, gold nanoparticles were selectively placed on the corners of rectangular origami and subsequently enlarged via solution-based metal deposition. The resulting assemblies exhibit "hot spots" of enhanced electromagnetic field between the nanoparticles. We observed a significant Raman signal enhancement from molecules covalently attached to the assemblies, as compared to control nanoparticle samples that lack interparticle hot spots. Furthermore, Raman molecules are used to map out the hot spots' distribution, as they are burned when experiencing a threshold electric field. Our method opens up the prospects of using DNA origami to rationally engineer and assemble plasmonic structures for molecular spectroscopy.Item Open Access Sustained-input switches for transcription factors and microRNAs are central building blocks of eukaryotic gene circuits.(Genome Biol, 2013-08-23) Megraw, Molly; Mukherjee, Sayan; Ohler, UweWaRSwap is a randomization algorithm that for the first time provides a practical network motif discovery method for large multi-layer networks, for example those that include transcription factors, microRNAs, and non-regulatory protein coding genes. The algorithm is applicable to systems with tens of thousands of genes, while accounting for critical aspects of biological networks, including self-loops, large hubs, and target rearrangements. We validate WaRSwap on a newly inferred regulatory network from Arabidopsis thaliana, and compare outcomes on published Drosophila and human networks. Specifically, sustained input switches are among the few over-represented circuits across this diverse set of eukaryotes.