Browsing by Subject "DNA, Intergenic"
Now showing 1 - 7 of 7
Results Per Page
Sort Options
Item Open Access A functional analysis of the spacer of V(D)J recombination signal sequences.(PLoS Biol, 2003-10) Lee, Alfred Ian; Fugmann, Sebastian D; Cowell, Lindsay G; Ptaszek, Leon M; Kelsoe, Garnett; Schatz, David GDuring lymphocyte development, V(D)J recombination assembles antigen receptor genes from component V, D, and J gene segments. These gene segments are flanked by a recombination signal sequence (RSS), which serves as the binding site for the recombination machinery. The murine Jbeta2.6 gene segment is a recombinationally inactive pseudogene, but examination of its RSS reveals no obvious reason for its failure to recombine. Mutagenesis of the Jbeta2.6 RSS demonstrates that the sequences of the heptamer, nonamer, and spacer are all important. Strikingly, changes solely in the spacer sequence can result in dramatic differences in the level of recombination. The subsequent analysis of a library of more than 4,000 spacer variants revealed that spacer residues of particular functional importance are correlated with their degree of conservation. Biochemical assays indicate distinct cooperation between the spacer and heptamer/nonamer along each step of the reaction pathway. The results suggest that the spacer serves not only to ensure the appropriate distance between the heptamer and nonamer but also regulates RSS activity by providing additional RAG:RSS interaction surfaces. We conclude that while RSSs are defined by a "digital" requirement for absolutely conserved nucleotides, the quality of RSS function is determined in an "analog" manner by numerous complex interactions between the RAG proteins and the less-well conserved nucleotides in the heptamer, the nonamer, and, importantly, the spacer. Those modulatory effects are accurately predicted by a new computational algorithm for "RSS information content." The interplay between such binary and multiplicative modes of interactions provides a general model for analyzing protein-DNA interactions in various biological systems.Item Open Access Identification and utilization of arbitrary correlations in models of recombination signal sequences.(Genome Biol, 2002) Cowell, Lindsay G; Davila, Marco; Kepler, Thomas B; Kelsoe, GarnettBACKGROUND: A significant challenge in bioinformatics is to develop methods for detecting and modeling patterns in variable DNA sequence sites, such as protein-binding sites in regulatory DNA. Current approaches sometimes perform poorly when positions in the site do not independently affect protein binding. We developed a statistical technique for modeling the correlation structure in variable DNA sequence sites. The method places no restrictions on the number of correlated positions or on their spatial relationship within the site. No prior empirical evidence for the correlation structure is necessary. RESULTS: We applied our method to the recombination signal sequences (RSS) that direct assembly of B-cell and T-cell antigen-receptor genes via V(D)J recombination. The technique is based on model selection by cross-validation and produces models that allow computation of an information score for any signal-length sequence. We also modeled RSS using order zero and order one Markov chains. The scores from all models are highly correlated with measured recombination efficiencies, but the models arising from our technique are better than the Markov models at discriminating RSS from non-RSS. CONCLUSIONS: Our model-development procedure produces models that estimate well the recombinogenic potential of RSS and are better at RSS recognition than the order zero and order one Markov models. Our models are, therefore, valuable for studying the regulation of both physiologic and aberrant V(D)J recombination. The approach could be equally powerful for the study of promoter and enhancer elements, splice sites, and other DNA regulatory sites that are highly variable at the level of individual nucleotide positions.Item Open Access Identifying multiple origins of polyploid taxa: a multilocus study of the hybrid cloak fern (Astrolepis integerrima; Pteridaceae).(American journal of botany, 2012-11) Beck, James B; Allison, James R; Pryer, Kathleen M; Windham, Michael DPREMISE OF THE STUDY: Molecular studies have shown that multiple origins of polyploid taxa are the rule rather than the exception. To understand the distribution and ecology of polyploid species and the evolutionary significance of polyploidy in general, it is important to delineate these independently derived lineages as accurately as possible. Although gene flow among polyploid lineages and backcrossing to their diploid parents often confound this process, such post origin gene flow is very infrequent in asexual polyploids. In this study, we estimate the number of independent origins of the apomictic allopolyploid fern Astrolepis integerrima, a morphologically heterogeneous species most common in the southwestern United States and Mexico, with outlying populations in the southeastern United States and the Caribbean. METHODS: Plastid DNA sequence and AFLP data were obtained from 33 A. integerrima individuals. Phylogenetic analysis of the sequence data and multidimensional clustering of the AFLP data were used to identify independently derived lineages. KEY RESULTS: Analysis of the two datasets identified 10 genetic groups within the 33 analyzed samples. These groups suggest a minimum of 10 origins of A. integerrima in the northern portion of its range, with both putative parents functioning as maternal donors, both supplying unreduced gametes, and both contributing a significant portion of their genetic diversity to the hybrids. CONCLUSIONS: Our results highlight the extreme cryptic genetic diversity and systematic complexity that can underlie a single polyploid taxon.Item Open Access Intergenic and genic sequence lengths have opposite relationships with respect to gene expression.(PLoS One, 2008) Colinas, Juliette; Schmidler, Scott C; Bohrer, Gil; Iordanov, Borislav; Benfey, Philip NEukaryotic genomes are mostly composed of noncoding DNA whose role is still poorly understood. Studies in several organisms have shown correlations between the length of the intergenic and genic sequences of a gene and the expression of its corresponding mRNA transcript. Some studies have found a positive relationship between intergenic sequence length and expression diversity between tissues, and concluded that genes under greater regulatory control require more regulatory information in their intergenic sequences. Other reports found a negative relationship between expression level and gene length and the interpretation was that there is selection pressure for highly expressed genes to remain small. However, a correlation between gene sequence length and expression diversity, opposite to that observed for intergenic sequences, has also been reported, and to date there is no testable explanation for this observation. To shed light on these varied and sometimes conflicting results, we performed a thorough study of the relationships between sequence length and gene expression using cell-type (tissue) specific microarray data in Arabidopsis thaliana. We measured median gene expression across tissues (expression level), expression variability between tissues (expression pattern uniformity), and expression variability between replicates (expression noise). We found that intergenic (upstream and downstream) and genic (coding and noncoding) sequences have generally opposite relationships with respect to expression, whether it is tissue variability, median, or expression noise. To explain these results we propose a model, in which the lengths of the intergenic and genic sequences have opposite effects on the ability of the transcribed region of the gene to be epigenetically regulated for differential expression. These findings could shed light on the role and influence of noncoding sequences on gene expression.Item Open Access Three periods of regulatory innovation during vertebrate evolution.(Science (New York, N.Y.), 2011-08) Lowe, Craig B; Kellis, Manolis; Siepel, Adam; Raney, Brian J; Clamp, Michele; Salama, Sofie R; Kingsley, David M; Lindblad-Toh, Kerstin; Haussler, DavidThe gain, loss, and modification of gene regulatory elements may underlie a substantial proportion of phenotypic changes on animal lineages. To investigate the gain of regulatory elements throughout vertebrate evolution, we identified genome-wide sets of putative regulatory regions for five vertebrates, including humans. These putative regulatory regions are conserved nonexonic elements (CNEEs), which are evolutionarily conserved yet do not overlap any coding or noncoding mature transcript. We then inferred the branch on which each CNEE came under selective constraint. Our analysis identified three extended periods in the evolution of gene regulatory elements. Early vertebrate evolution was characterized by regulatory gains near transcription factors and developmental genes, but this trend was replaced by innovations near extracellular signaling genes, and then innovations near posttranslational protein modifiers.Item Open Access Unique expression of a sporophytic character on the gametophytes of notholaenid ferns (Pteridaceae).(American journal of botany, 2012-06) Johnson, Anne K; Rothfels, Carl J; Windham, Michael D; Pryer, Kathleen MPREMISE OF THE STUDY: Not all ferns grow in moist, shaded habitats; some lineages thrive in exposed, seasonally dry environments. Notholaenids are a clade of xeric-adapted ferns commonly characterized by the presence of a waxy exudate, called farina, on the undersides of their leaves. Although some other lineages of cheilanthoid ferns also have farinose sporophytes, previous studies suggested that notholaenids are unique in also producing farina on their gametophytes. For this reason, consistent farina expression across life cycle phases has been proposed as a potential synapomorphy for the genus Notholaena. Recent phylogenetic studies have shown two species with nonfarinose sporophytes to be nested within Notholaena, with a third nonfarinose species well supported as sister to all other notholaenids. This finding raises the question: are the gametophytes of these three species farinose like those of their close relatives, or are they glabrous, consistent with their sporophytes? METHODS: We sowed spores of a diversity of cheilanthoid ferns onto culture media to observe and document whether their gametophytes produced farina. To place these species within a phylogenetic context, we extracted genomic DNA, then amplified and sequenced three plastid loci. The aligned data were analyzed using maximum likelihood to generate a phylogenetic tree. KEY RESULTS: Here we show that notholaenids lacking sporophytic farina also lack farina in the gametophytic phase, and notholaenids with sporophytic farina always display gametophytic farina (with a single exception). Outgroup taxa never displayed gametophytic farina, regardless of whether they displayed farina on their sporophytes. CONCLUSIONS: Notholaenids are unique among ferns in consistently expressing farina across both phases of the life cycle.Item Open Access Unprecedented loss of ammonia assimilation capability in a urease-encoding bacterial mutualist.(BMC Genomics, 2010-12-02) Williams, Laura E; Wernegreen, Jennifer JBACKGROUND: Blochmannia are obligately intracellular bacterial mutualists of ants of the tribe Camponotini. Blochmannia perform key nutritional functions for the host, including synthesis of several essential amino acids. We used Illumina technology to sequence the genome of Blochmannia associated with Camponotus vafer. RESULTS: Although Blochmannia vafer retains many nutritional functions, it is missing glutamine synthetase (glnA), a component of the nitrogen recycling pathway encoded by the previously sequenced B. floridanus and B. pennsylvanicus. With the exception of Ureaplasma, B. vafer is the only sequenced bacterium to date that encodes urease but lacks the ability to assimilate ammonia into glutamine or glutamate. Loss of glnA occurred in a deletion hotspot near the putative replication origin. Overall, compared to the likely gene set of their common ancestor, 31 genes are missing or eroded in B. vafer, compared to 28 in B. floridanus and four in B. pennsylvanicus. Three genes (queA, visC and yggS) show convergent loss or erosion, suggesting relaxed selection for their functions. Eight B. vafer genes contain frameshifts in homopolymeric tracts that may be corrected by transcriptional slippage. Two of these encode DNA replication proteins: dnaX, which we infer is also frameshifted in B. floridanus, and dnaG. CONCLUSIONS: Comparing the B. vafer genome with B. pennsylvanicus and B. floridanus refines the core genes shared within the mutualist group, thereby clarifying functions required across ant host species. This third genome also allows us to track gene loss and erosion in a phylogenetic context to more fully understand processes of genome reduction.