Browsing by Author "Goldberg, Amy"
- Results Per Page
- Sort Options
Item Open Access Ancestry-based Methods for Characterizing the Evolutionary History of Admixed Populations(2022) Hamid, ImanAdmixture occurs when previously isolated populations come together to form a new population with genetic ancestry from those sources. Admixture is ubiquitous across the tree of life, including humans, and is often associated with migration and exposure to new environments and selective pressures. Admixed populations provide a unique opportunity to study adaptation on short timescales by introducing beneficial alleles at high frequency. However, admixed populations are often excluded from genomic studies due to lack of applicable methodology. Instead of relying on classical methods confounded by the process of admixture itself, we can detect changes in patterns of genetics ancestry that are informative about selection in admixed populations and at the short timescales often relevant for post-admixture selection. However, we lack theoretical expectations and methods to detect and characterize ancestry-based genomic signals indicative of post-admixture selection and adaptation. Common ancestry outlier approaches discard information about the surrounding genomic context and are prone to false positives due to drift and demography. Here, I present three studies which leverage patterns of genetic ancestry to investigate the evolutionary history of admixed populations. First, I develop a suite of ancestry-based summary statistics and computational methods to detect post-admixture adaptation, and demonstrate their application in a case study of human adaptation to malaria. In particular, these summary statistics incorporate patterns of ancestry beyond the site under selection, such as the length of contiguous ancestry tracts surrounding the locus, and are informative about the strength and timing of selection in admixed populations. I observe one of the strongest signals of recent selection in humans at the malaria protective Duffy-null allele, and show that this mode of strong single-locus selection over 20 generations has impacted genome-wide patterns of ancestry. Next, I move beyond summary statistics to develop a deep learning strategy for localizing regions of the genome under selection. This method takes images of chromosomes painted by ancestry as input to avoid the loss of information and bias that can occur when relying on user-defined summary statistics. I demonstrate this approach on simulated admixture scenarios and find that the method successfully localizes variants under selection 95% percent of the time, outperforms the common ancestry outlier approach, and is robust to demographic misspecification. Lastly, I present the first Illyrian genome sequences available from the Iron Age in a study of the ancestry and genetic relationships of five neonates buried in Korčula, Croatia. I find genetic support for classifying these individuals as Illyrian, and show that patterns of ancestry and genetic variation are consistent with their geographic location between Italy and the mainland Balkans. In the combined work presented here, I advance our ability to study the evolutionary history of admixed populations, which has implications for our understanding of phenotypic variation, disease risk, and conservation genetics across many study systems. Further, these methods tailored to the mosaic ancestry of admixed populations is a step towards expanding the diversity of populations, especially humans, who benefit from discoveries and advancement in genomic research.
Item Open Access Post-invasion demography of prehistoric humans in South America(Nature, 2016-04) Goldberg, Amy; Mychajliw, Alexis M; Hadly, Elizabeth AItem Open Access Primate malarias as a model for cross-species parasite transmission.(eLife, 2022-01) Voinson, Marina; Nunn, Charles L; Goldberg, AmyParasites regularly switch into new host species, representing a disease burden and conservation risk to the hosts. The distribution of these parasites also gives insight into characteristics of ecological networks and genetic mechanisms of host-parasite interactions. Some parasites are shared across many species, whereas others tend to be restricted to hosts from a single species. Understanding the mechanisms producing this distribution of host specificity can enable more effective interventions and potentially identify genetic targets for vaccines or therapies. As ecological connections between human and local animal populations increase, the risk to human and wildlife health from novel parasites also increases. Which of these parasites will fizzle out and which have the potential to become widespread in humans? We consider the case of primate malarias, caused by Plasmodium parasites, to investigate the interacting ecological and evolutionary mechanisms that put human and nonhuman primates at risk for infection. Plasmodium host switching from nonhuman primates to humans led to ancient introductions of the most common malaria-causing agents in humans today, and new parasite switching is a growing threat, especially in Asia and South America. Based on a wild host-Plasmodium occurrence database, we highlight geographic areas of concern and potential areas to target further sampling. We also discuss methodological developments that will facilitate clinical and field-based interventions to improve human and wildlife health based on this eco-evolutionary perspective.Item Open Access Uncovering causal noncoding variants among nervous system disease linked variants with evolutionary analysis, epigenomic annotation, and in vivo scSTARR-seq(2023-04) Simpson, ShaeIntegrating information from multiple sources of genomic data, including inferred evolutionary history, epigenomic annotation, and high throughput in vivo assays, can help expand from the conclusions drawn from Genome-Wide Association Studies (GWAS) to provide causal insight into human genomic variation. I collected all nervous system disease-associated variation from the human GWAS catalog and identified variants linked to human ancestor quickly evolved regions (HAQERs), regions that rapidly evolved under positive selection and are enriched in neurodevelopmental functional elements. I identified variants located in putative enhancer elements based on open chromatin and 3-dimensional chromosome contacts with nearby promotors. To facilitate in vivo testing of these variants, I implemented haplotypeGenerator, an open-source program that infers haplotype sequences from individual variation data. I evaluated the efficiency of multiple methods of input library preparation and identified bulk transformation with maxiprep DNA isolation, as well as content validation by sequencing, as the most efficient method. The methods outlined in this work provide a framework for deeper interpretation of disease-linked variation and facilitate better understandings of the genetic determinants of human disease.