Browsing by Author "Zhang, Guojie"
Now showing 1 - 20 of 20
Results Per Page
Sort Options
Item Open Access Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species.(Gigascience, 2013-07-22) Bradnam, Keith R; Fass, Joseph N; Alexandrov, Anton; Baranay, Paul; Bechner, Michael; Birol, Inanç; Boisvert, Sébastien; Chapman, Jarrod A; Chapuis, Guillaume; Chikhi, Rayan; Chitsaz, Hamidreza; Chou, Wen-Chi; Corbeil, Jacques; Del Fabbro, Cristian; Docking, T Roderick; Durbin, Richard; Earl, Dent; Emrich, Scott; Fedotov, Pavel; Fonseca, Nuno A; Ganapathy, Ganeshkumar; Gibbs, Richard A; Gnerre, Sante; Godzaridis, Elénie; Goldstein, Steve; Haimel, Matthias; Hall, Giles; Haussler, David; Hiatt, Joseph B; Ho, Isaac Y; Howard, Jason; Hunt, Martin; Jackman, Shaun D; Jaffe, David B; Jarvis, Erich D; Jiang, Huaiyang; Kazakov, Sergey; Kersey, Paul J; Kitzman, Jacob O; Knight, James R; Koren, Sergey; Lam, Tak-Wah; Lavenier, Dominique; Laviolette, François; Li, Yingrui; Li, Zhenyu; Liu, Binghang; Liu, Yue; Luo, Ruibang; Maccallum, Iain; Macmanes, Matthew D; Maillet, Nicolas; Melnikov, Sergey; Naquin, Delphine; Ning, Zemin; Otto, Thomas D; Paten, Benedict; Paulo, Octávio S; Phillippy, Adam M; Pina-Martins, Francisco; Place, Michael; Przybylski, Dariusz; Qin, Xiang; Qu, Carson; Ribeiro, Filipe J; Richards, Stephen; Rokhsar, Daniel S; Ruby, J Graham; Scalabrin, Simone; Schatz, Michael C; Schwartz, David C; Sergushichev, Alexey; Sharpe, Ted; Shaw, Timothy I; Shendure, Jay; Shi, Yujian; Simpson, Jared T; Song, Henry; Tsarev, Fedor; Vezzi, Francesco; Vicedomini, Riccardo; Vieira, Bruno M; Wang, Jun; Wang, Jun; Worley, Kim C; Yin, Shuangye; Yiu, Siu-Ming; Yuan, Jianying; Zhang, Guojie; Zhang, Hao; Zhou, Shiguo; Korf, Ian FBACKGROUND: The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. RESULTS: In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. CONCLUSIONS: Many current genome assemblers produced useful assemblies, containing a significant representation of their genes and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another.Item Open Access Avian genomes. A flock of genomes. Introduction.(Science, 2014-12-12) Zhang, Guojie; Jarvis, Erich D; Gilbert, M Thomas PItem Open Access Avianbase: a community resource for bird genomics.(Genome Biol, 2015-01-29) Eöry, Lél; Gilbert, M Thomas P; Li, Cai; Li, Bo; Archibald, Alan; Aken, Bronwen L; Zhang, Guojie; Jarvis, Erich; Flicek, Paul; Burt, David WGiving access to sequence and annotation data for genome assemblies is important because, while facilitating research, it places both assembly and annotation quality under scrutiny, resulting in improvements to both. Therefore we announce Avianbase, a resource for bird genomics, which provides access to data released by the Avian Phylogenomics Consortium.Item Open Access Comparative genomic data of the Avian Phylogenomics Project.(2014) Zhang, Guojie; Li, Bo; Li, Cai; Gilbert, M Thomas P; Jarvis, Erich D; Wang, Jun; Wang, Jun; Avian Genome ConsortiumBACKGROUND: The evolutionary relationships of modern birds are among the most challenging to understand in systematic biology and have been debated for centuries. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders, and used the genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomics analyses (Jarvis et al. in press; Zhang et al. in press). Here we release assemblies and datasets associated with the comparative genome analyses, which include 38 newly sequenced avian genomes plus previously released or simultaneously released genomes of Chicken, Zebra finch, Turkey, Pigeon, Peregrine falcon, Duck, Budgerigar, Adelie penguin, Emperor penguin and the Medium Ground Finch. We hope that this resource will serve future efforts in phylogenomics and comparative genomics. FINDINGS: The 38 bird genomes were sequenced using the Illumina HiSeq 2000 platform and assembled using a whole genome shotgun strategy. The 48 genomes were categorized into two groups according to the N50 scaffold size of the assemblies: a high depth group comprising 23 species sequenced at high coverage (>50X) with multiple insert size libraries resulting in N50 scaffold sizes greater than 1 Mb (except the White-throated Tinamou and Bald Eagle); and a low depth group comprising 25 species sequenced at a low coverage (~30X) with two insert size libraries resulting in an average N50 scaffold size of about 50 kb. Repetitive elements comprised 4%-22% of the bird genomes. The assembled scaffolds allowed the homology-based annotation of 13,000 ~ 17000 protein coding genes in each avian genome relative to chicken, zebra finch and human, as well as comparative and sequence conservation analyses. CONCLUSIONS: Here we release full genome assemblies of 38 newly sequenced avian species, link genome assembly downloads for the 7 of the remaining 10 species, and provide a guideline of genomic data that has been generated and used in our Avian Phylogenomics Project. To the best of our knowledge, the Avian Phylogenomics Project is the biggest vertebrate comparative genomics project to date. The genomic data presented here is expected to accelerate further analyses in many fields, including phylogenetics, comparative genomics, evolution, neurobiology, development biology, and other related areas.Item Open Access Comparative genomics reveals insights into avian genome evolution and adaptation.(Science, 2014-12-12) Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M; Lee, Chul; Storz, Jay F; Antunes, Agostinho; Greenwold, Matthew J; Meredith, Robert W; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S; Gatesy, John; Hoffmann, Federico G; Opazo, Juan C; Håstad, Olle; Sawyer, Roger H; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A; Green, Richard E; O'Brien, Stephen J; Griffin, Darren; Johnson, Warren E; Haussler, David; Ryder, Oliver A; Willerslev, Eske; Graves, Gary R; Alström, Per; Fjeldså, Jon; Mindell, David P; Edwards, Scott V; Braun, Edward L; Rahbek, Carsten; Burt, David W; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Avian Genome Consortium; Jarvis, Erich D; Gilbert, M Thomas P; Wang, JunBirds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.Item Open Access Complex evolutionary trajectories of sex chromosomes across bird taxa.(Science, 2014-12-12) Zhou, Qi; Zhang, Jilin; Bachtrog, Doris; An, Na; Huang, Quanfei; Jarvis, Erich D; Gilbert, M Thomas P; Zhang, GuojieSex-specific chromosomes, like the W of most female birds and the Y of male mammals, usually have lost most genes owing to a lack of recombination. We analyze newly available genomes of 17 bird species representing the avian phylogenetic range, and find that more than half of them do not have as fully degenerated W chromosomes as that of chicken. We show that avian sex chromosomes harbor tremendous diversity among species in their composition of pseudoautosomal regions and degree of Z/W differentiation. Punctuated events of shared or lineage-specific recombination suppression have produced a gradient of "evolutionary strata" along the Z chromosome, which initiates from the putative avian sex-determining gene DMRT1 and ends at the pseudoautosomal region. W-linked genes are subject to ongoing functional decay after recombination was suppressed, and the tempo of degeneration slows down in older strata. Overall, we unveil a complex history of avian sex chromosome evolution.Item Open Access Convergent transcriptional specializations in the brains of humans and song-learning birds.(Science, 2014-12-12) Pfenning, Andreas R; Hara, Erina; Whitney, Osceola; Rivas, Miriam V; Wang, Rui; Roulhac, Petra L; Howard, Jason T; Wirthlin, Morgan; Lovell, Peter V; Ganapathy, Ganeshkumar; Mouncastle, Jacquelyn; Moseley, M Arthur; Thompson, J Will; Soderblom, Erik J; Iriki, Atsushi; Kato, Masaki; Gilbert, M Thomas P; Zhang, Guojie; Bakken, Trygve; Bongaarts, Angie; Bernard, Amy; Lein, Ed; Mello, Claudio V; Hartemink, Alexander J; Jarvis, Erich DSong-learning birds and humans share independently evolved similarities in brain pathways for vocal learning that are essential for song and speech and are not found in most other species. Comparisons of brain transcriptomes of song-learning birds and humans relative to vocal nonlearners identified convergent gene expression specializations in specific song and speech brain regions of avian vocal learners and humans. The strongest shared profiles relate bird motor and striatal song-learning nuclei, respectively, with human laryngeal motor cortex and parts of the striatum that control speech production and learning. Most of the associated genes function in motor control and brain connectivity. Thus, convergent behavior and neural connectivity for a complex trait are associated with convergent specialized expression of multiple genes.Item Open Access Dynamic evolution of the alpha (α) and beta (β) keratins has accompanied integument diversification and the adaptation of birds into novel lifestyles.(BMC Evol Biol, 2014-12-12) Greenwold, Matthew J; Bao, Weier; Jarvis, Erich D; Hu, Haofu; Li, Cai; Gilbert, M Thomas P; Zhang, Guojie; Sawyer, Roger HBACKGROUND: Vertebrate skin appendages are constructed of keratins produced by multigene families. Alpha (α) keratins are found in all vertebrates, while beta (β) keratins are found exclusively in reptiles and birds. We have studied the molecular evolution of these gene families in the genomes of 48 phylogenetically diverse birds and their expression in the scales and feathers of the chicken. RESULTS: We found that the total number of α-keratins is lower in birds than mammals and non-avian reptiles, yet two α-keratin genes (KRT42 and KRT75) have expanded in birds. The β-keratins, however, demonstrate a dynamic evolution associated with avian lifestyle. The avian specific feather β-keratins comprise a large majority of the total number of β-keratins, but independently derived lineages of aquatic and predatory birds have smaller proportions of feather β-keratin genes and larger proportions of keratinocyte β-keratin genes. Additionally, birds of prey have a larger proportion of claw β-keratins. Analysis of α- and β-keratin expression during development of chicken scales and feathers demonstrates that while α-keratins are expressed in these tissues, the number and magnitude of expressed β-keratin genes far exceeds that of α-keratins. CONCLUSIONS: These results support the view that the number of α- and β-keratin genes expressed, the proportion of the β-keratin subfamily genes expressed and the diversification of the β-keratin genes have been important for the evolution of the feather and the adaptation of birds into multiple ecological niches.Item Open Access Evidence for a single loss of mineralized teeth in the common avian ancestor.(Science, 2014-12-12) Meredith, Robert W; Zhang, Guojie; Gilbert, M Thomas P; Jarvis, Erich D; Springer, Mark SEdentulism, the absence of teeth, has evolved convergently among vertebrates, including birds, turtles, and several lineages of mammals. Instead of teeth, modern birds (Neornithes) use a horny beak (rhamphotheca) and a muscular gizzard to acquire and process food. We performed comparative genomic analyses representing lineages of nearly all extant bird orders and recovered shared, inactivating mutations within genes expressed in both the enamel and dentin of teeth of other vertebrate species, indicating that the common ancestor of modern birds lacked mineralized teeth. We estimate that tooth loss, or at least the loss of enamel caps that provide the outer layer of mineralized teeth, occurred about 116 million years ago.Item Open Access Evolutionary genomics and adaptive evolution of the Hedgehog gene family (Shh, Ihh and Dhh) in vertebrates.(PLoS One, 2014) Pereira, Joana; Johnson, Warren E; O'Brien, Stephen J; Jarvis, Erich D; Zhang, Guojie; Gilbert, M Thomas P; Vasconcelos, Vitor; Antunes, AgostinhoThe Hedgehog (Hh) gene family codes for a class of secreted proteins composed of two active domains that act as signalling molecules during embryo development, namely for the development of the nervous and skeletal systems and the formation of the testis cord. While only one Hh gene is found typically in invertebrate genomes, most vertebrates species have three (Sonic hedgehog--Shh; Indian hedgehog--Ihh; and Desert hedgehog--Dhh), each with different expression patterns and functions, which likely helped promote the increasing complexity of vertebrates and their successful diversification. In this study, we used comparative genomic and adaptive evolutionary analyses to characterize the evolution of the Hh genes in vertebrates following the two major whole genome duplication (WGD) events. To overcome the lack of Hh-coding sequences on avian publicly available databases, we used an extensive dataset of 45 avian and three non-avian reptilian genomes to show that birds have all three Hh paralogs. We find suggestions that following the WGD events, vertebrate Hh paralogous genes evolved independently within similar linkage groups and under different evolutionary rates, especially within the catalytic domain. The structural regions around the ion-binding site were identified to be under positive selection in the signaling domain. These findings contrast with those observed in invertebrates, where different lineages that experienced gene duplication retained similar selective constraints in the Hh orthologs. Our results provide new insights on the evolutionary history of the Hh gene family, the functional roles of these paralogs in vertebrate species, and on the location of mutational hotspots.Item Open Access Gene loss, adaptive evolution and the co-evolution of plumage coloration genes with opsins in birds.(BMC Genomics, 2015-10-06) Borges, Rui; Borges, Rui; Khan, Imran; Johnson, Warren E; Gilbert, M Thomas P; Zhang, Guojie; Jarvis, Erich D; O'Brien, Stephen J; Antunes, AgostinhoBACKGROUND: The wide range of complex photic systems observed in birds exemplifies one of their key evolutionary adaptions, a well-developed visual system. However, genomic approaches have yet to be used to disentangle the evolutionary mechanisms that govern evolution of avian visual systems. RESULTS: We performed comparative genomic analyses across 48 avian genomes that span extant bird phylogenetic diversity to assess evolutionary changes in the 17 representatives of the opsin gene family and five plumage coloration genes. Our analyses suggest modern birds have maintained a repertoire of up to 15 opsins. Synteny analyses indicate that PARA and PARIE pineal opsins were lost, probably in conjunction with the degeneration of the parietal organ. Eleven of the 15 avian opsins evolved in a non-neutral pattern, confirming the adaptive importance of vision in birds. Visual conopsins sw1, sw2 and lw evolved under negative selection, while the dim-light RH1 photopigment diversified. The evolutionary patterns of sw1 and of violet/ultraviolet sensitivity in birds suggest that avian ancestors had violet-sensitive vision. Additionally, we demonstrate an adaptive association between the RH2 opsin and the MC1R plumage color gene, suggesting that plumage coloration has been photic mediated. At the intra-avian level we observed some unique adaptive patterns. For example, barn owl showed early signs of pseudogenization in RH2, perhaps in response to nocturnal behavior, and penguins had amino acid deletions in RH2 sites responsible for the red shift and retinal binding. These patterns in the barn owl and penguins were convergent with adaptive strategies in nocturnal and aquatic mammals, respectively. CONCLUSIONS: We conclude that birds have evolved diverse opsin adaptations through gene loss, adaptive selection and coevolution with plumage coloration, and that differentiated selective patterns at the species level suggest novel photic pressures to influence evolutionary patterns of more-recent lineages.Item Restricted Genomic signatures of near-extinction and rebirth of the crested ibis and other endangered bird species(GENOME BIOLOGY, 2014) Li, Shengbin; Li, Bo; Cheng, Cheng; Xiong, Zijun; Liu, Qingbo; Lai, Jianghua; Carey, Hannah V; Zhang, Qiong; Zheng, Haibo; Wei, Shuguang; Zhang, Hongbo; Chang, Liao; Liu, Shiping; Zhang, Shanxin; Yu, Bing; Zeng, Xiaofan; Hou, Yong; Nie, Wenhui; Guo, Youmin; Chen, Teng; Han, Jiuqiang; Wang, Jian; Wang, Jun; Chen, Chen; Liu, Jiankang; Stambrook, Peter J; Xu, Ming; Zhang, Guojie; Gilbert, M Thomas P; Yang, Huanming; Jarvis, Erich D; Yu, Jun; Yan, JianqunBACKGROUND: Nearly one-quarter of all avian species is either threatened or nearly threatened. Of these, 73 species are currently being rescued from going extinct in wildlife sanctuaries. One of the previously most critically-endangered is the crested ibis, Nipponia nippon. Once widespread across North-East Asia, by 1981 only seven individuals from two breeding pairs remained in the wild. The recovering crested ibis populations thus provide an excellent example for conservation genomics since every individual bird has been recruited for genomic and demographic studies. RESULTS: Using high-quality genome sequences of multiple crested ibis individuals, its thriving co-habitant, the little egret, Egretta garzetta, and the recently sequenced genomes of 41 other avian species that are under various degrees of survival threats, including the bald eagle, we carry out comparative analyses for genomic signatures of near extinction events in association with environmental and behavioral attributes of species. We confirm that both loss of genetic diversity and enrichment of deleterious mutations of protein-coding genes contribute to the major genetic defects of the endangered species. We further identify that genetic inbreeding and loss-of-function genes in the crested ibis may all constitute genetic susceptibility to other factors including long-term climate change, over-hunting, and agrochemical overuse. We also establish a genome-wide DNA identification platform for molecular breeding and conservation practices, to facilitate sustainable recovery of endangered species. CONCLUSIONS: These findings demonstrate common genomic signatures of population decline across avian species and pave a way for further effort in saving endangered species and enhancing conservation genomic efforts.Item Open Access High-coverage sequencing and annotated assemblies of the budgerigar genome.(Gigascience, 2014) Ganapathy, Ganeshkumar; Howard, Jason T; Ward, James M; Li, Jianwen; Li, Bo; Li, Yingrui; Xiong, Yingqi; Zhang, Yong; Zhou, Shiguo; Schwartz, David C; Schatz, Michael; Aboukhalil, Robert; Fedrigo, Olivier; Bukovnik, Lisa; Wang, Ty; Wray, Greg; Rasolonjatovo, Isabelle; Winer, Roger; Knight, James R; Koren, Sergey; Warren, Wesley C; Zhang, Guojie; Phillippy, Adam M; Jarvis, Erich DBACKGROUND: Parrots belong to a group of behaviorally advanced vertebrates and have an advanced ability of vocal learning relative to other vocal-learning birds. They can imitate human speech, synchronize their body movements to a rhythmic beat, and understand complex concepts of referential meaning to sounds. However, little is known about the genetics of these traits. Elucidating the genetic bases would require whole genome sequencing and a robust assembly of a parrot genome. FINDINGS: We present a genomic resource for the budgerigar, an Australian Parakeet (Melopsittacus undulatus) -- the most widely studied parrot species in neuroscience and behavior. We present genomic sequence data that includes over 300× raw read coverage from multiple sequencing technologies and chromosome optical maps from a single male animal. The reads and optical maps were used to create three hybrid assemblies representing some of the largest genomic scaffolds to date for a bird; two of which were annotated based on similarities to reference sets of non-redundant human, zebra finch and chicken proteins, and budgerigar transcriptome sequence assemblies. The sequence reads for this project were in part generated and used for both the Assemblathon 2 competition and the first de novo assembly of a giga-scale vertebrate genome utilizing PacBio single-molecule sequencing. CONCLUSIONS: Across several quality metrics, these budgerigar assemblies are comparable to or better than the chicken and zebra finch genome assemblies built from traditional Sanger sequencing reads, and are sufficient to analyze regions that are difficult to sequence and assemble, including those not yet assembled in prior bird genomes, and promoter regions of genes differentially regulated in vocal learning brain regions. This work provides valuable data and material for genome technology development and for investigating the genomics of complex behavioral traits.Item Open Access Low frequency of paleoviral infiltration across the avian phylogeny.(Genome Biol, 2014) Cui, Jie; Zhao, Wei; Huang, Zhiyong; Jarvis, Erich D; Gilbert, M Thomas P; Walker, Peter J; Holmes, Edward C; Zhang, GuojieBACKGROUND: Mammalian genomes commonly harbor endogenous viral elements. Due to a lack of comparable genome-scale sequence data, far less is known about endogenous viral elements in avian species, even though their small genomes may enable important insights into the patterns and processes of endogenous viral element evolution. RESULTS: Through a systematic screening of the genomes of 48 species sampled across the avian phylogeny we reveal that birds harbor a limited number of endogenous viral elements compared to mammals, with only five viral families observed: Retroviridae, Hepadnaviridae, Bornaviridae, Circoviridae, and Parvoviridae. All nonretroviral endogenous viral elements are present at low copy numbers and in few species, with only endogenous hepadnaviruses widely distributed, although these have been purged in some cases. We also provide the first evidence for endogenous bornaviruses and circoviruses in avian genomes, although at very low copy numbers. A comparative analysis of vertebrate genomes revealed a simple linear relationship between endogenous viral element abundance and host genome size, such that the occurrence of endogenous viral elements in bird genomes is 6- to 13-fold less frequent than in mammals. CONCLUSIONS: These results reveal that avian genomes harbor relatively small numbers of endogenous viruses, particularly those derived from RNA viruses, and hence are either less susceptible to viral invasions or purge them more effectively.Item Open Access Olfactory Receptor Subgenomes Linked with Broad Ecological Adaptations in Sauropsida.(Mol Biol Evol, 2015-11) Khan, Imran; Yang, Zhikai; Maldonado, Emanuel; Li, Cai; Zhang, Guojie; Gilbert, M Thomas P; Jarvis, Erich D; O'Brien, Stephen J; Johnson, Warren E; Antunes, AgostinhoOlfactory receptors (ORs) govern a prime sensory function. Extant birds have distinct olfactory abilities, but the molecular mechanisms underlining diversification and specialization remain mostly unknown. We explored OR diversity in 48 phylogenetic and ecologically diverse birds and 2 reptiles (alligator and green sea turtle). OR subgenomes showed species- and lineage-specific variation related with ecological requirements. Overall 1,953 OR genes were identified in reptiles and 16,503 in birds. The two reptiles had larger OR gene repertoires (989 and 964 genes, respectively) than birds (182-688 genes). Overall, birds had more pseudogenes (7,855) than intact genes (1,944). The alligator had significantly more functional genes than sea turtle, likely because of distinct foraging habits. We found rapid species-specific expansion and positive selection in OR14 (detects hydrophobic compounds) in birds and in OR51 and OR52 (detect hydrophilic compounds) in sea turtle, suggestive of terrestrial and aquatic adaptations, respectively. Ecological partitioning among birds of prey, water birds, land birds, and vocal learners showed that diverse ecological factors determined olfactory ability and influenced corresponding olfactory-receptor subgenome. OR5/8/9 was expanded in predatory birds and alligator, suggesting adaptive specialization for carnivory. OR families 2/13, 51, and 52 were correlated with aquatic adaptations (water birds), OR families 6 and 10 were more pronounced in vocal-learning birds, whereas most specialized land birds had an expanded OR family 14. Olfactory bulb ratio (OBR) and OR gene repertoire were correlated. Birds that forage for prey (carnivores/piscivores) had relatively complex OBR and OR gene repertoires compared with modern birds, including passerines, perhaps due to highly developed cognitive capacities facilitating foraging innovations.Item Open Access Phylogenomic analyses data of the avian phylogenomics project.(Gigascience, 2015) Jarvis, Erich D; Mirarab, Siavash; Aberer, Andre J; Li, Bo; Houde, Peter; Li, Cai; Ho, Simon YW; Faircloth, Brant C; Nabholz, Benoit; Howard, Jason T; Suh, Alexander; Weber, Claudia C; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Narula, Nitish; Liu, Liang; Burt, Dave; Ellegren, Hans; Edwards, Scott V; Stamatakis, Alexandros; Mindell, David P; Cracraft, Joel; Braun, Edward L; Warnow, Tandy; Jun, Wang; Gilbert, M Thomas Pius; Zhang, Guojie; Avian Phylogenomics ConsortiumBACKGROUND: Determining the evolutionary relationships among the major lineages of extant birds has been one of the biggest challenges in systematic biology. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders. We used these genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomic analyses. FINDINGS: Here we present the datasets associated with the phylogenomic analyses, which include sequence alignment files consisting of nucleotides, amino acids, indels, and transposable elements, as well as tree files containing gene trees and species trees. Inferring an accurate phylogeny required generating: 1) A well annotated data set across species based on genome synteny; 2) Alignments with unaligned or incorrectly overaligned sequences filtered out; and 3) Diverse data sets, including genes and their inferred trees, indels, and transposable elements. Our total evidence nucleotide tree (TENT) data set (consisting of exons, introns, and UCEs) gave what we consider our most reliable species tree when using the concatenation-based ExaML algorithm or when using statistical binning with the coalescence-based MP-EST algorithm (which we refer to as MP-EST*). Other data sets, such as the coding sequence of some exons, revealed other properties of genome evolution, namely convergence. CONCLUSIONS: The Avian Phylogenomics Project is the largest vertebrate phylogenomics project to date that we are aware of. The sequence, alignment, and tree data are expected to accelerate analyses in phylogenomics and other related areas.Item Open Access Response to Comment on "Whole-genome analyses resolve early branches in the tree of life of modern birds".(Science, 2015-09-25) Cracraft, Joel; Houde, Peter; Ho, Simon YW; Mindell, David P; Fjeldså, Jon; Lindow, Bent; Edwards, Scott V; Rahbek, Carsten; Mirarab, Siavash; Warnow, Tandy; Gilbert, M Thomas P; Zhang, Guojie; Braun, Edward L; Jarvis, Erich DMitchell et al. argue that divergence-time estimates for our avian phylogeny were too young because of an "inappropriate" maximum age constraint for the most recent common ancestor of modern birds and that, as a result, most modern bird orders diverged before the Cretaceous-Paleogene mass extinction event 66 million years ago instead of after. However, their interpretations of the fossil record and timetrees are incorrect.Item Open Access Three crocodilian genomes reveal ancestral patterns of evolution among archosaurs.(Science, 2014-12-12) Green, Richard E; Braun, Edward L; Armstrong, Joel; Earl, Dent; Nguyen, Ngan; Hickey, Glenn; Vandewege, Michael W; St John, John A; Capella-Gutiérrez, Salvador; Castoe, Todd A; Kern, Colin; Fujita, Matthew K; Opazo, Juan C; Jurka, Jerzy; Kojima, Kenji K; Caballero, Juan; Hubley, Robert M; Smit, Arian F; Platt, Roy N; Lavoie, Christine A; Ramakodi, Meganathan P; Finger, John W; Suh, Alexander; Isberg, Sally R; Miles, Lee; Chong, Amanda Y; Jaratlerdsiri, Weerachai; Gongora, Jaime; Moran, Christopher; Iriarte, Andrés; McCormack, John; Burgess, Shane C; Edwards, Scott V; Lyons, Eric; Williams, Christina; Breen, Matthew; Howard, Jason T; Gresham, Cathy R; Peterson, Daniel G; Schmitz, Jürgen; Pollock, David D; Haussler, David; Triplett, Eric W; Zhang, Guojie; Irie, Naoki; Jarvis, Erich D; Brochu, Christopher A; Schmidt, Carl J; McCarthy, Fiona M; Faircloth, Brant C; Hoffmann, Federico G; Glenn, Travis C; Gabaldón, Toni; Paten, Benedict; Ray, David ATo provide context for the diversification of archosaurs--the group that includes crocodilians, dinosaurs, and birds--we generated draft genomes of three crocodilians: Alligator mississippiensis (the American alligator), Crocodylus porosus (the saltwater crocodile), and Gavialis gangeticus (the Indian gharial). We observed an exceptionally slow rate of genome evolution within crocodilians at all levels, including nucleotide substitutions, indels, transposable element content and movement, gene family evolution, and chromosomal synteny. When placed within the context of related taxa including birds and turtles, this suggests that the common ancestor of all of these taxa also exhibited slow genome evolution and that the comparatively rapid evolution is derived in birds. The data also provided the opportunity to analyze heterozygosity in crocodilians, which indicates a likely reduction in population size for all three taxa through the Pleistocene. Finally, these data combined with newly published bird genomes allowed us to reconstruct the partial genome of the common ancestor of archosaurs, thereby providing a tool to investigate the genetic starting material of crocodilians, birds, and dinosaurs.Item Open Access Two Antarctic penguin genomes reveal insights into their evolutionary history and molecular changes related to the Antarctic environment.(Gigascience, 2014) Li, Cai; Zhang, Yong; Li, Jianwen; Kong, Lesheng; Hu, Haofu; Pan, Hailin; Xu, Luohao; Deng, Yuan; Li, Qiye; Jin, Lijun; Yu, Hao; Chen, Yan; Liu, Binghang; Yang, Linfeng; Liu, Shiping; Zhang, Yan; Lang, Yongshan; Xia, Jinquan; He, Weiming; Shi, Qiong; Subramanian, Sankar; Millar, Craig D; Meader, Stephen; Rands, Chris M; Fujita, Matthew K; Greenwold, Matthew J; Castoe, Todd A; Pollock, David D; Gu, Wanjun; Nam, Kiwoong; Ellegren, Hans; Ho, Simon Yw; Burt, David W; Ponting, Chris P; Jarvis, Erich D; Gilbert, M Thomas P; Yang, Huanming; Wang, Jian; Lambert, David M; Wang, Jun; Zhang, GuojieBACKGROUND: Penguins are flightless aquatic birds widely distributed in the Southern Hemisphere. The distinctive morphological and physiological features of penguins allow them to live an aquatic life, and some of them have successfully adapted to the hostile environments in Antarctica. To study the phylogenetic and population history of penguins and the molecular basis of their adaptations to Antarctica, we sequenced the genomes of the two Antarctic dwelling penguin species, the Adélie penguin [Pygoscelis adeliae] and emperor penguin [Aptenodytes forsteri]. RESULTS: Phylogenetic dating suggests that early penguins arose ~60 million years ago, coinciding with a period of global warming. Analysis of effective population sizes reveals that the two penguin species experienced population expansions from ~1 million years ago to ~100 thousand years ago, but responded differently to the climatic cooling of the last glacial period. Comparative genomic analyses with other available avian genomes identified molecular changes in genes related to epidermal structure, phototransduction, lipid metabolism, and forelimb morphology. CONCLUSIONS: Our sequencing and initial analyses of the first two penguin genomes provide insights into the timing of penguin origin, fluctuations in effective population sizes of the two penguin species over the past 10 million years, and the potential associations between these biological patterns and global climate change. The molecular changes compared with other avian genomes reflect both shared and diverse adaptations of the two penguin species to the Antarctic environment.Item Open Access Whole-genome analyses resolve early branches in the tree of life of modern birds.(Science, 2014-12-12) Jarvis, Erich D; Mirarab, Siavash; Aberer, Andre J; Li, Bo; Houde, Peter; Li, Cai; Ho, Simon YW; Faircloth, Brant C; Nabholz, Benoit; Howard, Jason T; Suh, Alexander; Weber, Claudia C; da Fonseca, Rute R; Li, Jianwen; Zhang, Fang; Li, Hui; Zhou, Long; Narula, Nitish; Liu, Liang; Ganapathy, Ganesh; Boussau, Bastien; Bayzid, Md Shamsuzzoha; Zavidovych, Volodymyr; Subramanian, Sankar; Gabaldón, Toni; Capella-Gutiérrez, Salvador; Huerta-Cepas, Jaime; Rekepalli, Bhanu; Munch, Kasper; Schierup, Mikkel; Lindow, Bent; Warren, Wesley C; Ray, David; Green, Richard E; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Li, Shengbin; Li, Ning; Huang, Yinhua; Derryberry, Elizabeth P; Bertelsen, Mads Frost; Sheldon, Frederick H; Brumfield, Robb T; Mello, Claudio V; Lovell, Peter V; Wirthlin, Morgan; Schneider, Maria Paula Cruz; Prosdocimi, Francisco; Samaniego, José Alfredo; Vargas Velazquez, Amhed Missael; Alfaro-Núñez, Alonzo; Campos, Paula F; Petersen, Bent; Sicheritz-Ponten, Thomas; Pas, An; Bailey, Tom; Scofield, Paul; Bunce, Michael; Lambert, David M; Zhou, Qi; Perelman, Polina; Driskell, Amy C; Shapiro, Beth; Xiong, Zijun; Zeng, Yongli; Liu, Shiping; Li, Zhenyu; Liu, Binghang; Wu, Kui; Xiao, Jin; Yinqi, Xiong; Zheng, Qiuemei; Zhang, Yong; Yang, Huanming; Wang, Jian; Wang, Jian; Smeds, Linnea; Rheindt, Frank E; Braun, Michael; Fjeldsa, Jon; Orlando, Ludovic; Barker, F Keith; Jønsson, Knud Andreas; Johnson, Warren; Koepfli, Klaus-Peter; O'Brien, Stephen; Haussler, David; Ryder, Oliver A; Rahbek, Carsten; Willerslev, Eske; Graves, Gary R; Glenn, Travis C; McCormack, John; Burt, Dave; Ellegren, Hans; Alström, Per; Edwards, Scott V; Stamatakis, Alexandros; Mindell, David P; Cracraft, Joel; Braun, Edward L; Warnow, Tandy; Jun, Wang; Gilbert, M Thomas P; Zhang, GuojieTo better determine the history of modern birds, we performed a genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves using phylogenomic methods created to handle genome-scale data. We recovered a highly resolved tree that confirms previously controversial sister or close relationships. We identified the first divergence in Neoaves, two groups we named Passerea and Columbea, representing independent lineages of diverse and convergently evolved land and water bird species. Among Passerea, we infer the common ancestor of core landbirds to have been an apex predator and confirm independent gains of vocal learning. Among Columbea, we identify pigeons and flamingoes as belonging to sister clades. Even with whole genomes, some of the earliest branches in Neoaves proved challenging to resolve, which was best explained by massive protein-coding sequence convergence and high levels of incomplete lineage sorting that occurred during a rapid radiation after the Cretaceous-Paleogene mass extinction event about 66 million years ago.