Menu
April 21, 2020  |  

Global-level population genomics reveals differential effects of geography and phylogeny on horizontal gene transfer in soil bacteria.

Although microorganisms are known to dominate Earth’s biospheres and drive biogeochemical cycling, little is known about the geographic distributions of microbial populations or the environmental factors that pattern those distributions. We used a global-level hierarchical sampling scheme to comprehensively characterize the evolutionary relationships and distributional limitations of the nitrogen-fixing bacterial symbionts of the crop chickpea, generating 1,027 draft whole-genome sequences at the level of bacterial populations, including 14 high-quality PacBio genomes from a phylogenetically representative subset. We find that diverse Mesorhizobium taxa perform symbiosis with chickpea and have largely overlapping global distributions. However, sampled locations cluster based on the phylogenetic diversity of Mesorhizobium populations, and diversity clusters correspond to edaphic and environmental factors, primarily soil type and latitude. Despite long-standing evolutionary divergence and geographic isolation, the diverse taxa observed to nodulate chickpea share a set of integrative conjugative elements (ICEs) that encode the major functions of the symbiosis. This symbiosis ICE takes 2 forms in the bacterial chromosome-tripartite and monopartite-with tripartite ICEs confined to a broadly distributed superspecies clade. The pairwise evolutionary relatedness of these elements is controlled as much by geographic distance as by the evolutionary relatedness of the background genome. In contrast, diversity in the broader gene content of Mesorhizobium genomes follows a tight linear relationship with core genome phylogenetic distance, with little detectable effect of geography. These results illustrate how geography and demography can operate differentially on the evolution of bacterial genomes and offer useful insights for the development of improved technologies for sustainable agriculture.


April 21, 2020  |  

Characterization of Mauritian cynomolgus macaque Fc?R alleles using long-read sequencing.

The Fc?Rs are immune cell surface proteins that bind IgG and facilitate cytokine production, phagocytosis, and Ab-dependent, cell-mediated cytotoxicity. Fc?Rs play a critical role in immunity; variation in these genes is implicated in autoimmunity and other diseases. Cynomolgus macaques are an excellent animal model for many human diseases, and Mauritian cynomolgus macaques (MCMs) are particularly useful because of their restricted genetic diversity. Previous studies of MCM immune gene diversity have focused on the MHC and killer cell Ig-like receptor. In this study, we characterize Fc?R diversity in 48 MCMs using PacBio long-read sequencing to identify novel alleles of each of the four expressed MCM Fc?R genes. We also developed a high-throughput Fc?R genotyping assay, which we used to determine allele frequencies and identify Fc?R haplotypes in more than 500 additional MCMs. We found three alleles for Fc?R1A, seven each for Fc?R2A and Fc?R2B, and four for Fc?R3A; these segregate into eight haplotypes. We also assessed whether different Fc?R alleles confer different Ab-binding affinities by surface plasmon resonance and found minimal difference in binding affinities across alleles for a panel of wild type and Fc-engineered human IgG. This work suggests that although MCMs may not fully represent the diversity of Fc?R responses in humans, they may offer highly reproducible results for mAb therapy and toxicity studies. Copyright © 2018 by The American Association of Immunologists, Inc.


April 21, 2020  |  

Maleness-on-the-Y (MoY) orchestrates male sex determination in major agricultural fruit fly pests.

In insects, rapidly evolving primary sex-determining signals are transduced by a conserved regulatory module controlling sexual differentiation. In the agricultural pest Ceratitis capitata (Mediterranean fruit fly, or Medfly), we identified a Y-linked gene, Maleness-on-the-Y (MoY), encoding a small protein that is necessary and sufficient for male development. Silencing or disruption of MoY in XY embryos causes feminization, whereas overexpression of MoY in XX embryos induces masculinization. Crosses between transformed XY females and XX males give rise to males and females, indicating that a Y chromosome can be transmitted by XY females. MoY is Y-linked and functionally conserved in other species of the Tephritidae family, highlighting its potential to serve as a tool for developing more effective control strategies against these major agricultural insect pests.Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.


April 21, 2020  |  

Rapid and Focused Maturation of a VRC01-Class HIV Broadly Neutralizing Antibody Lineage Involves Both Binding and Accommodation of the N276-Glycan.

The VH1-2 restricted VRC01-class of antibodies targeting the HIV envelope CD4 binding site are a major focus of HIV vaccine strategies. However, a detailed analysis of VRC01-class antibody development has been limited by the rare nature of these responses during natural infection and the lack of longitudinal sampling of such responses. To inform vaccine strategies, we mapped the development of a VRC01-class antibody lineage (PCIN63) in the subtype C infected IAVI Protocol C neutralizer PC063. PCIN63 monoclonal antibodies had the hallmark VRC01-class features and demonstrated neutralization breadth similar to the prototype VRC01 antibody, but were 2- to 3-fold less mutated. Maturation occurred rapidly within ~24 months of emergence of the lineage and somatic hypermutations accumulated at key contact residues. This longitudinal study of broadly neutralizing VRC01-class antibody lineage reveals early binding to the N276-glycan during affinity maturation, which may have implications for vaccine design.Copyright © 2019 The Authors. Published by Elsevier Inc. All rights reserved.


April 21, 2020  |  

TSD: A Computational Tool To Study the Complex Structural Variants Using PacBio Targeted Sequencing Data.

PacBio sequencing is a powerful approach to study DNA or RNA sequences in a longer scope. It is especially useful in exploring the complex structural variants generated by random integration or multiple rearrangement of endogenous or exogenous sequences. Here, we present a tool, TSD, for complex structural variant discovery using PacBio targeted sequencing data. It allows researchers to identify and visualize the genomic structures of targeted sequences by unlimited splitting, alignment and assembly of long PacBio reads. Application to the sequencing data derived from an HBV integrated human cell line(PLC/PRF/5) indicated that TSD could recover the full profile of HBV integration events, especially for the regions with the complex human-HBV genome integrations and multiple HBV rearrangements. Compared to other long read analysis tools, TSD showed a better performance for detecting complex genomic structural variants. TSD is publicly available at: https://github.com/menggf/tsd. Copyright © 2019 Meng et al.


April 21, 2020  |  

Social genes are selection hotspots in kin groups of a soil microbe.

The composition of cooperative systems, including animal societies, organismal bodies, and microbial groups, reflects their past and shapes their future evolution. However, genomic diversity within many multiunit systems remains uncharacterized, limiting our ability to understand and compare their evolutionary character. We have analyzed genomic and social-phenotype variation among 120 natural isolates of the cooperative bacterium Myxococcus xanthus derived from six multicellular fruiting bodies. Each fruiting body was composed of multiple lineages radiating from a unique recent ancestor. Genomic evolution was concentrated in selection hotspots associated with evolutionary change in social phenotypes. Synonymous mutations indicated that kin lineages within the same fruiting body often first diverged from a common ancestor more than 100 generations ago. Thus, selection appears to promote endemic diversification of kin lineages that remain together over long histories of local interaction, thereby potentiating social coevolution. Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.


April 21, 2020  |  

A novel plasmid carrying carbapenem-resistant gene blaKPC-2 in Pseudomonas aeruginosa.

A carbapenem-resistant Pseudomonas aeruginosa strain PA1011 (ST463) was isolated from a patient in a surgical intensive care unit. PCR detection showed that PA1011 carried the blaKPC-2 gene. A plasmid was isolated and sequenced using the Illumina NextSeq 500 and PacBio RSII sequencing platforms. The plasmid was named pPA1011 and carried the carbapenem-resistant gene blaKPC-2. pPA1011 was a 62,793 bp in length with an average G+C content of 58.8%. It was identified as a novel plasmid and encoded a novel genetic environment of blaKPC-2 gene (?IS6-Tn3-ISKpn8-blaKPC-2-ISKpn6-IS26).


April 21, 2020  |  

Genome of the Komodo dragon reveals adaptations in the cardiovascular and chemosensory systems of monitor lizards.

Monitor lizards are unique among ectothermic reptiles in that they have high aerobic capacity and distinctive cardiovascular physiology resembling that of endothermic mammals. Here, we sequence the genome of the Komodo dragon Varanus komodoensis, the largest extant monitor lizard, and generate a high-resolution de novo chromosome-assigned genome assembly for V. komodoensis using a hybrid approach of long-range sequencing and single-molecule optical mapping. Comparing the genome of V. komodoensis with those of related species, we find evidence of positive selection in pathways related to energy metabolism, cardiovascular homoeostasis, and haemostasis. We also show species-specific expansions of a chemoreceptor gene family related to pheromone and kairomone sensing in V. komodoensis and other lizard lineages. Together, these evolutionary signatures of adaptation reveal the genetic underpinnings of the unique Komodo dragon sensory and cardiovascular systems, and suggest that selective pressure altered haemostasis genes to help Komodo dragons evade the anticoagulant effects of their own saliva. The Komodo dragon genome is an important resource for understanding the biology of monitor lizards and reptiles worldwide.


April 21, 2020  |  

PacBio sequencing reveals bacterial community diversity in cheeses collected from different regions.

Cheese is a fermented dairy product that is popular for its unique flavor and nutritional value. Recent studies have shown that microorganisms in cheese play an important role in the fermentation process and determine the quality of the cheese. We collected 12 cheese samples from different regions and studied the composition of their bacterial communities using PacBio small-molecule real-time sequencing (Pacific Biosciences, Menlo Park, CA). Our data revealed 144 bacterial genera (including Lactobacillus, Streptococcus, Lactococcus, and Staphylococcus) and 217 bacterial species (including Lactococcus lactis, Streptococcus thermophilus, Staphylococcus equorum, and Streptococcus uberis). We investigated the flavor quality of the cheese samples using an electronic nose system and we found differences in flavor-quality indices among samples from different regions. We found a clustering tendency based on flavor quality using principal component analysis. We found correlations between lactic acid bacteria and the flavor quality of the cheese samples. Biodegradation and metabolism of xenobiotics, and lipid-metabolism-related pathways, were predicted to contribute to differences in cheese flavor using Phylogenetic Investigation of Communities by Reconstruction of Unobserved States (PICRUSt). This preliminary study explored the bacterial communities in cheeses collected from different regions and their potential genome functions from the perspective of flavor quality.Copyright © 2020 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.


April 21, 2020  |  

Long-read sequencing identifies GGC repeat expansions in NOTCH2NLC associated with neuronal intranuclear inclusion disease.

Neuronal intranuclear inclusion disease (NIID) is a progressive neurodegenerative disease that is characterized by eosinophilic hyaline intranuclear inclusions in neuronal and somatic cells. The wide range of clinical manifestations in NIID makes ante-mortem diagnosis difficult1-8, but skin biopsy enables its ante-mortem diagnosis9-12. The average onset age is 59.7 years among approximately 140 NIID cases consisting of mostly sporadic and several familial cases. By linkage mapping of a large NIID family with several affected members (Family 1), we identified a 58.1 Mb linked region at 1p22.1-q21.3 with a maximum logarithm of the odds score of 4.21. By long-read sequencing, we identified a GGC repeat expansion in the 5′ region of NOTCH2NLC (Notch 2 N-terminal like C) in all affected family members. Furthermore, we found similar expansions in 8 unrelated families with NIID and 40 sporadic NIID cases. We observed abnormal anti-sense transcripts in fibroblasts specifically from patients but not unaffected individuals. This work shows that repeat expansion in human-specific NOTCH2NLC, a gene that evolved by segmental duplication, causes a human disease.


April 21, 2020  |  

Blast Fungal Genomes Show Frequent Chromosomal Changes, Gene Gains and Losses, and Effector Gene Turnover.

Pyricularia is a fungal genus comprising several pathogenic species causing the blast disease in monocots. Pyricularia oryzae, the best-known species, infects rice, wheat, finger millet, and other crops. As past comparative and population genomics studies mainly focused on isolates of P. oryzae, the genomes of the other Pyricularia species have not been well explored. In this study, we obtained a chromosomal-level genome assembly of the finger millet isolate P. oryzae MZ5-1-6 and also highly contiguous assemblies of Pyricularia sp. LS, P. grisea, and P. pennisetigena. The differences in the genomic content of repetitive DNA sequences could largely explain the variation in genome size among these new genomes. Moreover, we found extensive gene gains and losses and structural changes among Pyricularia genomes, including a large interchromosomal translocation. We searched for homologs of known blast effectors across fungal taxa and found that most avirulence effectors are specific to Pyricularia, whereas many other effectors share homologs with distant fungal taxa. In particular, we discovered a novel effector family with metalloprotease activity, distinct from the well-known AVR-Pita family. We predicted 751 gene families containing putative effectors in 7 Pyricularia genomes and found that 60 of them showed differential expression in the P. oryzae MZ5-1-6 transcriptomes obtained under experimental conditions mimicking the pathogen infection process. In summary, this study increased our understanding of the structural, functional, and evolutionary genomics of the blast pathogen and identified new potential effector genes, providing useful data for developing crops with durable resistance. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


April 21, 2020  |  

Copy-number variants in clinical genome sequencing: deployment and interpretation for rare and undiagnosed disease.

Current diagnostic testing for genetic disorders involves serial use of specialized assays spanning multiple technologies. In principle, genome sequencing (GS) can detect all genomic pathogenic variant types on a single platform. Here we evaluate copy-number variant (CNV) calling as part of a clinically accredited GS test.We performed analytical validation of CNV calling on 17 reference samples, compared the sensitivity of GS-based variants with those from a clinical microarray, and set a bound on precision using orthogonal technologies. We developed a protocol for family-based analysis of GS-based CNV calls, and deployed this across a clinical cohort of 79 rare and undiagnosed cases.We found that CNV calls from GS are at least as sensitive as those from microarrays, while only creating a modest increase in the number of variants interpreted (~10 CNVs per case). We identified clinically significant CNVs in 15% of the first 79 cases analyzed, all of which were confirmed by an orthogonal approach. The pipeline also enabled discovery of a uniparental disomy (UPD) and a 50% mosaic trisomy 14. Directed analysis of select CNVs enabled breakpoint level resolution of genomic rearrangements and phasing of de novo CNVs.Robust identification of CNVs by GS is possible within a clinical testing environment.


April 21, 2020  |  

Mutation of a bHLH transcription factor allowed almond domestication.

Wild almond species accumulate the bitter and toxic cyanogenic diglucoside amygdalin. Almond domestication was enabled by the selection of genotypes harboring sweet kernels. We report the completion of the almond reference genome. Map-based cloning using an F1 population segregating for kernel taste led to the identification of a 46-kilobase gene cluster encoding five basic helix-loop-helix transcription factors, bHLH1 to bHLH5. Functional characterization demonstrated that bHLH2 controls transcription of the P450 monooxygenase-encoding genes PdCYP79D16 and PdCYP71AN24, which are involved in the amygdalin biosynthetic pathway. A nonsynonymous point mutation (Leu to Phe) in the dimerization domain of bHLH2 prevents transcription of the two cytochrome P450 genes, resulting in the sweet kernel trait. Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.


April 21, 2020  |  

Chromosome-level genome assembly of Triplophysa tibetana, a fish adapted to the harsh high-altitude environment of the Tibetan Plateau.

Triplophysa is an endemic fish genus of the Tibetan Plateau in China. Triplophysa tibetana, which lives at a recorded altitude of ~4,000 m and plays an important role in the highland aquatic ecosystem, serves as an excellent model for investigating high-altitude environmental adaptation. However, evolutionary and conservation studies of T. tibetana have been limited by scarce genomic resources for the genus Triplophysa. In the present study, we applied PacBio sequencing and the Hi-C technique to assemble the T. tibetana genome. A 652-Mb genome with 1,325 contigs with an N50 length of 3.1 Mb was obtained. The 1,137 contigs were further assembled into 25 chromosomes, representing 98.7% and 80.47% of all contigs at the base and sequence number level, respectively. Approximately 260 Mb of sequence, accounting for ~39.8% of the genome, was identified as repetitive elements. DNA transposons (16.3%), long interspersed nuclear elements (12.4%) and long terminal repeats (11.0%) were the most repetitive types. In total, 24,372 protein-coding genes were predicted in the genome, and ~95% of the genes were functionally annotated via a search in public databases. Using whole genome sequence information, we found that T. tibetana diverged from its common ancestor with Danio rerio ~121.4 million years ago. The high-quality genome assembled in this work not only provides a valuable genomic resource for future population and conservation studies of T. tibetana, but it also lays a solid foundation for further investigation into the mechanisms of environmental adaptation of endemic fishes in the Tibetan Plateau. © 2019 John Wiley & Sons Ltd.


April 21, 2020  |  

Adaptation and Phenotypic Diversification in Arabidopsis through Loss-of-Function Mutations in Protein-Coding Genes.

According to the less-is-more hypothesis, gene loss is an engine for evolutionary change. Loss-of-function (LoF) mutations resulting in the natural knockout of protein-coding genes not only provide information about gene function but also play important roles in adaptation and phenotypic diversification. Although the less-is-more hypothesis was proposed two decades ago, it remains to be explored on a large scale. In this study, we identified 60,819 LoF variants in 1071 Arabidopsis (Arabidopsis thaliana) genomes and found that 34% of Arabidopsis protein-coding genes annotated in the Columbia-0 genome do not have any LoF variants. We found that nucleotide diversity, transposable element density, and gene family size are strongly correlated with the presence of LoF variants. Intriguingly, 0.9% of LoF variants with minor allele frequency larger than 0.5% are associated with climate change. In addition, in the Yangtze River basin population, 1% of genes with LoF mutations were under positive selection, providing important insights into the contribution of LoF mutations to adaptation. In particular, our results demonstrate that LoF mutations shape diverse phenotypic traits. Overall, our results highlight the importance of the LoF variants for the adaptation and phenotypic diversification of plants. © 2019 American Society of Plant Biologists. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.