Menu
April 21, 2020

Genome assembly and annotation of the Trichoplusia ni Tni-FNL insect cell line enabled by long-read technologies.

Trichoplusiani derived cell lines are commonly used to enable recombinant protein expression via baculovirus infection to generate materials approved for clinical use and in clinical trials. In order to develop systems biology and genome engineering tools to improve protein expression in this host, we performed de novo genome assembly of the Trichoplusiani-derived cell line Tni-FNL.By integration of PacBio single-molecule sequencing, Bionano optical mapping, and 10X Genomics linked-reads data, we have produced a draft genome assembly of Tni-FNL.Our assembly contains 280 scaffolds, with a N50 scaffold size of 2.3 Mb and a total length of 359 Mb. Annotation of the Tni-FNL genome resulted in 14,101 predicted genes and 93.2% of the predicted proteome contained recognizable protein domains. Ortholog searches within the superorder Holometabola provided further evidence of high accuracy and completeness of the Tni-FNL genome assembly.This first draft Tni-FNL genome assembly was enabled by complementary long-read technologies and represents a high-quality, well-annotated genome that provides novel insight into the complexity of this insect cell line and can serve as a reference for future large-scale genome engineering work in this and other similar recombinant protein production hosts.


April 21, 2020

Into the Thermus Mobilome: Presence, Diversity and Recent Activities of Insertion Sequences Across Thermus spp.

A high level of transposon-mediated genome rearrangement is a common trait among microorganisms isolated from thermal environments, probably contributing to the extraordinary genomic plasticity and horizontal gene transfer (HGT) observed in these habitats. In this work, active and inactive insertion sequences (ISs) spanning the sequenced members of the genus Thermus were characterized, with special emphasis on three T. thermophilus strains: HB27, HB8, and NAR1. A large number of full ISs and fragments derived from different IS families were found, concentrating within megaplasmids present in most isolates. Potentially active ISs were identified through analysis of transposase integrity, and domestication-related transposition events of ISTth7 were identified in laboratory-adapted HB27 derivatives. Many partial copies of ISs appeared throughout the genome, which may serve as specific targets for homologous recombination contributing to genome rearrangement. Moreover, recruitment of IS1000 32 bp segments as spacers for CRISPR sequence was identified, pointing to the adaptability of these elements in the biology of these thermophiles. Further knowledge about the activity and functional diversity of ISs in this genus may contribute to the generation of engineered transposons as new genetic tools, and enrich our understanding of the outstanding plasticity shown by these thermophiles.


April 21, 2020

Adaptive archaic introgression of copy number variants and the discovery of previously unknown human genes

As they migrated out of Africa and into Europe and Asia, anatomically modern humans interbred with archaic hominins, such as Neanderthals and Denisovans. The result of this genetic introgression on the recipient populations has been of considerable interest, especially in cases of selection for specific archaic genetic variants. Hsieh et al. characterized adaptive structural variants and copy number variants that are likely targets of positive selection in Melanesians. Focusing on population-specific regions of the genome that carry duplicated genes and show an excess of amino acid replacements provides evidence for one of the mechanisms by which genetic novelty can arise and result in differentiation between human genomes.Science, this issue p. eaax2083INTRODUCTIONCharacterizing genetic variants underlying local adaptations in human populations is one of the central goals of evolutionary research. Most studies have focused on adaptive single-nucleotide variants that either arose as new beneficial mutations or were introduced after interbreeding with our now-extinct relatives, including Neanderthals and Denisovans. The adaptive role of copy number variants (CNVs), another well-known form of genomic variation generated through deletions or duplications that affect more base pairs in the genome, is less well understood, despite evidence that such mutations are subject to stronger selective pressures.RATIONALEThis study focuses on the discovery of introgressed and adaptive CNVs that have become enriched in specific human populations. We combine whole-genome CNV calling and population genetic inference methods to discover CNVs and then assess signals of selection after controlling for demographic history. We examine 266 publicly available modern human genomes from the Simons Genome Diversity Project and genomes of three ancient homininstextemdasha Denisovan, a Neanderthal from the Altai Mountains in Siberia, and a Neanderthal from Croatia. We apply long-read sequencing methods to sequence-resolve complex CNVs of interest specifically in the Melanesianstextemdashan Oceanian population distributed from Papua New Guinea to as far east as the islands of Fiji and known to harbor some of the greatest amounts of Neanderthal and Denisovan ancestry.RESULTSConsistent with the hypothesis of archaic introgression outside Africa, we find a significant excess of CNV sharing between modern non-African populations and archaic hominins (P = 0.039). Among Melanesians, we observe an enrichment of CNVs with potential signals of positive selection (n = 37 CNVs), of which 19 CNVs likely introgressed from archaic hominins. We show that Melanesian-stratified CNVs are significantly associated with signals of positive selection (P = 0.0323). Many map near or within genes associated with metabolism (e.g., ACOT1 and ACOT2), development and cell cycle or signaling (e.g., TNFRSF10D and CDK11A and CDK11B), or immune response (e.g., IFNLR1). We characterize two of the largest and most complex CNVs on chromosomes 16p11.2 and 8p21.3 that introgressed from Denisovans and Neanderthals, respectively, and are absent from most other human populations. At chromosome 16p11.2, we sequence-resolve a large duplication of >383 thousand base pairs (kbp) that originated from Denisovans and introgressed into the ancestral Melanesian population 60,000 to 170,000 years ago. This large duplication occurs at high frequency (>79%) in diverse Melanesian groups, shows signatures of positive selection, and maps adjacent to Homo sapienstextendashspecific duplications that predispose to rearrangements associated with autism. On chromosome 8p21.3, we identify a Melanesian haplotype that carries two CNVs, a ~6-kbp deletion, and a ~38-kbp duplication, with a Neanderthal origin and that introgressed into non-Africans 40,000 to 120,000 years ago. This CNV haplotype occurs at high frequency (44%) and shows signals consistent with a partial selective sweep in Melanesians. Using long-read sequencing genomic and transcriptomic data, we reconstruct the structure and complex evolutionary history for these two CNVs and discover previously undescribed duplicated genes (TNFRSF10D1, TNFRSF10D2, and NPIPB16) that show an excess of amino acid replacements consistent with the action of positive selection.CONCLUSIONOur results suggest that large CNVs originating in archaic hominins and introgressed into modern humans have played an important role in local population adaptation and represent an insufficiently studied source of large-scale genetic variation that is absent from current reference genomes.Large adaptive-introgressed CNVs at chromosomes 8p21.3 and 16p11.2 in Melanesians.The magnifying glasses highlight structural differences between the archaic (top) and reference (bottom) genomes. Neanderthal (red) and Denisovan (blue) haplotypes encompassing large CNVs occur at high frequencies in Melanesians (44 and 79%, respectively) but are absent (black) in all non-Melanesians. These CNVs create positively selected genes (TNFRSF10D1, TNFRSF10D2, and NPIPB16) that are absent from the reference genome.Copy number variants (CNVs) are subject to stronger selective pressure than single-nucleotide variants, but their roles in archaic introgression and adaptation have not been systematically investigated. We show that stratified CNVs are significantly associated with signatures of positive selection in Melanesians and provide evidence for adaptive introgression of large CNVs at chromosomes 16p11.2 and 8p21.3 from Denisovans and Neanderthals, respectively. Using long-read sequence data, we reconstruct the structure and complex evolutionary history of these polymorphisms and show that both encode positively selected genes absent from most human populations. Our results collectively suggest that large CNVs originating in archaic hominins and introgressed into modern humans have played an important role in local population adaptation and represent an insufficiently studied source of large-scale genetic variation.


April 21, 2020

The comparative genomics and complex population history of Papio baboons.

Recent studies suggest that closely related species can accumulate substantial genetic and phenotypic differences despite ongoing gene flow, thus challenging traditional ideas regarding the genetics of speciation. Baboons (genus Papio) are Old World monkeys consisting of six readily distinguishable species. Baboon species hybridize in the wild, and prior data imply a complex history of differentiation and introgression. We produced a reference genome assembly for the olive baboon (Papio anubis) and whole-genome sequence data for all six extant species. We document multiple episodes of admixture and introgression during the radiation of Papio baboons, thus demonstrating their value as a model of complex evolutionary divergence, hybridization, and reticulation. These results help inform our understanding of similar cases, including modern humans, Neanderthals, Denisovans, and other ancient hominins.


April 21, 2020

Aquella oligotrophica gen. nov. sp. nov.: A new member of the family Neisseriaceae isolated from laboratory tap water.

A bacterial strain designated as P08T was isolated from laboratory tap water during a water quality assessment in University of Malaya, Malaysia. The strain was a Gram-negative, rod-shaped, nonmotile, and aerobic bacterium. Complete genome of P08T comprised of a 2,820,660 bp chromosome with a G + C content of 36.43%. Both 16S rRNA phylogeny and phylogenetic tree inferred from the core gene matrix demonstrated that P08T formed a hitherto unknown subline within the family Neisseriaceae. Ortho average nucleotide identity (OrthoANI) values and the percentage of conserved proteins (POCP) calculated from complete genome sequence indicated low relatedness between P08T and its phylogenetic neighbors. Respiratory quinone analysis revealed Q-8 as the only detectable quinone. The predominant cellular fatty acids were identified as C14:0 , iso-C15:0 , and summed feature 3 (C16:1 ?7c/C16:1 ?6c). The polar lipids consisted of uncharacterized aminolipid, phosphatidylglycerol, and phosphatidylethanolamine. All aspects of phenotypic and phylogenetic data suggested that strain P08T represents a novel genus within family Neisseriaceae, for which the name Aquella gen. nov. is proposed. The type species of the genus is Aquella oligotrophica sp. nov., and the type strain is P08T (=LMG 29629T =DSM 100970T ). © 2019 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.


April 21, 2020

Whole genome sequencing of a novel, dichloromethane-fermenting Peptococcaceae from an enrichment culture

Bacteria capable of dechlorinating the toxic environmental contaminant dichloromethane (DCM, CHt2Cl2) are of great interest for potential bioremediation applications. A novel, strictly anaerobic, DCM-fermenting bacterium, “DCMF”, was enriched from organochlorine-contaminated groundwater near Botany Bay, Australia. The enrichment culture was maintained in minimal, mineral salt medium amended with dichloromethane as the sole energy source. PacBio whole genome SMRTtextsuperscriptTM sequencing of DCMF allowed textitde novo, gap-free assembly despite the presence of cohabiting organisms in the culture. Illumina sequencing reads were utilised to correct minor indels. The single, circularised 6.44 Mb chromosome was annotated with the IMG pipeline and contains 5,773 predicted protein-coding genes. Based on 16S rRNA gene and predicted proteome phylogeny, the organism appears to be a novel member of the textitPeptococcaceae family. The DCMF genome is large in comparison to known DCM-fermenting bacteria and includes 96 predicted methylamine methyltransferases, which may provide clues to the basis of its DCM metabolism. Full annotation has been provided in a custom genome browser and search tool, in addition to multiple sequence alignments and phylogenetic trees for every predicted protein, available at http://www.slimsuite.unsw.edu.au/research/dcmf/.


April 21, 2020

Long-read amplicon denoising.

Long-read next-generation amplicon sequencing shows promise for studying complete genes or genomes from complex and diverse populations. Current long-read sequencing technologies have challenging error profiles, hindering data processing and incorporation into downstream analyses. Here we consider the problem of how to reconstruct, free of sequencing error, the true sequence variants and their associated frequencies from PacBio reads. Called ‘amplicon denoising’, this problem has been extensively studied for short-read sequencing technologies, but current solutions do not always successfully generalize to long reads with high indel error rates. We introduce two methods: one that runs nearly instantly and is very accurate for medium length reads and high template coverage, and another, slower method that is more robust when reads are very long or coverage is lower. On two Mock Virus Community datasets with ground truth, each sequenced on a different PacBio instrument, and on a number of simulated datasets, we compare our two approaches to each other and to existing algorithms. We outperform all tested methods in accuracy, with competitive run times even for our slower method, successfully discriminating templates that differ by a just single nucleotide. Julia implementations of Fast Amplicon Denoising (FAD) and Robust Amplicon Denoising (RAD), and a webserver interface, are freely available. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


April 21, 2020

Effector gene reshuffling involves dispensable mini-chromosomes in the wheat blast fungus.

Newly emerged wheat blast disease is a serious threat to global wheat production. Wheat blast is caused by a distinct, exceptionally diverse lineage of the fungus causing rice blast disease. Through sequencing a recent field isolate, we report a reference genome that includes seven core chromosomes and mini-chromosome sequences that harbor effector genes normally found on ends of core chromosomes in other strains. No mini-chromosomes were observed in an early field strain, and at least two from another isolate each contain different effector genes and core chromosome end sequences. The mini-chromosome is enriched in transposons occurring most frequently at core chromosome ends. Additionally, transposons in mini-chromosomes lack the characteristic signature for inactivation by repeat-induced point (RIP) mutation genome defenses. Our results, collectively, indicate that dispensable mini-chromosomes and core chromosomes undergo divergent evolutionary trajectories, and mini-chromosomes and core chromosome ends are coupled as a mobile, fast-evolving effector compartment in the wheat pathogen genome.


April 21, 2020

Chromulinavorax destructans, a pathogen of microzooplankton that provides a window into the enigmatic candidate phylum Dependentiae.

Members of the major candidate phylum Dependentiae (a.k.a. TM6) are widespread across diverse environments from showerheads to peat bogs; yet, with the exception of two isolates infecting amoebae, they are only known from metagenomic data. The limited knowledge of their biology indicates that they have a long evolutionary history of parasitism. Here, we present Chromulinavorax destructans (Strain SeV1) the first isolate of this phylum to infect a representative from a widespread and ecologically significant group of heterotrophic flagellates, the microzooplankter Spumella elongata (Strain CCAP 955/1). Chromulinavorax destructans has a reduced 1.2 Mb genome that is so specialized for infection that it shows no evidence of complete metabolic pathways, but encodes an extensive transporter system for importing nutrients and energy in the form of ATP from the host. Its replication causes extensive reorganization and expansion of the mitochondrion, effectively surrounding the pathogen, consistent with its dependency on the host for energy. Nearly half (44%) of the inferred proteins contain signal sequences for secretion, including many without recognizable similarity to proteins of known function, as well as 98 copies of proteins with an ankyrin-repeat domain; ankyrin-repeats are known effectors of host modulation, suggesting the presence of an extensive host-manipulation apparatus. These observations help to cement members of this phylum as widespread and diverse parasites infecting a broad range of eukaryotic microbes.


April 21, 2020

Potentially mobile denitrification genes identified in Azospirillum sp. strain TSH58.

Denitrification ability is sporadically distributed among diverse bacteria, archaea, and fungi. In addition, disagreement has been found between denitrification gene phylogenies and the 16S rRNA gene phylogeny. These facts have suggested potential occurrences of horizontal gene transfer (HGT) for the denitrification genes. However, evidence of HGT has not been clearly presented thus far. In this study, we identified the sequences and the localization of the nitrite reductase genes in the genomes of 41 denitrifying Azospirillum sp. strains and searched for mobile genetic elements that contain denitrification genes. All Azospirillum sp. strains examined in this study possessed multiple replicons (4 to 11 replicons), with their sizes ranging from 7 to 1,031 kbp. Among those, the nitrite reductase gene nirK was located on large replicons (549 to 941 kbp). Genome sequencing showed that Azospirillum strains that had similar nirK sequences also shared similar nir-nor gene arrangements, especially between the TSH58, Sp7T, and Sp245 strains. In addition to the high similarity between nir-nor gene clusters among the three Azospirillum strains, a composite transposon structure was identified in the genome of strain TSH58, which contains the nir-nor gene cluster and the novel IS6 family insertion sequences (ISAz581 and ISAz582). The nirK gene within the composite transposon system was actively transcribed under denitrification-inducing conditions. Although not experimentally verified in this study, the composite transposon system containing the nir-nor gene cluster could be transferred to other cells if it is moved to a prophage region and the phage becomes activated and released outside the cells. Taken together, strain TSH58 most likely acquired its denitrification ability by HGT from closely related Azospirillum sp. denitrifiers.IMPORTANCE The evolutionary history of denitrification is complex. While the occurrence of horizontal gene transfer has been suggested for denitrification genes, most studies report circumstantial evidences, such as disagreement between denitrification gene phylogenies and the 16S rRNA gene phylogeny. Based on the comparative genome analyses of Azospirillum sp. denitrifiers, we identified denitrification genes, including nirK and norCBQD, located on a mobile genetic element in the genome of Azospirillum sp. strain TSH58. The nirK was actively transcribed under denitrification-inducing conditions. Since this gene was the sole nitrite reductase gene in strain TSH58, this strain most likely benefitted by acquiring denitrification genes via horizontal gene transfer. This finding will significantly advance our scientific knowledge regarding the ecology and evolution of denitrification. Copyright © 2019 American Society for Microbiology.


April 21, 2020

A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set.

In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequencing data. The best assembly comprises 69 nucleome sequences and displays a contig length of up to 16 Mbp. Compared to an earlier Illumina short read-based NGS assembly (AthNd-1_v1), a 75 fold increase in contiguity was observed for AthNd-1_v2c. To assign contig locations independent from the Col-0 gold standard reference sequence, we used genetic anchoring to generate a de novo assembly. In addition, we assembled the chondrome and plastome sequences. Detailed analyses of AthNd-1_v2c allowed reliable identification of large genomic rearrangements between A. thaliana accessions contributing to differences in the gene sets that distinguish the genotypes. One of the differences detected identified a gene that is lacking from the Col-0 gold standard sequence. This de novo assembly extends the known proportion of the A. thaliana pan-genome.


April 21, 2020

A whole genome scan of SNP data suggests a lack of abundant hard selective sweeps in the genome of the broad host range plant pathogenic fungus Sclerotinia sclerotiorum.

The pathogenic fungus Sclerotinia sclerotiorum infects over 600 species of plant. It is present in numerous environments throughout the world and causes significant damage to many agricultural crops. Fragmentation and lack of gene flow between populations may lead to population sub-structure. Within discrete recombining populations, positive selection may lead to a ‘selective sweep’. This is characterised by an increase in frequency of a favourable allele leading to reduction in genotypic diversity in a localised genomic region due to the phenomenon of genetic hitchhiking. We aimed to assess whether isolates of S. sclerotiorum from around the world formed genotypic clusters associated with geographical origin and to determine whether signatures of population-specific positive selection could be detected. To do this, we sequenced the genomes of 25 isolates of S. sclerotiorum collected from four different continents-Australia, Africa (north and south), Europe and North America (Canada and the northen United States) and conducted SNP based analyses of population structure and selective sweeps. Among the 25 isolates, there was evidence for two major population clusters. One of these consisted of 11 isolates from Canada, the USA and France (population 1), and the other consisted of nine isolates from Australia and one from Morocco (population 2). The rest of the isolates were genotypic outliers. We found that there was evidence of outcrossing in these two populations based on linkage disequilibrium decay. However, only a single candidate selective sweep was observed, and it was present in population 2. This sweep was close to a Major Facilitator Superfamily transporter gene, and we speculate that this gene may have a role in nutrient uptake from the host. The low abundance of selective sweeps in the S. sclerotiorum genome contrasts the numerous examples in the genomes of other fungal pathogens. This may be a result of its slow rate of evolution and low effective recombination rate due to self-fertilisation and vegetative reproduction.


April 21, 2020

Comparative genomic analysis of eight novel haloalkaliphilic bacteriophages from Lake Elmenteita, Kenya.

We report complete genome sequences of eight bacteriophages isolated from Haloalkaline Lake Elmenteita found on the floor of Kenyan Rift Valley. The bacteriophages were sequenced, annotated and a comparative genomic analysis using various Bioinformatics tools carried out to determine relatedness of the bacteriophages to each other, and to those in public databases. Basic genome properties like genome size, percentage coding density, number of open reading frames, percentage GC content and gene organizations revealed the bacteriophages had no relationship to each other. Comparison to other nucleotide sequences in GenBank database showed no significant similarities hence novel. At the amino acid level, phages of our study revealed mosaicism to genes with conserved domains to already described phages. Phylogenetic analyses of large terminase gene responsible for DNA packaging and DNA polymerase gene for replication further showed diversity among the bacteriophages. Our results give insight into diversity of bacteriophages in Lake Elmenteita and provide information on their evolution. By providing primary sequence information, this study not only provides novel sequences for biotechnological exploitation, but also sets stage for future studies aimed at better understanding of virus diversity and genomes from haloalkaline lakes in the Rift Valley.


April 21, 2020

Genome Sequence of Rhizobium jaguaris CCGE525T, a Strain Isolated from Calliandra grandiflora Nodules from a Rain Forest in Mexico.

We present the genome sequence of Rhizobium jaguaris CCGE525T, a nitrogen-fixing bacterium isolated from nodules of Calliandra grandiflora. CCGE525T belongs to Rhizobium tropici group A, represents the symbiovar calliandrae, and forms nitrogen-fixing nodules in Phaseolus vulgaris. Genome-based metrics and phylogenomic approaches support Rhizobium jaguaris as a novel species.


April 21, 2020

A Controlled Human Infection Model of Group A Streptococcus Pharyngitis: Which Strain and Why?

Group A Streptococcus (GAS) is a major cause of global infection-related morbidity and mortality. A modern controlled human infection model (CHIM) of GAS pharyngitis can accelerate vaccine development and pathogenesis research. A robust rationale for strain selection is central to meeting ethical, scientific, and regulatory requirements. Multifaceted characterization studies were done to compare a preferred candidate emm75 (M75) GAS strain to three other strains: an alternative candidate emm12 (M12) strain, an M1 strain used in 1970s pharyngitis CHIM studies (SS-496), and a representative (5448) of the globally disseminated M1T1 clone. A range of approaches were used to explore strain growth, adherence, invasion, delivery characteristics, short- and long-term viability, phylogeny, virulence factors, vaccine antigens, resistance to killing by human neutrophils, and lethality in a murine invasive model. The strains grew reliably in a medium without animal-derived components, were consistently transferred using a swab method simulating the CHIM protocol, remained viable at -80°C, and carried genes for most candidate vaccine antigens. Considering GAS molecular epidemiology, virulence factors, in vitro assays, and results from the murine model, the contemporary strains show a spectrum of virulence, with M75 appearing the least virulent and 5448 the most. The virulence profile of SS-496, used safely in 1970s CHIM studies, was similar to that of 5448 in the animal model and virulence gene carriage. The results of this multifaceted characterization confirm the M75 strain as an appropriate choice for initial deployment in the CHIM, with the aim of safely and successfully causing pharyngitis in healthy adult volunteers. IMPORTANCE GAS (Streptococcus pyogenes) is a leading global cause of infection-related morbidity and mortality. A modern CHIM of GAS pharyngitis could help to accelerate vaccine development and drive pathogenesis research. Challenge strain selection is critical to the safety and success of any CHIM and especially so for an organism such as GAS, with its wide strain diversity and potential to cause severe life-threatening acute infections (e.g., toxic shock syndrome and necrotizing fasciitis) and postinfectious complications (e.g., acute rheumatic fever, rheumatic heart disease, and acute poststreptococcal glomerulonephritis). In this paper, we outline the rationale for selecting an emm75 strain for initial use in a GAS pharyngitis CHIM in healthy adult volunteers, drawing on the findings of a broad characterization effort spanning molecular epidemiology, in vitro assays, whole-genome sequencing, and animal model studies. Copyright © 2019 Osowicki et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.