Menu
April 21, 2020

Featherweight long read alignment using partitioned reference indexes.

The advent of Nanopore sequencing has realised portable genomic research and applications. However, state of the art long read aligners and large reference genomes are not compatible with most mobile computing devices due to their high memory requirements. We show how memory requirements can be reduced through parameter optimisation and reference genome partitioning, but highlight the associated limitations and caveats of these approaches. We then demonstrate how these issues can be overcome through an appropriate merging technique. We incorporated multi-index merging into the Minimap2 aligner and demonstrate that long read alignment to the human genome can be performed on a system with 2?GB RAM with negligible impact on accuracy.


April 21, 2020

An African Salmonella Typhimurium ST313 sublineage with extensive drug-resistance and signatures of host adaptation.

Bloodstream infections by Salmonella enterica serovar Typhimurium constitute a major health burden in sub-Saharan Africa (SSA). These invasive non-typhoidal (iNTS) infections are dominated by isolates of the antibiotic resistance-associated sequence type (ST) 313. Here, we report emergence of ST313 sublineage II.1 in the Democratic Republic of the Congo. Sublineage II.1 exhibits extensive drug resistance, involving a combination of multidrug resistance, extended spectrum ß-lactamase production and azithromycin resistance. ST313 lineage II.1 isolates harbour an IncHI2 plasmid we name pSTm-ST313-II.1, with one isolate also exhibiting decreased ciprofloxacin susceptibility. Whole genome sequencing reveals that ST313 II.1 isolates have accumulated genetic signatures potentially associated with altered pathogenicity and host adaptation, related to changes observed in biofilm formation and metabolic capacity. Sublineage II.1 emerged at the beginning of the 21st century and is involved in on-going outbreaks. Our data provide evidence of further evolution within the ST313 clade associated with iNTS in SSA.


April 21, 2020

A chromosome-level genome assembly of Cydia pomonella provides insights into chemical ecology and insecticide resistance.

The codling moth Cydia pomonella, a major invasive pest of pome fruit, has spread around the globe in the last half century. We generated a chromosome-level scaffold assembly including the Z chromosome and a portion of the W chromosome. This assembly reveals the duplication of an olfactory receptor gene (OR3), which we demonstrate enhances the ability of C. pomonella to exploit kairomones and pheromones in locating both host plants and mates. Genome-wide association studies contrasting insecticide-resistant and susceptible strains identify hundreds of single nucleotide polymorphisms (SNPs) potentially associated with insecticide resistance, including three SNPs found in the promoter of CYP6B2. RNAi knockdown of CYP6B2 increases C. pomonella sensitivity to two insecticides, deltamethrin and azinphos methyl. The high-quality genome assembly of C. pomonella informs the genetic basis of its invasiveness, suggesting the codling moth has distinctive capabilities and adaptive potential that may explain its worldwide expansion.


April 21, 2020

Long-read assembly of the Chinese rhesus macaque genome and identification of ape-specific structural variants.

We present a high-quality de novo genome assembly (rheMacS) of the Chinese rhesus macaque (Macaca mulatta) using long-read sequencing and multiplatform scaffolding approaches. Compared to the current Indian rhesus macaque reference genome (rheMac8), rheMacS increases sequence contiguity 75-fold, closing 21,940 of the remaining assembly gaps (60.8 Mbp). We improve gene annotation by generating more than two million full-length transcripts from ten different tissues by long-read RNA sequencing. We sequence resolve 53,916 structural variants (96% novel) and identify 17,000 ape-specific structural variants (ASSVs) based on comparison to ape genomes. Many ASSVs map within ChIP-seq predicted enhancer regions where apes and macaque show diverged enhancer activity and gene expression. We further characterize a subset that may contribute to ape- or great-ape-specific phenotypic traits, including taillessness, brain volume expansion, improved manual dexterity, and large body size. The rheMacS genome assembly serves as an ideal reference for future biomedical and evolutionary studies.


April 21, 2020

Metatranscriptomic evidence for classical and RuBisCO-mediated CO2 reduction to methane facilitated by direct interspecies electron transfer in a methanogenic system.

In a staged anaerobic fluidized-bed ceramic membrane bioreactor, metagenomic and metatranscriptomic analyses were performed to decipher the microbial interactions on the granular activated carbon. Metagenome bins, representing the predominating microbes in the bioreactor: syntrophic propionate-oxidizing bacteria (SPOB), acetoclastic Methanothrix concilii, and exoelectrogenic Geobacter lovleyi, were successfully recovered for the reconstruction and analysis of metabolic pathways involved in the transformation of fatty acids to methane. In particular, SPOB degraded propionate into acetate, which was further converted into methane and CO2 by M. concilii via the acetoclastic methanogenesis. Concurrently, G. lovleyi oxidized acetate into CO2, releasing electrons into the extracellular environment. By accepting these electrons through direct interspecies electron transfer (DIET), M. concilii was capable of performing CO2 reduction for further methane formation. Most notably, an alternative RuBisCO-mediated CO2 reduction (the reductive hexulose-phosphate (RHP) pathway) is transcriptionally-active in M. concilii. This RHP pathway enables M. concilii dominance and energy gain by carbon fixation and methanogenesis, respectively via a methyl-H4MPT intermediate, constituting the third methanogenesis route. The complete acetate reduction (2 mole methane formation/1 mole acetate consumption), coupling of acetoclastic methanogenesis and two CO2 reduction pathways, are thermodynamically favorable even under very low substrate condition (down to to 10-5?M level). Such tight interactions via both mediated and direct interspecies electron transfer (MIET and DIET), induced by the conductive GAC promote the overall efficiency of bioenergy processes.


April 21, 2020

Genome-wide mutational biases fuel transcriptional diversity in the Mycobacterium tuberculosis complex.

The Mycobacterium tuberculosis complex (MTBC) members display different host-specificities and virulence phenotypes. Here, we have performed a comprehensive RNAseq and methylome analysis of the main clades of the MTBC and discovered unique transcriptional profiles. The majority of genes differentially expressed between the clades encode proteins involved in host interaction and metabolic functions. A significant fraction of changes in gene expression can be explained by positive selection on single mutations that either create or disrupt transcriptional start sites (TSS). Furthermore, we show that clinical strains have different methyltransferases inactivated and thus different methylation patterns. Under the tested conditions, differential methylation has a minor direct role on transcriptomic differences between strains. However, disruption of a methyltransferase in one clinical strain revealed important expression differences suggesting indirect mechanisms of expression regulation. Our study demonstrates that variation in transcriptional profiles are mainly due to TSS mutations and have likely evolved due to differences in host characteristics.


April 21, 2020

Meiotic sex in Chagas disease parasite Trypanosoma cruzi.

Genetic exchange enables parasites to rapidly transform disease phenotypes and exploit new host populations. Trypanosoma cruzi, the parasitic agent of Chagas disease and a public health concern throughout Latin America, has for decades been presumed to exchange genetic material rarely and without classic meiotic sex. We present compelling evidence from 45 genomes sequenced from southern Ecuador that T. cruzi in fact maintains truly sexual, panmictic groups that can occur alongside others that remain highly clonal after past hybridization events. These groups with divergent reproductive strategies appear genetically isolated despite possible co-occurrence in vectors and hosts. We propose biological explanations for the fine-scale disconnectivity we observe and discuss the epidemiological consequences of flexible reproductive modes. Our study reinvigorates the hunt for the site of genetic exchange in the T. cruzi life cycle, provides tools to define the genetic determinants of parasite virulence, and reforms longstanding theory on clonality in trypanosomatid parasites.


April 21, 2020

Uncovering the biosynthetic potential of rare metagenomic DNA using co-occurrence network analysis of targeted sequences.

Sequencing of DNA extracted from environmental samples can provide key insights into the biosynthetic potential of uncultured bacteria. However, the high complexity of soil metagenomes, which can contain thousands of bacterial species per gram of soil, imposes significant challenges to explore secondary metabolites potentially produced by rare members of the soil microbiome. Here, we develop a targeted sequencing workflow termed CONKAT-seq (co-occurrence network analysis of targeted sequences) that detects physically clustered biosynthetic domains, a hallmark of bacterial secondary metabolism. Following targeted amplification of conserved biosynthetic domains in a highly partitioned metagenomic library, CONKAT-seq evaluates amplicon co-occurrence patterns across library subpools to identify chromosomally clustered domains. We show that a single soil sample can contain more than a thousand uncharacterized biosynthetic gene clusters, most of which originate from low frequency genomes which are practically inaccessible through untargeted sequencing. CONKAT-seq allows scalable exploration of largely untapped biosynthetic diversity across multiple soils, and can guide the discovery of novel secondary metabolites from rare members of the soil microbiome.


April 21, 2020

Population dynamics of an Escherichia coli ST131 lineage during recurrent urinary tract infection.

Recurrent urinary tract infections (rUTIs) are extremely common, with ~?25% of all women experiencing a recurrence within 1 year of their original infection. Escherichia coli ST131 is a globally dominant multidrug resistant clone associated with high rates of rUTI. Here, we show the dynamics of an ST131 population over a 5-year period from one elderly woman with rUTI since the 1970s. Using whole genome sequencing, we identify an indigenous clonal lineage (P1A) linked to rUTI and persistence in the fecal flora, providing compelling evidence of an intestinal reservoir of rUTI. We also show that the P1A lineage possesses substantial plasmid diversity, resulting in the coexistence of antibiotic resistant and sensitive intestinal isolates despite frequent treatment. Our longitudinal study provides a unique comprehensive genomic analysis of a clonal lineage within a single individual and suggests a population-wide resistance mechanism enabling rapid adaptation to fluctuating antibiotic exposure.


April 21, 2020

CRISPR/CAS9 targeted CAPTURE of mammalian genomic regions for characterization by NGS.

The robust detection of structural variants in mammalian genomes remains a challenge. It is particularly difficult in the case of genetically unstable Chinese hamster ovary (CHO) cell lines with only draft genome assemblies available. We explore the potential of the CRISPR/Cas9 system for the targeted capture of genomic loci containing integrated vectors in CHO-K1-based cell lines followed by next generation sequencing (NGS), and compare it to popular target-enrichment sequencing methods and to whole genome sequencing (WGS). Three different CRISPR/Cas9-based techniques were evaluated; all of them allow for amplification-free enrichment of target genomic regions in the range from 5 to 60 fold, and for recovery of ~15 kb-long sequences with no sequencing artifacts introduced. The utility of these protocols has been proven by the identification of transgene integration sites and flanking sequences in three CHO cell lines. The long enriched fragments helped to identify Escherichia coli genome sequences co-integrated with vectors, and were further characterized by Whole Genome Sequencing (WGS). Other advantages of CRISPR/Cas9-based methods are the ease of bioinformatics analysis, potential for multiplexing, and the production of long target templates for real-time sequencing.


April 21, 2020

Sequencing and Genomic Diversity Analysis of IncHI5 Plasmids.

IncHI plasmids could be divided into five different subgroups IncHI1-5. In this study, the complete nucleotide sequences of seven blaIMP- or blaVIM-carrying IncHI5 plasmids from Klebsiella pneumoniae, K. quasipneumoniae, and K. variicola were determined and compared in detail with all the other four available sequenced IncHI5 plasmids. These plasmids carried conserved IncHI5 backbones composed of repHI5B and a repFIB-like gene (replication), parABC (partition), and tra1 (conjugal transfer). Integration of a number of accessory modules, through horizontal gene transfer, at various sites of IncHI5 backbones resulted in various deletions of surrounding backbone regions and thus considerable diversification of IncHI5 backbones. Among the accessory modules were three kinds of resistance accessory modules, namely Tn10 and two antibiotic resistance islands designated ARI-A and ARI-B. These two islands, inserted at two different fixed sites (one island was at one site and the other was at a different site) of IncHI5 backbones, were derived from the prototype Tn3-family transposons Tn1696 and Tn6535, respectively, and could be further discriminated as various intact transposons and transposon-like structures. The ARI-A or ARI-B islands from different IncHI5 plasmids carried distinct profiles of antimicrobial resistance markers and associated mobile elements, and complex events of transposition and homologous recombination accounted for assembly of these islands. The carbapenemase genes blaIMP-4, blaIMP-38 and blaVIM-1 were identified within various class 1 integrons from ARI-A or ARI-B of the seven plasmids sequenced in this study. Data presented here would provide a deeper insight into diversification and evolution history of IncHI5 plasmids.


April 21, 2020

FDA-ARGOS is a database with public quality-controlled reference genomes for diagnostic use and regulatory science.

FDA proactively invests in tools to support innovation of emerging technologies, such as infectious disease next generation sequencing (ID-NGS). Here, we introduce FDA-ARGOS quality-controlled reference genomes as a public database for diagnostic purposes and demonstrate its utility on the example of two use cases. We provide quality control metrics for the FDA-ARGOS genomic database resource and outline the need for genome quality gap filling in the public domain. In the first use case, we show more accurate microbial identification of Enterococcus avium from metagenomic samples with FDA-ARGOS reference genomes compared to non-curated GenBank genomes. In the second use case, we demonstrate the utility of FDA-ARGOS reference genomes for Ebola virus target sequence comparison as part of a composite validation strategy for ID-NGS diagnostic tests. The use of FDA-ARGOS as an in silico target sequence comparator tool combined with representative clinical testing could reduce the burden for completing ID-NGS clinical trials.


April 21, 2020

Genome-wide systematic identification of methyltransferase recognition and modification patterns.

Genome-wide analysis of DNA methylation patterns using single molecule real-time DNA sequencing has boosted the number of publicly available methylomes. However, there is a lack of tools coupling methylation patterns and the corresponding methyltransferase genes. Here we demonstrate a high-throughput method for coupling methyltransferases with their respective motifs, using automated cloning and analysing the methyltransferases in vectors carrying a strain-specific cassette containing all potential target sites. To validate the method, we analyse the genomes of the thermophile Moorella thermoacetica and the mesophile Acetobacterium woodii, two acetogenic bacteria having substantially modified genomes with 12 methylation motifs and a total of 23 methyltransferase genes. Using our method, we characterize the 23 methyltransferases, assign motifs to the respective enzymes and verify activity for 11 of the 12 motifs.


April 21, 2020

Non-coding variability at the APOE locus contributes to the Alzheimer’s risk.

Alzheimer’s disease (AD) is a leading cause of mortality in the elderly. While the coding change of APOE-e4 is a key risk factor for late-onset AD and has been believed to be the only risk factor in the APOE locus, it does not fully explain the risk effect conferred by the locus. Here, we report the identification of AD causal variants in PVRL2 and APOC1 regions in proximity to APOE and define common risk haplotypes independent of APOE-e4 coding change. These risk haplotypes are associated with changes of AD-related endophenotypes including cognitive performance, and altered expression of APOE and its nearby genes in the human brain and blood. High-throughput genome-wide chromosome conformation capture analysis further supports the roles of these risk haplotypes in modulating chromatin states and gene expression in the brain. Our findings provide compelling evidence for additional risk factors in the APOE locus that contribute to AD pathogenesis.


April 21, 2020

Modulation of metabolome and bacterial community in whole crop corn silage by inoculating homofermentative Lactobacillus plantarum and heterofermentative Lactobacillus buchneri.

The present study investigated the species level based microbial community and metabolome in corn silage inoculated with or without homofermentative Lactobacillus plantarum and heterofermentative Lactobacillus buchneri using the PacBio SMRT Sequencing and time-of-flight mass spectrometry (GC-TOF/MS). Chopped whole crop corn was treated with (1) deionized water (control), (2) Lactobacillus plantarum, or (3) Lactobacillus buchneri. The chopped whole crop corn was ensiled in vacuum-sealed polyethylene bags containing 300 g of fresh forge for 90 days, with three replicates for each treatment. The results showed that a total of 979 substances were detected, and 316 different metabolites were identified. Some metabolites with antimicrobial activity were detected in whole crop corn silage, such as catechol, 3-phenyllactic acid, 4-hydroxybenzoic acid, azelaic acid, 3,4-dihydroxybenzoic acid and 4-hydroxycinnamic acid. Catechol, pyrogallol and ferulic acid with antioxidant property, 4-hydroxybutyrate with nervine activity, and linoleic acid with cholesterol lowering effects, were detected in present study. In addition, a flavoring agent of myristic acid and a depression mitigation substance of phenylethylamine were also found in this study. Samples treated with inoculants presented more biofunctional metabolites of organic acids, amino acids and phenolic acids than untreated samples. The Lactobacillus species covered over 98% after ensiling, and were mainly comprised by the L. acetotolerans, L. silagei, L. parafarraginis, L. buchneri and L. odoratitofui. As compared to the control silage, inoculation of L. plantarum increased the relative abundances of L. acetotolerans, L. buchneri and L. parafarraginis, and a considerable decline in the proportion of L. silagei was observed; whereas an obvious decrease in L. acetotolerans and increases in L. odoratitofui and L. farciminis were observed in the L. buchneri inoculated silage. Therefore, inoculation of L. plantarum and L. buchneri regulated the microbial composition and metabolome of the corn silage with different behaviors. The present results indicated that profiling of silage microbiome and metabolome might improve our current understanding of the biological process underlying silage formation.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.