Menu
April 21, 2020

Meiotic sex in Chagas disease parasite Trypanosoma cruzi.

Genetic exchange enables parasites to rapidly transform disease phenotypes and exploit new host populations. Trypanosoma cruzi, the parasitic agent of Chagas disease and a public health concern throughout Latin America, has for decades been presumed to exchange genetic material rarely and without classic meiotic sex. We present compelling evidence from 45 genomes sequenced from southern Ecuador that T. cruzi in fact maintains truly sexual, panmictic groups that can occur alongside others that remain highly clonal after past hybridization events. These groups with divergent reproductive strategies appear genetically isolated despite possible co-occurrence in vectors and hosts. We propose biological explanations for the fine-scale disconnectivity we observe and discuss the epidemiological consequences of flexible reproductive modes. Our study reinvigorates the hunt for the site of genetic exchange in the T. cruzi life cycle, provides tools to define the genetic determinants of parasite virulence, and reforms longstanding theory on clonality in trypanosomatid parasites.


April 21, 2020

CRISPR/CAS9 targeted CAPTURE of mammalian genomic regions for characterization by NGS.

The robust detection of structural variants in mammalian genomes remains a challenge. It is particularly difficult in the case of genetically unstable Chinese hamster ovary (CHO) cell lines with only draft genome assemblies available. We explore the potential of the CRISPR/Cas9 system for the targeted capture of genomic loci containing integrated vectors in CHO-K1-based cell lines followed by next generation sequencing (NGS), and compare it to popular target-enrichment sequencing methods and to whole genome sequencing (WGS). Three different CRISPR/Cas9-based techniques were evaluated; all of them allow for amplification-free enrichment of target genomic regions in the range from 5 to 60 fold, and for recovery of ~15 kb-long sequences with no sequencing artifacts introduced. The utility of these protocols has been proven by the identification of transgene integration sites and flanking sequences in three CHO cell lines. The long enriched fragments helped to identify Escherichia coli genome sequences co-integrated with vectors, and were further characterized by Whole Genome Sequencing (WGS). Other advantages of CRISPR/Cas9-based methods are the ease of bioinformatics analysis, potential for multiplexing, and the production of long target templates for real-time sequencing.


April 21, 2020

Urinary tract colonization is enhanced by a plasmid that regulates uropathogenic Acinetobacter baumannii chromosomal genes.

Multidrug resistant (MDR) Acinetobacter baumannii poses a growing threat to global health. Research on Acinetobacter pathogenesis has primarily focused on pneumonia and bloodstream infections, even though one in five A. baumannii strains are isolated from urinary sites. In this study, we highlight the role of A. baumannii as a uropathogen. We develop the first A. baumannii catheter-associated urinary tract infection (CAUTI) murine model using UPAB1, a recent MDR urinary isolate. UPAB1 carries the plasmid pAB5, a member of the family of large conjugative plasmids that represses the type VI secretion system (T6SS) in multiple Acinetobacter strains. pAB5 confers niche specificity, as its carriage improves UPAB1 survival in a CAUTI model and decreases virulence in a pneumonia model. Comparative proteomic and transcriptomic analyses show that pAB5 regulates the expression of multiple chromosomally-encoded virulence factors besides T6SS. Our results demonstrate that plasmids can impact bacterial infections by controlling the expression of chromosomal genes.


April 21, 2020

Programmable mutually exclusive alternative splicing for generating RNA and protein diversity.

Alternative splicing performs a central role in expanding genomic coding capacity and proteomic diversity. However, programming of splicing patterns in engineered biological systems remains underused. Synthetic approaches thus far have predominantly focused on controlling expression of a single protein through alternative splicing. Here, we describe a modular and extensible platform for regulating four programmable exons that undergo a mutually exclusive alternative splicing event to generate multiple functionally-distinct proteins. We present an intron framework that enforces the mutual exclusivity of two internal exons and demonstrate a graded series of consensus sequence elements of varying strengths that set the ratio of two mutually exclusive isoforms. We apply this framework to program the DNA-binding domains of modular transcription factors to differentially control downstream gene activation. This splicing platform advances an approach for generating diverse isoforms and can ultimately be applied to program modular proteins and increase coding capacity of synthetic biological systems.


April 21, 2020

Complete Genome Sequence of Sequevar 14M Ralstonia solanacearum Strain HA4-1 Reveals Novel Type III Effectors Acquired Through Horizontal Gene Transfer.

Ralstonia solanacearum, which causes bacterial wilt in a broad range of plants, is considered a “species complex” due to its significant genetic diversity. Recently, we have isolated a new R. solanacearum strain HA4-1 from Hong’an county in Hubei province of China and identified it being phylotype I, sequevar 14M (phylotype I-14M). Interestingly, we found that it can cause various disease symptoms among different potato genotypes and display different pathogenic behavior compared to a phylogenetically related strain, GMI1000. To dissect the pathogenic mechanisms of HA4-1, we sequenced its whole genome by combined sequencing technologies including Illumina HiSeq2000, PacBio RS II, and BAC-end sequencing. Genome assembly results revealed the presence of a conventional chromosome, a megaplasmid as well as a 143 kb plasmid in HA4-1. Comparative genome analysis between HA4-1 and GMI1000 shows high conservation of the general virulence factors such as secretion systems, motility, exopolysaccharides (EPS), and key regulatory factors, but significant variation in the repertoire and structure of type III effectors, which could be the determinants of their differential pathogenesis in certain potato species or genotypes. We have identified two novel type III effectors that were probably acquired through horizontal gene transfer (HGT). These novel R. solanacearum effectors display homology to several YopJ and XopAC family members. We named them as RipBR and RipBS. Notably, the copy of RipBR on the plasmid is a pseudogene, while the other on the megaplasmid is normal. For RipBS, there are three copies located in the megaplasmid and plasmid, respectively. Our results have not only enriched the genome information on R. solanacearum species complex by sequencing the first sequevar 14M strain and the largest plasmid reported in R. solanacearum to date but also revealed the variation in the repertoire of type III effectors. This will greatly contribute to the future studies on the pathogenic evolution, host adaptation, and interaction between R. solanacearum and potato.


April 21, 2020

Multi-platform discovery of haplotype-resolved structural variation in human genomes.

The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (<50?bp) and 27,622 SVs (=50?bp) per genome. We also discover 156 inversions per genome and 58 of the inversions intersect with the critical regions of recurrent microdeletion and microduplication syndromes. Taken together, our SV callsets represent a three to sevenfold increase in SV detection compared to most standard high-throughput sequencing studies, including those from the 1000 Genomes Project. The methods and the dataset presented serve as a gold standard for the scientific community allowing us to make recommendations for maximizing structural variation sensitivity for future genome sequencing studies.


April 21, 2020

Occurrence and Characterization of mcr-1-Positive Escherichia coli Isolated From Food-Producing Animals in Poland, 2011-2016.

The emergence of plasmid-mediated colistin resistance (mcr genes) threatens the effectiveness of polymyxins, which are last-resort drugs to treat infections by multidrug- and carbapenem-resistant Gram-negative bacteria. Based on the occurrence of colistin resistance the aims of the study were to determine possible resistance mechanisms and then characterize the mcr-positive Escherichia coli. The research used material from the Polish national and EU harmonized antimicrobial resistance (AMR) monitoring programs. A total of 5,878 commensal E. coli from fecal samples of turkeys, chickens, pigs, and cattle collected in 2011-2016 were screened by minimum inhibitory concentration (MIC) determination for the presence of resistance to colistin (R) defined as R > 2 mg/L. Strains with MIC = 2 mg/L isolated in 2014-2016 were also included. A total of 128 isolates were obtained, and most (66.3%) had colistin MIC of 2 mg/L. PCR revealed mcr-1 in 80 (62.5%) isolates recovered from 61 turkeys, 11 broilers, 2 laying hens, 1 pig, and 1 bovine. No other mcr-type genes (including mcr-2 to -5) were detected. Whole-genome sequencing (WGS) of the mcr-1-positive isolates showed high diversity in the multi-locus sequence types (MLST) of E. coli, plasmid replicons, and AMR and virulence genes. Generally mcr-1.1 was detected on the same contig as the IncX4 (76.3%) and IncHI2 (6.3%) replicons. One isolate harbored mcr-1.1 on the chromosome. Various extended-spectrum beta-lactamase (blaSHV-12, blaCTX-M-1, blaCTX-M-15, blaTEM-30, blaTEM-52, and blaTEM-135) and quinolone resistance genes (qnrS1, qnrB19, and chromosomal gyrA, parC, and parE mutations) were present in the mcr-1.1-positive E. coli. A total of 49 sequence types (ST) were identified, ST354, ST359, ST48, and ST617 predominating. One isolate, identified as ST189, belonged to atypical enteropathogenic E. coli. Our findings show that mcr-1.1 has spread widely among production animals in Poland, particularly in turkeys and appears to be transferable mainly by IncX4 and IncHI2 plasmids spread across diverse E. coli lineages. Interestingly, most of these mcr-1-positive E. coli would remain undetected using phenotypic methods with the current epidemiological cut-off value (ECOFF). The appearance and spread of mcr-1 among various animals, but notably in turkeys, might be considered a food chain, and public health hazard.


April 21, 2020

Platanus-allee is a de novo haplotype assembler enabling a comprehensive access to divergent heterozygous regions.

The ultimate goal for diploid genome determination is to completely decode homologous chromosomes independently, and several phasing programs from consensus sequences have been developed. These methods work well for lowly heterozygous genomes, but the manifold species have high heterozygosity. Additionally, there are highly divergent regions (HDRs), where the haplotype sequences differ considerably. Because HDRs are likely to direct various interesting biological phenomena, many genomic analysis targets fall within these regions. However, they cannot be accessed by existing phasing methods, and we have to adopt costly traditional methods. Here, we develop a de novo haplotype assembler, Platanus-allee ( http://platanus.bio.titech.ac.jp/platanus2 ), which initially constructs each haplotype sequence and then untangles the assembly graphs utilizing sequence links and synteny information. A comprehensive benchmark analysis reveals that Platanus-allee exhibits high recall and precision, particularly for HDRs. Using this approach, previously unknown HDRs are detected in the human genome, which may uncover novel aspects of genome variability.


April 21, 2020

Deep convolutional neural networks for accurate somatic mutation detection.

Accurate detection of somatic mutations is still a challenge in cancer analysis. Here we present NeuSomatic, the first convolutional neural network approach for somatic mutation detection, which significantly outperforms previous methods on different sequencing platforms, sequencing strategies, and tumor purities. NeuSomatic summarizes sequence alignments into small matrices and incorporates more than a hundred features to capture mutation signals effectively. It can be used universally as a stand-alone somatic mutation detection method or with an ensemble of existing methods to achieve the highest accuracy.


April 21, 2020

A multi-task convolutional deep neural network for variant calling in single molecule sequencing.

The accurate identification of DNA sequence variants is an important, but challenging task in genomics. It is particularly difficult for single molecule sequencing, which has a per-nucleotide error rate of ~5-15%. Meeting this demand, we developed Clairvoyante, a multi-task five-layer convolutional neural network model for predicting variant type (SNP or indel), zygosity, alternative allele and indel length from aligned reads. For the well-characterized NA12878 human sample, Clairvoyante achieves 99.67, 95.78, 90.53% F1-score on 1KP common variants, and 98.65, 92.57, 87.26% F1-score for whole-genome analysis, using Illumina, PacBio, and Oxford Nanopore data, respectively. Training on a second human sample shows Clairvoyante is sample agnostic and finds variants in less than 2?h on a standard server. Furthermore, we present 3,135 variants that are missed using Illumina but supported independently by both PacBio and Oxford Nanopore reads. Clairvoyante is available open-source ( https://github.com/aquaskyline/Clairvoyante ), with modules to train, utilize and visualize the model.


April 21, 2020

Complete genome sequence analysis of the thermoacidophilic verrucomicrobial methanotroph “Candidatus Methylacidiphilum kamchatkense” strain Kam1 and comparison with its closest relatives.

The candidate genus “Methylacidiphilum” comprises thermoacidophilic aerobic methane oxidizers belonging to the Verrucomicrobia phylum. These are the first described non-proteobacterial aerobic methane oxidizers. The genes pmoCAB, encoding the particulate methane monooxygenase do not originate from horizontal gene transfer from proteobacteria. Instead, the “Ca. Methylacidiphilum” and the sister genus “Ca. Methylacidimicrobium” represent a novel and hitherto understudied evolutionary lineage of aerobic methane oxidizers. Obtaining and comparing the full genome sequences is an important step towards understanding the evolution and physiology of this novel group of organisms.Here we present the closed genome of “Ca. Methylacidiphilum kamchatkense” strain Kam1 and a comparison with the genomes of its two closest relatives “Ca. Methylacidiphilum fumariolicum” strain SolV and “Ca. Methylacidiphilum infernorum” strain V4. The genome consists of a single 2,2 Mbp chromosome with 2119 predicted protein coding sequences. Genome analysis showed that the majority of the genes connected with metabolic traits described for one member of “Ca. Methylacidiphilum” is conserved between all three genomes. All three strains encode class I CRISPR-cas systems. The average nucleotide identity between “Ca. M. kamchatkense” strain Kam1 and strains SolV and V4 is =95% showing that they should be regarded as separate species. Whole genome comparison revealed a high degree of synteny between the genomes of strains Kam1 and SolV. In contrast, comparison of the genomes of strains Kam1 and V4 revealed a number of rearrangements. There are large differences in the numbers of transposable elements found in the genomes of the three strains with 12, 37 and 80 transposable elements in the genomes of strains Kam1, V4 and SolV respectively. Genomic rearrangements and the activity of transposable elements explain much of the genomic differences between strains. For example, a type 1h uptake hydrogenase is conserved between strains Kam1 and SolV but seems to have been lost from strain V4 due to genomic rearrangements.Comparing three closed genomes of “Ca. Methylacidiphilum” spp. has given new insights into the evolution of these organisms and revealed large differences in numbers of transposable elements between strains, the activity of these explains much of the genomic differences between strains.


April 21, 2020

Single-molecule sequencing detection of N6-methyladenine in microbial reference materials.

The DNA base modification N6-methyladenine (m6A) is involved in many pathways related to the survival of bacteria and their interactions with hosts. Nanopore sequencing offers a new, portable method to detect base modifications. Here, we show that a neural network can improve m6A detection at trained sequence contexts compared to previously published methods using deviations between measured and expected current values as each adenine travels through a pore. The model, implemented as the mCaller software package, can be extended to detect known or confirm suspected methyltransferase target motifs based on predictions of methylation at untrained contexts. We use PacBio, Oxford Nanopore, methylated DNA immunoprecipitation sequencing (MeDIP-seq), and whole-genome bisulfite sequencing data to generate and orthogonally validate methylomes for eight microbial reference species. These well-characterized microbial references can serve as controls in the development and evaluation of future methods for the identification of base modifications from single-molecule sequencing data.


April 21, 2020

A Pathovar of Xanthomonas oryzae Infecting Wild Grasses Provides Insight Into the Evolution of Pathogenicity in Rice Agroecosystems

Xanthomonas oryzae (Xo) are critical rice pathogens. Virulent lineages from Africa and Asia and less virulent strains from the US have been well characterized. X. campestris pv. leersiae (Xcl), first described in 1957, causes bacterial streak on the perennial grass, Leersia hexandra, and is a close relative of Xo. L. hexandra, a member of the Poaceae, is highly similar to rice phylogenetically, is globally ubiquitous around rice paddies, and is a reservoir of pathogenic Xo. We used long read, single molecule, real time (SMRT) genome sequences of five strains of Xcl from Burkina Faso, China, Mali and Uganda to determine the genetic relatedness of this organism with Xo. Novel Transcription Activator-Like Effectors (TALEs) were discovered in all five strains of Xcl. Predicted TALE target sequences were identified in the L. perrieri genome and compared to rice susceptibility gene homologs. Pathogenicity screening on L. hexandra and diverse rice cultivars confirmed that Xcl are able to colonize rice and produce weak but not progressive symptoms. Overall, based on average nucleotide identity, type III effector repertoires and disease phenotype, we propose to rename Xcl to X. oryzae pv. leersiae (Xol) and use this parallel system to improve understanding of the evolution of bacterial pathogenicity in rice agroecosystems.


April 21, 2020

Virulence characteristics and an action mode of antibiotic resistance in multidrug-resistant Pseudomonas aeruginosa.

Pseudomonas aeruginosa displays intrinsic resistance to many antibiotics and known to acquire actively genetic mutations for further resistance. In this study, we attempted to understand genomic and transcriptomic landscapes of P. aeruginosa clinical isolates that are highly resistant to multiple antibiotics. We also aimed to reveal a mode of antibiotic resistance by elucidating transcriptional response of genes conferring antibiotic resistance. To this end, we sequenced the whole genomes and profiled genome-wide RNA transcripts of three different multi-drug resistant (MDR) clinical isolates that are phylogenetically distant from one another. Multi-layered genome comparisons with genomes of antibiotic-susceptible P. aeruginosa strains and 70 other antibiotic-resistance strains revealed both well-characterized conserved gene mutations and distinct distribution of antibiotic-resistant genes (ARGs) among strains. Transcriptions of genes involved in quorum sensing and type VI secretion systems were invariably downregulated in the MDR strains. Virulence-associated phenotypes were further examined and results indicate that our MDR strains are clearly avirulent. Transcriptions of 64 genes, logically selected to be related with antibiotic resistance in MDR strains, were active under normal growth conditions and remained unchanged during antibiotic treatment. These results propose that antibiotic resistance is achieved by a “constitutive” response scheme, where ARGs are actively expressed even in the absence of antibiotic stress, rather than a “reactive” response. Bacterial responses explored at the transcriptomic level in conjunction with their genome repertoires provided novel insights into (i) the virulence-associated phenotypes and (ii) a mode of antibiotic resistance in MDR P. aeruginosa strains.


April 21, 2020

Long-Read Sequencing Emerging in Medical Genetics

The wide implementation of next-generation sequencing (NGS) technologies has revolutionized the field of medical genetics. However, the short read lengths of currently used sequencing approaches pose a limitation for identification of structural variants, sequencing repetitive regions, phasing alleles and distinguishing highly homologous genomic regions. These limitations may significantly contribute to the diagnostic gap in patients with genetic disorders who have undergone standard NGS, like whole exome or even genome sequencing. Now, the emerging long-read sequencing (LRS) technologies may offer improvements in the characterization of genetic variation and regions that are difficult to assess with the currently prevailing NGS approaches. LRS has so far mainly been used to investigate genetic disorders with previously known or strongly suspected disease loci. While these targeted approaches already show the potential of LRS, it remains to be seen whether LRS technologies can soon enable true whole genome sequencing routinely. Ultimately, this could allow the de novo assembly of individual whole genomes used as a generic test for genetic disorders. In this article, we summarize the current LRS-based research on human genetic disorders and discuss the potential of these technologies to facilitate the next major advancements in medical genetics.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.