Menu
September 22, 2019

A graph-based approach to diploid genome assembly.

Constructing high-quality haplotype-resolved de novo assemblies of diploid genomes is important for revealing the full extent of structural variation and its role in health and disease. Current assembly approaches often collapse the two sequences into one haploid consensus sequence and, therefore, fail to capture the diploid nature of the organism under study. Thus, building an assembler capable of producing accurate and complete diploid assemblies, while being resource-efficient with respect to sequencing costs, is a key challenge to be addressed by the bioinformatics community.We present a novel graph-based approach to diploid assembly, which combines accurate Illumina data and long-read Pacific Biosciences (PacBio) data. We demonstrate the effectiveness of our method on a pseudo-diploid yeast genome and show that we require as little as 50× coverage Illumina data and 10× PacBio data to generate accurate and complete assemblies. Additionally, we show that our approach has the ability to detect and phase structural variants.https://github.com/whatshap/whatshap.Supplementary data are available at Bioinformatics online.


September 22, 2019

MIRU-profiler: a rapid tool for determination of 24-loci MIRU-VNTR profiles from assembled genomes of Mycobacterium tuberculosis.

Tuberculosis (TB) resulted in an estimated 1.7 million deaths in the year 2016. The disease is caused by the members of Mycobacterium tuberculosis complex, which includes Mycobacterium tuberculosis, Mycobacterium bovis and other closely related TB causing organisms. In order to understand the epidemiological dynamics of TB, national TB control programs often conduct standardized genotyping at 24 Mycobacterial-Interspersed-Repetitive-Units (MIRU)-Variable-Number-of-Tandem-Repeats (VNTR) loci. With the advent of next generation sequencing technology, whole-genome sequencing (WGS) has been widely used for studying TB transmission. However, an open-source software that can connect WGS and MIRU-VNTR typing is currently unavailable, which hinders interlaboratory communication. In this manuscript, we introduce the MIRU-profiler program which could be used for prediction of MIRU-VNTR profile from WGS of M. tuberculosis.The MIRU-profiler is implemented in shell scripting language and depends on EMBOSS software. The in-silico workflow of MIRU-profiler is similar to those described in the laboratory manuals for genotyping M. tuberculosis. Given an input genome sequence, the MIRU-profiler computes alleles at the standard 24-loci based on in-silico PCR amplicon lengths. The final output is a tab-delimited text file detailing the 24-loci MIRU-VNTR pattern of the input sequence.The MIRU-profiler was validated on four datasets: complete genomes from NCBI-GenBank (n = 11), complete genomes for locally isolated strains sequenced using PacBio (n = 4), complete genomes for BCG vaccine strains (n = 2) and draft genomes based on 250 bp paired-end Illumina reads (n = 106).The digital MIRU-VNTR results were identical to the experimental genotyping results for complete genomes of locally isolated strains, BCG vaccine strains and five out of 11 genomes from the NCBI-GenBank. For draft genomes based on short Illumina reads, 21 out of 24 loci were inferred with a high accuracy, while a number of inaccuracies were recorded for three specific loci (ETRA, QUB11b and QUB26). One of the unique features of the MIRU-profiler was its ability to process multiple genomes in a batch. This feature was tested on all complete M. tuberculosis genome (n = 157), for which results were successfully obtained in approximately 14 min.The MIRU-profiler is a rapid tool for inference of digital MIRU-VNTR profile from the assembled genome sequences. The tool can accurately infer repeat numbers at the standard 24 or 21/24 MIRU-VNTR loci from the complete or draft genomes respectively. Thus, the tool is expected to bridge the communication gap between the laboratories using WGS and those using the conventional MIRU-VNTR typing.


September 22, 2019

Comparative analysis reveals unexpected genome features of newly isolated Thraustochytrids strains: on ecological function and PUFAs biosynthesis.

Thraustochytrids are unicellular fungal-like marine protists with ubiquitous existence in marine environments. They are well-known for their ability to produce high-valued omega-3 polyunsaturated fatty acids (?-3-PUFAs) (e.g., docosahexaenoic acid (DHA)) and hydrolytic enzymes. Thraustochytrid biomass has been estimated to surpass that of bacterioplankton in both coastal and oceanic waters indicating they have an important role in microbial food-web. Nevertheless, the molecular pathway and regulatory network for PUFAs production and the molecular mechanisms underlying ecological functions of thraustochytrids remain largely unknown.The genomes of two thraustochytrids strains (Mn4 and SW8) with ability to produce DHA were sequenced and assembled with a hybrid sequencing approach utilizing Illumina short paired-end reads and Pacific Biosciences long reads to generate a highly accurate genome assembly. Phylogenomic and comparative genomic analyses found that DHA-producing thraustochytrid strains were highly similar and possessed similar gene content. Analysis of the conventional fatty acid synthesis (FAS) and the polyketide synthase (PKS) systems for PUFAs production only detected incomplete and fragmentary pathways in the genome of these two strains. Surprisingly, secreted carbohydrate active enzymes (CAZymes) were found to be significantly depleted in the genomes of these 2 strains as compared to other sequenced relatives. Furthermore, these two strains possess an expanded gene repertoire for signal transduction and self-propelled movement, which could be important for their adaptations to dynamic marine environments.Our results demonstrate the possibility of a third PUFAs synthesis pathway besides previously described FAS and PKS pathways encoded in the genome of these two thraustochytrid strains. Moreover, lack of a complete set of hydrolytic enzymatic machinery for degrading plant-derived organic materials suggests that these two DHA-producing strains play an important role as a nutritional source rather than a nutrient-producer in marine microbial-food web. Results of this study suggest the existence of two types of saprobic thraustochytrids in the world’s ocean. The first group, which does not produce cellulosic enzymes and live as ‘left-over’ scavenger of bacterioplankton, serves as a dietary source for the plankton of higher trophic levels and the other possesses capacity to live on detrital organic matters in the marine ecosystems.


September 22, 2019

Large-scale gene losses underlie the genome evolution of parasitic plant Cuscuta australis.

Dodders (Cuscuta spp., Convolvulaceae) are root- and leafless parasitic plants. The physiology, ecology, and evolution of these obligate parasites are poorly understood. A high-quality reference genome of Cuscuta australis was assembled. Our analyses reveal that Cuscuta experienced accelerated molecular evolution, and Cuscuta and the convolvulaceous morning glory (Ipomoea) shared a common whole-genome triplication event before their divergence. C. australis genome harbors 19,671 protein-coding genes, and importantly, 11.7% of the conserved orthologs in autotrophic plants are lost in C. australis. Many of these gene loss events likely result from its parasitic lifestyle and the massive changes of its body plan. Moreover, comparison of the gene expression patterns in Cuscuta prehaustoria/haustoria and various tissues of closely related autotrophic plants suggests that Cuscuta haustorium formation requires mostly genes normally involved in root development. The C. australis genome provides important resources for studying the evolution of parasitism, regressive evolution, and evo-devo in plant parasites.


September 22, 2019

Evidence of non-tandemly repeated rDNAs and their intragenomic heterogeneity in Rhizophagus irregularis

Arbuscular mycorrhizal fungus (AMF) species are some of the most widespread symbionts of land plants. Our much improved reference genome assembly of a model AMF, Rhizophagus irregularis DAOM-181602 (total contigs?=?210), facilitated a discovery of repetitive elements with unusual characteristics. R. irregularis has only ten or 11 copies of complete 45S rDNAs, whereas the general eukaryotic genome has tens to thousands of rDNA copies. R. irregularis rDNAs are highly heterogeneous and lack a tandem repeat structure. These findings provide evidence for the hypothesis that rDNA heterogeneity depends on the lack of tandem repeat structures. RNA-Seq analysis confirmed that all rDNA variants are actively transcribed. Observed rDNA/rRNA polymorphisms may modulate translation by using different ribosomes depending on biotic and abiotic interactions. The non-tandem repeat structure and intragenomic heterogeneity of AMF rDNA/rRNA may facilitate successful adaptation to various environmental conditions, increasing host compatibility of these symbiotic fungi.


September 22, 2019

Emergence of a novel mobile colistin resistance gene, mcr-8, in NDM-producing Klebsiella pneumoniae.

The rapid increase in carbapenem resistance among gram-negative bacteria has renewed focus on the importance of polymyxin antibiotics (colistin or polymyxin E). However, the recent emergence of plasmid-mediated colistin resistance determinants (mcr-1, -2, -3, -4, -5, -6, and -7), especially mcr-1, in carbapenem-resistant Enterobacteriaceae is a serious threat to global health. Here, we characterized a novel mobile colistin resistance gene, mcr-8, located on a transferrable 95,983-bp IncFII-type plasmid in Klebsiella pneumoniae. The deduced amino-acid sequence of MCR-8 showed 31.08%, 30.26%, 39.96%, 37.85%, 33.51%, 30.43%, and 37.46% identity to MCR-1, MCR-2, MCR-3, MCR-4, MCR-5, MCR-6, and MCR-7, respectively. Functional cloning indicated that the acquisition of the single mcr-8 gene significantly increased resistance to colistin in both Escherichia coli and K. pneumoniae. Notably, the coexistence of mcr-8 and the carbapenemase-encoding gene blaNDM was confirmed in K. pneumoniae isolates of livestock origin. Moreover, BLASTn analysis of mcr-8 revealed that this gene was present in a colistin- and carbapenem-resistant K. pneumoniae strain isolated from the sputum of a patient with pneumonia syndrome in the respiratory intensive care unit of a Chinese hospital in 2016. These findings indicated that mcr-8 has existed for some time and has disseminated among K. pneumoniae of both animal and human origin, further increasing the public health burden of antimicrobial resistance.


September 22, 2019

Evolutionary trade-offs associated with loss of PmrB function in host-adapted Pseudomonas aeruginosa.

Pseudomonas aeruginosa colonises the upper airway of cystic fibrosis (CF) patients, providing a reservoir of host-adapted genotypes that subsequently establish chronic lung infection. We previously experimentally-evolved P. aeruginosa in a murine model of respiratory tract infection and observed early-acquired mutations in pmrB, encoding the sensor kinase of a two-component system that promoted establishment and persistence of infection. Here, using proteomics, we show downregulation of proteins involved in LPS biosynthesis, antimicrobial resistance and phenazine production in pmrB mutants, and upregulation of proteins involved in adherence, lysozyme resistance and inhibition of the chloride ion channel CFTR, relative to wild-type strain LESB65. Accordingly, pmrB mutants are susceptible to antibiotic treatment but show enhanced adherence to airway epithelial cells, resistance to lysozyme treatment, and downregulate host CFTR expression. We propose that P. aeruginosa pmrB mutations in CF patients are subject to an evolutionary trade-off, leading to enhanced colonisation potential, CFTR inhibition, and resistance to host defences, but also to increased susceptibility to antibiotics.


September 22, 2019

Genome analysis of the ancient tracheophyte Selaginella tamariscina reveals evolutionary features relevant to the acquisition of desiccation tolerance.

Resurrection plants, which are the “gifts” of natural evolution, are ideal models for studying the genetic basis of plant desiccation tolerance. Here, we report a high-quality genome assembly of 301 Mb for the diploid spike moss Selaginella tamariscina, a primitive vascular resurrection plant. We predicated 27 761 protein-coding genes from the assembled S. tamariscina genome, 11.38% (2363) of which showed significant expression changes in response to desiccation. Approximately 60.58% of the S. tamariscina genome was annotated as repetitive DNA, which is an almost 2-fold increase of that in the genome of desiccation-sensitive Selaginella moellendorffii. Genomic and transcriptomic analyses highlight the unique evolution and complex regulations of the desiccation response in S. tamariscina, including species-specific expansion of the oleosin and pentatricopeptide repeat gene families, unique genes and pathways for reactive oxygen species generation and scavenging, and enhanced abscisic acid (ABA) biosynthesis and potentially distinct regulation of ABA signaling and response. Comparative analysis of chloroplast genomes of several Selaginella species revealed a unique structural rearrangement and the complete loss of chloroplast NAD(P)H dehydrogenase (NDH) genes in S. tamariscina, suggesting a link between the absence of the NDH complex and desiccation tolerance. Taken together, our comparative genomic and transcriptomic analyses reveal common and species-specific desiccation tolerance strategies in S. tamariscina, providing significant insights into the desiccation tolerance mechanism and the evolution of resurrection plants. Copyright © 2018 The Author. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Heterogeneous and flexible transmission of mcr-1 in hospital-associated Escherichia coli.

The recent emergence of a transferable colistin resistance mechanism, MCR-1, has gained global attention because of its threat to clinical treatment of infections caused by multidrug-resistant Gram-negative bacteria. However, the possible transmission route of mcr-1 among Enterobacteriaceae species in clinical settings is largely unknown. Here, we present a comprehensive genomic analysis of Escherichia coli isolates collected in a hospital in Hangzhou, China. We found that mcr-1-carrying isolates from clinical infections and feces of inpatients and healthy volunteers were genetically diverse and were not closely related phylogenetically, suggesting that clonal expansion is not involved in the spread of mcr-1 The mcr-1 gene was found on either chromosomes or plasmids, but in most of the E. coli isolates, mcr-1 was carried on plasmids. The genetic context of the plasmids showed considerable diversity as evidenced by the different functional insertion sequence (IS) elements, toxin-antitoxin (TA) systems, heavy metal resistance determinants, and Rep proteins of broad-host-range plasmids. Additionally, the genomic analysis revealed nosocomial transmission of mcr-1 and the coexistence of mcr-1 with other genes encoding ß-lactamases and fluoroquinolone resistance in the E. coli isolates. These findings indicate that mcr-1 is heterogeneously disseminated in both commensal and pathogenic strains of E. coli, suggest the high flexibility of this gene in its association with diverse genetic backgrounds of the hosts, and provide new insights into the genome epidemiology of mcr-1 among hospital-associated E. coli strains. IMPORTANCE Colistin represents one of the very few available drugs for treating infections caused by extensively multidrug-resistant Gram-negative bacteria. The recently emergent mcr-1 colistin resistance gene threatens the clinical utility of colistin and has gained global attention. How mcr-1 spreads in hospital settings remains unknown and was investigated by whole-genome sequencing of mcr-1-carrying Escherichia coli in this study. The findings revealed extraordinary flexibility of mcr-1 in its spread among genetically diverse E. coli hosts and plasmids, nosocomial transmission of mcr-1-carrying E. coli, and the continuous emergence of novel Inc types of plasmids carrying mcr-1 and new mcr-1 variants. Additionally, mcr-1 was found to be frequently associated with other genes encoding ß-lactams and fluoroquinolone resistance. These findings provide important information on the transmission and epidemiology of mcr-1 and are of significant public health importance as the information is expected to facilitate the control of this significant antibiotic resistance threat. Copyright © 2018 Shen et al.


September 22, 2019

Genetic adaptation of a mevalonate pathway deficient mutant in Staphylococcus aureus.

In this study we addressed the question how a mevalonate (MVA)-auxotrophic Staphylococcus aureus?mvaS mutant can revert to prototrophy. This mutant couldn’t grow in the absence of MVA. However, after a long lag-phase of 4-6 days the mutant adapted from auxotrophic to prototrophic phenotype. During that time, it acquired two point mutations: One mutation in the coding region of the regulator gene spx, which resulted in an amino acid exchange that decreased Spx function. The other mutation in the upstream-element within the core-promoter of the mevalonolactone lactonase gene drp35. This mutation led to an increased expression of drp35. In repeated experiments the mutations always occurred in spx and drp35 and in the same order. The first detectable mutation appeared in spx and allowed slight growth; the second mutation, in drp35, increased growth further. Phenotypical characterizations of the mutant showed that small amounts of the lipid-carrier undecaprenol are synthesized, despite the lack of mvaS. The growth of the adapted clone, ?mvaSad, indicates that the mutations reawake a rescue bypass. We think that this bypass enters the MVA pathway at the stage of MVA, because blocking the pathway downstream of MVA led to growth arrest of the mutant. In addition, the lactonase Drp35 is able to convert mevalonolactone to MVA. Summarized, we describe here a mutation-based two-step adaptation process that allows resuscitation of growth of the ?mvaS mutant.


September 22, 2019

The genome assembly of the fungal pathogen Pyrenochaeta lycopersici from Single-Molecule Real-Time sequencing sheds new light on its biological complexity.

The first draft genome sequencing of the non-model fungal pathogen Pyrenochaeta lycopersici showed an expansion of gene families associated with heterokaryon incompatibility and lacking of mating-type genes, providing insights into the genetic basis of this “imperfect” fungus which lost the ability to produce the sexual stage. However, due to the Illumina short-read technology, the draft genome was too fragmented to allow a comprehensive characterization of the genome, especially of the repetitive sequence fraction. In this work, the sequencing of another P. lycopersici isolate using long-read Single Molecule Real-Time sequencing technology was performed with the aim of obtaining a gapless genome. Indeed, a gapless genome assembly of 62.7 Mb was obtained, with a fraction of repetitive sequences representing 30% of the total bases. The gene content of the two P. lycopersici isolates was very similar, and the large difference in genome size (about 8 Mb) might be attributable to the high fraction of repetitive sequences detected for the new sequenced isolate. The role of repetitive elements, including transposable elements, in modulating virulence effectors is well established in fungal plant pathogens. Moreover, transposable elements are of fundamental importance in creating and re-modelling genes, especially in imperfect fungi. Their abundance in P. lycopersici, together with the large expansion of heterokaryon incompatibility genes in both sequenced isolates, suggest the presence of possible mechanisms alternative to gene re-assorting mediated by sexual recombination. A quite large fraction (~9%) of repetitive elements in P. lycopersici, has no homology with known classes, strengthening this hypothesis. The availability of a gapless genome of P. lycopersici allowed the in-depth analysis of its genome content, by annotating functional genes and TEs. This goal will be an important resource for shedding light on the evolution of the reproductive and pathogenic behaviour of this soilborne pathogen and the onset of a possible speciation within this species.


September 22, 2019

Complete genome sequencing and comparative genomic analysis of Helicobacter apodemus isolated from the wild Korean striped field mouse (Apodemus agrarius) for potential pathogenicity

The Helicobacter bacterial genus comprises of spiral-shaped gram-negative bacteria with flagella that colonize the gastro-intestinal (GI) tract of humans and various mammals (Solnick and Schauer, 2001). In particular, Helicobacter pylori was classified as a group 1 carcinogen by the International Agency for Research on Cancer (IARC) in 1994, and has been shown to occur with a high prevalence in humans, although this varies between geographical regions, ethnic groups, and various populations (Kusters et al., 2006; Goh et al., 2011). To date, more than 37 Helicobacter species have been identified in addition to H. pylori (Péré-Védrenne et al., 2017). Furthermore, non-H. pylori Helicobacters (NHPH) have been shown to infect both humans and animals, and NHPH infections are associated with intestinal carcinoma, and mucinous adenocarcinoma (Swennes et al., 2016). Despite the demonstrated association between NHPH and disease, most studies to date have investigated H. pylori in humans; thus, it is necessary to characterize NHPH and elucidate its role in the GI tract of wild rodents which are potential Helicobacter carriers (Taylor et al., 2007; Mladenova-Hristova et al., 2017).


September 22, 2019

Comparing two Mycobacterium tuberculosis genomes from Chinese immigrants with native genomes using mauve alignments.

The number of immigrants with tuberculosis (TB) increases each year in South Korea. Determining the transmission dynamics based on whole genome sequencing (WGS) to cluster the strains has been challenging.WGS, annotation refinement, and orthology assignment for the GenBank accession number acquisition were performed on two clinical isolates from Chinese immigrants. In addition, the genomes of the two isolates were compared with the genomes of Mycobacterium tuberculosis isolates, from two native Korean and five native Chinese individuals using a phylogenetic topology tree based on the Multiple Alignment of Conserved Genomic Sequence with Rearrangements (Mauve) package.The newly assigned accession numbers for two clinical isolates were CP020381.2 (a Korean-Chinese from Yanbian Province) and CP022014.1 (a Chinese from Shandong Province), respectively. Mauve alignment classified all nine TB isolates into a discriminative collinear set with matched regions. The phylogenetic analysis revealed a rooted phylogenetic tree grouping the nine strains into two lineages: strains from Chinese individuals and strains from Korean individuals.Phylogenetic trees based on the Mauve alignments were supposed to be useful in revealing the dynamics of TB transmission from immigrants in South Korea, which can provide valuable information for scaling up the TB screening policy for immigrants. Copyright©2018. The Korean Academy of Tuberculosis and Respiratory Diseases.


September 22, 2019

Comparative genomic analysis of Bacillus thuringiensis reveals molecular adaptation to copper tolerance

Bacillus thuringiensis is a type of Gram positive and rod shaped bacterium that is found in a wide range of habitats. Despite the intensive studies conducted on this bacterium, most of the information available are related to its pathogenic characteristics, with only a limited number of publications mentioning its ability to survive in extreme environments. Recently, a B. thuringiensis MCMY1 strain was successfully isolated from a copper contaminated site in Mamut Copper Mine, Sabah. This study aimed to conduct a comparative genomic analysis by using the genome sequence of MCMY1 strain published in GenBank (PRJNA374601) as a target genome for comparison with other available B. thuringiensis genomes at the GenBank. Whole genome alignment, Fragment all-against-all comparison analysis, phylogenetic reconstruction and specific copper genes comparison were applied to all forty-five B. thuringiensis genomes to reveal the molecular adaptation to copper tolerance. The comparative results indicated that B. thuringiensis MCMY1 strain is closely related to strain Bt407 and strain IS5056. This strain harbors almost all available copper genes annotated from the forty-five B. thuringiensis genomes, except for the gene for Magnesium and cobalt efflux protein (CorC) which plays an indirect role in reducing the oxidative stress that caused by copper and other metal ions. Furthermore, the findings also showed that the Copper resistance gene family, CopABCDZ and its repressor (CsoR) are conserved in almost all sequenced genomes but the presence of the genes for Cytoplasmic copper homeostasis protein (CutC) and CorC across the sample genomes are highly inconsonant. The variation of these genes across the B. thuringiensis genomes suggests that each strain may have adapted to their specific ecological niche. However, further investigations will be need to support this preliminary hypothesis.


September 22, 2019

The Chara genome: Secondary complexity and implications for plant terrestrialization.

Land plants evolved from charophytic algae, among which Charophyceae possess the most complex body plans. We present the genome of Chara braunii; comparison of the genome to those of land plants identified evolutionary novelties for plant terrestrialization and land plant heritage genes. C. braunii employs unique xylan synthases for cell wall biosynthesis, a phragmoplast (cell separation) mechanism similar to that of land plants, and many phytohormones. C. braunii plastids are controlled via land-plant-like retrograde signaling, and transcriptional regulation is more elaborate than in other algae. The morphological complexity of this organism may result from expanded gene families, with three cases of particular note: genes effecting tolerance to reactive oxygen species (ROS), LysM receptor-like kinases, and transcription factors (TFs). Transcriptomic analysis of sexual reproductive structures reveals intricate control by TFs, activity of the ROS gene network, and the ancestral use of plant-like storage and stress protection proteins in the zygote. Copyright © 2018 Elsevier Inc. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.