Menu
September 22, 2019

A mosaic monoploid reference sequence for the highly complex genome of sugarcane.

Sugarcane (Saccharum spp.) is a major crop for sugar and bioenergy production. Its highly polyploid, aneuploid, heterozygous, and interspecific genome poses major challenges for producing a reference sequence. We exploited colinearity with sorghum to produce a BAC-based monoploid genome sequence of sugarcane. A minimum tiling path of 4660 sugarcane BAC that best covers the gene-rich part of the sorghum genome was selected based on whole-genome profiling, sequenced, and assembled in a 382-Mb single tiling path of a high-quality sequence. A total of 25,316 protein-coding gene models are predicted, 17% of which display no colinearity with their sorghum orthologs. We show that the two species, S. officinarum and S. spontaneum, involved in modern cultivars differ by their transposable elements and by a few large chromosomal rearrangements, explaining their distinct genome size and distinct basic chromosome numbers while also suggesting that polyploidization arose in both lineages after their divergence.


September 22, 2019

npInv: accurate detection and genotyping of inversions using long read sub-alignment.

Detection of genomic inversions remains challenging. Many existing methods primarily target inzversions with a non repetitive breakpoint, leaving inverted repeat (IR) mediated non-allelic homologous recombination (NAHR) inversions largely unexplored.We present npInv, a novel tool specifically for detecting and genotyping NAHR inversion using long read sub-alignment of long read sequencing data. We benchmark npInv with other tools in both simulation and real data. We use npInv to generate a whole-genome inversion map for NA12878 consisting of 30 NAHR inversions (of which 15 are novel), including all previously known NAHR mediated inversions in NA12878 with flanking IR less than 7kb. Our genotyping accuracy on this dataset was 94%. We used PCR to confirm the presence of two of these novel inversions. We show that there is a near linear relationship between the length of flanking IR and the minimum inversion size, without inverted repeats.The application of npInv shows high accuracy in both simulation and real data. The results give deeper insight into understanding inversion.


September 22, 2019

Molecular characterization of invasive meningococcal isolates in Burkina Faso as the relative importance of serogroups X and W increases, 2008-2012.

Neisseria meningitidis serogroup A disease in Burkina Faso has greatly decreased following introduction of a meningococcal A conjugate vaccine in 2010, yet other serogroups continue to pose a risk of life-threatening disease. Capsule switching among epidemic-associated serogroup A N. meningitidis strains could allow these lineages to persist despite vaccination. The introduction of new strains at the national or sub-national levels could affect the epidemiology of disease.Isolates collected from invasive meningococcal disease in Burkina Faso between 2008 and 2012 were characterized by serogrouping and molecular typing. Genome sequences from a subset of isolates were used to infer phylogenetic relationships.The ST-5 clonal complex (CC5) was identified only among serogroup A isolates, which were rare after 2010. CC181 and CC11 were the most common clonal complexes after 2010, having serogroup X and W isolates, respectively. Whole-genome phylogenetic analysis showed that the CC181 isolates collected during and after the epidemic of 2010 formed a single clade that was closely related to isolates collected in Niger during 2005 and Burkina Faso during 2007. Geographic population structure was identified among the CC181 isolates, where pairs of isolates collected from the same region of Burkina Faso within a single year had less phylogenetic diversity than the CC181 isolate collection as a whole. However, the reduction of phylogenetic diversity within a region did not extend across multiple years. Instead, CC181 isolates collected during the same year had lower than average diversity, even when collected from different regions, indicating geographic mixing of strains across years. The CC11 isolates were primarily collected during the epidemic of 2012, with sparse sampling during 2011. These isolates belong to a clade that includes previously described isolates collected in Burkina Faso, Mali, and Niger from 2011 to 2015. Similar to CC181, reduced phylogenetic diversity was observed among CC11 isolate pairs collected from the same regions during a single year.The population of disease-associated N. meningitidis strains within Burkina Faso was highly dynamic between 2008 and 2012, reflecting both vaccine-imposed selection against serogroup A strains and potentially complex clonal waves of serogroup X and serogroup W strains.


September 22, 2019

Heterogeneous and flexible transmission of mcr-1 in hospital-associated Escherichia coli.

The recent emergence of a transferable colistin resistance mechanism, MCR-1, has gained global attention because of its threat to clinical treatment of infections caused by multidrug-resistant Gram-negative bacteria. However, the possible transmission route of mcr-1 among Enterobacteriaceae species in clinical settings is largely unknown. Here, we present a comprehensive genomic analysis of Escherichia coli isolates collected in a hospital in Hangzhou, China. We found that mcr-1-carrying isolates from clinical infections and feces of inpatients and healthy volunteers were genetically diverse and were not closely related phylogenetically, suggesting that clonal expansion is not involved in the spread of mcr-1 The mcr-1 gene was found on either chromosomes or plasmids, but in most of the E. coli isolates, mcr-1 was carried on plasmids. The genetic context of the plasmids showed considerable diversity as evidenced by the different functional insertion sequence (IS) elements, toxin-antitoxin (TA) systems, heavy metal resistance determinants, and Rep proteins of broad-host-range plasmids. Additionally, the genomic analysis revealed nosocomial transmission of mcr-1 and the coexistence of mcr-1 with other genes encoding ß-lactamases and fluoroquinolone resistance in the E. coli isolates. These findings indicate that mcr-1 is heterogeneously disseminated in both commensal and pathogenic strains of E. coli, suggest the high flexibility of this gene in its association with diverse genetic backgrounds of the hosts, and provide new insights into the genome epidemiology of mcr-1 among hospital-associated E. coli strains. IMPORTANCE Colistin represents one of the very few available drugs for treating infections caused by extensively multidrug-resistant Gram-negative bacteria. The recently emergent mcr-1 colistin resistance gene threatens the clinical utility of colistin and has gained global attention. How mcr-1 spreads in hospital settings remains unknown and was investigated by whole-genome sequencing of mcr-1-carrying Escherichia coli in this study. The findings revealed extraordinary flexibility of mcr-1 in its spread among genetically diverse E. coli hosts and plasmids, nosocomial transmission of mcr-1-carrying E. coli, and the continuous emergence of novel Inc types of plasmids carrying mcr-1 and new mcr-1 variants. Additionally, mcr-1 was found to be frequently associated with other genes encoding ß-lactams and fluoroquinolone resistance. These findings provide important information on the transmission and epidemiology of mcr-1 and are of significant public health importance as the information is expected to facilitate the control of this significant antibiotic resistance threat. Copyright © 2018 Shen et al.


September 22, 2019

Strand-seq enables reliable separation of long reads by chromosome via expectation maximization.

Current sequencing technologies are able to produce reads orders of magnitude longer than ever possible before. Such long reads have sparked a new interest in de novo genome assembly, which removes reference biases inherent to re-sequencing approaches and allows for a direct characterization of complex genomic variants. However, even with latest algorithmic advances, assembling a mammalian genome from long error-prone reads incurs a significant computational burden and does not preclude occasional misassemblies. Both problems could potentially be mitigated if assembly could commence for each chromosome separately.To address this, we show how single-cell template strand sequencing (Strand-seq) data can be leveraged for this purpose. We introduce a novel latent variable model and a corresponding Expectation Maximization algorithm, termed SaaRclust, and demonstrates its ability to reliably cluster long reads by chromosome. For each long read, this approach produces a posterior probability distribution over all chromosomes of origin and read directionalities. In this way, it allows to assess the amount of uncertainty inherent to sparse Strand-seq data on the level of individual reads. Among the reads that our algorithm confidently assigns to a chromosome, we observed more than 99% correct assignments on a subset of Pacific Bioscience reads with 30.1×?coverage. To our knowledge, SaaRclust is the first approach for the in silico separation of long reads by chromosome prior to assembly.https://github.com/daewoooo/SaaRclust.


September 22, 2019

GC content elevates mutation and recombination rates in the yeast Saccharomyces cerevisiae.

The chromosomes of many eukaryotes have regions of high GC content interspersed with regions of low GC content. In the yeast Saccharomyces cerevisiae, high-GC regions are often associated with high levels of meiotic recombination. In this study, we constructed URA3 genes that differ substantially in their base composition [URA3-AT (31% GC), URA3-WT (43% GC), and URA3-GC (63% GC)] but encode proteins with the same amino acid sequence. The strain with URA3-GC had an approximately sevenfold elevated rate of ura3 mutations compared with the strains with URA3-WT or URA3-AT About half of these mutations were single-base substitutions and were dependent on the error-prone DNA polymerase ?. About 30% were deletions or duplications between short (5-10 base) direct repeats resulting from DNA polymerase slippage. The URA3-GC gene also had elevated rates of meiotic and mitotic recombination relative to the URA3-AT or URA3-WT genes. Thus, base composition has a substantial effect on the basic parameters of genome stability and evolution. Copyright © 2018 the Author(s). Published by PNAS.


September 22, 2019

Unrestrained markerless trait stacking in Nannochloropsis gaditana through combined genome editing and marker recycling technologies.

Robust molecular tool kits in model and industrial microalgae are key to efficient targeted manipulation of endogenous and foreign genes in the nuclear genome for basic research and, as importantly, for the development of algal strains to produce renewable products such as biofuels. While Cas9-mediated gene knockout has been demonstrated in a small number of algal species with varying efficiency, the ability to stack traits or generate knockout mutations in two or more loci are often severely limited by selectable agent availability. This poses a critical hurdle in developing production strains, which require stacking of multiple traits, or in probing functionally redundant gene families. Here, we combine Cas9 genome editing with an inducible Cre recombinase in the industrial alga Nannochloropsis gaditana to generate a strain, NgCas9+Cre+, in which the potentially unlimited stacking of knockouts and addition of new genes is readily achievable. Cre-mediated marker recycling is first demonstrated in the removal of the selectable marker and GFP reporter transgenes associated with the Cas9/Cre construct in NgCas9+Cre+ Next, we show the proof-of-concept generation of a markerless knockout in a gene encoding an acyl-CoA oxidase (Aco1), as well as the markerless recapitulation of a 2-kb insert in the ZnCys gene 5′-UTR, which results in a doubling of wild-type lipid productivity. Finally, through an industrially oriented process, we generate mutants that exhibit up to ~50% reduction in photosynthetic antennae size by markerless knockout of seven genes in the large light-harvesting complex gene family. Copyright © 2018 the Author(s). Published by PNAS.


September 22, 2019

Genomic comparison of highly virulent, moderately virulent, and avirulent strains from a genetically closely-related MRSA ST239 sub-lineage provides insights into pathogenesis.

The genomic comparison of virulent (TW20), moderately virulent (CMRSA6/CMRSA3), and avirulent (M92) strains from a genetically closely-related MRSA ST239 sub-lineage revealed striking similarities in their genomes and antibiotic resistance profiles, despite differences in virulence and pathogenicity. The main differences were in the spa gene (coding for staphylococcal protein A), lpl genes (coding for lipoprotein-like membrane proteins), cta genes (genes involved in heme synthesis), and the dfrG gene (coding for a trimethoprim-resistant dihydrofolate reductase), as well as variations in the presence or content of some prophages and plasmids, which could explain the virulence differences of these strains. TW20 was positive for all genetic traits tested, compared to CMRSA6, CMRSA3, and M92. The major components differing among these strains included spa and lpl with TW20 carrying both whereas CMRSA6/CMRSA3 carry spa identical to TW20 but have a disrupted lpl. M92 is devoid of both these traits. Considering the role played by these components in innate immunity and virulence, it is predicted that since TW20 has both the components intact and functional, these traits contribute to its pathogenesis. However, CMRSA6/CMRSA3 are missing one of these components, hence their intermediately virulent nature. On the contrary, M92 is completely devoid of both the spa and lpl genes and is avirulent. Mobile genetic elements play a potential role in virulence. TW20 carries three prophages (?Sa6, ?Sa3, and ?SPß-like), a pathogenicity island and two plasmids. CMRSA6, CMRSA3, and M92 contain variations in one or more of these components. The virulence associated genes in these components include staphylokinase, entertoxins, antibiotic/antiseptic/heavy metal resistance and bacterial persistence. Additionally, there are many hypothetical proteins (present with variations among strains) with unknown function in these mobile elements which could be making an important contribution in the virulence of these strains. The above mentioned repertoire of virulence components in TW20 likely contributes to its increased virulence, while the absence and/or modification of one or more of these components in CMRSA6/CMRSA3 and M92 likely affects the virulence of the strains.


September 22, 2019

The genome assembly of the fungal pathogen Pyrenochaeta lycopersici from Single-Molecule Real-Time sequencing sheds new light on its biological complexity.

The first draft genome sequencing of the non-model fungal pathogen Pyrenochaeta lycopersici showed an expansion of gene families associated with heterokaryon incompatibility and lacking of mating-type genes, providing insights into the genetic basis of this “imperfect” fungus which lost the ability to produce the sexual stage. However, due to the Illumina short-read technology, the draft genome was too fragmented to allow a comprehensive characterization of the genome, especially of the repetitive sequence fraction. In this work, the sequencing of another P. lycopersici isolate using long-read Single Molecule Real-Time sequencing technology was performed with the aim of obtaining a gapless genome. Indeed, a gapless genome assembly of 62.7 Mb was obtained, with a fraction of repetitive sequences representing 30% of the total bases. The gene content of the two P. lycopersici isolates was very similar, and the large difference in genome size (about 8 Mb) might be attributable to the high fraction of repetitive sequences detected for the new sequenced isolate. The role of repetitive elements, including transposable elements, in modulating virulence effectors is well established in fungal plant pathogens. Moreover, transposable elements are of fundamental importance in creating and re-modelling genes, especially in imperfect fungi. Their abundance in P. lycopersici, together with the large expansion of heterokaryon incompatibility genes in both sequenced isolates, suggest the presence of possible mechanisms alternative to gene re-assorting mediated by sexual recombination. A quite large fraction (~9%) of repetitive elements in P. lycopersici, has no homology with known classes, strengthening this hypothesis. The availability of a gapless genome of P. lycopersici allowed the in-depth analysis of its genome content, by annotating functional genes and TEs. This goal will be an important resource for shedding light on the evolution of the reproductive and pathogenic behaviour of this soilborne pathogen and the onset of a possible speciation within this species.


September 22, 2019

Complete genome sequencing and comparative genomic analysis of Helicobacter apodemus isolated from the wild Korean striped field mouse (Apodemus agrarius) for potential pathogenicity

The Helicobacter bacterial genus comprises of spiral-shaped gram-negative bacteria with flagella that colonize the gastro-intestinal (GI) tract of humans and various mammals (Solnick and Schauer, 2001). In particular, Helicobacter pylori was classified as a group 1 carcinogen by the International Agency for Research on Cancer (IARC) in 1994, and has been shown to occur with a high prevalence in humans, although this varies between geographical regions, ethnic groups, and various populations (Kusters et al., 2006; Goh et al., 2011). To date, more than 37 Helicobacter species have been identified in addition to H. pylori (Péré-Védrenne et al., 2017). Furthermore, non-H. pylori Helicobacters (NHPH) have been shown to infect both humans and animals, and NHPH infections are associated with intestinal carcinoma, and mucinous adenocarcinoma (Swennes et al., 2016). Despite the demonstrated association between NHPH and disease, most studies to date have investigated H. pylori in humans; thus, it is necessary to characterize NHPH and elucidate its role in the GI tract of wild rodents which are potential Helicobacter carriers (Taylor et al., 2007; Mladenova-Hristova et al., 2017).


September 22, 2019

Genomic characterization of extensively drug-resistant Acinetobacter baumannii strain, KAB03 belonging to ST451 from Korea.

Extensively drug-resistant (XDR) Acinetobacter baumannii strains have emerged rapidly worldwide. The antibiotic resistance characteristics of XDR A. baumannii strains show regional differences; therefore, it is necessary to analyze both genomic and proteomic characteristics of emerging XDR A. baumannii clinical strains isolated in Korea to elucidate their multidrug resistance. Here, we isolated new sequence type of XDR A. baumannii clinical strain (KAB03) from Korean hospitals and performed comprehensive genome analyses. The strain belongs to new sequence type, ST451. Single nucleotide polymorphism (SNP) analysis with other types of A. baumannii strains revealed that KAB03 has unique SNP pattern in the regions of gyrB and gpi of MLST profiles. A. baumannii KAB03 harbours three antibiotic resistance islands (AbGRI1, 2, and 3). AbGRI1 harbours two copies of Tn2006 containing blaOXA-23, which play an important role in antibiotic resistance. AbGRI2 possesses aminoglycoside resistant gene aph(3′)-Ic and class A ß-lactamase blaTEM. AbGIR3 has macrolide resistant genes and aminoglycoside resistant gene armA. A. baumannii KAB03 harbours mutations in pmrB and pmrC, which are believed to confer colistin resistance. In addition, proteomic and transcriptional analysis of KAB03 confirmed that ß-lactamases (ADC-73 and OXA-23), Ade efflux pumps (AdeIJK), outer membrane proteins (OmpA and OmpW), and colistin resistance genes (PmrCAB) were major proteins responsible for antibiotic resistance. Our proteogenomic results provide valuable information for multi-drug resistance in emerging XDR A. baumannii strains belonging to ST451. Copyright © 2018. Published by Elsevier B.V.


September 22, 2019

The Chara genome: Secondary complexity and implications for plant terrestrialization.

Land plants evolved from charophytic algae, among which Charophyceae possess the most complex body plans. We present the genome of Chara braunii; comparison of the genome to those of land plants identified evolutionary novelties for plant terrestrialization and land plant heritage genes. C. braunii employs unique xylan synthases for cell wall biosynthesis, a phragmoplast (cell separation) mechanism similar to that of land plants, and many phytohormones. C. braunii plastids are controlled via land-plant-like retrograde signaling, and transcriptional regulation is more elaborate than in other algae. The morphological complexity of this organism may result from expanded gene families, with three cases of particular note: genes effecting tolerance to reactive oxygen species (ROS), LysM receptor-like kinases, and transcription factors (TFs). Transcriptomic analysis of sexual reproductive structures reveals intricate control by TFs, activity of the ROS gene network, and the ancestral use of plant-like storage and stress protection proteins in the zygote. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

An outbreak of a rare Shiga-toxin-producing Escherichia coli serotype (O117:H7) among men who have sex with men.

Sexually transmissible enteric infections (STEIs) are commonly associated with transmission among men who have sex with men (MSM). In the past decade, the UK has experienced multiple parallel STEI emergences in MSM caused by a range of bacterial species of the genus Shigella, and an outbreak of an uncommon serotype (O117?:?H7) of Shiga-toxin-producing Escherichia coli (STEC). Here, we used microbial genomics on 6 outbreak and 30 sporadic STEC O117?:?H7 isolates to explore the origins and pathogenic drivers of the STEC O117?:?H7 emergence in MSM. Using genomic epidemiology, we found that the STEC O117?:?H7 outbreak lineage was potentially imported from Latin America and likely continues to circulate both in the UK MSM population and in Latin America. We found genomic relationships consistent with existing symptomatic evidence for chronic infection with this STEC serotype. Comparative genomic analysis indicated the existence of a novel Shiga toxin 1-encoding prophage in the outbreak isolates, and evidence of horizontal gene exchange among the STEC O117?:?H7 outbreak lineage and other enteric pathogens. There was no evidence of increased virulence in the outbreak strains relative to contextual isolates, but the outbreak lineage was associated with azithromycin resistance. Comparing these findings with similar genomic investigations of emerging MSM-associated Shigella in the UK highlighted many parallels, the most striking of which was the importance of the azithromycin phenotype for STEI emergence in this patient group.


September 22, 2019

Biosynthetic Baeyer-Villiger chemistry enables access to two anthracene scaffolds from a single gene cluster in Deep-Sea-derived Streptomyces olivaceus SCSIO T05.

Four known compounds, rishirilide B (1), rishirilide C (2), lupinacidin A (3), and galvaquinone B (4), representing two anthracene scaffolds typical of aromatic polyketides, were isolated from a culture of the deep-sea-derived Streptomyces olivaceus SCSIO T05. From the S. olivaceus producer was cloned and sequenced the rsd biosynthetic gene cluster (BGC) that drives rishirilide biosynthesis. The structural gene rsdK2 inactivation and heterologous expression of the rsd BGC confirmed the single rsd BGC encodes construction of 1-4 and, thus, accounts for two anthracene scaffolds. Precursor incubation experiments with 13C-labeled acetate revealed that a Baeyer-Villiger-type rearrangement plays a central role in construction of 1-4. Two luciferase monooxygenase components, along with a reductase component, are presumably involved in the Baeyer-Villiger-type rearrangement reaction enabling access to the two anthracene scaffold variants. Engineering of the rsd BGC unveiled three SARP family transcriptional regulators, enhancing anthracene production. Inactivation of rsdR4, a MarR family transcriptional regulator, failed to impact production of 1-4, although production of 3 was slightly improved; most importantly rsdR4 inactivation led to the new adduct 6 in high titer. Notably, inactivation of rsdH, a putative amidohydrolase, substantially improved the overall titers of 1-4 by more than 4-fold.


September 22, 2019

Pol V-mediated translesion synthesis elicits localized untargeted mutagenesis during post-replicative gap repair.

In vivo, replication forks proceed beyond replication-blocking lesions by way of downstream repriming, generating daughter strand gaps that are subsequently processed by post-replicative repair pathways such as homologous recombination and translesion synthesis (TLS). The way these gaps are filled during TLS is presently unknown. The structure of gap repair synthesis was assessed by sequencing large collections of single DNA molecules that underwent specific TLS events in vivo. The higher error frequency of specialized relative to replicative polymerases allowed us to visualize gap-filling events at high resolution. Unexpectedly, the data reveal that a specialized polymerase, Pol V, synthesizes stretches of DNA both upstream and downstream of a site-specific DNA lesion. Pol V-mediated untargeted mutations are thus spread over several hundred nucleotides, strongly eliciting genetic instability on either side of a given lesion. Consequently, post-replicative gap repair may be a source of untargeted mutations critical for gene diversification in adaptation and evolution. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.