Menu
September 22, 2019

Assembly of chromosome-scale contigs by efficiently resolving repetitive sequences with long reads

Due to the large number of repetitive sequences in complex eukaryotic genomes, fragmented and incompletely assembled genomes lose value as reference sequences, often due to short contigs that cannot be anchored or mispositioned onto chromosomes. Here we report a novel method Highly Efficient Repeat Assembly (HERA), which includes a new concept called a connection graph as well as algorithms for constructing the graph. HERA resolves repeats at high efficiency with single-molecule sequencing data, and enables the assembly of chromosome-scale contigs by further integrating genome maps and Hi-C data. We tested HERA with the genomes of rice R498, maize B73, human HX1 and Tartary buckwheat Pinku1. HERA can correctly assemble most of the tandemly repetitive sequences in rice using single-molecule sequencing data only. Using the same maize and human sequencing data published by Jiao et al. (2017) and Shi et al. (2016), respectively, we dramatically improved on the sequence contiguity compared with the published assemblies, increasing the contig N50 from 1.3 Mb to 61.2 Mb in maize B73 assembly and from 8.3 Mb to 54.4 Mb in human HX1 assembly with HERA. We provided a high-quality maize reference genome with 96.9% of the gaps filled (only 76 gaps left) and several incorrectly positioned sequences fixed compared with the B73 RefGen_v4 assembly. Comparisons between the HERA assembly of HX1 and the human GRCh38 reference genome showed that many gaps in GRCh38 could be filled, and that GRCh38 contained some potential errors that could be fixed. We assembled the Pinku1 genome into 12 scaffolds with a contig N50 size of 27.85 Mb. HERA serves as a new genome assembly/phasing method to generate high quality sequences for complex genomes and as a curation tool to improve the contiguity and completeness of existing reference genomes, including the correction of assembly errors in repetitive regions.


September 22, 2019

C-di-GMP turnover influences motility and biofilm formation in Bacillus amyloliquefaciens PG12.

Bis-(3′?5′) cyclic dimeric guanosine monophosphate (c-di-GMP) is defined as a highly versatile secondary messenger in bacteria, coordinating diverse aspects of bacterial growth and behavior, including motility and biofilm formation. Bacillus amyloliquefaciens PG12 is an effective biocontrol agent against apple ring rot caused by Botryosphaeria dothidea. In this study, we characterized the core regulators of c-di-GMP turnover in B. amyloliquefaciens PG12. Using bioinformatic analysis, heterologous expression and biochemical characterization of knockout and overexpression derivatives, we identified and characterized two active diguanylate cyclases (which catalyze c-di-GMP biosynthesis), YhcK and YtrP and one active c-di-GMP phosphodiesterase (which degrades c-di-GMP), YuxH. Furthermore, we showed that elevating c-di-GMP levels up to a certain threshold inhibited the swimming motility of B. amyloliquefaciens PG12. Although yhcK, ytrP and yuxH knockout mutants did not display defects in biofilm formation, significant increases in c-di-GMP levels induced by YtrP or YuxH overexpression stimulated biofilm formation in B. amyloliquefaciens PG12. Our results indicate that B. amyloliquefaciens possesses a functional c-di-GMP signaling system that influences the bacterium’s motility and ability to form biofilms. Since motility and biofilm formation influence the efficacy of biological control agent, our work provides a basis for engineering a more effective strain of B. amyloliquefaciens PG12. Copyright © 2018 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.


September 22, 2019

Saliniramus fredricksonii gen. nov., sp. nov., a heterotrophic halophile isolated from Hot Lake, Washington, a member of a novel lineage (Salinarimonadaceae fam. nov.) within the order Rhizobiales, and reclassification of the genus Salinarimonas Liu et al. 2010 into Salinarimonadaceae.

There was an error in the proposed genus name in the published article, in that the genus ‘Salinivirga’ was effectively published while this article was in review. Therefore, the genus ‘Salinivirga’ should be replaced with ‘Saliniramus’. For the convenience of future readers, we have included the complete corrected article below, in which all occurrences of the incorrect genus name have been amended: A halophilic bacterial strain, HL-109T, was isolated from the unicyanobacterial consortium UCC-O, which was obtained from the photosynthetic mat of Hot Lake (Washington, USA). A polyphasic approach using phenotypic, genotypic and chemotaxonomic data was used to classify the strain within the order Rhizobiales. The organism stained Gram-negative and was a moderate thermophile with a growth optimum of 45?°C. It was obligately aerobic, heterotrophic and halophilic, growing in both NaCl and MgSO4 brines. The novel isolate had a polymorphic cellular morphology of short rods with occasional branching, and cells were monotrichous. The major fatty acids detected were C18?:?1, C18?:?0, C16?:?0 and C18?:?cyc. Phylogenetic analysis of the 16S rRNA gene placed the strain in the order Rhizobiales and it shared 94?% identity with the type strain of its nearest relative, Salinarimonas ramus. Morphological, chemotaxonomic and phylogenetic results did not affiliate the novel organism with any of the families in the Rhizobiales; therefore, HL-109T is representative of a new lineage, for which the name Saliniramus fredricksonii gen. nov., sp. nov. is proposed, with the type strain HL-109T (=JCM 31876T=DSM 102886T). In addition, examination of the phylogenetics of strain HL-109T and its nearest relatives, Salinarimonas ramus and Salinarimonasrosea, demonstrates that these halophiles form a clade distinct from the described families of the Rhizobiales. We further propose the establishment of a new family, Salinarimonadaceae fam. nov., to accommodate the genera Saliniramus and Salinarimonas (the type genus of the family).


September 22, 2019

Genome-wide DNA methylation and transcriptome changes in Mycobacterium tuberculosis with rifampicin and isoniazid resistance

We investigated the genome-wide DNA methylation and transcriptome changes in M. tuberculosis with rifampicin or isoniazid resistance. Single-molecule real-time (SMRT) sequencing and microarray technology were performed to expound DNA methylation profiles and differentially expressed genes in rifampicin or isoniazid resis- tant M. tuberculosis. Kyoto Encyclopedia of Genes and Genomes (KEGG) biological pathway analysis and meth- ylated regulatory network analysis were conducted by online forecasting databases. Integrated analysis of DNA methylation and transcriptome revealed that 335 differentially methylated genes (175 hypermethylated and 160 hypomethylated) and 132 significant differentially expressed genes (68 up-regulated and 63 down-regulated) were found to be regulated by both rifampicin and isoniazid in M. tuberculosis H37Rv. Correlation analysis showed that differential methylated genes were negatively correlated with their transcriptional levels in rifampicin or isoniazid resistant strains. KEGG pathway analysis indicated that nitrogen metabolism pathway is closely related to differ- entially methylated genes induced by rifampicin and isoniazid. KEGG also suggested that differentially expressed genes in rifampicin or isoniazid-resistant strains may play different roles in regulating signal transduction events. Furthermore, five differentially methylated candidate genes (Rv0840c, Rv2243, Rv0644c, Rv2386c and Rv1130) in rifampicin resistant strains and three genes (Rv0405, Rv0252 and Rv0908) in isoniazid-resistant strains were verified the existence of protein-protein interaction in STRING database. Integrated DNA methylation and transcrip- tome analyses provide an epigenetic overview of rifampicin and isoniazid-induced antibiotic resistance in M. tuber- culosis H37Rv. Several interesting genes and regulatory pathways may provide valuable resources for epigenetic studies in M. tuberculosis antibiotic resistance.


September 22, 2019

Recurrent loss, horizontal transfer, and the obscure origins of mitochondrial introns in diatoms (Bacillariophyta).

We sequenced mitochondrial genomes from five diverse diatoms (Toxarium undulatum, Psammoneis japonica, Eunotia naegelii, Cylindrotheca closterium, and Nitzschia sp.), chosen to fill important phylogenetic gaps and help us characterize broadscale patterns of mitochondrial genome evolution in diatoms. Although gene content was strongly conserved, intron content varied widely across species. The vast majority of introns were of group II type and were located in the cox1 or rnl genes. Although recurrent intron loss appears to be the principal underlying cause of the sporadic distributions of mitochondrial introns across diatoms, phylogenetic analyses showed that intron distributions superficially consistent with a recurrent-loss model were sometimes more complicated, implicating horizontal transfer as a likely mechanism of intron acquisition as well. It was not clear, however, whether diatoms were the donors or recipients of horizontally transferred introns, highlighting a general challenge in resolving the evolutionary histories of many diatom mitochondrial introns. Although some of these histories may become clearer as more genomes are sampled, high rates of intron loss suggest that the origins of many diatom mitochondrial introns are likely to remain unclear.


September 22, 2019

Comparative genomics provides insights into the marine adaptation in sponge-derived Kocuriaflava S43.

Sponge-derived actinomycetes represent a significant component of marine actinomycetes. Members of the genus Kocuria are distributed in various habitats such as soil, rhizosphere, clinical specimens, marine sediments, and sponges, however, to date, little is known about the mechanism of their environmental adaptation. Kocuria flava S43 was isolated from a coastal sponge. Phylogenetic analysis revealed that it was closely related to the terrestrial airborne K. flava HO-9041. In this study, to gain insights into the marine adaptation in K. flava S43 we sequenced the draft genome for K. flava S43 by third generation sequencing (TGS) and compared it with those of K. flava HO-9041 and some other Kocuria relatives. Comparative genomics and phylogenetic analyses revealed that K. flava S43 might adapt to the marine environment mainly by increasing the number of the genes linked to potassium homeostasis, resistance to heavy metals and phosphate metabolism, and acquiring the genes associated with electron transport and the genes encoding ATP-binding cassette (ABC) transporter, aquaporin, and thiol/disulfide interchange protein. Notably, gene acquisition was probably a primary mechanism of environmental adaptation in K. flava S43. Furthermore, this study also indicated that the Kocuria isolates from various marine and hyperosmotic environments possessed common genetic basis for environmental adaptation.


September 22, 2019

Recurrent symbiont recruitment from fungal parasites in cicadas.

Diverse insects are associated with ancient bacterial symbionts, whose genomes have often suffered drastic reduction and degeneration. In extreme cases, such symbiont genomes seem almost unable to sustain the basic cellular functioning, which comprises an open question in the evolution of symbiosis. Here, we report an insect group wherein an ancient symbiont lineage suffering massive genome erosion has experienced recurrent extinction and replacement by host-associated pathogenic microbes. Cicadas are associated with the ancient bacterial co-obligate symbionts Sulcia and Hodgkinia, whose streamlined genomes are specialized for synthesizing essential amino acids, thereby enabling the host to live on plant sap. However, our inspection of 24 Japanese cicada species revealed that while all species possessed Sulcia, only nine species retained Hodgkinia, and their genomes exhibited substantial structural instability. The remaining 15 species lacked Hodgkinia and instead harbored yeast-like fungal symbionts. Detailed phylogenetic analyses uncovered repeated Hodgkinia-fungus and fungus-fungus replacements in cicadas. The fungal symbionts were phylogenetically intermingled with cicada-parasitizing Ophiocordyceps fungi, identifying entomopathogenic origins of the fungal symbionts. Most fungal symbionts of cicadas were uncultivable, but the fungal symbiont of Meimuna opalifera was cultivable, possibly because it is at an early stage of fungal symbiont replacement. Genome sequencing of the fungal symbiont revealed its metabolic versatility, presumably capable of synthesizing almost all amino acids, vitamins, and other metabolites, which is more than sufficient to compensate for the Hodgkinia loss. These findings highlight a straightforward ecological and evolutionary connection between parasitism and symbiosis, which may provide an evolutionary trajectory to renovate deteriorated ancient symbiosis via pathogen domestication. Copyright © 2018 the Author(s). Published by PNAS.


September 22, 2019

Whole genome sequencing of Klebsiella pneumoniae strain unravels a new model for the development of extensive drug resistance in Enterobacteriaceae.

Increased incidence of carbapenem-resistant Enterobacteriaceae (CRE) has been reported worldwide. The WHO warns about the imminent risk to global health if the spread of resistant bacteria is not contained.Here, single molecule real time sequencing was used to analyse the whole genome and resistome of SKGH01, a strain of Klebsiella pneumoniae .The data showed that SKGH01 was resistant to all commercially available antibiotics. A complete account of extensively drug-resistant (XDR) CRE at a genomic level and the entire location map of all antibiotic resistance components are here presented. Additionally, this work proposes a model of XDR acquisition in Enterobacteriaceae.


September 22, 2019

Mosaic structure as the main feature of Mycobacterium bovis BCG genomes

Background: The genome stability of attenuated live BCG vaccine preventing the acute forms of childhood tuberculosis is an important aspect of vaccine production. The pur- pose of our study was a whole genome comparative analysis of BCG sub-strains and identification of potential triggers of sub-strains’ transition. Results: Genomes of three BCG Russia seed lots (1963, 1982, 2006 years) have been sequenced, and the stability of vaccine sub-strain genomes has been confirmed. A com- parative genome analysis of nine Mycobacterium bovis BCG and three M. bovis strains revealed their specific genome features associated with prophage profiles. A number of prophage-coded homologs to Caudovirales ORFs were common to all BCG genomes. Prophage profiles of BCG Tice and BCG Montreal genomes were unique and coded homologs to herpes viruses ORFs. The data of phylogenetic analysis of BCG sub-strain groups based on whole genome sequences and genome restriction maps were in con- gruence with prophage profiles. The only fragmentary similarity of specific prophage sequences of BCG Tice, BCG Montreal, and BCG Russia 368 in pair-wise alignments was observed, suggesting the impact of prophages on mosaic structure of genomes. Conclusions: The whole genome sequencing approach is essential for genomes with mosaic structure, harboring numerous prophage sequences. Tools for prophage search are effective instruments in this analysis.


September 22, 2019

Genome sequencing of Streptomyces atratus SCSIOZH16 and activation production of nocardamine via metabolic engineering.

The Actinomycetes are metabolically flexible microorganisms capable of producing a wide range of interesting compounds, including but by no means limited to, siderophores which have high affinity for ferric iron. In this study, we report the complete genome sequence of marine-derived Streptomyces atratus ZH16 and the activation of an embedded siderophore gene cluster via the application of metabolic engineering methods. The S. atratus ZH16 genome reveals that this strain has the potential to produce 26 categories of natural products (NPs) barring the ilamycins. Our activation studies revealed S. atratus SCSIO ZH16 to be a promising source of the production of nocardamine-type (desferrioxamine) compounds which are important in treating acute iron intoxication and performing ecological remediation. We conclude that metabolic engineering provides a highly effective strategy by which to discover drug-like compounds and new NPs in the genomic era.


September 22, 2019

Genome Assembly.

Genome assembly uses sequence similarity to go from sequencing reads to longer contiguous sequences (contigs). Scaffolds are contigs linked together by gaps where the order and orientation of the contigs is known but the exact sequence connecting two contigs is unknown, represented by Ns which estimate the gap length. Here we describe recommendations for genome assembly for different sequencing technologies, describe organelle assembly, and review how to perform assembly quality control.


September 22, 2019

Computational Modeling of Multidrug-Resistant Bacteria

Understanding how complex phenotypes arise from individual molecules and their interactions is a primary challenge in biology, and computational approaches have been increasingly employed to tackle this task. In this chapter, we describe current efforts by FIOCRUZ and partners to develop integrated computational models of multidrug-resistant bacteria. The bacterium chosen as the main focus of this effort is Pseudomonas aeruginosa, an opportunistic pathogen associated with a broad spectrum of infections in humans. Nowadays, P. aeruginosa is one of the main problems of healthcare-associated infections (HAI) in the world, because of its great capacity of survival in hospital environments and its intrinsic resistance to many antibiotics. Our overall research objective is to use integrated computational models to accurately predict a wide range of observable cellular behaviors of multidrug-resistant P. aeruginosa CCBH4851, which is a strain belonging to the clone ST277, endemic in Brazil. In this chapter, after a brief introduction to P. aeruginosa biology, we discuss the construction of metabolic and gene regulatory networks of P. aeruginosa CCBH 4851 from its genome. We also illustrate how these networks can be integrated into a single model, and we discuss methods for identifying potential therapeutic targets through integrated models.


September 22, 2019

Long-read whole genome sequencing and comparative analysis of six strains of the human pathogen Orientia tsutsugamushi.

Orientia tsutsugamushi is a clinically important but neglected obligate intracellular bacterial pathogen of the Rickettsiaceae family that causes the potentially life-threatening human disease scrub typhus. In contrast to the genome reduction seen in many obligate intracellular bacteria, early genetic studies of Orientia have revealed one of the most repetitive bacterial genomes sequenced to date. The dramatic expansion of mobile elements has hampered efforts to generate complete genome sequences using short read sequencing methodologies, and consequently there have been few studies of the comparative genomics of this neglected species.We report new high-quality genomes of O. tsutsugamushi, generated using PacBio single molecule long read sequencing, for six strains: Karp, Kato, Gilliam, TA686, UT76 and UT176. In comparative genomics analyses of these strains together with existing reference genomes from Ikeda and Boryong strains, we identify a relatively small core genome of 657 genes, grouped into core gene islands and separated by repeat regions, and use the core genes to infer the first whole-genome phylogeny of Orientia.Complete assemblies of multiple Orientia genomes verify initial suggestions that these are remarkable organisms. They have larger genomes compared with most other Rickettsiaceae, with widespread amplification of repeat elements and massive chromosomal rearrangements between strains. At the gene level, Orientia has a relatively small set of universally conserved genes, similar to other obligate intracellular bacteria, and the relative expansion in genome size can be accounted for by gene duplication and repeat amplification. Our study demonstrates the utility of long read sequencing to investigate complex bacterial genomes and characterise genomic variation.


September 22, 2019

The mutation rate and the age of the sex chromosomes in Silene latifolia.

Many aspects of sex chromosome evolution are common to both plants and animals [1], but the process of Y chromosome degeneration, where genes on the Y become non-functional over time, may be much slower in plants due to purifying selection against deleterious mutations in the haploid gametophyte [2, 3]. Testing for differences in Y degeneration between the kingdoms has been hindered by the absence of accurate age estimates for plant sex chromosomes. Here, we used genome resequencing to estimate the spontaneous mutation rate and the age of the sex chromosomes in white campion (Silene latifolia). Screening of single nucleotide polymorphisms (SNPs) in parents and 10 F1 progeny identified 39 de novo mutations and yielded a rate of 7.31 × 10-9 (95% confidence interval: 5.20 × 10-9 – 8.00 × 10-9) mutations per site per haploid genome per generation. Applying this mutation rate to the synonymous divergence between homologous X- and Y-linked genes (gametologs) gave age estimates of 11.00 and 6.32 million years for the old and young strata, respectively. Based on SNP segregation patterns, we inferred which genes were Y-linked and found that at least 47% are already dysfunctional. Applying our new estimates for the age of the sex chromosomes indicates that the rate of Y degeneration in S. latifolia is nearly 2-fold slower when compared to animal sex chromosomes of a similar age. Our revised estimates support Y degeneration taking place more slowly in plants, a discrepancy that may be explained by differences in the life cycles of animals and plants. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019

Characterization of a novel multidrug resistance plasmid pSGB23 isolated from Salmonella enterica subspecies enterica serovar Saintpaul.

Salmonella enterica subspecies enterica serovar Saintpaul (S. Saintpaul) is an important gut pathogen which causes salmonellosis worldwide. Although intestinal salmonellosis is usually self-limiting, it can be life-threatening in children, the elderlies and immunocompromised patients. Appropriate antibiotic treatment is therefore required for these patients. However, the efficacy of many antibiotics on S. enterica infections has been greatly compromised due to spreading of multidrug resistance (MDR) plasmids, which poses serious threats on public health and needs to be closely monitored. In this study, we sequenced and fully characterized an S. enterica MDR plasmid pSGB23 isolated from chicken.Complete genome sequence analysis revealed that S. Saintpaul strain SGB23 harbored a 254 kb megaplasmid pSGB23, which carries 11 antibiotic resistance genes responsible for resistance to 9 classes of antibiotics and quaternary ammonium compounds that are commonly used to disinfect food processing facilities. Furthermore, we found that pSGB23 carries multiple conjugative systems, which allow it to spread into other Enterobacteriaceae spp. by self-conjugation. It also harbors multiple types of replicons and plasmid maintenance and addictive systems, which explains its broad host range and stable inheritance.We report here a novel MDR plasmid pSGB23 harboured by S. enterica. To our knowledge, it carried the greatest number of antibiotic resistance genes with the broadest range of resistance spectrum among S. enterica MDR plasmids identified so far. The isolation of pSGB23 from food sources is worrisome, while surveillance on its further spreading will be carried out based on the findings reported in this study.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.