Menu
July 7, 2019

On the (im)possibility of reconstructing plasmids from whole-genome short-read sequencing data.

To benchmark algorithms for automated plasmid sequence reconstruction from short-read sequencing data, we selected 42 publicly available complete bacterial genome sequences spanning 12 genera, containing 148 plasmids. We predicted plasmids from short-read data with four programs (PlasmidSPAdes, Recycler, cBar and PlasmidFinder) and compared the outcome to the reference sequences. PlasmidSPAdes reconstructs plasmids based on coverage differences in the assembly graph. It reconstructed most of the reference plasmids (recall=0.82), but approximately a quarter of the predicted plasmid contigs were false positives (precision=0.75). PlasmidSPAdes merged 84?% of the predictions from genomes with multiple plasmids into a single bin. Recycler searches the assembly graph for sub-graphs corresponding to circular sequences and correctly predicted small plasmids, but failed with long plasmids (recall=0.12, precision=0.30). cBar, which applies pentamer frequency analysis to detect plasmid-derived contigs, showed a recall and precision of 0.76 and 0.62, respectively. However, cBar categorizes contigs as plasmid-derived and does not bin the different plasmids. PlasmidFinder, which searches for replicons, had the highest precision (1.0), but was restricted by the contents of its database and the contig length obtained fromde novoassembly (recall=0.36). PlasmidSPAdes and Recycler detected putative small plasmids (<10?kbp), which were also predicted as plasmids by cBar, but were absent in the original assembly. This study shows that it is possible to automatically predict small plasmids. Prediction of large plasmids (>50?kbp) containing repeated sequences remains challenging and limits the high-throughput analysis of plasmids from short-read whole-genome sequencing data.


July 7, 2019

A 3-way hybrid approach to generate a new high-quality chimpanzee reference genome (Pan_tro_3.0).

The chimpanzee is arguably the most important species for the study of human origins. A key resource for these studies is a high-quality reference genome assembly; however, as with most mammalian genomes, the current iteration of the chimpanzee reference genome assembly is highly fragmented. In the current iteration of the chimpanzee reference genome assembly (Pan_tro_2.1.4), the sequence is scattered across more then 183 000 contigs, incorporating more than 159 000 gaps, with a genome-wide contig N50 of 51 Kbp. In this work, we produce an extensive and diverse array of sequencing datasets to rapidly assemble a new chimpanzee reference that surpasses previous iterations in bases represented and organized in large scaffolds. To this end, we show substantial improvements over the current release of the chimpanzee genome (Pan_tro_2.1.4) by several metrics, such as increased contiguity by >750% and 300% on contigs and scaffolds, respectively, and closure of 77% of gaps in the Pan_tro_2.1.4 assembly gaps spanning >850 Kbp of the novel coding sequence based on RNASeq data. We further report more than 2700 genes that had putatively erroneous frame-shift predictions to human in Pan_tro_2.1.4 and show a substantial increase in the annotation of repetitive elements. We apply a simple 3-way hybrid approach to considerably improve the reference genome assembly for the chimpanzee, providing a valuable resource for the study of human origins. Furthermore, we produce extensive sequencing datasets that are all derived from the same cell line, generating a broad non-human benchmark dataset.© The Author 2017. Published by Oxford University Press.


July 7, 2019

Trajectories and drivers of genome evolution in surface-associated marine Phaeobacter.

The extent of genome divergence and the evolutionary events leading to speciation of marine bacteria have mostly been studied for (locally) abundant, free-living groups. The genus Phaeobacter is found on different marine surfaces, seems to occupy geographically disjunct habitats, and is involved in different biotic interactions, and was therefore targeted in the present study. The analysis of the chromosomes of 32 closely related but geographically spread Phaeobacter strains revealed an exceptionally large, highly syntenic core genome. The flexible gene pool is constantly but slightly expanding across all Phaeobacter lineages. The horizontally transferred genes mostly originated from bacteria of the Roseobacter group and horizontal transfer most likely was mediated by gene transfer agents. No evidence for geographic isolation and habitat specificity of the different phylogenomic Phaeobacter clades was detected based on the sources of isolation. In contrast, the functional gene repertoire and physiological traits of different phylogenomic Phaeobacter clades were sufficiently distinct to suggest an adaptation to an associated lifestyle with algae, to additional nutrient sources, or toxic heavy metals. Our study reveals that the evolutionary trajectories of surface-associated marine bacteria can differ significantly from free-living marine bacteria or marine generalists.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Complete genome sequence of Bacillus altitudinis P-10, a potential bioprotectant against Xanthomonas oryzae pv. oryzae, isolated from rice rhizosphere in Java, Indonesia.

Bacillus altitudinis P-10 was isolated from the rhizosphere of rice grown in an organic rice field and provides strong antagonism against the bacterial blight caused by Xanthomonas oryzae pv. oryzae in rice. Herein, we provide the complete genome sequence and a possible explanation of the antibiotic function of the P-10 strain.


July 7, 2019

High-quality draft genome sequence of Streptomyces agglomeratus 5-1-8 with strong anti-MRSA ability, isolated from the frozen soil of Tibet in China

Streptomyces agglomeratus 5-1-8 with strong anti methicillin-resistant Staphylococcus aureus (MRSA) ability, isolated from the frozen soil of Tibet in China, has a strong ability to kill the multi-drugs-resistant MRSA. To identify the second-ary metabolism ability of this strain, we describe here the phenotypic characteristics of this strain, along with its high-quality draft genome sequence, its annotation, and analysis. The 7.1M draft genome encodes 6,284 putative open reading frames (ORFs), of which 4,416 ORFs were assigned with clusters of orthologous genes (COG) categories. Also, 65 tRNA genes and 24 rRNA operons were identified. The genome contains 12 gene clusters involved in antibiotics production and 1 gene cluster involved in anticancer-compounds production; 4 gene clusters belong to polyketides and nonribosomal peptides, 1 gene cluster belong to the butyrolactone, 4 gene clusters belong to the bacteriocin or lantipeptide, and 3 gene clusters belong to the others. This genome-sequence data will facilitate efforts to probe the potential of new antibiotics to kill multi-drugs-resistant MRSA.


July 7, 2019

Disease onset in X-linked dystonia-parkinsonism correlates with expansion of a hexameric repeat within an SVA retrotransposon in TAF1.

X-linked dystonia-parkinsonism (XDP) is a neurodegenerative disease associated with an antisense insertion of a SINE-VNTR-Alu (SVA)-type retrotransposon within an intron ofTAF1This unique insertion coincides with six additional noncoding sequence changes inTAF1, the gene that encodes TATA-binding protein-associated factor-1, which appear to be inherited together as an identical haplotype in all reported cases. Here we examined the sequence of this SVA in XDP patients (n= 140) and detected polymorphic variation in the length of a hexanucleotide repeat domain, (CCCTCT)nThe number of repeats in these cases ranged from 35 to 52 and showed a highly significant inverse correlation with age at disease onset. Because other SVAs exhibit intrinsic promoter activity that depends in part on the hexameric domain, we assayed the transcriptional regulatory effects of varying hexameric lengths found in the unique XDP SVA retrotransposon using luciferase reporter constructs. When inserted sense or antisense to the luciferase reading frame, the XDP variants repressed or enhanced transcription, respectively, to an extent that appeared to vary with length of the hexamer. Further in silico analysis of this SVA sequence revealed multiple motifs predicted to form G-quadruplexes, with the greatest potential detected for the hexameric repeat domain. These data directly link sequence variation within the XDP-specific SVA sequence to phenotypic variability in clinical disease manifestation and provide insight into potential mechanisms by which this intronic retroelement may induce transcriptional interference inTAF1expression. Copyright © 2017 the Author(s). Published by PNAS.


July 7, 2019

Sex-specific influences of mtDNA mitotype and diet on mitochondrial functions and physiological traits in Drosophila melanogaster.

Here we determine the sex-specific influence of mtDNA type (mitotype) and diet on mitochondrial functions and physiology in two Drosophila melanogaster lines. In many species, males and females differ in aspects of their energy production. These sex-specific influences may be caused by differences in evolutionary history and physiological functions. We predicted the influence of mtDNA mutations should be stronger in males than females as a result of the organelle’s maternal mode of inheritance in the majority of metazoans. In contrast, we predicted the influence of diet would be greater in females due to higher metabolic flexibility. We included four diets that differed in their protein: carbohydrate (P:C) ratios as they are the two-major energy-yielding macronutrients in the fly diet. We assayed four mitochondrial function traits (Complex I oxidative phosphorylation, reactive oxygen species production, superoxide dismutase activity, and mtDNA copy number) and four physiological traits (fecundity, longevity, lipid content, and starvation resistance). Traits were assayed at 11 d and 25 d of age. Consistent with predictions we observe that the mitotype influenced males more than females supporting the hypothesis of a sex-specific selective sieve in the mitochondrial genome caused by the maternal inheritance of mitochondria. Also, consistent with predictions, we found that the diet influenced females more than males.


July 7, 2019

Comparative whole-genomic analysis of an ancient L2 lineage Mycobacterium novel phylogenetic clade and common genetic determinants of hypervirulent strains.

Background: Development of improved therapeutics against tuberculosis (TB) is hindered by an inadequate understanding of the relationship between disease severity and genetic diversity of its causative agent, Mycobacterium tuberculosis. We previously isolated a hypervirulent M. tuberculosis strain H112 from an HIV-negative patient with an aggressive disease progression from pulmonary TB to tuberculous meningitis—the most severe manifestation of tuberculosis. Human macrophage challenge experiment demonstrated that the strain H112 exhibited significantly better intracellular survivability and induced lower level of TNF-a than the reference virulent strain H37Rv and other 123 clinical isolates. Aim: The present study aimed to identify the potential genetic determinants of mycobacterial virulence that were common to strain H112 and hypervirulent M. tuberculosis strains of the same phylogenetic clade isolated in other global regions. Methods: A low-virulent M. tuberculosis strain H54 which belonged to the same phylogenetic lineage (L2) as strain H112 was selected from a collection of 115 clinical isolates. Both H112 and H54 were whole-genome-sequenced using PacBio sequencing technology. A comparative genomics approach was adopted to identify mutations present in strain H112 but absent in strain H54. Subsequently, an extensive phylogenetic analysis was conducted by including all publically available M. tuberculosis genomes. Single-nucleotide-polymorphisms (SNPs) and structural variations (SVs) common to hypervirulent strains in the global collection of genomes were considered as potential genetic determinants of hypervirulence. Results: Sequencing data revealed that both H112 and H54 were identified as members of the same sub-lineage L2.2.1. After excluding the lineage-related mutations shared between H112 and H54, we analyzed the phylogenetic relatedness of H112 with global collection of M. tuberculosis genomes (n = 4,338), and identified a novel phylogenetic clade in which four hypervirulent strains isolated from geographically diverse regions were clustered together. All hypervirulent strains in the clade shared 12 SNPs and 5 SVs with H112, including those affecting key virulence-associated loci, notably, a deleterious SNP (rv0178 p. D150E) within mce1 operon and an intergenic deletion (854259_ 854261delCC) in close-proximity to phoP. Conclusion: The present study identified common genetic factors in a novel phylogenetic clade of hypervirulent M. tuberculosis. The causative role of these mutations in mycobacterial virulence should be validated in future study.


July 7, 2019

De novo design and synthesis of a 30-cistron translation-factor module.

Two of the many goals of synthetic biology are synthesizing large biochemical systems and simplifying their assembly. While several genes have been assembled together by modular idempotent cloning, it is unclear if such simplified strategies scale to very large constructs for expression and purification of whole pathways. Here we synthesize from oligodeoxyribonucleotides a completely de-novo-designed, 58-kb multigene DNA. This BioBrick plasmid insert encodes 30 of the 31 translation factors of the PURE translation system, each His-tagged and in separate transcription cistrons. Dividing the insert between three high-copy expression plasmids enables the bulk purification of the aminoacyl-tRNA synthetases and translation factors necessary for affordable, scalable reconstitution of an in vitro transcription and translation system, PURE 3.0.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 7, 2019

Nitrogen fixation genes and nitrogenase activity of the non-heterocystous cyanobacterium Thermoleptolyngbya sp. O-77.

Cyanobacteria are widely distributed in marine, aquatic, and terrestrial ecosystems, and play an important role in the global nitrogen cycle. In the present study, we examined the genome sequence of the thermophilic non-heterocystous N2-fixing cyanobacterium, Thermoleptolyngbya sp. O-77 (formerly known as Leptolyngbya sp. O-77) and characterized its nitrogenase activity. The genome of this cyanobacterial strain O-77 consists of a single chromosome containing a nitrogen fixation gene cluster. A phylogenetic analysis indicated that the NifH amino acid sequence from strain O-77 was clustered with those from a group of mesophilic species: the highest identity was found in Leptolyngbya sp. KIOST-1 (97.9% sequence identity). The nitrogenase activity of O-77 cells was dependent on illumination, whereas a high intensity of light of 40 µmol m-2 s-1 suppressed the effects of illumination.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.