Menu
April 21, 2020

Complete genome sequence unveiled cellulose degradation enzymes and secondary metabolic potentials in Streptomyces sp. CC0208.

Marine Streptomyces sp. CC0208 isolated from the Bohai Bay showed high efficiency of cellulose degradation under optimized fermentation parameters. Also, as one of the bioinformatics-based approaches for the discovery of novel natural product and enzyme effectively, genome mining has been developed and applied widely. Herein, we reported the complete genome sequence of Streptomyces sp. CC0208.Whole-genome sequencing analysis revealed a genome size of 9,325,981?bp with a linear chromosome, GC content of 70.59% and 8487 protein-coding genes. Abundant genes have predicted functions in antibiotic metabolism and enzymes. A 20 enzymes closely associated with cellulose degradation were discovered. A total of 25 biosynthetic gene clusters (BGCs) of secondary metabolites were identified, including diverse classes of natural products. The availability of genome sequence of Streptomyces sp. CC0208 not only will assist in cracking the mechanism of cellulose degradation but also will provide the insights into the significant secondary metabolic potentials for the production of diverse compound classes based on rational strategies. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.


April 21, 2020

Long-read sequencing identified intronic repeat expansions in SAMD12 from Chinese pedigrees affected with familial cortical myoclonic tremor with epilepsy.

The locus for familial cortical myoclonic tremor with epilepsy (FCMTE) has long been mapped to 8q24 in linkage studies, but the causative mutations remain unclear. Recently, expansions of intronic TTTCA and TTTTA repeat motifs within SAMD12 were found to be involved in the pathogenesis of FCMTE in Japanese pedigrees. We aim to identify the causative mutations of FCMTE in Chinese pedigrees.We performed genetic linkage analysis by microsatellite markers in a five-generation Chinese pedigree with 55 members. We also used array-comparative genomic hybridisation (CGH) and next-generation sequencing (NGS) technologies (whole-exome sequencing, capture region deep sequencing and whole-genome sequencing) to identify the causative mutations in the disease locus. Recently, we used low-coverage (~10×) long-read genome sequencing (LRS) on the PacBio Sequel and Oxford Nanopore platforms to identify the causative mutations, and used repeat-primed PCR for validation of the repeat expansions.Linkage analysis mapped the disease locus to 8q23.3-24.23. Array-CGH and NGS failed to identify causative mutations in this locus. LRS identified the intronic TTTCA and TTTTA repeat expansions in SAMD12 as the causative mutations, thus corroborating the recently published results in Japanese pedigrees.We identified the pentanucleotide repeat expansion in SAMD12 as the causative mutation in Chinese FCMTE pedigrees. Our study also suggested that LRS is an effective tool for molecular diagnosis of genetic disorders, especially for neurological diseases that cannot be positively diagnosed by conventional clinical microarray and NGS technologies. © Author(s) (or their employer(s)) 2019. No commercial re-use. See rights and permissions. Published by BMJ.


April 21, 2020

Carbohydrate catabolic capability of a Flavobacteriia bacterium isolated from hadal water.

Flavobacteriia are abundant in many marine environments including hadal waters, as demonstrated recently. However, it is unclear how this flavobacterial population adapts to hadal conditions. In this study, extensive comparative genomic analyses were performed for the flavobacterial strain Euzebyella marina RN62 isolated from the Mariana Trench hadal water in low abundance. The complete genome of RN62 possessed a considerable number of carbohydrate-active enzymes with a different composition. There was a predominance of GH family 13 proteins compared to closely related relatives, suggesting that RN62 has preserved a certain capacity for carbohydrate utilization and that the hadal ocean may hold an organic matter reservoir distinct from the surface ocean. Additionally, RN62 possessed potential intracellular cycling of the glycogen/starch pathway, which may serve as a strategy for carbon storage and consumption in response to nutrient pulse and starvation. Moreover, the discovery of higher glycoside hydrolase dissimilarities among Flavobacteriia, compared to peptidases and transporters, suggested variation in polysaccharide utilization related traits as an important ecophysiological factor in response to environmental alterations, such as decreased labile organic carbon in hadal waters. The presence of abundant toxin exporting, transcription and signal transduction related genes in RN62 may further help to survive in hadal conditions, including high pressure/low temperature.Copyright © 2019 Elsevier GmbH. All rights reserved.


April 21, 2020

Genetic characterization and potential molecular dissemination mechanism of tet(31) gene in Aeromonas caviae from an oxytetracycline wastewater treatment system.

Recently, the rarely reported tet(31) tetracycline resistance determinant was commonly found in Aeromonas salmonicida, Gallibacterium anatis, and Oblitimonas alkaliphila isolated from farming animals and related environment. However, its distribution in other bacteria and potential molecular dissemination mechanism in environment are still unknown. The purpose of this study was to investigate the potential mechanism underlying dissemination of tet(31) by analysing the tet(31)-carrying fragments in A. caviae strains isolated from an aerobic biofilm reactor treating oxytetracycline bearing wastewater. Twenty-three A. caviae strains were screened for the tet(31) gene by polymerase chain reaction (PCR). Three strains (two harbouring tet(31), one not) were subjected to whole genome sequencing using the PacBio RSII platform. Seventeen A. caviae strains carried the tet(31) gene and exhibited high resistance levels to oxytetracycline with minimum inhibitory concentrations (MICs) ranging from 256 to 512?mg/L. tet(31) was comprised of the transposon Tn6432 on the chromosome of A. caviae, and Tn6432 was also found in 15 additional tet(31)-positive A. caviae isolates by PCR. More important, Tn6432 was located on an integrative conjugative element (ICE)-like element, which could mediate the dissemination of the tet(31)-carrying transposon Tn6432 between bacteria. Comparative analysis demonstrated that Tn6432 homologs with the structure ISCR2-?phzF-tetR(31)-tet(31)-?glmM-sul2 were also carried by A. salmonicida, G. anatis, and O. alkaliphila, suggesting that this transposon can be transferred between species and even genera. This work provides the first report on the identification of the tet(31) gene in A. caviae, and will be helpful in exploring the dissemination mechanisms of tet(31) in water environment.Copyright © 2018. Published by Elsevier B.V.


April 21, 2020

Toxin and genome evolution in a Drosophila defensive symbiosis.

Defenses conferred by microbial symbionts play a vital role in the health and fitness of their animal hosts. An important outstanding question in the study of defensive symbiosis is what determines long term stability and effectiveness against diverse natural enemies. In this study, we combine genome and transcriptome sequencing, symbiont transfection and parasite protection experiments, and toxin activity assays to examine the evolution of the defensive symbiosis between Drosophila flies and their vertically transmitted Spiroplasma bacterial symbionts, focusing in particular on ribosome-inactivating proteins (RIPs), symbiont-encoded toxins that have been implicated in protection against both parasitic wasps and nematodes. Although many strains of Spiroplasma, including the male-killing symbiont (sMel) of Drosophila melanogaster, protect against parasitic wasps, only the strain (sNeo) that infects the mycophagous fly Drosophila neotestacea appears to protect against parasitic nematodes. We find that RIP repertoire is a major differentiating factor between strains that do and do not offer nematode protection, and that sMel RIPs do not show activity against nematode ribosomes in vivo. We also discovered a strain of Spiroplasma infecting a mycophagous phorid fly, Megaselia nigra. Although both the host and its Spiroplasma are distantly related to D. neotestacea and its symbiont, genome sequencing revealed that the M. nigra symbiont encodes abundant and diverse RIPs, including plasmid-encoded toxins that are closely related to the RIPs in sNeo. Our results suggest that distantly related Spiroplasma RIP toxins may perform specialized functions with regard to parasite specificity and suggest an important role for horizontal gene transfer in the emergence of novel defensive phenotypes.


April 21, 2020

Confident phylogenetic identification of uncultured prokaryotes through long read amplicon sequencing of the 16S-ITS-23S rRNA operon.

Amplicon sequencing of the 16S rRNA gene is the predominant method to quantify microbial compositions and to discover novel lineages. However, traditional short amplicons often do not contain enough information to confidently resolve their phylogeny. Here we present a cost-effective protocol that amplifies a large part of the rRNA operon and sequences the amplicons with PacBio technology. We tested our method on a mock community and developed a read-curation pipeline that reduces the overall read error rate to 0.18%. Applying our method on four environmental samples, we captured near full-length rRNA operon amplicons from a large diversity of prokaryotes. The method operated at moderately high-throughput (22286-37,850 raw ccs reads) and generated a large amount of putative novel archaeal 23S rRNA gene sequences compared to the archaeal SILVA database. These long amplicons allowed for higher resolution during taxonomic classification by means of long (~1000 bp) 16S rRNA gene fragments and for substantially more confident phylogenies by means of combined near full-length 16S and 23S rRNA gene sequences, compared to shorter traditional amplicons (250 bp of the 16S rRNA gene). We recommend our method to those who wish to cost-effectively and confidently estimate the phylogenetic diversity of prokaryotes in environmental samples at high throughput. © 2019 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.


April 21, 2020

Long-read sequence capture of the haemoglobin gene clusters across codfish species.

Combining high-throughput sequencing with targeted sequence capture has become an attractive tool to study specific genomic regions of interest. Most studies have so far focused on the exome using short-read technology. These approaches are not designed to capture intergenic regions needed to reconstruct genomic organization, including regulatory regions and gene synteny. Here, we demonstrate the power of combining targeted sequence capture with long-read sequencing technology for comparative genomic analyses of the haemoglobin (Hb) gene clusters across eight species separated by up to 70 million years. Guided by the reference genome assembly of the Atlantic cod (Gadus morhua) together with genome information from draft assemblies of selected codfishes, we designed probes covering the two Hb gene clusters. Use of custom-made barcodes combined with PacBio RSII sequencing led to highly continuous assemblies of the LA (~100 kb) and MN (~200 kb) clusters, which include syntenic regions of coding and intergenic sequences. Our results revealed an overall conserved genomic organization of the Hb genes within this lineage, yet with several, lineage-specific gene duplications. Moreover, for some of the species examined, we identified amino acid substitutions at two sites in the Hbb1 gene as well as length polymorphisms in its regulatory region, which has previously been linked to temperature adaptation in Atlantic cod populations. This study highlights the use of targeted long-read capture as a versatile approach for comparative genomic studies by generation of a cross-species genomic resource elucidating the evolutionary history of the Hb gene family across the highly divergent group of codfishes. © 2018 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.


April 21, 2020

Comparative genomic analysis of Lactobacillus mucosae LM1 identifies potential niche-specific genes and pathways for gastrointestinal adaptation.

Lactobacillus mucosae is currently of interest as putative probiotics due to their metabolic capabilities and ability to colonize host mucosal niches. L. mucosae LM1 has been studied in its functions in cell adhesion and pathogen inhibition, etc. It demonstrated unique abilities to use energy from carbohydrate and non-carbohydrate sources. Due to these functions, we report the first complete genome sequence of an L. mucosae strain, L. mucosae LM1. Analysis of the pan-genome in comparison with closely-related Lactobacillus species identified a complete glycogen metabolism pathway, as well as folate biosynthesis, complementing previous proteomic data on the LM1 strain. It also revealed common and unique niche-adaptation genes among the various L. mucosae strains. The aim of this study was to derive genomic information that would reveal the probable mechanisms underlying the probiotic effect of L. mucosae LM1, and provide a better understanding of the nature of L. mucosae sp. Copyright © 2017 Elsevier Inc. All rights reserved.


April 21, 2020

Population Genome Sequencing of the Scab Fungal Species Venturia inaequalis, Venturia pirina, Venturia aucupariae and Venturia asperata.

The Venturia genus comprises fungal species that are pathogens on Rosaceae host plants, including V. inaequalis and V. asperata on apple, V. aucupariae on sorbus and V. pirina on pear. Although the genetic structure of V. inaequalis populations has been investigated in detail, genomic features underlying these subdivisions remain poorly understood. Here, we report whole genome sequencing of 87 Venturia strains that represent each species and each population within V. inaequalis We present a PacBio genome assembly for the V. inaequalis EU-B04 reference isolate. The size of selected genomes was determined by flow cytometry, and varied from 45 to 93 Mb. Genome assemblies of V. inaequalis and V. aucupariae contain a high content of transposable elements (TEs), most of which belong to the Gypsy or Copia LTR superfamilies and have been inactivated by Repeat-Induced Point mutations. The reference assembly of V. inaequalis presents a mosaic structure of GC-equilibrated regions that mainly contain predicted genes and AT-rich regions, mainly composed of TEs. Six pairs of strains were identified as clones. Single-Nucleotide Polymorphism (SNP) analysis between these clones revealed a high number of SNPs that are mostly located in AT-rich regions due to misalignments and allowed determining a false discovery rate. The availability of these genome sequences is expected to stimulate genetics and population genomics research of Venturia pathogens. Especially, it will help understanding the evolutionary history of Venturia species that are pathogenic on different hosts, a history that has probably been substantially influenced by TEs.Copyright © 2019 Le Cam et al.


April 21, 2020

High Quality Draft Genome of Arogyapacha (Trichopus zeylanicus), an Important Medicinal Plant Endemic to Western Ghats of India.

Arogyapacha, the local name of Trichopus zeylanicus, is a rare, indigenous medicinal plant of India. This plant is famous for its traditional use as an instant energy stimulant. So far, no genomic resource is available for this important plant and hence its metabolic pathways are poorly understood. Here, we report on a high-quality draft assembly of approximately 713.4 Mb genome of T. zeylanicus, first draft genome from the genus Trichopus The assembly was generated in a hybrid approach using Illumina short-reads and Pacbio longer-reads. The total assembly comprised of 22601 scaffolds with an N50 value of 433.3 Kb. We predicted 34452 protein coding genes in T. zeylanicus genome and found that a significant portion of these predicted genes were associated with various secondary metabolite biosynthetic pathways. Comparative genome analysis revealed extensive gene collinearity between T. zeylanicus and its closely related plant species. The present genome and annotation data provide an essential resource to speed-up the research on secondary metabolism, breeding and molecular evolution of T. zeylanicus. Copyright © 2019 Chellappan et al.


April 21, 2020

Complete Genome Sequence of Saccharospirillum mangrovi HK-33T Sheds Light on the Ecological Role of a Bacterium in Mangrove Sediment Environment.

We present the genome sequence of Saccharospirillum mangrovi HK-33T, isolated from a mangrove sediment sample in Haikou, China. The complete genome of S. mangrovi HK-33T consisted of a single-circular chromosome with the size of 3,686,911 bp as well as an average G?+?C content of 57.37%, and contained 3,383 protein-coding genes, 4 operons of 16S-23S-5S rRNA genes, and 52 tRNA genes. Genomic annotation indicated that the genome of S. mangrovi HK-33T had many genes related to oligosaccharide and polysaccharide degradation and utilization of polyhydroxyalkanoate. For nitrogen cycle, genes encoding nitrate and nitrite reductase, glutamate dehydrogenase, glutamate synthase, and glutamine synthetase could be found. For phosphorus cycle, genes related to polyphosphate kinases (ppk1 and ppk2), the high-affinity phosphate-specific transport (Pst) system, and the low-affinity inorganic phosphate transporter (pitA) were predicted. For sulfur cycle, cysteine synthase and type III acyl coenzyme A transferase (dddD) coding genes were searched out. This study provides evidence about carbon, nitrogen, phosphorus, and sulfur metabolic patterns of S. mangrovi HK-33T and broadens our understandings about ecological roles of this bacterium in the mangrove sediment environment.


April 21, 2020

Characterization of NDM-5- and CTX-M-55-coproducing Escherichia coli GSH8M-2 isolated from the effluent of a wastewater treatment plant in Tokyo Bay.

New Delhi metallo-ß-lactamase (NDM)-5-producing Enterobacteriaceae have been detected in rivers, sewage, and effluents from wastewater treatment plants (WWTPs). Environmental contamination due to discharged effluents is of particular concern as NDM variants may be released into waterways, thereby posing a risk to humans. In this study, we collected effluent samples from a WWTP discharged into a canal in Tokyo Bay, Japan.Testing included the complete genome sequencing of Escherichia coli GSH8M-2 isolated from the effluent as well as a gene network analysis.The complete genome sequencing of GSH8M-2 revealed that it was an NDM-5-producing E. coli strain sequence type ST542, which carries multiple antimicrobial resistance genes for ß-lactams, quinolone, tetracycline, trimethoprim-sulfamethoxazole, florfenicol/chloramphenicol, kanamycin, and fosfomycin. The blaNDM-5 gene was found in the IncX3 replicon plasmid pGSH8M-2-4. Gene network analysis using 142 IncX3 plasmid sequences suggested that pGSH8M-2-4 is related to both clinical isolates of  E. coli and Klebsiella species in Eastern Asia. GSH8M-2 also carries the blaCTX-M-55 gene in IncX1 plasmid pGSH8M-2-3.This is the first report of environmental NDM-5-producing E. coli isolated from a WWTP in Japan. NDM-5 detection is markedly increasing in veterinary and clinical settings, suggesting that dual ß-lactamases, such as NDM-5 and CTX-M-55, might be acquired through multiple steps in environment settings. Environmental contamination through WWTP effluents that contain producers of NDM variants could be an emerging potential health hazard. Thus, regular monitoring of WWTP effluents is important for the detection of antimicrobial-resistant bacteria that may be released into the waterways and nearby communities.


April 21, 2020

Reference genome sequences of two cultivated allotetraploid cottons, Gossypium hirsutum and Gossypium barbadense.

Allotetraploid cotton species (Gossypium hirsutum and Gossypium barbadense) have long been cultivated worldwide for natural renewable textile fibers. The draft genome sequences of both species are available but they are highly fragmented and incomplete1-4. Here we report reference-grade genome assemblies and annotations for G. hirsutum accession Texas Marker-1 (TM-1) and G. barbadense accession 3-79 by integrating single-molecule real-time sequencing, BioNano optical mapping and high-throughput chromosome conformation capture techniques. Compared with previous assembled draft genomes1,3, these genome sequences show considerable improvements in contiguity and completeness for regions with high content of repeats such as centromeres. Comparative genomics analyses identify extensive structural variations that probably occurred after polyploidization, highlighted by large paracentric/pericentric inversions in 14 chromosomes. We constructed an introgression line population to introduce favorable chromosome segments from G. barbadense to G. hirsutum, allowing us to identify 13 quantitative trait loci associated with superior fiber quality. These resources will accelerate evolutionary and functional genomic studies in cotton and inform future breeding programs for fiber improvement.


April 21, 2020

Genomic analysis of three Clostridioides difficile isolates from urban water sources.

We investigated inflow of a wastewater treatment plant and sediment of an urban lake for the presence of Clostridioides difficile by cultivation and PCR. Among seven colonies we sequenced the complete genomes of three: two non-toxigenic isolates from wastewater and one toxigenic isolate from the urban lake. For all obtained isolates, a close genomic relationship with human-derived isolates was observed.Copyright © 2019 Elsevier Ltd. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.