September 22, 2019  |  

Single-molecule real-time transcript sequencing facilitates common wheat genome annotation and grain transcriptome research.

The large and complex hexaploid genome has greatly hindered genomics studies of common wheat (Triticum aestivum, AABBDD). Here, we investigated transcripts in common wheat developing caryopses using the emerging single-molecule real-time (SMRT) sequencing technology PacBio RSII, and assessed the resultant data for improving common wheat genome annotation and grain transcriptome research.We obtained 197,709 full-length non-chimeric (FLNC) reads, 74.6 % of which were estimated to carry complete open reading frame. A total of 91,881 high-quality FLNC reads were identified and mapped to 16,188 chromosomal loci, corresponding to 13,162 known genes and 3026 new genes not annotated previously. Although some FLNC reads could not be unambiguously mapped to the current draft genome sequence, many of them are likely useful for studying highly similar homoeologous or paralogous loci or for improving chromosomal contig assembly in further research. The 91,881 high-quality FLNC reads represented 22,768 unique transcripts, 9591 of which were newly discovered. We found 180 transcripts each spanning two or three previously annotated adjacent loci, suggesting that they should be merged to form correct gene models. Finally, our data facilitated the identification of 6030 genes differentially regulated during caryopsis development, and full-length transcripts for 72 transcribed gluten gene members that are important for the end-use quality control of common wheat.Our work demonstrated the value of PacBio transcript sequencing for improving common wheat genome annotation through uncovering the loci and full-length transcripts not discovered previously. The resource obtained may aid further structural genomics and grain transcriptome studies of common wheat.


September 22, 2019  |  

Occurrence, evolution, and functions of DNA phosphorothioate epigenetics in bacteria.

The chemical diversity of physiological DNA modifications has expanded with the identification of phosphorothioate (PT) modification in which the nonbridging oxygen in the sugar-phosphate backbone of DNA is replaced by sulfur. Together with DndFGH as cognate restriction enzymes, DNA PT modification, which is catalyzed by the DndABCDE proteins, functions as a bacterial restriction-modification (R-M) system that protects cells against invading foreign DNA. However, the occurrence of dnd systems across a large number of bacterial genomes and their functions other than R-M are poorly understood. Here, a genomic survey revealed the prevalence of bacterial dnd systems: 1,349 bacterial dnd systems were observed to occur sporadically across diverse phylogenetic groups, and nearly half of these occur in the form of a solitary dndBCDE gene cluster that lacks the dndFGH restriction counterparts. A phylogenetic analysis of 734 complete PT R-M pairs revealed the coevolution of M and R components, despite the observation that several PT R-M pairs appeared to be assembled from M and R parts acquired from distantly related organisms. Concurrent epigenomic analysis, transcriptome analysis, and metabolome characterization showed that a solitary PT modification contributed to the overall cellular redox state, the loss of which perturbed the cellular redox balance and induced Pseudomonas fluorescens to reconfigure its metabolism to fend off oxidative stress. An in vitro transcriptional assay revealed altered transcriptional efficiency in the presence of PT DNA modification, implicating its function in epigenetic regulation. These data suggest the versatility of PT in addition to its involvement in R-M protection.


September 22, 2019  |  

Genome-based evolutionary history of Pseudomonas spp.

Pseudomonas is a large and diverse genus of Gammaproteobacteria. To provide a framework for discovery of evolutionary and taxonomic relationships of these bacteria, we compared the genomes of type strains of 163 species and 3 additional subspecies of Pseudomonas, including 118 genomes sequenced herein. A maximum likelihood phylogeny of the 166 type strains based on protein sequences of 100 single-copy orthologous genes revealed thirteen groups of Pseudomonas, composed of two to sixty three species each. Pairwise average nucleotide identities and alignment fractions were calculated for the data set of the 166 type strains and 1224 genomes of Pseudomonas available in public databases. Results revealed that 394 of the 1224 genomes were distinct from any type strain, suggesting that the type strains represent only a fraction of the genomic diversity of the genus. The core genome of Pseudomonas was determined to contain 794 genes conferring primarily housekeeping functions. The results of this study provide a phylogenetic framework for future studies aiming to resolve the classification and phylogenetic relationships, identify new gene functions and phenotypes, and explore the ecological and metabolic potential of the Pseudomonas spp.© 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.


September 22, 2019  |  

Comparative genomics provides insights into the marine adaptation in sponge-derived Kocuriaflava S43.

Sponge-derived actinomycetes represent a significant component of marine actinomycetes. Members of the genus Kocuria are distributed in various habitats such as soil, rhizosphere, clinical specimens, marine sediments, and sponges, however, to date, little is known about the mechanism of their environmental adaptation. Kocuria flava S43 was isolated from a coastal sponge. Phylogenetic analysis revealed that it was closely related to the terrestrial airborne K. flava HO-9041. In this study, to gain insights into the marine adaptation in K. flava S43 we sequenced the draft genome for K. flava S43 by third generation sequencing (TGS) and compared it with those of K. flava HO-9041 and some other Kocuria relatives. Comparative genomics and phylogenetic analyses revealed that K. flava S43 might adapt to the marine environment mainly by increasing the number of the genes linked to potassium homeostasis, resistance to heavy metals and phosphate metabolism, and acquiring the genes associated with electron transport and the genes encoding ATP-binding cassette (ABC) transporter, aquaporin, and thiol/disulfide interchange protein. Notably, gene acquisition was probably a primary mechanism of environmental adaptation in K. flava S43. Furthermore, this study also indicated that the Kocuria isolates from various marine and hyperosmotic environments possessed common genetic basis for environmental adaptation.


September 22, 2019  |  

Draft genome of Glyptosternon maculatum, an endemic fish from Tibet Plateau.

Mechanisms for high-altitude adaption have attracted widespread interest among evolutionary biologists. Several genome-wide studies have been carried out for endemic vertebrates in Tibet, including mammals, birds, and amphibians. However, little information is available about the adaptive evolution of highland fishes. Glyptosternon maculatum (Regan 1905), also known as Regan or barkley and endemic to the Tibetan Plateau, belongs to the Sisoridae family, order Siluriformes (catfishes). This species lives at an elevation ranging from roughly 2,800 m to 4,200 m. Hence, a high-quality reference genome of G. maculatum provides an opportunity to investigate high-altitude adaption mechanisms of fishes.To obtain a high-quality reference genome sequence of G. maculatum, we combined Pacific Bioscience single-molecule real-time sequencing, Illumina paired-end sequencing, 10X Genomics linked-reads, and BioNano optical map techniques. In total, 603.99 Gb sequencing data were generated. The assembled genome was about 662.34 Mb with scaffold and contig N50 sizes of 20.90 Mb and 993.67 kb, respectively, which captured 83% complete and 3.9% partial vertebrate Benchmarking Universal Single-Copy Orthologs. Repetitive elements account for 35.88% of the genome, and ?22,066 protein-coding genes were predicted from the genome, of which 91.7% have been functionally annotated.We present the first comprehensive de novo genome of G. maculatum. This genetic resource is fundamental for investigating the origin of G. maculatum and will improve our understanding of high-altitude adaption of fishes. The assembled genome can also be used as reference for future population genetic studies of G. maculatum.


July 19, 2019  |  

The mitochondrial genome map of Nelumbo nucifera reveals ancient evolutionary features.

Nelumbo nucifera is an evolutionary relic from the Late Cretaceous period. Sequencing the N. nucifera mitochondrial genome is important for elucidating the evolutionary characteristics of basal eudicots. Here, the N. nucifera mitochondrial genome was sequenced using single molecule real-time sequencing technology (SMRT), and the mitochondrial genome map was constructed after de novo assembly and annotation. The results showed that the 524,797-bp N. nucifera mitochondrial genome has a total of 63 genes, including 40 protein-coding genes, three rRNA genes and 20 tRNA genes. Fifteen collinear gene clusters were conserved across different plant species. Approximately 700 RNA editing sites in the protein-coding genes were identified. Positively selected genes were identified with selection pressure analysis. Nineteen chloroplast-derived fragments were identified, and seven tRNAs were derived from the chloroplast. These results suggest that the N. nucifera mitochondrial genome retains evolutionarily conserved characteristics, including ancient gene content and gene clusters, high levels of RNA editing, and low levels of chloroplast-derived fragment insertions. As the first publicly available basal eudicot mitochondrial genome, the N. nucifera mitochondrial genome facilitates further analysis of the characteristics of basal eudicots and provides clues of the evolutionary trajectory from basal angiosperms to advanced eudicots.


July 19, 2019  |  

Resequencing of 243 diploid cotton accessions based on an updated A genome identifies the genetic basis of key agronomic traits.

The ancestors of Gossypium arboreum and Gossypium herbaceum provided the A subgenome for the modern cultivated allotetraploid cotton. Here, we upgraded the G. arboreum genome assembly by integrating different technologies. We resequenced 243?G. arboreum and G. herbaceum accessions to generate a map of genome variations and found that they are equally diverged from Gossypium raimondii. Independent analysis suggested that Chinese G. arboreum originated in South China and was subsequently introduced to the Yangtze and Yellow River regions. Most accessions with domestication-related traits experienced geographic isolation. Genome-wide association study (GWAS) identified 98 significant peak associations for 11 agronomically important traits in G. arboreum. A nonsynonymous substitution (cysteine-to-arginine substitution) of GaKASIII seems to confer substantial fatty acid composition (C16:0 and C16:1) changes in cotton seeds. Resistance to fusarium wilt disease is associated with activation of GaGSTF9 expression. Our work represents a major step toward understanding the evolution of the A genome of cotton.


July 7, 2019  |  

A precise chloroplast genome of Nelumbo nucifera (Nelumbonaceae) evaluated with Sanger, Illumina MiSeq, and PacBio RS II sequencing platforms: insight into the plastid evolution of basal eudicots.

BackgroundThe chloroplast genome is important for plant development and plant evolution. Nelumbo nucifera is one member of relict plants surviving from the late Cretaceous. Recently, a new sequencing platform PacBio RS II, known as `SMRT (Single Molecule, Real-Time) sequencing¿, has been developed. Using the SMRT sequencing to investigate the chloroplast genome of N. nucifera will help to elucidate the plastid evolution of basal eudicots.ResultsThe sizes of the de novo assembled complete chloroplast genome of N. nucifera were 163,307 bp, 163,747 bp and 163,600 bp with average depths of coverage of 7×, 712× and 105× sequenced by Sanger, Illumina MiSeq and PacBio RS II, respectively. The precise chloroplast genome of N. nucifera was obtained from PacBio RS II data proofread by Illumina MiSeq reads, with a quadripartite structure containing a large single copy region (91,846 bp) and a small single copy region (19,626 bp) separated by two inverted repeat regions (26,064 bp). The genome contains 113 different genes, including four distinct rRNAs, 30 distinct tRNAs and 79 distinct peptide-coding genes. A phylogenetic analysis of 133 taxa from 56 orders indicated that Nelumbo with an age of 177 million years is a sister clade to Platanus, which belongs to the basal eudicots. Basal eudicots began to emerge during the early Jurassic with estimated divergence times at 197 million years using MCMCTree. IR expansions/contractions within the basal eudicots seem to have occurred independently.ConclusionsBecause of long reads and lack of bias in coverage of AT-rich regions, PacBio RS II showed a great promise for highly accurate `finished¿ genomes, especially for a de novo assembly of genomes. N. nucifera is one member of basal eudicots, however, evolutionary analyses of IR structural variations of N. nucifera and other basal eudicots suggested that IR expansions/contractions occurred independently in these basal eudicots or were caused by independent insertions and deletions. The precise chloroplast genome of N. nucifera will present new information for structural variation of chloroplast genomes and provide new insight into the evolution of basal eudicots at the primary sequence and structural level.


July 7, 2019  |  

Genomic mapping of phosphorothioates reveals partial modification of short consensus sequences.

Bacterial phosphorothioate (PT) DNA modifications are incorporated by Dnd proteins A-E and often function with DndF-H as a restriction-modification (R-M) system, as in Escherichia coli B7A. However, bacteria such as Vibrio cyclitrophicus FF75 lack dndF-H, which points to other PT functions. Here we report two novel, orthogonal technologies to map PTs across the genomes of B7A and FF75 with >90% agreement: single molecule, real-time sequencing and deep sequencing of iodine-induced cleavage at PT (ICDS). In B7A, we detect PT on both strands of GpsAAC/GpsTTC motifs, but with only 12% of 40,701 possible sites modified. In contrast, PT in FF75 occurs as a single-strand modification at CpsCA, again with only 14% of 160,541 sites modified. Single-molecule analysis indicates that modification could be partial at any particular genomic site even with active restriction by DndF-H, with direct interaction of modification proteins with GAAC/GTTC sites demonstrated with oligonucleotides. These results point to highly unusual target selection by PT-modification proteins and rule out known R-M mechanisms.


July 7, 2019  |  

Comparative genomics reveals insights into avian genome evolution and adaptation.

Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. Copyright © 2014, American Association for the Advancement of Science.


July 7, 2019  |  

Identification of YfiH and the catalase CatA as polyphenol oxidases of Aeromonas media and CatA as a regulator of pigmentation by Its peroxyl radical scavenging capacity.

Pyomelanin is the major constituent of pigment in melanogenic Aeromonas strains of bacteria. However, eumelanin, synthesized from tyrosine via L-DOPA and polyphenol oxidases (PPOs), may also be present in this genus since L-DOPA is frequently detected in culture fluids of several species. To address this question, we used a deletion mutant of Aeromonas media strain WS, in which pyomelanin synthesis is completely blocked under normal culture conditions. When tyrosine was supplied to the medium, we observed residual melanin accumulation, which we interpret as evidence for existence of the DOPA-melanin pathway. We traced enzymatic activity in this bacterium using native-polyacrylamide gel electrophoresis. Two PPOs: YfiH, a laccase-like protein, and CatA, a catalase, were identified. However, neither protein was critical for the residual pigmentation in pyomelanin-deficient mutant. We speculate that eumelanin synthesis may require other unknown enzymes. Deletion of yfiH did not affect pigmentation in A. media strain WS, while deletion of the CatA-encoding gene katE resulted in a reduction of melanin accumulation, but it started 9 h earlier than in the wild-type. Since catalases regulate reactive oxygen species levels during melanogenesis, we speculated that CatA affects pigmentation through its peroxyl radical scavenging capacity. Consistent with this, expression of the catalases Hpi or Hpii from Escherichia coli in the katE deletion strain of A. media strain WS restored pigmentation to the wild-type level. Hpi and Hpii also exhibited PPO activity, suggesting that catalase may represent a new class of PPOs.


July 7, 2019  |  

Genome mining of astaxanthin biosynthetic genes from Sphingomonas sp. ATCC 55669 for heterologous overproduction in Escherichia coli.

As a highly valued keto-carotenoid, astaxanthin is widely used in nutritional supplements and pharmaceuticals. Therefore, the demand for biosynthetic astaxanthin and improved efficiency of astaxanthin biosynthesis has driven the investigation of metabolic engineering of native astaxanthin producers and heterologous hosts. However, microbial resources for astaxanthin are limited. In this study, we found that the a-Proteobacterium Sphingomonas sp. ATCC 55669 could produce astaxanthin naturally. We used whole-genome sequencing to identify the astaxanthin biosynthetic pathway using a combined PacBio-Illumina approach. The putative astaxanthin biosynthetic pathway in Sphingomonas sp. ATCC 55669 was predicted. For further confirmation, a high-efficiency targeted engineering carotenoid synthesis platform was constructed in E. coli for identifying the functional roles of candidate genes. All genes involved in astaxanthin biosynthesis showed discrete distributions on the chromosome. Moreover, the overexpression of exogenous E. coli idi in Sphingomonas sp. ATCC 55669 increased astaxanthin production by 5.4-fold. This study described a new astaxanthin producer and provided more biosynthesis components for bioengineering of astaxanthin in the future. © 2015 The Authors. Biotechnology Journal published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.


July 7, 2019  |  

Development of Streptomyces sp. FR-008 as an emerging chassis

Microbial-derived natural products are important in both the pharmaceutical industry and academic research. As the metabolic potential of original producer especially Streptomyces is often limited by slow growth rate, complicated cultivation profile, and unfeasible genetic manipulation, so exploring a Streptomyces as a super industrial chassis is valuable and urgent. Streptomyces sp. FR-008 is a fast-growing microorganism and can also produce a considerable amount of macrolide candicidin via modular polyketide synthase. In this study, we evaluated Streptomyces sp. FR-008 as a potential industrial-production chassis. First, PacBio sequencing and transcriptome analyses indicated that the Streptomyces sp. FR-008 genome size is 7.26 Mb, which represents one of the smallest of currently sequenced Streptomyces genomes. In addition, we simplified the conjugation procedure without heat-shock and pre-germination treatments but with high conjugation efficiency, suggesting it is inherently capable of accepting heterologous DNA. In addition, a series of promoters selected from literatures was assessed based on GusA activity in Streptomyces sp. FR-008. Compared with the common used promoter ermE*-p, the strength of these promoters comprise a library with a constitutive range of 60–860%, thus providing the useful regulatory elements for future genetic engineering purpose. In order to minimum the genome, we also target deleted three endogenous polyketide synthase (PKS) gene clusters to generate a mutant LQ3. LQ3 is thus an “updated” version of Streptomyces sp. FR-008, producing fewer secondary metabolites profiles than Streptomyces sp. FR-008. We believe this work could facilitate further development of Streptomyces sp. FR-008 for use in biotechnological applications.


July 7, 2019  |  

Complete genome sequence of the poly-?-glutamate-synthesizing Bacterium Bacillus subtilis Bs-115.

Bacillus subtilis Bs-115 was isolated from the soil of a corn field in Yutai County, Jinan City, Shandong Province, People’s Republic of China, and is characterized by the efficient synthesis of poly-?-glutamate (?-PGA), with corn saccharification liquid as the sole energy and carbon source during the process of ?-PGA formation. Here, we report the complete genome sequence of Bacillus subtilis Bs-115 and the genes associated with poly-?-glutamate synthesis. Copyright © 2018 Wang et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.