Menu
April 21, 2020

Identification of Initial Colonizing Bacteria in Dental Plaques from Young Adults Using Full-Length 16S rRNA Gene Sequencing.

Development of dental plaque begins with the adhesion of salivary bacteria to the acquired pellicle covering the tooth surface. In this study, we collected in vivo dental plaque formed on hydroxyapatite disks for 6 h from 74 young adults and identified initial colonizing taxa based on full-length 16S rRNA gene sequences. A long-read, single-molecule sequencer, PacBio Sequel, provided 100,109 high-quality full-length 16S rRNA gene sequence reads from the early plaque microbiota, which were assigned to 90 oral bacterial taxa. The microbiota obtained from every individual mostly comprised the 21 predominant taxa with the maximum relative abundance of over 10% (95.8?±?6.2%, mean ± SD), which included Streptococcus species as well as nonstreptococcal species. A hierarchical cluster analysis of their relative abundance distribution suggested three major patterns of microbiota compositions: a Streptococcus mitis/Streptococcus sp. HMT-423-dominant profile, a Neisseria sicca/Neisseria flava/Neisseria mucosa-dominant profile, and a complex profile with high diversity. No notable variations in the community structures were associated with the dental caries status, although the total bacterial amounts were larger in the subjects with a high number of caries-experienced teeth (=8) than in those with no or a low number of caries-experienced teeth. Our results revealed the bacterial taxa primarily involved in early plaque formation on hydroxyapatite disks in young adults.IMPORTANCE Selective attachment of salivary bacteria to the tooth surface is an initial and repetitive phase in dental plaque development. We employed full-length 16S rRNA gene sequence analysis with a high taxonomic resolution using a third-generation sequencer, PacBio Sequel, to determine the bacterial composition during early plaque formation in 74 young adults accurately and in detail. The results revealed 21 bacterial taxa primarily involved in early plaque formation on hydroxyapatite disks in young adults, which include several streptococcal species as well as nonstreptococcal species, such as Neisseria sicca/Nflava/Nmucosa and Rothia dentocariosa Given that no notable variations in the microbiota composition were associated with the dental caries status, the maturation process, rather than the specific bacterial species that are the initial colonizers, is likely to play an important role in the development of dysbiotic microbiota associated with dental caries. Copyright © 2019 Ihara et al.


April 21, 2020

Pacbio Sequencing Reveals Identical Organelle Genomes between American Cranberry (Vaccinium macrocarpon Ait.) and a Wild Relative.

Breeding efforts in the American cranberry (Vaccinium macrocarpon Ait.), a North American perennial fruit crop of great importance, have been hampered by the limited genetic and phenotypic variability observed among cultivars and experimental materials. Most of the cultivars commercially used by cranberry growers today were derived from a few wild accessions bred in the 1950s. In different crops, wild germplasm has been used as an important genetic resource to incorporate novel traits and increase the phenotypic diversity of breeding materials. Vaccinium microcarpum (Turcz. ex Rupr.) Schmalh. and V. oxycoccos L., two closely related species, may be cross-compatible with the American cranberry, and could be useful to improve fruit quality such as phytochemical content. Furthermore, given their northern distribution, they could also help develop cold hardy cultivars. Although these species have previously been analyzed in diversity studies, genomic characterization and comparative studies are still lacking. In this study, we sequenced and assembled the organelle genomes of the cultivated American cranberry and its wild relative, V. microcarpum. PacBio sequencing technology allowed us to assemble both mitochondrial and plastid genomes at very high coverage and in a single circular scaffold. A comparative analysis revealed that the mitochondrial genome sequences were identical between both species and that the plastids presented only two synonymous single nucleotide polymorphisms (SNPs). Moreover, the Illumina resequencing of additional accessions of V. microcarpum and V. oxycoccos revealed high genetic variation in both species. Based on these results, we provided a hypothesis involving the extension and dynamics of the last glaciation period in North America, and how this could have shaped the distribution and dispersal of V. microcarpum. Finally, we provided important data regarding the polyploid origin of V. oxycoccos.


April 21, 2020

Genome sequence of the corn leaf aphid (Rhopalosiphum maidis Fitch).

The corn leaf aphid (Rhopalosiphum maidis Fitch) is the most economically damaging aphid pest on maize (Zea mays), one of the world’s most important grain crops. In addition to causing direct damage by removing photoassimilates, R. maidis transmits several destructive maize viruses, including maize yellow dwarf virus, barley yellow dwarf virus, sugarcane mosaic virus, and cucumber mosaic virus.The genome of a parthenogenetically reproducing R. maidis clone was assembled with a combination of Pacific Biosciences (207-fold coverage) and Illumina (83-fold coverage) sequencing. The 689 assembled contigs, which have an N50 size of 9.0 megabases (Mb) and a low level of heterozygosity, were clustered using Phase Genomics Hi-C interaction maps. Consistent with the commonly observed 2n = 8 karyotype of R. maidis, most of the contigs (473 spanning 321 Mb) were successfully oriented into 4 scaffolds. The genome assembly captured the full length of 95.8% of the core eukaryotic genes, indicating that it is highly complete. Repetitive sequences accounted for 21.2% of the assembly, and a total of 17,629 protein-coding genes were predicted with integrated evidence from ab initio and homology-based gene predictions and transcriptome sequences generated with both Pacific Biosciences and Illumina. An analysis of likely horizontally transferred genes identified 2 from bacteria, 7 from fungi, 2 from protozoa, and 9 from algae. Repeat elements, transposons, and genes encoding likely detoxification enzymes (cytochrome P450s, glutathione S-transferases, carboxylesterases, uridine diphosphate-glucosyltransferases, and ABC transporters) were identified in the genome sequence. Other than Buchnera aphidicola (642,929 base pairs, 602 genes), no endosymbiont bacteria were found in R. maidis.A high-quality R. maidis genome was assembled at the chromosome level. This genome sequence will enable further research related to ecological interactions, virus transmission, pesticide resistance, and other aspects of R. maidis biology. It also serves as a valuable resource for comparative investigation of other aphid species. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Genome Sequencing Illustrates the Genetic Basis of the Pharmacological Properties of Gloeostereum incarnatum.

Gloeostereum incarnatum is a precious edible mushroom that is widely grown in Asia and known for its useful medicinal properties. Here, we present a high-quality genome of G. incarnatum using the single-molecule real-time (SMRT) sequencing platform. The G. incarnatum genome, which is the first complete genome to be sequenced in the family Cyphellaceae, was 38.67 Mbp, with an N50 of 3.5 Mbp, encoding 15,251 proteins. Based on our phylogenetic analysis, the Cyphellaceae diverged ~174 million years ago. Several genes and gene clusters associated with lignocellulose degradation, secondary metabolites, and polysaccharide biosynthesis were identified in G. incarnatum, and compared with other medicinal mushrooms. In particular, we identified two terpenoid-associated gene clusters, each containing a gene encoding a sesterterpenoid synthase adjacent to a gene encoding a cytochrome P450 enzyme. These clusters might participate in the biosynthesis of incarnal, a known bioactive sesterterpenoid produced by G. incarnatum. Through a transcriptomic analysis comparing the G. incarnatum mycelium and fruiting body, we also demonstrated that the genes associated with terpenoid biosynthesis were generally upregulated in the mycelium, while those associated with polysaccharide biosynthesis were generally upregulated in the fruiting body. This study provides insights into the genetic basis of the medicinal properties of G. incarnatum, laying a framework for future characterization of bioactive proteins and pharmaceutical uses of this fungus.


April 21, 2020

A chromosomal-scale genome assembly of Tectona grandis reveals the importance of tandem gene duplication and enables discovery of genes in natural product biosynthetic pathways.

Teak, a member of the Lamiaceae family, produces one of the most expensive hardwoods in the world. High demand coupled with deforestation have caused a decrease in natural teak forests, and future supplies will be reliant on teak plantations. Hence, selection of teak tree varieties for clonal propagation with superior growth performance is of great importance, and access to high-quality genetic and genomic resources can accelerate the selection process by identifying genes underlying desired traits.To facilitate teak research and variety improvement, we generated a highly contiguous, chromosomal-scale genome assembly using high-coverage Pacific Biosciences long reads coupled with high-throughput chromatin conformation capture. Of the 18 teak chromosomes, we generated 17 near-complete pseudomolecules with one chromosome present as two chromosome arm scaffolds. Genome annotation yielded 31,168 genes encoding 46,826 gene models, of which, 39,930 and 41,155 had Pfam domain and expression evidence, respectively. We identified 14 clusters of tandem-duplicated terpene synthases (TPSs), genes central to the biosynthesis of terpenes, which are involved in plant defense and pollinator attraction. Transcriptome analysis revealed 10 TPSs highly expressed in woody tissues, of which, 8 were in tandem, revealing the importance of resolving tandemly duplicated genes and the quality of the assembly and annotation. We also validated the enzymatic activity of four TPSs to demonstrate the function of key TPSs.In summary, this high-quality chromosomal-scale assembly and functional annotation of the teak genome will facilitate the discovery of candidate genes related to traits critical for sustainable production of teak and for anti-insecticidal natural products. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Candidate Gene Selection for Cytoplasmic Male Sterility in Pepper (Capsicum annuum L.) through Whole Mitochondrial Genome Sequencing.

Cytoplasmic male sterility (CMS), which is controlled by mitochondrial genes, is an important trait for commercial hybrid seed production. So far, genes controlling this trait are still not clear in pepper. In this study, complete mitochondrial genomes were sequenced and assembled for the CMS line 138A and its maintainer line 138B. The genome size of 138A is 504,210 bp, which is 8618 bp shorter than that of 138B. Meanwhile, more than 214 and 215 open reading frames longer than 100 amino acids (aas) were identified in 138A and 138B, respectively. Mitochondrial genome structure of 138A was quite different from that of 138B, indicating the existence of recombination and rearrangement events. Based on the mitochondrial genome sequence and structure variations, mitochondrion of 138A and FS4401, a Korean origin CMS line, may have inherited from a common female ancestor, but their CMS traits did originate separately. Candidate gene selection was performed according to the published characteristics of the CMS genes, including the presence SNPs and InDels, located in unique regions, their chimeric structure, co-transcription, and transmembrane domain. A total of 35 ORFs were considered as potential candidate genes and 14 of these were selected, with orf300a and 0rf314a as strong candidates. A new marker, orf300a, was developed which did co-segregate with the CMS trait.


April 21, 2020

Genome Sequencing of Cladobotryum protrusum Provides Insights into the Evolution and Pathogenic Mechanisms of the Cobweb Disease Pathogen on Cultivated Mushroom.

Cladobotryum protrusum is one of the mycoparasites that cause cobweb disease on cultivated edible mushrooms. However, the molecular mechanisms of evolution and pathogenesis of C. protrusum on mushrooms are largely unknown. Here, we report a high-quality genome sequence of C. protrusum using the single-molecule, real-time sequencing platform of PacBio and perform a comparative analysis with closely related fungi in the family Hypocreaceae. The C. protrusum genome, the first complete genome to be sequenced in the genus Cladobotryum, is 39.09 Mb long, with an N50 of 4.97 Mb, encoding 11,003 proteins. The phylogenomic analysis confirmed its inclusion in Hypocreaceae, with its evolutionary divergence time estimated to be ~170.1 million years ago. The genome encodes a large and diverse set of genes involved in secreted peptidases, carbohydrate-active enzymes, cytochrome P450 enzymes, pathogen?host interactions, mycotoxins, and pigments. Moreover, C. protrusum harbors arrays of genes with the potential to produce bioactive secondary metabolites and stress response-related proteins that are significant for adaptation to hostile environments. Knowledge of the genome will foster a better understanding of the biology of C. protrusum and mycoparasitism in general, as well as help with the development of effective disease control strategies to minimize economic losses from cobweb disease in cultivated edible mushrooms.


April 21, 2020

Genome sequence of Malania oleifera, a tree with great value for nervonic acid production.

Malania oleifera, a member of the Olacaceae family, is an IUCN red listed tree, endemic and restricted to the Karst region of southwest China. This tree’s seed is valued for its high content of precious fatty acids (especially nervonic acid). However, studies on its genetic makeup and fatty acid biogenesis are severely hampered by a lack of molecular and genetic tools.We generated 51 Gb and 135 Gb of raw DNA sequences, using Pacific Biosciences (PacBio) single-molecule real-time and 10× Genomics sequencing, respectively. A final genome assembly, with a scaffold N50 size of 4.65 Mb and a total length of 1.51 Gb, was obtained by primary assembly based on PacBio long reads plus scaffolding with 10× Genomics reads. Identified repeats constituted ~82% of the genome, and 24,064 protein-coding genes were predicted with high support. The genome has low heterozygosity and shows no evidence for recent whole genome duplication. Metabolic pathway genes relating to the accumulation of long-chain fatty acid were identified and studied in detail.Here, we provide the first genome assembly and gene annotation for M. oleifera. The availability of these resources will be of great importance for conservation biology and for the functional genomics of nervonic acid biosynthesis. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Genome assembly and annotation of the Trichoplusia ni Tni-FNL insect cell line enabled by long-read technologies.

Trichoplusiani derived cell lines are commonly used to enable recombinant protein expression via baculovirus infection to generate materials approved for clinical use and in clinical trials. In order to develop systems biology and genome engineering tools to improve protein expression in this host, we performed de novo genome assembly of the Trichoplusiani-derived cell line Tni-FNL.By integration of PacBio single-molecule sequencing, Bionano optical mapping, and 10X Genomics linked-reads data, we have produced a draft genome assembly of Tni-FNL.Our assembly contains 280 scaffolds, with a N50 scaffold size of 2.3 Mb and a total length of 359 Mb. Annotation of the Tni-FNL genome resulted in 14,101 predicted genes and 93.2% of the predicted proteome contained recognizable protein domains. Ortholog searches within the superorder Holometabola provided further evidence of high accuracy and completeness of the Tni-FNL genome assembly.This first draft Tni-FNL genome assembly was enabled by complementary long-read technologies and represents a high-quality, well-annotated genome that provides novel insight into the complexity of this insect cell line and can serve as a reference for future large-scale genome engineering work in this and other similar recombinant protein production hosts.


April 21, 2020

Hidden genomic evolution in a morphospecies-The landscape of rapidly evolving genes in Tetrahymena.

A morphospecies is defined as a taxonomic species based wholly on morphology, but often morphospecies consist of clusters of cryptic species that can be identified genetically or molecularly. The nature of the evolutionary novelty that accompanies speciation in a morphospecies is an intriguing question. Morphospecies are particularly common among ciliates, a group of unicellular eukaryotes that separates 2 kinds of nuclei-the silenced germline nucleus (micronucleus [MIC]) and the actively expressed somatic nucleus (macronucleus [MAC])-within a common cytoplasm. Because of their very similar morphologies, members of the Tetrahymena genus are considered a morphospecies. We explored the hidden genomic evolution within this genus by performing a comprehensive comparative analysis of the somatic genomes of 10 species and the germline genomes of 2 species of Tetrahymena. These species show high genetic divergence; phylogenomic analysis suggests that the genus originated about 300 million years ago (Mya). Seven universal protein domains are preferentially included among the species-specific (i.e., the youngest) Tetrahymena genes. In particular, leucine-rich repeat (LRR) genes make the largest contribution to the high level of genome divergence of the 10 species. LRR genes can be sorted into 3 different age groups. Parallel evolutionary trajectories have independently occurred among LRR genes in the different Tetrahymena species. Thousands of young LRR genes contain tandem arrays of exactly 90-bp exons. The introns separating these exons show a unique, extreme phase 2 bias, suggesting a clonal origin and successive expansions of 90-bp-exon LRR genes. Identifying LRR gene age groups allowed us to document a Tetrahymena intron length cycle. The youngest 90-bp exon LRR genes in T. thermophila are concentrated in pericentromeric and subtelomeric regions of the 5 micronuclear chromosomes, suggesting that these regions act as genome innovation centers. Copies of a Tetrahymena Long interspersed element (LINE)-like retrotransposon are very frequently found physically adjacent to 90-bp exon/intron repeat units of the youngest LRR genes. We propose that Tetrahymena species have used a massive exon-shuffling mechanism, involving unequal crossing over possibly in concert with retrotransposition, to create the unique 90-bp exon array LRR genes.


April 21, 2020

High-throughput amplicon sequencing of the full-length 16S rRNA gene with single-nucleotide resolution.

Targeted PCR amplification and high-throughput sequencing (amplicon sequencing) of 16S rRNA gene fragments is widely used to profile microbial communities. New long-read sequencing technologies can sequence the entire 16S rRNA gene, but higher error rates have limited their attractiveness when accuracy is important. Here we present a high-throughput amplicon sequencing methodology based on PacBio circular consensus sequencing and the DADA2 sample inference method that measures the full-length 16S rRNA gene with single-nucleotide resolution and a near-zero error rate. In two artificial communities of known composition, our method recovered the full complement of full-length 16S sequence variants from expected community members without residual errors. The measured abundances of intra-genomic sequence variants were in the integral ratios expected from the genuine allelic variants within a genome. The full-length 16S gene sequences recovered by our approach allowed Escherichia coli strains to be correctly classified to the O157:H7 and K12 sub-species clades. In human fecal samples, our method showed strong technical replication and was able to recover the full complement of 16S rRNA alleles in several E. coli strains. There are likely many applications beyond microbial profiling for which high-throughput amplicon sequencing of complete genes with single-nucleotide resolution will be of use. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


April 21, 2020

Infection mechanisms and putative effector repertoire of the mosquito pathogenic oomycete Pythium guiyangense uncovered by genomic analysis.

Pythium guiyangense, an oomycete from a genus of mostly plant pathogens, is an effective biological control agent that has wide potential to manage diverse mosquitoes. However, its mosquito-killing mechanisms are almost unknown. In this study, we observed that P. guiyangense could utilize cuticle penetration and ingestion of mycelia into the digestive system to infect mosquito larvae. To explore pathogenic mechanisms, a high-quality genome sequence with 239 contigs and an N50 contig length of 1,009 kb was generated. The genome assembly is approximately 110 Mb, which is almost twice the size of other sequenced Pythium genomes. Further genome analysis suggests that P. guiyangense may arise from a hybridization of two related but distinct parental species. Phylogenetic analysis demonstrated that P. guiyangense likely evolved from common ancestors shared with plant pathogens. Comparative genome analysis coupled with transcriptome sequencing data suggested that P. guiyangense may employ multiple virulence mechanisms to infect mosquitoes, including secreted proteases and kazal-type protease inhibitors. It also shares intracellular Crinkler (CRN) effectors used by plant pathogenic oomycetes to facilitate the colonization of plant hosts. Our experimental evidence demonstrates that CRN effectors of P. guiyangense can be toxic to insect cells. The infection mechanisms and putative virulence effectors of P. guiyangense uncovered by this study provide the basis to develop improved mosquito control strategies. These data also provide useful knowledge on host adaptation and evolution of the entomopathogenic lifestyle within the oomycete lineage. A deeper understanding of the biology of P. guiyangense effectors might also be useful for management of other important agricultural pests.


April 21, 2020

Genome sequence of Methylocystis hirsuta CSC1, a polyhydroxyalkanoate producing methanotroph.

Polyhydroxyalkanoates (PHAs) are biodegradable plastics that can be produced by some methanotrophic organisms such as those of the genus Methylocystis. This allows the conversion of a detrimental greenhouse gas into an environmentally friendly high added-value bioproduct. This study presents the genome sequence of Methylocystis hirsuta CSC1 (a high yield PHB producer). The genome comprises 4,213,043 bp in 4 contigs, with the largest contig being 3,776,027 bp long. Two of the other contigs are likely to correspond to large size plasmids. A total of 4,664 coding sequences were annotated, revealing a PHA production cluster, two distinct particulate methane monooxygenases with active catalytic sites, as well as a nitrogen fixation operon and a partial denitrification pathway. © 2018 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.


April 21, 2020

Complete Genome Sequence of Actinosynnema pretiosum X47, An Industrial Strain that Produces the Antibiotic Ansamitocin AP-3.

Ansamitocins are extraordinarily potent antitumor agents. Ansamitocin P-3 (AP-3), which is produced by Actinosynnema pretiosum, has been developed as a cytotoxic drug for breast cancer. Despite its importance, AP-3 is of limited applicability because of the low production yield. A. pretiosum strain X47 was developed from A. pretiosum ATCC 31565 by mutation breeding and shows a relatively high AP-3 yield. Here, we analyzed the A. pretiosum X47 genome, which is ~8.13 Mb in length with 6693 coding sequences, 58 tRNA genes, and 15 rRNA genes. The DNA sequence of the ansamitocin biosynthetic gene cluster is highly similar to that of the corresponding cluster in A. pretiosum ATCC 31565, with 99.9% identity. However, RT-qPCR analysis showed that the expression levels of ansamitocin biosynthetic genes were significantly increased in X47 compared with the levels in the wild-type strain, consistent with the higher yield of AP-3 in X47. The annotated complete genome sequence of this strain will facilitate understanding the molecular mechanisms of ansamitocin biosynthesis and regulation in A. pretiosum and help further genetic engineering studies to enhance the production of AP-3.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.