Menu
September 22, 2019  |  

Single molecule RNA sequencing uncovers trans-splicing and improves annotations in Anopheles stephensi.

Single molecule real-time (SMRT) sequencing has recently been used to obtain full-length cDNA sequences that improve genome annotation and reveal RNA isoforms. Here, we used one such method called isoform sequencing from Pacific Biosciences (PacBio) to sequence a cDNA library from the Asian malaria mosquito Anopheles stephensi. More than 600 000 full-length cDNAs, referred to as reads of insert, were identified. Owing to the inherently high error rate of PacBio sequencing, we tested different approaches for error correction. We found that error correction using Illumina RNA sequencing (RNA-seq) generated more data than using the default SMRT pipeline. The full-length error-corrected PacBio reads greatly improved the gene annotation of Anopheles stephensi: 4867 gene models were updated and 1785 alternatively spliced isoforms were added to the annotation. In addition, six trans-splicing events, where exons from different primary transcripts were joined together, were identified in An. stephensi. All six trans-splicing events appear to be conserved in Culicidae, as they are also found in Anopheles gambiae and Aedes aegypti. The proteins encoded by trans-splicing events are also highly conserved and the orthologues of these proteins are cis-spliced in outgroup species, indicating that trans-splicing may arise as a mechanism to rescue genes that broke up during evolution.© 2017 The Royal Entomological Society.


September 22, 2019  |  

Complete genome sequencing of the luminescent bacterium, Vibrio qinghaiensis sp. Q67 using PacBio technology.

Vibrio qinghaiensis sp.-Q67 (Vqin-Q67) is a freshwater luminescent bacterium that continuously emits blue-green light (485?nm). The bacterium has been widely used for detecting toxic contaminants. Here, we report the complete genome sequence of Vqin-Q67, obtained using third-generation PacBio sequencing technology. Continuous long reads were attained from three PacBio sequencing runs and reads >500?bp with a quality value of >0.75 were merged together into a single dataset. This resultant highly-contiguous de novo assembly has no genome gaps, and comprises two chromosomes with substantial genetic information, including protein-coding genes, non-coding RNA, transposon and gene islands. Our dataset can be useful as a comparative genome for evolution and speciation studies, as well as for the analysis of protein-coding gene families, the pathogenicity of different Vibrio species in fish, the evolution of non-coding RNA and transposon, and the regulation of gene expression in relation to the bioluminescence of Vqin-Q67.


September 22, 2019  |  

First draft genome of an iconic clownfish species (Amphiprion frenatus).

Clownfishes (or anemonefishes) form an iconic group of coral reef fishes, principally known for their mutualistic interaction with sea anemones. They are characterized by particular life history traits, such as a complex social structure and mating system involving sequential hermaphroditism, coupled with an exceptionally long lifespan. Additionally, clownfishes are considered to be one of the rare groups to have experienced an adaptive radiation in the marine environment. Here, we assembled and annotated the first genome of a clownfish species, the tomato clownfish (Amphiprion frenatus). We obtained 17,801 assembled scaffolds, containing a total of 26,917 genes. The completeness of the assembly and annotation was satisfying, with 96.5% of the Actinopterygii Benchmarking Universal Single-Copy Orthologs (BUSCOs) being retrieved in A. frenatus assembly. The quality of the resulting assembly is comparable to other bony fish assemblies. This resource is valuable for advancing studies of the particular life history traits of clownfishes, as well as being useful for population genetic studies and the development of new phylogenetic markers. It will also open the way to comparative genomics. Indeed, future genomic comparison among closely related fishes may provide means to identify genes related to the unique adaptations to different sea anemone hosts, as well as better characterize the genomic signatures of an adaptive radiation.© 2018 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.


September 22, 2019  |  

Screening and genomic characterization of filamentous hemagglutinin-deficient Bordetella pertussis.

Despite high vaccine coverage, pertussis cases in the United States have increased over the last decade. Growing evidence suggests that disease resurgence results, in part, from genetic divergence of circulating strain populations away from vaccine references. The United States employs acellular vaccines exclusively, and current Bordetella pertussis isolates are predominantly deficient in at least one immunogen, pertactin (Prn). First detected in the United States retrospectively in a 1994 isolate, the rapid spread of Prn deficiency is likely vaccine driven, raising concerns about whether other acellular vaccine immunogens experience similar pressures, as further antigenic changes could potentially threaten vaccine efficacy. We developed an electrochemiluminescent antibody capture assay to monitor the production of the acellular vaccine immunogen filamentous hemagglutinin (Fha). Screening 722 U.S. surveillance isolates collected from 2010 to 2016 identified two that were both Prn and Fha deficient. Three additional Fha-deficient laboratory strains were also identified from a historic collection of 65 isolates dating back to 1935. Whole-genome sequencing of deficient isolates revealed putative, underlying genetic changes. Only four isolates harbored mutations to known genes involved in Fha production, highlighting the complexity of its regulation. The chromosomes of two Fha-deficient isolates included unexpected structural variation that did not appear to influence Fha production. Furthermore, insertion sequence disruption of fhaB was also detected in a previously identified pertussis toxin-deficient isolate that still produced normal levels of Fha. These results demonstrate the genetic potential for additional vaccine immunogen deficiency and underscore the importance of continued surveillance of circulating B. pertussis evolution in response to vaccine pressure. Copyright © 2018 American Society for Microbiology.


September 22, 2019  |  

Complete genome sequence and analysis of the industrial Saccharomyces cerevisiae strain N85 used in Chinese rice wine production.

Chinese rice wine is a popular traditional alcoholic beverage in China, while its brewing processes have rarely been explored. We herein report the first gapless, near-finished genome sequence of the yeast strain Saccharomyces cerevisiae N85 for Chinese rice wine production. Several assembly methods were used to integrate Pacific Bioscience (PacBio) and Illumina sequencing data to achieve high-quality genome sequencing of the strain. The genome encodes more than 6,000 predicted proteins, and 238 long non-coding RNAs, which are validated by RNA-sequencing data. Moreover, our annotation predicts 171 novel genes that are not present in the reference S288c genome. We also identified 65,902 single nucleotide polymorphisms and small indels, many of which are located within genic regions. Dozens of larger copy-number variations and translocations were detected, mainly enriched in the subtelomeres, suggesting these regions may be related to genomic evolution. This study will serve as a milestone in studying of Chinese rice wine and related beverages in China and in other countries. It will help to develop more scientific and modern fermentation processes of Chinese rice wine, and explore metabolism pathways of desired and harmful components in Chinese rice wine to improve its taste and nutritional value.© The Author(s) 2018. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


September 22, 2019  |  

Complete genome of Cobetia marina JCM 21022T and phylogenomic analysis of the family Halomonadaceae

Cobetia marina is a model proteobacteria in researches on marine biofouling. Its taxonomic nomenclature has been revised many times over the past few decades. To better understand the role of the surface-associated lifestyle of C. marina and the phylogeny of the family Halomonadaceae, we sequenced the entire genome of C. marina JCM 21022T using single molecule real-time sequencing technology (SMRT) and performed comparative genomics and phylogenomics analyses. The circular chromosome was 4 176 300 bp with an average GC content of 62.44% and contained 3 611 predicted coding sequences, 72 tRNA genes, and 21 rRNA genes. The C. marina JCM 21022T genome contained a set of crucial genes involved in surface colonization processes. The comparative genome analysis indicated the significant diff erences between C. marina JCM 21022T and Cobetia amphilecti KMM 296 (formerly named C. marina KMM 296) resulted from sequence insertions or deletions and chromosomal recombination. Despite these diff erences, pan and core genome analysis showed similar gene functions between the two strains. The phylogenomic study of the family Halomonadaceae is reported here for the first time. We found that the relationships were well resolved among every genera tested, including Chromohalobacter, Halomonas, Cobetia, Kushneria, Zymobacter, and Halotalea.


September 22, 2019  |  

Assembly and analysis of a qingke reference genome demonstrate its close genetic relation to modern cultivated barley.

Qingke, the local name of hulless barley in the Tibetan Plateau, is a staple food for Tibetans. The availability of its reference genome sequences could be useful for studies on breeding and molecular evolution. Taking advantage of the third-generation sequencer (PacBio), we de novo assembled a 4.84-Gb genome sequence of qingke, cv. Zangqing320 and anchored a 4.59-Gb sequence to seven chromosomes. Of the 46,787 annotated ‘high-confidence’ genes, 31 564 were validated by RNA-sequencing data of 39 wild and cultivated barley genotypes with wide genetic diversity, and the results were also confirmed by nonredundant protein database from NCBI. As some gaps in the reference genome of Morex were covered in the reference genome of Zangqing320 by PacBio reads, we believe that the Zangqing320 genome provides the useful supplements for the Morex genome. Using the qingke genome as a reference, we conducted a genome comparison, revealing a close genetic relationship between a hulled barley (cv. Morex) and a hulless barley (cv. Zangqing320), which is strongly supported by the low-diversity regions in the two genomes. Considering the origin of Morex from its breeding pedigree, we then demonstrated a close genomic relationship between modern cultivated barley and qingke. Given this genomic relationship and the large genetic diversity between qingke and modern cultivated barley, we propose that qingke could provide elite genes for barley improvement.© 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


September 22, 2019  |  

Functional genomics of lipid metabolism in the oleaginous yeast Rhodosporidium toruloides.

The basidiomycete yeast Rhodosporidium toruloides (also known as Rhodotorula toruloides) accumulates high concentrations of lipids and carotenoids from diverse carbon sources. It has great potential as a model for the cellular biology of lipid droplets and for sustainable chemical production. We developed a method for high-throughput genetics (RB-TDNAseq), using sequence-barcoded Agrobacterium tumefaciens T-DNA insertions. We identified 1,337 putative essential genes with low T-DNA insertion rates. We functionally profiled genes required for fatty acid catabolism and lipid accumulation, validating results with 35 targeted deletion strains. We identified a high-confidence set of 150 genes affecting lipid accumulation, including genes with predicted function in signaling cascades, gene expression, protein modification and vesicular trafficking, autophagy, amino acid synthesis and tRNA modification, and genes of unknown function. These results greatly advance our understanding of lipid metabolism in this oleaginous species and demonstrate a general approach for barcoded mutagenesis that should enable functional genomics in diverse fungi.


September 22, 2019  |  

Enterobacter bugandensis: a novel enterobacterial species associated with severe clinical infection.

Nosocomial pathogens can cause life-threatening infections in neonates and immunocompromised patients. E. bugandensis (EB-247) is a recently described species of Enterobacter, associated with neonatal sepsis. Here we demonstrate that the extended spectrum ß-lactam (ESBL) producing isolate EB-247 is highly virulent in both Galleria mellonella and mouse models of infection. Infection studies in a streptomycin-treated mouse model showed that EB-247 is as efficient as Salmonella Typhimurium in inducing systemic infection and release of proinflammatory cytokines. Sequencing and analysis of the complete genome and plasmid revealed that virulence properties are associated with the chromosome, while antibiotic-resistance genes are exclusively present on a 299?kb IncHI plasmid. EB-247 grew in high concentrations of human serum indicating septicemic potential. Using whole genome-based transcriptome analysis we found 7% of the genome was mobilized for growth in serum. Upregulated genes include those involved in the iron uptake and storage as well as metabolism. The lasso peptide microcin J25 (MccJ25), an inhibitor of iron-uptake and RNA polymerase activity, inhibited EB-247 growth. Our studies indicate that Enterobacter bugandensis is a highly pathogenic species of the genus Enterobacter. Further studies on the colonization and virulence potential of E. bugandensis and its association with septicemic infection is now warranted.


September 22, 2019  |  

The hardy rubber tree genome provides insights into the evolution of polyisoprene biosynthesis.

Eucommia ulmoides, also called hardy rubber tree, is an economically important tree; however, the lack of its genome sequence restricts the fundamental biological research and applied studies of this plant species. Here, we present a high-quality assembly of its ~1.2-Gb genome (scaffold N50 = 1.88 Mb) with at least 26 723 predicted genes for E. ulmoides, the first sequenced genome of the order Garryales, which was obtained using an integrated strategy combining Illumina sequencing, PacBio sequencing, and BioNano mapping. As a sister taxon to lamiids and campanulids, E. ulmoides underwent an ancient genome triplication shared by core eudicots but no further whole-genome duplication in the last ~125 million years. E. ulmoides exhibits high expression levels and/or gene number expansion for multiple genes involved in stress responses and the biosynthesis of secondary metabolites, which may account for its considerable environmental adaptability. In contrast to the rubber tree (Hevea brasiliensis), which produces cis-polyisoprene, E. ulmoides has evolved to synthesize long-chain trans-polyisoprene via farnesyl diphosphate synthases (FPSs). Moreover, FPS and rubber elongation factor/small rubber particle protein gene families were expanded independently from the H. brasiliensis lineage. These results provide new insights into the biology of E. ulmoides and the origin of polyisoprene biosynthesis. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019  |  

The genome sequence of a new strain of Mycobacterium ulcerans ecovar Liflandii, emerging as a sturgeon pathogen

Mycobacterium ulcerans ecovar Liflandii (MuLiflandii) is emerging as a non-mycobacterial pathogen in amphibians. Here, we make the first report on the prevalence of a new strain of MuLiflandii infection in Chinese sturgeon. All the diseased fish showed the classic clinical symptoms of ascites and/or muscle ulceration. A new slow-growing and acid-fast bacillus ASM001 strain was obtained from the ascites of infected fish; this strain demonstrated pathogenicity when tested in hybrid sturgeon. The complete genome sequence of MuLiflandii ASM001 is a circular chromosome of 6,167,296?bp, with a G?+?C content of 65.57%, containing 4518 predicted coding DNA sequences and 999 pseudo-genes, 3 rRNA operons, and 47 transfer RNA sequences. In addition, we found 245 copies of IS2404, 34 microsatellites, and 36 CRISPR sequences in the whole MuLiflandii ASM001 genome. Among the predicted genes of MuLiflandii ASM001, we found orthologs of 203 virulence factors of clinical MuLiflandii 128FXT operating in host cell invasion, modulation of phagocyte function, and survival inside the macrophages. These virulence factor candidates provide a key basis for understanding their pathogenic mechanisms at the molecular level. A comparative analysis that used complete, existing genomes showed that MuLiflandii ASM001 has high synteny with MuLiflandii 128FXT. We anticipate the availability of the complete MuLiflandii ASM001 genome sequence will provide a valuable resource for comparative genomic studies of MuLiflandii isolates, as well as provide new insights into the host, ecological, and functional diversity of the genus Mycobacterium.


September 22, 2019  |  

Microsatellite polymorphism in the endangered snail kite reveals a panmictic, low diversity population

Genetic structure and genetic diversity are key population characteristics that can inform conservation decisions, such as delineating management units or assessing potential risks for inbreeding depression. Evidence of genetic structuring or low genetic diversity in the critically endangered snail kite (Rostrhamus sociabilis plumbeus) would have implications for monitoring and planning decisions. Recent work on understanding connectivity across the snail kite range indicated that there is less dispersal between northern and southern parts of the current range, and that dispersal is shaped by individual habitat preference. We examine whether there is neutral genetic structure and the amount of genetic variation in the population by non-lethally sampling 235 nestlings from unique nests across the entire breeding range between 2013 and 2014. Data on 15 microsatellite revealed low diversity (e.g., Na?=?2.54, He?=?0.37) and range-wide panmixia based on AMOVA, Bayesian clustering, spatial autocorrelation, isolation by distance, and spatially explicit ordination analyses. Our results emphasize that long-term recovery goals and management strategies should be based on viewing snail kites as a single genetic population, despite evidence for non-random dispersal between wetlands over ecological time scales. These results also highlight the need to understand potential effects of low genetic diversity on population dynamics and viability of snail kites. More broadly, these results add to the growing evidence for potential discrepancies between dispersal and genetic patterns, emphasizing that care should be taken if using one to interpret the other, particularly for widely-ranging species.


September 22, 2019  |  

Expansions of intronic TTTCA and TTTTA repeats in benign adult familial myoclonic epilepsy.

Epilepsy is a common neurological disorder, and mutations in genes encoding ion channels or neurotransmitter receptors are frequent causes of monogenic forms of epilepsy. Here we show that abnormal expansions of TTTCA and TTTTA repeats in intron 4 of SAMD12 cause benign adult familial myoclonic epilepsy (BAFME). Single-molecule, real-time sequencing of BAC clones and nanopore sequencing of genomic DNA identified two repeat configurations in SAMD12. Intriguingly, in two families with a clinical diagnosis of BAFME in which no repeat expansions in SAMD12 were observed, we identified similar expansions of TTTCA and TTTTA repeats in introns of TNRC6A and RAPGEF2, indicating that expansions of the same repeat motifs are involved in the pathogenesis of BAFME regardless of the genes in which the expanded repeats are located. This discovery that expansions of noncoding repeats lead to neuronal dysfunction responsible for myoclonic tremor and epilepsy extends the understanding of diseases with such repeat expansion.


September 22, 2019  |  

Massive lateral transfer of genes encoding plant cell wall-degrading enzymes to the mycoparasitic fungus Trichoderma from its plant-associated hosts.

Unlike most other fungi, molds of the genus Trichoderma (Hypocreales, Ascomycota) are aggressive parasites of other fungi and efficient decomposers of plant biomass. Although nutritional shifts are common among hypocrealean fungi, there are no examples of such broad substrate versatility as that observed in Trichoderma. A phylogenomic analysis of 23 hypocrealean fungi (including nine Trichoderma spp. and the related Escovopsis weberi) revealed that the genus Trichoderma has evolved from an ancestor with limited cellulolytic capability that fed on either fungi or arthropods. The evolutionary analysis of Trichoderma genes encoding plant cell wall-degrading carbohydrate-active enzymes and auxiliary proteins (pcwdCAZome, 122 gene families) based on a gene tree / species tree reconciliation demonstrated that the formation of the genus was accompanied by an unprecedented extent of lateral gene transfer (LGT). Nearly one-half of the genes in Trichoderma pcwdCAZome (41%) were obtained via LGT from plant-associated filamentous fungi belonging to different classes of Ascomycota, while no LGT was observed from other potential donors. In addition to the ability to feed on unrelated fungi (such as Basidiomycota), we also showed that Trichoderma is capable of endoparasitism on a broad range of Ascomycota, including extant LGT donors. This phenomenon was not observed in E. weberi and rarely in other mycoparasitic hypocrealean fungi. Thus, our study suggests that LGT is linked to the ability of Trichoderma to parasitize taxonomically related fungi (up to adelphoparasitism in strict sense). This may have allowed primarily mycotrophic Trichoderma fungi to evolve into decomposers of plant biomass.


September 22, 2019  |  

Epigenetic landscape influences the liver cancer genome architecture.

The accumulations of different types of genetic alterations such as nucleotide substitutions, structural rearrangements and viral genome integrations and epigenetic alterations contribute to carcinogenesis. Here, we report correlation between the occurrence of epigenetic features and genetic aberrations by whole-genome bisulfite, whole-genome shotgun, long-read, and virus capture sequencing of 373 liver cancers. Somatic substitutions and rearrangement breakpoints are enriched in tumor-specific hypo-methylated regions with inactive chromatin marks and actively transcribed highly methylated regions in the cancer genome. Individual mutation signatures depend on chromatin status, especially, signatures with a higher transcriptional strand bias occur within active chromatic areas. Hepatitis B virus (HBV) integration sites are frequently detected within inactive chromatin regions in cancer cells, as a consequence of negative selection for integrations in active chromatin regions. Ultra-high structural instability and preserved unmethylation of integrated HBV genomes are observed. We conclude that both precancerous and somatic epigenetic features contribute to the cancer genome architecture.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.