Menu
April 21, 2020

Targeted Long-Read RNA Sequencing Demonstrates Transcriptional Diversity Driven by Splice-Site Variation in MYBPC3.

To date, clinical sequencing has focused on genomic DNA using targeted panels and exome sequencing. Sequencing of a large hypertrophic cardiomyopathy (HCM) cohort revealed that positive identification of a disease-associated variant was returned in only 32% of patients, with an additional 15% receiving inconclusive results. When genome sequencing fails to reveal causative variants, the transcriptome may provide additional diagnostic clarity. A recent study examining patients with genetically undiagnosed muscle disorders found that RNA sequencing, when used as a complement to exome and whole genome sequencing, had an overall diagnosis rate of 35%.


April 21, 2020

Comparative Genome Analysis of an Extensively Drug-Resistant Isolate of Avian Sequence Type 167 Escherichia coli Strain Sanji with Novel In Silico Serotype O89b:H9.

Extensive drug resistance (XDR) is an escalating global problem. Escherichia coli strain Sanji was isolated from an outbreak of pheasant colibacillosis in Fujian province, China, in 2011. This strain has XDR properties, exhibiting sensitivity to carbapenems but no other classes of known antibiotics. Whole-genome sequencing revealed a total of 32 known antibiotic resistance genes, many associated with insertion sequence 26 (IS26) elements. These were found on the Sanji chromosome and 2 of its 6 plasmids, pSJ_255 and pSJ_82. The Sanji chromosome also harbors a type 2 secretion system (T2SS), a type 3 secretion system (T3SS), a type 6 secretion system (T6SS), and several putative prophages. Sanji and other ST167 strains have a previously uncharacterized O-antigen (O89b) that is most closely related to serotype O89 as determined on the basis of analysis of the wzm-wzt genes and in silico serotyping. This O89b-antigen gene cluster was also found in the genomes of a few other pathogenic sequence type 617 (ST617) and ST10 complex strains. A time-scaled phylogeny inferred from comparative single nucleotide variant analysis indicated that development of these O89b-containing lineages emerged about 30?years ago. Comparative sequence analysis revealed that the core genome of Sanji is nearly identical to that of several recently sequenced strains of pathogenic XDR E. coli belonging to the ST167 group. Comparison of the mobile elements among the different ST167 genomes revealed that each genome carries a distinct set of multidrug resistance genes on different types of plasmids, indicating that there are multiple paths toward the emergence of XDR in E. coli. IMPORTANCE E. coli strain Sanji is the first sequenced and analyzed genome of the recently emerged pathogenic XDR strains with sequence type ST167 and novel in silico serotype O89b:H9. Comparison of the genomes of Sanji with other ST167 strains revealed distinct sets of different plasmids, mobile IS elements, and antibiotic resistance genes in each genome, indicating that there exist multiple paths toward achieving XDR. The emergence of these pathogenic ST167 E. coli strains with diverse XDR capabilities highlights the difficulty of preventing or mitigating the development of XDR properties in bacteria and points to the importance of better understanding of the shared underlying virulence mechanisms and physiology of pathogenic bacteria.


April 21, 2020

Complete Genome Sequence of “Candidatus Thioglobus sp.” Strain NP1, an Open-Ocean Isolate from the SUP05 Clade of Marine Gammaproteobacteria

Candidatus Thioglobus sp.textquotedblright strain NP1 is an open-ocean isolate from the SUP05 clade of Gammaproteobacteria. Whole-genome comparisons of strain NP1 to other sequenced isolates from the SUP05 clade indicate that it represents a new species of SUP05 that lacks the ability to fix inorganic carbon using the Calvin-Benson-Bassham cycle.


April 21, 2020

An improved genome assembly of the fluke Schistosoma japonicum.

Schistosoma japonicum is a parasitic flatworm that causes human schistosomiasis, which is a significant cause of morbidity in China and the Philippines. A single draft genome was available for S. japonicum, yet this assembly is very fragmented and only covers 90% of the genome, which make it difficult to be applied as a reference in functional genome analysis and genes discovery.In this study, we present a high-quality assembly of the fluke S. japonicum genome by combining 20 G (~53X) long single molecule real time sequencing reads with 80 G (~ 213X) Illumina paired-end reads. This improved genome assembly is approximately 370.5 Mb, with contig and scaffold N50 length of 871.9 kb and 1.09 Mb, representing 142.4-fold and 6.2-fold improvement over the released WGS-based assembly, respectively. Additionally, our assembly captured 85.2% complete and 4.6% partial eukaryotic Benchmarking Universal Single-Copy Orthologs. Repetitive elements account for 46.80% of the genome, and 10,089 of the protein-coding genes were predicted from the improved genome, of which 96.5% have been functionally annotated. Lastly, using the improved assembly, we identified 20 significantly expanded gene families in S. japonicum, and those genes were primarily enriched in functions of proteolysis and protein glycosylation.Using the combination of PacBio and Illumina Sequencing technologies, we provided an improved high-quality genome of S. japonicum. This improved genome assembly, as well as the annotation, will be useful for the comparative genomics of the flukes and more importantly facilitate the molecular studies of this important parasite in the future.


April 21, 2020

Leishmania tarentolae: Taxonomic classification and its application as a promising biotechnological expression host.

In this review, we summarize the current knowledge concerning the eukaryotic protozoan parasite Leishmania tarentolae, with a main focus on its potential for biotechnological applications. We will also discuss the genus, subgenus, and species-level classification of this parasite, its life cycle and geographical distribution, and similarities and differences to human-pathogenic species, as these aspects are relevant for the evaluation of biosafety aspects of L. tarentolae as host for recombinant DNA/protein applications. Studies indicate that strain LEM-125 but not strain TARII/UC of L. tarentolae might also be capable of infecting mammals, at least transiently. This could raise the question of whether the current biosafety level of this strain should be reevaluated. In addition, we will summarize the current state of biotechnological research involving L. tarentolae and explain why this eukaryotic parasite is an advantageous and promising human recombinant protein expression host. This summary includes overall biotechnological applications, insights into its protein expression machinery (especially on glycoprotein and antibody fragment expression), available expression vectors, cell culture conditions, and its potential as an immunotherapy agent for human leishmaniasis treatment. Furthermore, we will highlight useful online tools and, finally, discuss possible future applications such as the humanization of the glycosylation profile of L. tarentolae or the expression of mammalian recombinant proteins in amastigote-like cells of this species or in amastigotes of avirulent human-pathogenic Leishmania species.


April 21, 2020

Application of long read sequencing to determine expressed antigen diversity in Trypanosoma brucei infections.

Antigenic variation is employed by many pathogens to evade the host immune response, and Trypanosoma brucei has evolved a complex system to achieve this phenotype, involving sequential use of variant surface glycoprotein (VSG) genes encoded from a large repertoire of ~2,000 genes. T. brucei express multiple, sometimes closely related, VSGs in a population at any one time, and the ability to resolve and analyse this diversity has been limited. We applied long read sequencing (PacBio) to VSG amplicons generated from blood extracted from batches of mice sacrificed at time points (days 3, 6, 10 and 12) post-infection with T. brucei TREU927. The data showed that long read sequencing is reliable for resolving variant differences between VSGs, and demonstrated that there is significant expressed diversity (449 VSGs detected across 20 mice) and across the timeframe of study there was a clear semi-reproducible pattern of expressed diversity (median of 27 VSGs per sample at day 3 post infection (p.i.), 82 VSGs at day 6 p.i., 187 VSGs at day 10 p.i. and 132 VSGs by day 12 p.i.). There was also consistent detection of one VSG dominating expression across replicates at days 3 and 6, and emergence of a second dominant VSG across replicates by day 12. The innovative application of ecological diversity analysis to VSG reads enabled characterisation of hierarchical VSG expression in the dataset, and resulted in a novel method for analysing such patterns of variation. Additionally, the long read approach allowed detection of mosaic VSG expression from very few reads-the earliest in infection that such events have been detected. Therefore, our results indicate that long read analysis is a reliable tool for resolving diverse gene expression profiles, and provides novel insights into the complexity and nature of VSG expression in trypanosomes, revealing significantly higher diversity than previously shown and the ability to identify mosaic gene formation early during the infection process.


April 21, 2020

The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications.

UNITE (https://unite.ut.ee/) is a web-based database and sequence management environment for the molecular identification of fungi. It targets the formal fungal barcode-the nuclear ribosomal internal transcribed spacer (ITS) region-and offers all ~1 000 000 public fungal ITS sequences for reference. These are clustered into ~459 000 species hypotheses and assigned digital object identifiers (DOIs) to promote unambiguous reference across studies. In-house and web-based third-party sequence curation and annotation have resulted in more than 275 000 improvements to the data over the past 15 years. UNITE serves as a data provider for a range of metabarcoding software pipelines and regularly exchanges data with all major fungal sequence databases and other community resources. Recent improvements include redesigned handling of unclassifiable species hypotheses, integration with the taxonomic backbone of the Global Biodiversity Information Facility, and support for an unlimited number of parallel taxonomic classification systems.


April 21, 2020

The genetic architecture underlying diapause termination in a planktonic crustacean.

Diapause is a feature of the life cycle of many invertebrates by which unfavourable environmental conditions can be outlived. The seasonal timing of diapause allows organisms to adapt to seasonal changes in habitat suitability and thus is key to their fitness. In the planktonic crustacean Daphnia, various cues can induce the production of diapause stages that are resistant to heat, drought or freezing and contain one to two embryos in developmental arrest. Daphnia is a keystone species of many freshwater ecosystems, where it acts as the main link between phytoplankton and higher trophic levels. The correct seasonal timing of diapause termination is essential to maintain trophic interactions and is achieved via a genetically based interpretation of environmental cues like photoperiod and temperature. Field monitoring and modelling studies raised concerns on whether populations can advance their seasonal release from diapause to advances in spring phenology under global change, or if a failure to adapt will cause trophic mismatches negatively affecting ecosystem functioning. Our capacity to understand and predict the evolution of diapause timing requires information about the genetic architecture underlying this trait. In this study, we identified eight quantitative trait loci (QTLs) and four epistatic interactions that together explained 66.5% of the variation in diapause termination in Daphnia magna using QTL mapping. Our results suggest that the most significant QTL is modulating diapause termination dependent on photoperiod and is involved in three of the four detected epistatic interactions. Candidate genes at this QTL could be identified through the integration with genome data and included the presynaptic active zone protein bruchpilot. Our findings contribute to understanding the genomic control of seasonal diapause timing in an ecological relevant species. © 2019 John Wiley & Sons Ltd.


April 21, 2020

Genome mining identifies cepacin as a plant-protective metabolite of the biopesticidal bacterium Burkholderia ambifaria.

Beneficial microorganisms are widely used in agriculture for control of plant pathogens, but a lack of efficacy and safety information has limited the exploitation of multiple promising biopesticides. We applied phylogeny-led genome mining, metabolite analyses and biological control assays to define the efficacy of Burkholderia ambifaria, a naturally beneficial bacterium with proven biocontrol properties but potential pathogenic risk. A panel of 64 B.?ambifaria strains demonstrated significant antimicrobial activity against priority plant pathogens. Genome sequencing, specialized metabolite biosynthetic gene cluster mining and metabolite analysis revealed an armoury of known and unknown pathways within B.?ambifaria. The biosynthetic gene cluster responsible for the production of the metabolite cepacin was identified and directly shown to mediate protection of germinating crops against Pythium damping-off disease. B.?ambifaria maintained biopesticidal protection and overall fitness in the soil after deletion of its third replicon, a non-essential plasmid associated with virulence in Burkholderia?cepacia complex bacteria. Removal of the third replicon reduced B.?ambifaria persistence in a murine respiratory infection model. Here, we show that by using interdisciplinary phylogenomic, metabolomic and functional approaches, the mode of action of natural biological control agents related to pathogens can be systematically established to facilitate their future exploitation.


April 21, 2020

Genome Analyses of a New Mycoplasma Species from the Scorpion Centruroides vittatus.

Arthropod Mycoplasma are little known endosymbionts in insects, primarily known as plant disease vectors. Mycoplasma in other arthropods such as arachnids are unknown. We report the first complete Mycoplasma genome sequenced, identified, and annotated from a scorpion, Centruroides vittatus, and designate it as Mycoplasma vittatus We find the genome is at least a 683,827 bp single circular chromosome with a GC content of 42.7% and with 987 protein-coding genes. The putative virulence determinants include 11 genes associated with the virulence operon associated with protein synthesis or DNA transcription and ten genes with antibiotic and toxic compound resistance. Comparative analysis revealed that the M. vittatus genome is smaller than other Mycoplasma genomes and exhibits a higher GC content. Phylogenetic analysis shows M. vittatus as part of the Hominis group of Mycoplasma As arthropod genomes accumulate, further novel Mycoplasma genomes may be identified and characterized. Copyright © 2019 Yamashita et al.


April 21, 2020

Characterization and Complete Genome Analysis of the Carbazomycin B-Producing Strain Streptomyces luteoverticillatus SZJ61.

Members of marine Actinobacteria have been highly regarded as potentially important sources of antimicrobial compounds. Here, we isolated a strain of Actinobacteria, SZJ61, and showed that it inhibits the in vitro growth of fungi pathogenic to plants. This new isolate was identified as Streptomyces luteoverticillatus by morphological, biochemical and genetic analyses. Antifungal compounds were isolated from S. luteoverticillatus strain SZJ61 and characterized as carbazomycin B by nuclear magnetic resonance spectra. We then sequenced the genome of the S. luteoverticillatus SZJ61 strain, which consists of only one 7,367,863 bp linear chromosome that has a G+C content of 72.05%. Thirty-five putative biosynthetic gene clusters for secondary metabolites, including a variety of bioactive products, were found. Mining of the genome sequence information revealed the putative biosynthetic gene cluster of carbazomycin B. This genomic information is valuable for interpreting the biosynthetic mechanisms of diverse bioactive compounds that have potential applications in the pharmaceutical industry.


April 21, 2020

The population genetics of structural variants in grapevine domestication.

Structural variants (SVs) are a largely unexplored feature of plant genomes. Little is known about the type and size of SVs, their distribution among individuals and, especially, their population dynamics. Understanding these dynamics is critical for understanding both the contributions of SVs to phenotypes and the likelihood of identifying them as causal genetic variants in genome-wide associations. Here, we identify SVs and study their evolutionary genomics in clonally propagated grapevine cultivars and their outcrossing wild progenitors. To catalogue SVs, we assembled the highly heterozygous Chardonnay genome, for which one in seven genes is hemizygous based on SVs. Using an integrative comparison between Chardonnay and Cabernet Sauvignon genomes by whole-genome, long-read and short-read alignment, we extended SV detection to population samples. We found that strong purifying selection acts against SVs but particularly against inversion and translocation events. SVs nonetheless accrue as recessive heterozygotes in clonally propagated lineages. They also define outlier regions of genomic divergence between wild and cultivated grapevines, suggesting roles in domestication. Outlier regions include the sex-determination region and the berry colour locus, where independent large, complex inversions have driven convergent phenotypic evolution.


April 21, 2020

Parallels between natural selection in the cold-adapted crop-wild relative Tripsacum dactyloides and artificial selection in temperate adapted maize.

Artificial selection has produced varieties of domesticated maize that thrive in temperate climates around the world. However, the direct progenitor of maize, teosinte, is indigenous only to a relatively small range of tropical and subtropical latitudes and grows poorly or not at all outside of this region. Tripsacum, a sister genus to maize and teosinte, is naturally endemic to the majority of areas in the western hemisphere where maize is cultivated. A full-length reference transcriptome for Tripsacum dactyloides generated using long-read Iso-Seq data was used to characterize independent adaptation to temperate climates in this clade. Genes related to phospholipid biosynthesis, a critical component of cold acclimation in other cold-adapted plant lineages, were enriched among those genes experiencing more rapid rates of protein sequence evolution in T. dactyloides. In contrast with previous studies of parallel selection, we find that there is a significant overlap between the genes that were targets of artificial selection during the adaptation of maize to temperate climates and those that were targets of natural selection in temperate-adapted T. dactyloides. Genes related to growth, development, response to stimulus, signaling, and organelles were enriched in the set of genes identified as both targets of natural and artificial selection. © 2019 The Authors The Plant Journal © 2019 John Wiley & Sons Ltd.


April 21, 2020

Estimating the Fitness Effect of Deleterious Mutations During the Two Phases of the Life Cycle: A New Method Applied to the Root-Rot Fungus Heterobasidion parviporum.

Many eukaryote species, including taxa such as fungi or algae, have a lifecycle with substantial haploid and diploid phases. A recent theoretical model predicts that such haploid-diploid lifecycles are stable over long evolutionary time scales when segregating deleterious mutations have stronger effects in homozygous diploids than in haploids and when they are partially recessive in heterozygous diploids. The model predicts that effective dominance-a measure that accounts for these two effects-should be close to 0.5 in these species. It also predicts that diploids should have higher fitness than haploids on average. However, an appropriate statistical framework to conjointly investigate these predictions is currently lacking. In this study, we derive a new quantitative genetic model to test these predictions using fitness data of two haploid parents and their diploid offspring, and genome-wide genetic distance between haploid parents. We apply this model to the root-rot basidiomycete fungus Heterobasidion parviporum-a species where the heterokaryotic (equivalent to the diploid) phase is longer than the homokaryotic (haploid) phase. We measured two fitness-related traits (mycelium growth rate and the ability to degrade wood) in both homokaryons and heterokaryons, and we used whole-genome sequencing to estimate nuclear genetic distance between parents. Possibly due to a lack of power, we did not find that deleterious mutations were recessive or more deleterious when expressed during the heterokaryotic phase. Using this model to compare effective dominance among haploid-diploid species where the relative importance of the two phases varies should help better understand the evolution of haploid-diploid life cycles. Copyright © 2019 by the Genetics Society of America.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.