Menu
April 21, 2020

Diploid Genome Assembly of the Wine Grape Carménère.

In this genome report, we describe the sequencing and annotation of the genome of the wine grape Carménère (clone 02, VCR-702). Long considered extinct, this old French wine grape variety is now cultivated mostly in Chile where it was imported in the 1850s just before the European phylloxera epidemic. Genomic DNA was sequenced using Single Molecule Real Time technology and assembled with FALCON-Unzip, a diploid-aware assembly pipeline. To optimize the contiguity and completeness of the assembly, we tested about a thousand combinations of assembly parameters, sequencing coverage, error correction and repeat masking methods. The final scaffolds provide a complete and phased representation of the diploid genome of this wine grape. Comparison of the two haplotypes revealed numerous heterozygous variants, including loss-of-function ones, some of which in genes associated with polyphenol biosynthesis. Comparisons with other publicly available grape genomes and transcriptomes showed the impact of structural variation on gene content differences between Carménère and other wine grape cultivars. Among the putative cultivar-specific genes, we identified genes potentially involved in aroma production and stress responses. The genome assembly of Carménère expands the representation of the genomic variability in grapes and will enable studies that aim to understand its distinctive organoleptic and agronomical features and assess its still elusive extant genetic variability. A genome browser for Carménère, its annotation, and an associated blast tool are available at http://cantulab.github.io/data.Copyright © 2019 Minio et al.


April 21, 2020

Megabase Length Hypermutation Accompanies Human Structural Variation at 17p11.2.

DNA rearrangements resulting in human genome structural variants (SVs) are caused by diverse mutational mechanisms. We used long- and short-read sequencing technologies to investigate end products of de novo chromosome 17p11.2 rearrangements and query the molecular mechanisms underlying both recurrent and non-recurrent events. Evidence for an increased rate of clustered single-nucleotide variant (SNV) mutation in cis with non-recurrent rearrangements was found. Indel and SNV formation are associated with both copy-number gains and losses of 17p11.2, occur up to ~1 Mb away from the breakpoint junctions, and favor C > G transversion substitutions; results suggest that single-stranded DNA is formed during the genesis of the SV and provide compelling support for a microhomology-mediated break-induced replication (MMBIR) mechanism for SV formation. Our data show an additional mutational burden of MMBIR consisting of hypermutation confined to the locus and manifesting as SNVs and indels predominantly within genes. Copyright © 2019 Elsevier Inc. All rights reserved.


April 21, 2020

A Species-Wide Inventory of NLR Genes and Alleles in Arabidopsis thaliana.

Infectious disease is both a major force of selection in nature and a prime cause of yield loss in agriculture. In plants, disease resistance is often conferred by nucleotide-binding leucine-rich repeat (NLR) proteins, intracellular immune receptors that recognize pathogen proteins and their effects on the host. Consistent with extensive balancing and positive selection, NLRs are encoded by one of the most variable gene families in plants, but the true extent of intraspecific NLR diversity has been unclear. Here, we define a nearly complete species-wide pan-NLRome in Arabidopsis thaliana based on sequence enrichment and long-read sequencing. The pan-NLRome largely saturates with approximately 40 well-chosen wild strains, with half of the pan-NLRome being present in most accessions. We chart NLR architectural diversity, identify new architectures, and quantify selective forces that act on specific NLRs and NLR domains. Our study provides a blueprint for defining pan-NLRomes.Copyright © 2019 The Author(s). Published by Elsevier Inc. All rights reserved.


April 21, 2020

Genome of the Komodo dragon reveals adaptations in the cardiovascular and chemosensory systems of monitor lizards.

Monitor lizards are unique among ectothermic reptiles in that they have high aerobic capacity and distinctive cardiovascular physiology resembling that of endothermic mammals. Here, we sequence the genome of the Komodo dragon Varanus komodoensis, the largest extant monitor lizard, and generate a high-resolution de novo chromosome-assigned genome assembly for V. komodoensis using a hybrid approach of long-range sequencing and single-molecule optical mapping. Comparing the genome of V. komodoensis with those of related species, we find evidence of positive selection in pathways related to energy metabolism, cardiovascular homoeostasis, and haemostasis. We also show species-specific expansions of a chemoreceptor gene family related to pheromone and kairomone sensing in V. komodoensis and other lizard lineages. Together, these evolutionary signatures of adaptation reveal the genetic underpinnings of the unique Komodo dragon sensory and cardiovascular systems, and suggest that selective pressure altered haemostasis genes to help Komodo dragons evade the anticoagulant effects of their own saliva. The Komodo dragon genome is an important resource for understanding the biology of monitor lizards and reptiles worldwide.


April 21, 2020

Blast Fungal Genomes Show Frequent Chromosomal Changes, Gene Gains and Losses, and Effector Gene Turnover.

Pyricularia is a fungal genus comprising several pathogenic species causing the blast disease in monocots. Pyricularia oryzae, the best-known species, infects rice, wheat, finger millet, and other crops. As past comparative and population genomics studies mainly focused on isolates of P. oryzae, the genomes of the other Pyricularia species have not been well explored. In this study, we obtained a chromosomal-level genome assembly of the finger millet isolate P. oryzae MZ5-1-6 and also highly contiguous assemblies of Pyricularia sp. LS, P. grisea, and P. pennisetigena. The differences in the genomic content of repetitive DNA sequences could largely explain the variation in genome size among these new genomes. Moreover, we found extensive gene gains and losses and structural changes among Pyricularia genomes, including a large interchromosomal translocation. We searched for homologs of known blast effectors across fungal taxa and found that most avirulence effectors are specific to Pyricularia, whereas many other effectors share homologs with distant fungal taxa. In particular, we discovered a novel effector family with metalloprotease activity, distinct from the well-known AVR-Pita family. We predicted 751 gene families containing putative effectors in 7 Pyricularia genomes and found that 60 of them showed differential expression in the P. oryzae MZ5-1-6 transcriptomes obtained under experimental conditions mimicking the pathogen infection process. In summary, this study increased our understanding of the structural, functional, and evolutionary genomics of the blast pathogen and identified new potential effector genes, providing useful data for developing crops with durable resistance. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


April 21, 2020

Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.

Metagenomic samples are snapshots of complex ecosystems at work. They comprise hundreds of known and unknown species, contain multiple strain variants and vary greatly within and across environments. Many microbes found in microbial communities are not easily grown in culture making their DNA sequence our only clue into their evolutionary history and biological function. Metagenomic assembly is a computational process aimed at reconstructing genes and genomes from metagenomic mixtures. Current methods have made significant strides in reconstructing DNA segments comprising operons, tandem gene arrays and syntenic blocks. Shorter, higher-throughput sequencing technologies have become the de facto standard in the field. Sequencers are now able to generate billions of short reads in only a few days. Multiple metagenomic assembly strategies, pipelines and assemblers have appeared in recent years. Owing to the inherent complexity of metagenome assembly, regardless of the assembly algorithm and sequencing method, metagenome assemblies contain errors. Recent developments in assembly validation tools have played a pivotal role in improving metagenomics assemblers. Here, we survey recent progress in the field of metagenomic assembly, provide an overview of key approaches for genomic and metagenomic assembly validation and demonstrate the insights that can be derived from assemblies through the use of assembly validation strategies. We also discuss the potential for impact of long-read technologies in metagenomics. We conclude with a discussion of future challenges and opportunities in the field of metagenomic assembly and validation. © The Author 2017. Published by Oxford University Press.


April 21, 2020

Real time monitoring of Aeromonas salmonicida evolution in response to successive antibiotic therapies in a commercial fish farm.

Our ability to predict evolutionary trajectories of pathogens in response to antibiotic pressure is one of the promising leverage to fight against the present antibiotic resistance worldwide crisis. Yet, few studies tackled this question in situ at the outbreak level, due to the difficulty to link a given pathogenic clone evolution with its precise antibiotic exposure over time. In this study, we monitored the real-time evolution of an Aeromonas salmonicida clone in response to successive antibiotic and vaccine therapies in a commercial fish farm. The clone was responsible for a four-year outbreak of furunculosis within a Recirculating Aquaculture System Salmo salar farm in China, and we reconstructed the precise tempo of mobile genetic elements (MGEs) acquisition events during this period. The resistance profile provided by the acquired MGEs closely mirrored the antibiotics used to treat the outbreak, and we evidenced that two subclonal groups developed similar resistances although unrelated MGE acquisitions. Finally, we also demonstrated the efficiency of vaccination in outbreak management and its positive effect on antibiotic resistance prevalence. Our study provides unprecedented knowledge critical to understand evolutionary trajectories of resistant pathogens outside the laboratory. © 2019 Society for Applied Microbiology and John Wiley & Sons Ltd.


April 21, 2020

Single-molecule real-time sequencing reveals diverse allelic variations in carotenoid biosynthetic genes in pepper (Capsicum spp.).

The diverse colours of mature pepper (Capsicum spp.) fruit result from the accumulation of different carotenoids. The carotenoid biosynthetic pathway has been well elucidated in Solanaceous plants, and analysis of candidate genes involved in this process has revealed variations in carotenoid biosynthetic genes in Capsicum spp. However, the allelic variations revealed by previous studies could not fully explain the variation in fruit colour in Capsicum spp. due to technical difficulties in detecting allelic variation in multiple candidate genes in numerous samples. In this study, we uncovered allelic variations in six carotenoid biosynthetic genes, including phytoene synthase (PSY1, PSY2), lycopene ß-cyclase, ß-carotene hydroxylase, zeaxanthin epoxidase and capsanthin-capsorubin synthase (CCS) genes, in 94 pepper accessions by single-molecule real-time (SMRT) sequencing. To investigate the relationship between allelic variations in the candidate genes and differences in fruit colour, we performed ultra-performance liquid chromatography analysis using 43 accessions representing each allelic variation. Different combinations of dysfunctional mutations in PSY1 and CCS could explain variation in the compositions and levels of carotenoids in the accessions examined in this study. Our results demonstrate that SMRT sequencing technology can be used to rapidly identify allelic variation in target genes in various germplasms. The newly identified allelic variants will be useful for pepper breeding and for further analysis of carotenoid biosynthesis pathways. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020

Paraburkholderia dokdonella sp. nov., isolated from a plant from the genus Campanula.

The novel Gram-stain-negative, rod-shaped, aerobic bacterial strain DCR-13T was isolated from a native plant belonging to the genus Campanula on Dokdo, an island in the Republic of Korea. Comparative analysis of the 16S rRNA gene sequence indicated that this strain is closely related to Paraburkholderia peleae PP52-1T (98.43% 16S rRNA gene sequence similarity), Paraburkholderia oxyphila NBRC 105797T (98.42%), Paraburkholderia sacchari IPT 101T (98.28%), Paraburkholderia mimosarum NBRC 106338T (97.80%), Paraburkholderia denitrificans KIS30-44T (97.46%), and Paraburkholderia paradise WAT (97.45%). This analysis of the 16S rRNA gene sequence also suggested that DCR-13T and the six closely related strains formed a clade within the genus Paraburkholderia, but that DCR-13T was clearly separated from the established species. DCR-13T had ubiquinone 8 as its predominant respiratory quinone, and its genomic DNA G + C content was 63.9 mol%. The isolated strain grew at a pH of 6.0-8.0 (with an optimal pH of 6.5), 0-4% w/v NaCl (with an optimal level of 0%), and a temperature of 18-42°C (with an optimal temperature of 30°C). The predominant fatty acids were C16:0, summed feature 8 (C18:1?7c/C18:1?6c), C17:0 cyclo, C19:0 cyclo ?8c, summed feature 3 (C16:1?6c/C16:1?7c) and summed feature 2 (C12:0 aldehyde), and the major polar lipids were phosphatidylglycerol and phosphatidylethanolamine. On the basis of polyphasic evidence, it is proposed that strain DCR-13T (= KCTC 62811T = LMG 30889T) represents the type strain of a novel species, Paraburkholderia dokdonella sp. nov.


April 21, 2020

Isolation, cloning and characterization of an azoreductase and the effect of salinity on its expression in a halophilic bacterium.

Understanding the molecular mechanisms of azo dye decolorization is important for the development of effective bioremediation for textile-colored wastewater. A halophilic bacterium Halomonas sp. strain GT was isolated, which could degrade the azo dye Acid Brilliant Scarlet GR at 10% NaCl. The complete genome sequence of this strain was obtained using the PacBio RS II platform. Genome annotation revealed that four proteins are related to decolorization of azo dyes, such as azoreductase, laccases, benzene 1,2-dioxygenase, and catechol 1,2-dioxygenase. The putative azoreductase gene of Halomonas sp. strain GT responsible for the decolorization of azo dye in high salt environment was isolated. Phylogenetic tree analysis showed that the azoG (azoreductase gene of Halomonas sp. strain GT) and its homologs constituted a new branch of the NADH depending azoreductases, with all the homologous sequence of the protein from halophilic bacteria. At high NaCl concentrations, azoreductase gene expression and azoreductase activity were restrained in Halomonas sp. strain GT, which resulted in low a decolorization rate. Copyright © 2018. Published by Elsevier B.V.


April 21, 2020

Chromosome-level genome assembly of Triplophysa tibetana, a fish adapted to the harsh high-altitude environment of the Tibetan Plateau.

Triplophysa is an endemic fish genus of the Tibetan Plateau in China. Triplophysa tibetana, which lives at a recorded altitude of ~4,000 m and plays an important role in the highland aquatic ecosystem, serves as an excellent model for investigating high-altitude environmental adaptation. However, evolutionary and conservation studies of T. tibetana have been limited by scarce genomic resources for the genus Triplophysa. In the present study, we applied PacBio sequencing and the Hi-C technique to assemble the T. tibetana genome. A 652-Mb genome with 1,325 contigs with an N50 length of 3.1 Mb was obtained. The 1,137 contigs were further assembled into 25 chromosomes, representing 98.7% and 80.47% of all contigs at the base and sequence number level, respectively. Approximately 260 Mb of sequence, accounting for ~39.8% of the genome, was identified as repetitive elements. DNA transposons (16.3%), long interspersed nuclear elements (12.4%) and long terminal repeats (11.0%) were the most repetitive types. In total, 24,372 protein-coding genes were predicted in the genome, and ~95% of the genes were functionally annotated via a search in public databases. Using whole genome sequence information, we found that T. tibetana diverged from its common ancestor with Danio rerio ~121.4 million years ago. The high-quality genome assembled in this work not only provides a valuable genomic resource for future population and conservation studies of T. tibetana, but it also lays a solid foundation for further investigation into the mechanisms of environmental adaptation of endemic fishes in the Tibetan Plateau. © 2019 John Wiley & Sons Ltd.


April 21, 2020

Transcriptome analysis reveals multiple signal network contributing to the Verticillium wilt resistance in eggplant

Verticillium wilt is a devastating disease in eggplants. In order to understand the molecular mechanism of disease resistance in eggplants, transcriptomes of Verticillium wilt infected eggplants were detected. A total of 480, 518, 887 and 1 046 Verticillium wilt related differentially expressed genes were identified at 6 (V6), 12 (V12), 24 (V24) and 48?h (V48), respectively. COG function classification revealed that most of DEGs functioned in “Amino acid transport and metabolism”, “Cytoskeleton” and “Cell motility”. In addition, compared the control plants (V0) to infected eggplants (V6-V48), a total of 111 common DEGs were identified. Except for “General function prediction only”, most of the DEGs enriched in “Signal transduction”. DEGs associated to different hormone signals, including GID1B, ROPGAP1, OPT3 and CDPK, were identified throughout the whole infection process. Cross-talk among defense signal pathways plays major roles in the Verticillium wilt disease resistance in eggplants.


April 21, 2020

The complete genome sequence of the denitrifying bacterium Marinobacter sp. Arc7-DN-1 isolated from Arctic Ocean sediment

The general features and genome characteristics of the denitrifying bacterium Marinobacter sp. Arc7-DN-1, isolated from Arctic Ocean sediment, are described. Marinobacter sp. Arc7-DN-1 uses NO3- or NH4+ as the sole nitrogen source to grow at low temperatures. The strain can grow at a wide range of temperatures (0–30?°C) and NaCl concentration (15–90‰). The genome has one circular chromosome of 4,300,456?bp (57.64?mol%?G?+?C content), consisting of 4012 coding genes, including 50 tRNAs and three rRNA operons as 16S-23S-5S rRNA. On the basis of the KEGG analysis, strain Arc7-DN-1 encodes 43 proteins related to nitrogen metabolism, including a complete denitrifying pathway and an assimilatory nitrate reduction pathway.


April 21, 2020

Improvement of the Pacific bluefin tuna (Thunnus orientalis) reference genome and development of male-specific DNA markers.

The Pacific bluefin tuna, Thunnus orientalis, is a highly migratory species that is widely distributed in the North Pacific Ocean. Like other marine species, T. orientalis has no external sexual dimorphism; thus, identifying sex-specific variants from whole genome sequence data is a useful approach to develop an effective sex identification method. Here, we report an improved draft genome of T. orientalis and male-specific DNA markers. Combining PacBio long reads and Illumina short reads sufficiently improved genome assembly, with a 38-fold increase in scaffold contiguity (to 444 scaffolds) compared to the first published draft genome. Through analysing re-sequence data of 15 males and 16 females, 250 male-specific SNPs were identified from more than 30 million polymorphisms. All male-specific variants were male-heterozygous, suggesting that T. orientalis has a male heterogametic sex-determination system. The largest linkage disequilibrium block (3,174?bp on scaffold_064) contained 51 male-specific variants. PCR primers and a PCR-based sex identification assay were developed using these male-specific variants. The sex of 115 individuals (56 males and 59 females; sex was diagnosed by visual examination of the gonads) was identified with high accuracy using the assay. This easy, accurate, and practical technique facilitates the control of sex ratios in tuna farms. Furthermore, this method could be used to estimate the sex ratio and/or the sex-specific growth rate of natural populations.


April 21, 2020

Whole Genome Sequencing of the Mutamouse Model Reveals Strain- and Colony-Level Variation, and Genomic Features of the Transgene Integration Site.

The MutaMouse transgenic rodent model is widely used for assessing in vivo mutagenicity. Here, we report the characterization of MutaMouse’s whole genome sequence and its genetic variants compared to the C57BL/6 reference genome. High coverage (>50X) next-generation sequencing (NGS) of whole genomes from multiple MutaMouse animals from the Health Canada (HC) colony showed ~5 million SNVs per genome, ~20% of which are putatively novel. Sequencing of two animals from a geographically separated colony at Covance indicated that, over the course of 23 years, each colony accumulated 47,847 (HC) and 17,677 (Covance) non-parental homozygous single nucleotide variants. We found no novel nonsense or missense mutations that impair the MutaMouse response to genotoxic agents. Pairing sequencing data with array comparative genomic hybridization (aCGH) improved the accuracy and resolution of copy number variants (CNVs) calls and identified 300 genomic regions with CNVs. We also used long-read sequence technology (PacBio) to show that the transgene integration site involved a large deletion event with multiple inversions and rearrangements near a retrotransposon. The MutaMouse genome gives important genetic context to studies using this model, offers insight on the mechanisms of structural variant formation, and contributes a framework to analyze aCGH results alongside NGS data.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.