Menu
July 7, 2019  |  

ThermoAlign: a genome-aware primer design tool for tiled amplicon resequencing.

Isolating and sequencing specific regions in a genome is a cornerstone of molecular biology. This has been facilitated by computationally encoding the thermodynamics of DNA hybridization for automated design of hybridization and priming oligonucleotides. However, the repetitive composition of genomes challenges the identification of target-specific oligonucleotides, which limits genetics and genomics research on many species. Here, a tool called ThermoAlign was developed that ensures the design of target-specific primer pairs for DNA amplification. This is achieved by evaluating the thermodynamics of hybridization for full-length oligonucleotide-template alignments – thermoalignments – across the genome to identify primers predicted to bind specifically to the target site. For amplification-based resequencing of regions that cannot be amplified by a single primer pair, a directed graph analysis method is used to identify minimum amplicon tiling paths. Laboratory validation by standard and long-range polymerase chain reaction and amplicon resequencing with maize, one of the most repetitive genomes sequenced to date (˜85% repeat content), demonstrated the specificity-by-design functionality of ThermoAlign. ThermoAlign is released under an open source license and bundled in a dependency-free container for wide distribution. It is anticipated that this tool will facilitate multiple applications in genetics and genomics and be useful in the workflow of high-throughput targeted resequencing studies.


July 7, 2019  |  

Genomic innovation for crop improvement.

Crop production needs to increase to secure future food supplies, while reducing its impact on ecosystems. Detailed characterization of plant genomes and genetic diversity is crucial for meeting these challenges. Advances in genome sequencing and assembly are being used to access the large and complex genomes of crops and their wild relatives. These have helped to identify a wide spectrum of genetic variation and permitted the association of genetic diversity with diverse agronomic phenotypes. In combination with improved and automated phenotyping assays and functional genomic studies, genomics is providing new foundations for crop-breeding systems.


July 7, 2019  |  

Hybrid assembly of the large and highly repetitive genome of Aegilops tauschii, a progenitor of bread wheat, with the MaSuRCA mega-reads algorithm.

Long sequencing reads generated by single-molecule sequencing technology offer the possibility of dramatically improving the contiguity of genome assemblies. The biggest challenge today is that long reads have relatively high error rates, currently around 15%. The high error rates make it difficult to use this data alone, particularly with highly repetitive plant genomes. Errors in the raw data can lead to insertion or deletion errors (indels) in the consensus genome sequence, which in turn create significant problems for downstream analysis; for example, a single indel may shift the reading frame and incorrectly truncate a protein sequence. Here, we describe an algorithm that solves the high error rate problem by combining long, high-error reads with shorter but much more accurate Illumina sequencing reads, whose error rates average <1%. Our hybrid assembly algorithm combines these two types of reads to construct mega-reads, which are both long and accurate, and then assembles the mega-reads using the CABOG assembler, which was designed for long reads. We apply this technique to a large data set of Illumina and PacBio sequences from the species Aegilops tauschii, a large and extremely repetitive plant genome that has resisted previous attempts at assembly. We show that the resulting assembled contigs are far larger than in any previous assembly, with an N50 contig size of 486,807 nucleotides. We compare the contigs to independently produced optical maps to evaluate their large-scale accuracy, and to a set of high-quality bacterial artificial chromosome (BAC)-based assemblies to evaluate base-level accuracy. © 2017 Zimin et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019  |  

Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism.

Plasmopara viticola causes downy mildew disease of grapevine which is one of the most devastating diseases of viticulture worldwide. Here we report a 101.3?Mb whole genome sequence of P. viticola isolate ‘JL-7-2’ obtained by a combination of Illumina and PacBio sequencing technologies. The P. viticola genome contains 17,014 putative protein-coding genes and has ~26% repetitive sequences. A total of 1,301 putative secreted proteins, including 100 putative RXLR effectors and 90 CRN effectors were identified in this genome. In the secretome, 261 potential pathogenicity genes and 95 carbohydrate-active enzymes were predicted. Transcriptional analysis revealed that most of the RXLR effectors, pathogenicity genes and carbohydrate-active enzymes were significantly up-regulated during infection. Comparative genomic analysis revealed that P. viticola evolved independently from the Arabidopsis downy mildew pathogen Hyaloperonospora arabidopsidis. The availability of the P. viticola genome provides a valuable resource not only for comparative genomic analysis and evolutionary studies among oomycetes, but also enhance our knowledge on the mechanism of interactions between this biotrophic pathogen and its host.


July 7, 2019  |  

Terpene synthases from Cannabis sativa.

Cannabis (Cannabis sativa) plants produce and accumulate a terpene-rich resin in glandular trichomes, which are abundant on the surface of the female inflorescence. Bouquets of different monoterpenes and sesquiterpenes are important components of cannabis resin as they define some of the unique organoleptic properties and may also influence medicinal qualities of different cannabis strains and varieties. Transcriptome analysis of trichomes of the cannabis hemp variety ‘Finola’ revealed sequences of all stages of terpene biosynthesis. Nine cannabis terpene synthases (CsTPS) were identified in subfamilies TPS-a and TPS-b. Functional characterization identified mono- and sesqui-TPS, whose products collectively comprise most of the terpenes of ‘Finola’ resin, including major compounds such as ß-myrcene, (E)-ß-ocimene, (-)-limonene, (+)-a-pinene, ß-caryophyllene, and a-humulene. Transcripts associated with terpene biosynthesis are highly expressed in trichomes compared to non-resin producing tissues. Knowledge of the CsTPS gene family may offer opportunities for selection and improvement of terpene profiles of interest in different cannabis strains and varieties.


July 7, 2019  |  

Genetic and genomic tools for Cannabis sativa

The Cannabis industry is currently one of the fastest growing industries in the United States. Given the changing legal status of the plant, and the rapidly advancing research, updated information on the advancement of Cannabis genomics is needed. This versatile plant is used as medicine and for food, fiber, and bioremediation. Insights from modern, high-throughput genomic technology are revolutionizing our understanding of the plant and are providing new tools to further improve our knowledge and utilization of this unique species. This review quantifies and evaluates the currently available genomic resources for Cannabis research, including six whole-genome assemblies, two transcriptomes, and 393 other substantial genomic resources, as well as other smaller publicly available genetic and genomic resources. The open-source approaches followed by many leading scientists in the field promote collaboration and facilitate these rapid advances.


July 7, 2019  |  

An improved assembly of the loblolly pine mega-genome using long-read single-molecule sequencing.

The 22-gigabase genome of loblolly pine (Pinus taeda) is one of the largest ever sequenced. The draft assembly published in 2014 was built entirely from short Illumina reads, with lengths ranging from 100 to 250 base pairs (bp). The assembly was quite fragmented, containing over 11 million contigs whose weighted average (N50) size was 8206 bp. To improve this result, we generated approximately 12-fold coverage in long reads using the Single Molecule Real Time sequencing technology developed at Pacific Biosciences. We assembled the long and short reads together using the MaSuRCA mega-reads assembly algorithm, which produced a substantially better assembly, P. taeda version 2.0. The new assembly has an N50 contig size of 25?361, more than three times as large as achieved in the original assembly, and an N50 scaffold size of 107?821, 61% larger than the previous assembly. © The Author 2017. Published by Oxford University Press.


July 7, 2019  |  

The two-component monooxygenase MeaXY initiates the downstream pathway of chloroacetanilide herbicide catabolism in sphingomonads.

Due to the extensive use of chloroacetanilide herbicides over the past 60 years, bacteria have evolved catabolic pathways to mineralize these compounds. In the upstream catabolic pathway, chloroacetanilide herbicides are transformed into the two common metabolites 2-methyl-6-ethylaniline (MEA) and 2,6-diethylaniline (DEA) through N-dealkylation and amide hydrolysis. The pathway downstream of MEA is initiated by the hydroxylation of aromatic rings, followed by its conversion to a substrate for ring cleavage after several steps. Most of the key genes in the pathway have been identified. However, the genes involved in the initial hydroxylation step of MEA are still unknown. As a special aniline derivative, MEA cannot be transformed by the aniline dioxygenases that have been characterized. Sphingobium baderi DE-13 can completely degrade MEA and use it as a sole carbon source for growth. In this work, an MEA degradation-deficient mutant of S. baderi DE-13 was isolated. MEA catabolism genes were predicted through comparative genomic analysis. The results of genetic complementation and heterologous expression demonstrated that the products of meaX and meaY are responsible for the initial step of MEA degradation in S. baderi DE-13. MeaXY is a two-component flavoprotein monooxygenase system that catalyzes the hydroxylation of MEA and DEA using NADH and flavin mononucleotide (FMN) as cofactors. Nuclear magnetic resonance (NMR) analysis confirmed that MeaXY hydroxylates MEA and DEA at the para-position. Transcription of meaX was enhanced remarkably upon induction of MEA or DEA in S. baderi DE-13. Additionally, meaX and meaY were highly conserved among other MEA-degrading sphingomonads. This study fills a gap in our knowledge of the biochemical pathway that carries out mineralization of chloroacetanilide herbicides in sphingomonads. IMPORTANCE Much attention has been paid to the environmental fate of chloroacetanilide herbicides used for the past 60 years. Microbial degradation is considered an important mechanism in the degradation of these compounds. Bacterial degradation of chloroacetanilide herbicides has been investigated in many recent studies. Pure cultures or consortia able to mineralize these herbicides have been obtained. The catabolic pathway has been proposed, and most key genes involved have been identified. However, the genes responsible for the initiation step (from MEA to hydroxylated MEA or from DEA to hydroxylated DEA) of the downstream pathway have not been reported. The present study demonstrates that a two-component flavin-dependent monooxygenase system, MeaXY, catalyzes the para-hydroxylation of MEA or DEA in sphingomonads. Therefore, this work finds a missing link in the biochemical pathway that carries out the mineralization of chloroacetanilide herbicides in sphingomonads. Additionally, the results expand our understanding of the degradation of a special kind of aniline derivative. Copyright © 2017 American Society for Microbiology.


July 7, 2019  |  

Draft genome sequence of the rhizobacterium Pseudomonas chlororaphis PCL1601, displaying biocontrol against soilborne phytopathogens.

In this study, we present the draft genome sequence of the bacterial strain Pseudomonas chlororaphis PCL1601. This bacterium was isolated from the rhizosphere of healthy avocado trees and displayed antagonistic and biological control activities against different soilborne phytopathogenic fungi and oomycetes. Copyright © 2017 Vida et al.


July 7, 2019  |  

Complete genome sequence of Spiroplasma citri strain R8-A2T, causal agent of stubborn disease in Citrus species.

Spiroplasma citri causes stubborn disease in Citrus spp. and diseases in other plants. Here, we report the nucleotide sequence of the 1,599,709-bp circular chromosome and two plasmids of S. citri strain R8-A2(T) This information will facilitate analyses to understand spiroplasmal pathogenicity and evolutionary adaptations to lifestyles in plants and arthropod hosts. Copyright © 2017 Davis et al.


July 7, 2019  |  

Complete genome sequence of Kosakonia oryzae type strain Ola 51(T).

Strain Ola 51(T) (=LMG 24251(T)?=?CGMCC 1.7012(T)) is the type strain of the species Kosakonia oryzae and was isolated from surface-sterilized roots of the wild rice species Oryza latifolia grown in Guangdong, China. Here we summarize the features of the strain Ola 51(T) and describe its complete genome sequence. The genome contains one circular chromosome of 5,303,342 nucleotides with 54.01% GC content, 4773 protein-coding genes, 16 rRNA genes, 76 tRNA genes, 13 ncRNA genes, 48 pseudo genes, and 1 CRISPR array.


July 7, 2019  |  

The complete chloroplast genome sequence of tung tree (Vernicia fordii): Organization and phylogenetic relationships with other angiosperms.

Tung tree (Vernicia fordii) is an economically important tree widely cultivated for industrial oil production in China. To better understand the molecular basis of tung tree chloroplasts, we sequenced and characterized its genome using PacBio RS II sequencing platforms. The chloroplast genome was sequenced with 161,528?bp in length, composed with one pair of inverted repeats (IRs) of 26,819?bp, which were separated by one small single copy (SSC; 18,758?bp) and one large single copy (LSC; 89,132?bp). The genome contains 114 genes, coding for 81 protein, four ribosomal RNAs and 29 transfer RNAs. An expansion with integration of an additional rps19 gene in the IR regions was identified. Compared to the chloroplast genome of Jatropha curcas, a species from the same family, the tung tree chloroplast genome is distinct with 85 single nucleotide polymorphisms (SNPs) and 82 indels. Phylogenetic analysis suggests that V. fordii is a sister species with J. curcas within the Eurosids I. The nucleotide sequence provides vital molecular information for understanding the biology of this important oil tree.


July 7, 2019  |  

SMRT sequencing data for Garcinia mangostana L. variety Mesta.

The “Queen of Fruits” mangosteen (Garcinia mangostana L.) produces commercially important fruits with desirable taste of flesh and pericarp rich in xanthones with medicinal properties. To date, only limited knowledge is available on the cytogenetics and genome sequences of a common variety of mangosteen (Abu Bakar et al., 2016 [1]). Here, we report the first single-molecule real-time (SMRT) sequencing data from whole genome sequencing of mangosteen of Mesta variety. Raw reads of the SMRT sequencing project can be obtained from SRA database with the accession numbers SRX2718652 until SRX2718659.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.