Menu
July 7, 2019  |  

Tigmint: correcting assembly errors using linked reads from large molecules.

Genome sequencing yields the sequence of many short snippets of DNA (reads) from a genome. Genome assembly attempts to reconstruct the original genome from which these reads were derived. This task is difficult due to gaps and errors in the sequencing data, repetitive sequence in the underlying genome, and heterozygosity. As a result, assembly errors are common. In the absence of a reference genome, these misassemblies may be identified by comparing the sequencing data to the assembly and looking for discrepancies between the two. Once identified, these misassemblies may be corrected, improving the quality of the assembled sequence. Although tools exist to identify and correct misassemblies using Illumina paired-end and mate-pair sequencing, no such tool yet exists that makes use of the long distance information of the large molecules provided by linked reads, such as those offered by the 10x Genomics Chromium platform. We have developed the tool Tigmint to address this gap.To demonstrate the effectiveness of Tigmint, we applied it to assemblies of a human genome using short reads assembled with ABySS 2.0 and other assemblers. Tigmint reduced the number of misassemblies identified by QUAST in the ABySS assembly by 216 (27%). While scaffolding with ARCS alone more than doubled the scaffold NGA50 of the assembly from 3 to 8 Mbp, the combination of Tigmint and ARCS improved the scaffold NGA50 of the assembly over five-fold to 16.4 Mbp. This notable improvement in contiguity highlights the utility of assembly correction in refining assemblies. We demonstrate the utility of Tigmint in correcting the assemblies of multiple tools, as well as in using Chromium reads to correct and scaffold assemblies of long single-molecule sequencing.Scaffolding an assembly that has been corrected with Tigmint yields a final assembly that is both more correct and substantially more contiguous than an assembly that has not been corrected. Using single-molecule sequencing in combination with linked reads enables a genome sequence assembly that achieves both a high sequence contiguity as well as high scaffold contiguity, a feat not currently achievable with either technology alone.


July 7, 2019  |  

The sequenced angiosperm genomes and genome databases.

Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology.


July 7, 2019  |  

Draft genome sequence of Streptomyces sp. strain DH-12, a soilborneisolate from the Thar Desert with broad-spectrum antibacterial activity.

Strain DH-12 exhibits broad-spectrum antibacterial activity toward Gram-positive and Gram-negative pathogens. The 7.6-Mb draft genome sequence gives insight into the complete secondary metabolite production capacity and reveals genes putatively responsible for its antibacterial activity, as well as genes which enable the survival of the organism in an extreme arid environment. Copyright © 2018 Jiao et al.


July 7, 2019  |  

Genome sequence of Pseudomonas chlororaphis Lzh-T5, a plant growth-promoting rhizobacterium with antimicrobial activity.

Pseudomonas chlororaphis Lzh-T5 is a plant growth-promoting rhizobacterium (PGPR) with antimicrobial activity isolated from tomato rhizosphere in the city of Dezhou, Shandong Province, China. Here, the draft genome sequence of P. chlororaphis Lzh-T5 is reported, and several functional genes related to antifungal antibiotics and siderophore biosynthesis have been found in the genome. Copyright © 2018 Li et al.


July 7, 2019  |  

Identification of Pseudomonas mosselii BS011 gene clusters required for suppression of Rice Blast Fungus Magnaporthe oryzae.

Pseudomonas is a Gram-negative, rod-shaped bacteria. Many members of this genus displayed remarkable physiological and metabolic activity against different plant pathogens. However, Pseudomonas mosselii has not yet been characterized in biocontrol against plant disease. Here we isolated a strain of P. mosselii BS011 from the rhizosphere soil of rice plants, and the isolate showed strong inhibitory activity against the rice blast fungus Magnaporthe oryzae. Further we sequenced the complete genome of BS011, which consist of 5.75?Mb with a circular chromosome, 5,170 protein-coding genes, 23 rRNA and 78 tRNA operons. Bioinformatic analysis revealed that seven gene clusters may be involved in the biosynthesis of metabolites. Gene deletion experiments demonstrated that the gene cluster c-xtl is required for inhibitory activity against M. oryzae. Bioassay showed that the crude extract from BS011 fermentation sample significantly inhibited the development of M. oryzae at a concentration of 10?µg/ml. Besides, we illustrated that the crude extract of BS011 impaired the appressorial formation in a dose dependent manner. Collectively our results revealed that P. mosselii BS011 is a promising biocontrol agent and the gene cluster c-xtl is essential for inhibiting the development of M. oryzae. Copyright © 2018. Published by Elsevier B.V.


July 7, 2019  |  

Probiotic genomes: Sequencing and annotation in the past decade

Probiotics are live microorganisms that confer many health benefits to the host when administered in adequate quantities. These health benefits have garnered much attention towards Probiotics and have given an impetus to their use as dietary supplements for the improvement of general health and as adjuvant therapies for certain diseases. The increased demand for probiotic products in the recent times has provided the thrust for probiotic research applied to several areas of human biology. The advances in genomic technologies have further facilitated the sequencing of the genomes of such probiotic bacteria and their genomic analyses to identify the genes that endow the beneficial effects they are known to exert. This work reviews the application of genomic strategies on probiotic bacteria, while providing the details about the probiotic strains whose genome sequences are available. It also consolidates the Genomic tools used for the sequencing, assembly and annotation of the probiotic genes and how it has helped in comparative genomic analyses.


July 7, 2019  |  

Optimise wheat A-genome.

The wild einkorn wheat Triticum urartu (Tu) is the A-genome progenitor of tetraploid (AABB) and hexaploid (AABBDD) wheat. A draft genome of Tu was published in 2013, but a better reference sequence is urgently needed by scientists and breeders. Hong-Qing Ling, from the Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and colleagues have now completed a high-quality Tu genome using multiple methods.


July 7, 2019  |  

The challenge of analyzing the sugarcane genome.

Reference genome sequences have become key platforms for genetics and breeding of the major crop species. Sugarcane is probably the largest crop produced in the world (in weight of crop harvested) but lacks a reference genome sequence. Sugarcane has one of the most complex genomes in crop plants due to the extreme level of polyploidy. The genome of modern sugarcane hybrids includes sub-genomes from two progenitors Saccharum officinarum and S. spontaneum with some chromosomes resulting from recombination between these sub-genomes. Advancing DNA sequencing technologies and strategies for genome assembly are making the sugarcane genome more tractable. Advances in long read sequencing have allowed the generation of a more complete set of sugarcane gene transcripts. This is supporting transcript profiling in genetic research. The progenitor genomes are being sequenced. A monoploid coverage of the hybrid genome has been obtained by sequencing BAC clones that cover the gene space of the closely related sorghum genome. The complete polyploid genome is now being sequenced and assembled. The emerging genome will allow comparison of related genomes and increase understanding of the functioning of this polyploidy system. Sugarcane breeding for traditional sugar and new energy and biomaterial uses will be enhanced by the availability of these genomic resources.


July 7, 2019  |  

Improved draft genome sequence of a monoteliosporic culture of the karnal bunt (Tilletia indica) pathogen of wheat.

Karnal bunt of wheat is an internationally quarantined fungal pathogen disease caused by Tilletia indica and affects the international commercial seed trade of wheat. We announce here the first improved draft genome assembly of a monoteliosporic culture of the Tilletia indica fungus, consisting of 787 scaffolds with an approximate total genome size of 31.83 Mbp, which is more accurate and near to complete than the previous version. Copyright © 2018 Kumar et al.


July 7, 2019  |  

Low-level antimicrobials in the medicinal leech select for resistant pathogens that spread to patients.

Fluoroquinolones (FQs) and ciprofloxacin (Cp) are important antimicrobials that pollute the environment in trace amounts. Although Cp has been recommended as prophylaxis for patients undergoing leech therapy to prevent infections by the leech gut symbiont Aeromonas, a puzzling rise in Cp-resistant (Cpr) Aeromonas infections has been reported. We report on the effects of subtherapeutic FQ concentrations on bacteria in an environmental reservoir, the medicinal leech, and describe the presence of multiple antibiotic resistance mutations and a gain-of-function resistance gene. We link the rise of CprAeromonas isolates to exposure of the leech microbiota to very low levels of Cp (0.01 to 0.04 µg/ml), <1/100 of the clinical resistance breakpoint for Aeromonas Using competition experiments and comparative genomics of 37 strains, we determined the mechanisms of resistance in clinical and leech-derived Aeromonas isolates, traced their origin, and determined that the presence of merely 0.01 µg/ml Cp provides a strong competitive advantage for Cpr strains. Deep-sequencing the Cpr-conferring region of gyrA enabled tracing of the mutation-harboring Aeromonas population in archived gut samples, and an increase in the frequency of the Cpr-conferring mutation in 2011 coincides with the initial reports of CprAeromonas infections in patients receiving leech therapy.IMPORTANCE The role of subtherapeutic antimicrobial contamination in selecting for resistant strains has received increasing attention and is an important clinical matter. This study describes the relationship of resistant bacteria from the medicinal leech, Hirudo verbana, with patient infections following leech therapy. While our results highlight the need for alternative antibiotic therapies, the rise of Cpr bacteria demonstrates the importance of restricting the exposure of animals to antibiotics approved for veterinary use. The shift to a more resistant community and the dispersion of Cpr-conferring mechanisms via mobile elements occurred in a natural setting due to the presence of very low levels of fluoroquinolones, revealing the challenges of controlling the spread of antibiotic-resistant bacteria and highlighting the importance of a holistic approach in the management of antibiotic use. Copyright © 2018 Beka et al.


July 7, 2019  |  

TriPoly: haplotype estimation for polyploids using sequencing data of related individuals.

Knowledge of haplotypes, i.e. phased and ordered marker alleles on a chromosome, is essential to answer many questions in genetics and genomics. By generating short pieces of DNA sequence, high-throughput modern sequencing technologies make estimation of haplotypes possible for single individuals. In polyploids, however, haplotype estimation methods usually require deep coverage to achieve sufficient accuracy. This often renders sequencing-based approaches too costly to be applied to large populations needed in studies of Quantitative Trait Loci.We propose a novel haplotype estimation method for polyploids, TriPoly, that combines sequencing data with Mendelian inheritance rules to infer haplotypes in parent-offspring trios. Using realistic simulations of both short and long-read sequencing data for banana (Musa acuminata) and potato (Solanum tuberosum) trios, we show that TriPoly yields more accurate progeny haplotypes at low coverages compared to existing methods that work on single individuals. We also apply TriPoly to phase Single Nucleotide Polymorphisms on chromosome 5 for a family of tetraploid potato with 2 parents and 37 offspring sequenced with an RNA capture approach. We show that TriPoly haplotype estimates differ from those of the other methods mainly in regions with imperfect sequencing or mapping difficulties, as it does not rely solely on sequence reads and aims to avoid phasings that are not likely to have been passed from the parents to the offspring.TriPoly has been implemented in Python 3.5.2 (also compatible with Python 2.7.3 and higher) and can be freely downloaded at https://github.com/EhsanMotazedi/TriPoly.Supplementary data are available at Bioinformatics online.


July 7, 2019  |  

Genomes and transcriptomes of duckweeds.

Duckweeds (Lemnaceae family) are the smallest flowering plants that adapt to the aquatic environment. They are regarded as the promising sustainable feedstock with the characteristics of high starch storage, fast propagation, and global distribution. The duckweed genome size varies 13-fold ranging from 150 Mb in Spirodela polyrhiza to 1,881 Mb in Wolffia arrhiza. With the development of sequencing technology and bioinformatics, five duckweed genomes from Spirodela and Lemna genera are sequenced and assembled. The genome annotations discover that they share similar protein orthologs, whereas the repeat contents could mainly explain the genome size difference. The gene families responsible for cell growth and expansion, lignin biosynthesis, and flowering are greatly contracted. However, the gene family of glutamate synthase has experienced expansion, indicating their significance in ammonia assimilation and nitrogen transport. The transcriptome is comprehensively sequenced for the genera of Spirodela, Landoltia, and Lemna, including various treatments such as abscisic acid, radiation, heavy metal, and starvation. The analysis of the underlying molecular mechanism and the regulatory network would accelerate their applications in the fields of bioenergy and phytoremediation. The comparative genomics has shown that duckweed genomes contain relatively low gene numbers and more contracted gene families, which may be in parallel with their highly reduced morphology with a simple leaf and primary roots. Still, we are waiting for the advancement of the long read sequencing technology to resolve the complex genomes and transcriptomes for unsequenced Wolffiella and Wolffia due to the large genome sizes and the similarity in their polyploidy.


July 7, 2019  |  

The complete genome sequence of Bacillus halotolerans ZB201702 isolated from a drought- and salt-stressed rhizosphere soil.

Bacillus halotolerans is a rhizobacterium with the potential to promote plant growth and tolerance to drought and salinity stress. Here, we present the complete genome sequence of B. halotolerans ZB201702, which consists of 4,150,000 bp in a linear chromosome, including 3074 protein-coding sequences, 30 rRNAs, and 85 tRNAs. Genome analysis revealed many putative gene clusters involved in defense mechanisms. Activity analysis of the strain under salt and simulated drought stress suggests tolerance to abiotic stresses. The complete genome information of B. halotolerans ZB201702 could provide valuable insights into rhizobacteria-mediated plant salt and drought tolerance and rhizobacteria-based solutions for abiotic stress agriculture. Copyright © 2018 Elsevier Ltd. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.