Menu
July 7, 2019

De novo genome assembly of the economically important weed horseweed using integrated data from multiple sequencing platforms.

Horseweed (Conyza canadensis), a member of the Compositae (Asteraceae) family, was the first broadleaf weed to evolve resistance to glyphosate. Horseweed, one of the most problematic weeds in the world, is a true diploid (2n = 2x = 18), with the smallest genome of any known agricultural weed (335 Mb). Thus, it is an appropriate candidate to help us understand the genetic and genomic bases of weediness. We undertook a draft de novo genome assembly of horseweed by combining data from multiple sequencing platforms (454 GS-FLX, Illumina HiSeq 2000, and PacBio RS) using various libraries with different insertion sizes (approximately 350 bp, 600 bp, 3 kb, and 10 kb) of a Tennessee-accessed, glyphosate-resistant horseweed biotype. From 116.3 Gb (approximately 350× coverage) of data, the genome was assembled into 13,966 scaffolds with 50% of the assembly = 33,561 bp. The assembly covered 92.3% of the genome, including the complete chloroplast genome (approximately 153 kb) and a nearly complete mitochondrial genome (approximately 450 kb in 120 scaffolds). The nuclear genome is composed of 44,592 protein-coding genes. Genome resequencing of seven additional horseweed biotypes was performed. These sequence data were assembled and used to analyze genome variation. Simple sequence repeat and single-nucleotide polymorphisms were surveyed. Genomic patterns were detected that associated with glyphosate-resistant or -susceptible biotypes. The draft genome will be useful to better understand weediness and the evolution of herbicide resistance and to devise new management strategies. The genome will also be useful as another reference genome in the Compositae. To our knowledge, this article represents the first published draft genome of an agricultural weed.© 2014 American Society of Plant Biologists. All Rights Reserved.


July 7, 2019

Complete genome sequence of the cyanide-degrading bacterium Pseudomonas pseudoalcaligenes CECT5344.

Pseudomonas pseudoalcaligenes CECT5344, a Gram-negative bacterium isolated from the Guadalquir River (Córdoba, Spain), is able to utilize different cyano-derivatives. Here, the complete genome sequence of P. pseudoalcaligenes CECT5344 harboring a 4,686,340bp circular chromosome encoding 4513 genes and featuring a GC-content of 62.34% is reported. Necessarily, remaining gaps in the genome had to be closed by assembly of few long reads obtained from PacBio single molecule real-time sequencing. Here, the first complete genome sequence for the species P. pseudoalcaligenes is presented. Copyright © 2014 Elsevier B.V. All rights reserved.


July 7, 2019

Draft genome sequence of Pantoea agglomerans R190, a producer of antibiotics against phytopathogens and foodborne pathogens.

Pantoea agglomerans R190, isolated from an apple orchard, showed antibacterial activity against various spoilage bacteria, including Pectobacterium carotovorum subsp. carotovorum, and foodborne pathogens such as Escherichia coli O157:H7. Here, we report the genome sequence of P. agglomerans R190. This report will raise the value of P. agglomerans as an agent for biocontrol of disease. Copyright © 2014. Published by Elsevier B.V.


July 7, 2019

The odd one out: Bacillus ACT bacteriophage CP-51 exhibits unusual properties compared to related Spounavirinae W.Ph. and Bastille.

The Bacillus ACT group includes three important pathogenic species of Bacillus: anthracis, cereus and thuringiensis. We characterized three virulent bacteriophages, Bastille, W.Ph. and CP-51, that infect various strains of these three species. We have determined the complete genome sequences of CP-51, W.Ph. and Bastille, and their physical genome structures. The CP-51 genome sequence could only be obtained using a combination of conventional and second and third next generation sequencing technologies – illustrating the problems associated with sequencing highly modified DNA. We present evidence that the generalized transduction facilitated by CP-51 is independent of a specific genome structure, but likely due to sporadic packaging errors of the terminase. There is clear correlation of the genetic and morphological features of these phages validating their placement in the Spounavirinae subfamily (SPO1-related phages) of the Myoviridae. This study also provides tools for the development of phage-based diagnostics/therapeutics for this group of pathogens. Copyright © 2014 Elsevier Inc. All rights reserved.


July 7, 2019

De novo assembly and characterization of the complete chloroplast genome of radish (Raphanus sativus L.).

Radish (Raphanus sativus L.) is an edible root vegetable crop that is cultivated worldwide and whose genome has been sequenced. Here we report the complete nucleotide sequence of the radish cultivar WK10039 chloroplast (cp) genome, along with a de novo assembly strategy using whole genome shotgun sequence reads obtained by next generation sequencing. The radish cp genome is 153,368 bp in length and has a typical quadripartite structure, composed of a pair of inverted repeat regions (26,217 bp each), a large single copy region (83,170 bp), and a small single copy region (17,764 bp). The radish cp genome contains 87 predicted protein-coding genes, 37 tRNA genes, and 8 rRNA genes. Sequence analysis revealed the presence of 91 simple sequence repeats (SSRs) in the radish cp genome. Phylogenetic analysis of 62 protein-coding gene sequences from the 17 cp genomes of the Brassicaceae family suggested that the radish cp genome is most closely related to the cp genomes of Brassica rapa and Brassicanapus. Comparisons with the B. rapa and B. napus cp genomes revealed highly divergent intergenic sequences and introns that can potentially be developed as diagnostic cp markers. Synonymous and nonsynonymous substitutions of cp genes suggested that nucleotide substitutions have occurred at similar rates in most genes. The complete sequence of the radish cp genome would serve as a valuable resource for the development of new molecular markers and the study of the phylogenetic relationships of Raphanus species in the Brassicaceae family. Copyright © 2014 Elsevier B.V. All rights reserved.


July 7, 2019

Get your high-quality low-cost genome sequence.

The study of whole-genome sequences has become essential for almost all branches of biological research. Next-generation sequencing (NGS) has revolutionized the scalability, speed, and resolution of sequencing and brought genomic science within reach of academic laboratories that study non-model organisms. Here, we show that a high-quality draft genome of a eukaryote can be obtained at relatively low cost by exploiting a hybrid combination of sequencing strategies. Copyright © 2014 Elsevier Ltd. All rights reserved.


July 7, 2019

Whole-genome assemblies of 56 Burkholderia species.

Burkholderia is a genus of betaproteobacteria that includes three notable human pathogens: B. cepacia, B. pseudomallei, and B. mallei. While B. pseudomallei and B. mallei are considered potential biowarfare agents, B. cepacia infections are largely limited to cystic fibrosis patients. Here, we present 56 Burkholderia genomes from 8 distinct species. Copyright © 2014 Daligault et al.


July 7, 2019

Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.

Advances in modern sequencing technologies allow us to generate sufficient data to analyze hundreds of bacterial genomes from a single machine in a single day. This potential for sequencing massive numbers of genomes calls for fully automated methods to produce high-quality assemblies and variant calls. We introduce Pilon, a fully automated, all-in-one tool for correcting draft assemblies and calling sequence variants of multiple sizes, including very large insertions and deletions. Pilon works with many types of sequence data, but is particularly strong when supplied with paired end data from two Illumina libraries with small e.g., 180 bp and large e.g., 3-5 Kb inserts. Pilon significantly improves draft genome assemblies by correcting bases, fixing mis-assemblies and filling gaps. For both haploid and diploid genomes, Pilon produces more contiguous genomes with fewer errors, enabling identification of more biologically relevant genes. Furthermore, Pilon identifies small variants with high accuracy as compared to state-of-the-art tools and is unique in its ability to accurately identify large sequence variants including duplications and resolve large insertions. Pilon is being used to improve the assemblies of thousands of new genomes and to identify variants from thousands of clinically relevant bacterial strains. Pilon is freely available as open source software.


July 7, 2019

Complete genome sequence of the lignin-degrading bacterium Klebsiella sp. strain BRL6-2.

In an effort to discover anaerobic bacteria capable of lignin degradation, we isolated Klebsiella sp. strain BRL6-2 on minimal media with alkali lignin as the sole carbon source. This organism was isolated anaerobically from tropical forest soils collected from the Bisley watershed at the Ridge site in the El Yunque National Forest in Puerto Rico, USA, part of the Luquillo Long-Term Ecological Research Station. At this site, the soils experience strong fluctuations in redox potential and are characterized by cycles of iron oxidation and reduction. Genome sequencing was targeted because of its ability to grow on lignin anaerobically and lignocellulolytic activity via in vitro enzyme assays. The genome of Klebsiella sp. strain BRL6-2 is 5.80 Mbp with no detected plasmids, and includes a relatively small arsenal of genes encoding lignocellulolytic carbohydrate active enzymes. The genome revealed four putative peroxidases including glutathione and DyP-type peroxidases, and a complete protocatechuate pathway encoded in a single gene cluster. Physiological studies revealed Klebsiella sp. strain BRL6-2 to be relatively stress tolerant to high ionic strength conditions. It grows in increasing concentrations of ionic liquid (1-ethyl-3-methyl-imidazolium acetate) up to 73.44 mM and NaCl up to 1.5 M.


July 7, 2019

Genome sequence of the dark pink pigmented Listia bainesii microsymbiont Methylobacterium sp. WSM2598.

Strains of a pink-pigmented Methylobacterium sp. are effective nitrogen- (N2) fixing microsymbionts of species of the African crotalarioid genus Listia. Strain WSM2598 is an aerobic, motile, Gram-negative, non-spore-forming rod isolated in 2002 from a Listia bainesii root nodule collected at Estcourt Research Station in South Africa. Here we describe the features of Methylobacterium sp. WSM2598, together with information and annotation of a high-quality draft genome sequence. The 7,669,765 bp draft genome is arranged in 5 scaffolds of 83 contigs, contains 7,236 protein-coding genes and 18 RNA-only encoding genes. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 G enomic E ncyclopedia for B acteria and A rchaea- R oot N odule B acteria (GEBA-RNB) project.


July 7, 2019

Complete genome sequence of Bifidobacterium longum 105-A, a strain with high transformation efficiency.

Bifidobacterium longum 105-A shows high transformation efficiency and allows for the generation of gene knockout mutants through homologous recombination. Here, we report the complete genome sequence of strain 105-A. Genes encoding at least four putative restriction-modification systems were found in this genome, which might contribute to its transformation efficiency. Copyright © 2014 Kanesaki et al.


July 7, 2019

Potential impact on kidney infection: a whole-genome analysis of Leptospira santarosai serovar Shermani.

Leptospira santarosai serovar Shermani is the most frequently encountered serovar, and it causes leptospirosis and tubulointerstitial nephritis in Taiwan. This study aims to complete the genome sequence of L. santarosai serovar Shermani and analyze the transcriptional responses of L. santarosai serovar Shermani to renal tubular cells. To assemble this highly repetitive genome, we combined reads that were generated from four next-generation sequencing platforms by using hybrid assembly approaches to finish two-chromosome contiguous sequences without gaps by validating the data with optical restriction maps and Sanger sequencing. Whole-genome comparison studies revealed a 28-kb region containing genes that encode transposases and hypothetical proteins in L. santarosai serovar Shermani, but this region is absent in other pathogenic Leptospira spp. We found that lipoprotein gene expression in both L. santarosai serovar Shermani and L. interrogans serovar Copenhageni were upregulated upon interaction with renal tubular cells, and LSS19962, a L. santarosai serovar Shermani-specific gene within a 28-kb region that encodes hypothetical proteins, was upregulated in L. santarosai serovar Shermani-infected renal tubular cells. Lipoprotein expression during leptospiral infection might facilitate the interactions of leptospires within kidneys. The availability of the whole-genome sequence of L. santarosai serovar Shermani would make it the first completed sequence of this species, and its comparison with that of other Leptospira spp. may provide invaluable information for further studies in leptospiral pathogenesis.


July 7, 2019

Comparative genome sequencing reveals genomic signature of extreme desiccation tolerance in the anhydrobiotic midge.

Anhydrobiosis represents an extreme example of tolerance adaptation to water loss, where an organism can survive in an ametabolic state until water returns. Here we report the first comparative analysis examining the genomic background of extreme desiccation tolerance, which is exclusively found in larvae of the only anhydrobiotic insect, Polypedilum vanderplanki. We compare the genomes of P. vanderplanki and a congeneric desiccation-sensitive midge P. nubifer. We determine that the genome of the anhydrobiotic species specifically contains clusters of multi-copy genes with products that act as molecular shields. In addition, the genome possesses several groups of genes with high similarity to known protective proteins. However, these genes are located in distinct paralogous clusters in the genome apart from the classical orthologues of the corresponding genes shared by both chironomids and other insects. The transcripts of these clustered paralogues contribute to a large majority of the mRNA pool in the desiccating larvae and most likely define successful anhydrobiosis. Comparison of expression patterns of orthologues between two chironomid species provides evidence for the existence of desiccation-specific gene expression systems in P. vanderplanki.


July 7, 2019

Comparative genomics reveals insights into avian genome evolution and adaptation.

Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. Copyright © 2014, American Association for the Advancement of Science.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.