Menu
July 7, 2019

Single-molecule sequencing assists genome assembly improvement and structural variation inference.

Dear editor, The single-molecule real-time (SMRT) sequencing platform presented by Pacific Biosciences (PacBio) is regarded as a third-generation sequencing technology (Eid et al., 2009, Roberts et al., 2013). PacBio delivers long reads from several to tens of kilobases (kbs), which are ideal for filling unsequenced gaps due to unusual sequence contexts, such as high-GC content or repeat-rich regions (Bashir et al., 2012, Berlin et al., 2015, Chaisson et al., 2015). PacBio long reads are also favorable for detecting large DNA fragments harboring structural variations (SVs), such as inversions, translocations, duplications, and large insertions/deletions (indels) (Ritz et al., 2010, English et al., 2014). However, one drawback of PacBio is the high error rate of base calling for single pass coverage of the genome (Au et al., 2012, Koren et al., 2012). This drawback can be mitigated by increasing sequencing coverage to achieve high consensus accuracy, but the requirements may be prohibitive for the de novo assembly of large- or medium-size genomes using only PacBio when considering both budgetary and computational costs. Alternatively, PacBio may be used for assembly improvement of near-finished reference genomes, especially for filling gaps in which unsequenced bases are represented by the letter N (English et al., 2012). Here, we combined PacBio (~15x) with Illumina reads (~40x) to improve the genome assemblies of African wild (Oryza barthii) and cultivated rice (O. glaberrima), and to infer large SVs between O. barthii and O. glaberrima.


July 7, 2019

First complete genome sequence of the Dutch veterinary Coxiella burnetii strain NL3262, originating from the largest global Q fever outbreak, and draft genome sequence of its epidemiologically linked chronic human isolate NLhu3345937

The largest global Q fever outbreak occurred in The Netherlands during 2007 to 2010. Goats and sheep were identified as the major sources of disease. Here, we report the first complete genome sequence of Coxiella burnetiigoat outbreak strain NL3262 and that of an epidemiologically linked chronic human strain, both having the outbreak-related CbNL01multilocus variable-number tandem-repeat analysis (MLVA) genotype. Copyright © 2016 Kuley et al.


July 7, 2019

Complete genome sequence of a low-temperature active and alkaline-stable endoglucanase-producing Paenibacillus sp. strain IHB B 3084 from the Indian Trans-Himalayas.

A genome of 5.88Mb with 46.83% G+C content is reported for an endoglucanase-producing bacterium Paenibacillus sp. strain IHB B 3084 isolated from the cold environments of the Indian Trans-Himalayas. The psychrotrophic bacterium produces low-temperature active and alkaline-stable endoglucanases of industrial importance. The genomic data has provided insight into genomic basis of cellulase production and survival of the bacterium in the cold environments. Copyright © 2016. Published by Elsevier B.V.


July 7, 2019

De novo assembly of complete genome sequence of Planococcus kocurii ATCC 43650(T), a potential plant growth promoting bacterium.

Planococcus kocurii ATCC 43650(T) is a halotolerant and psychrotolerant bacterium isolated from the skin of a North sea cod. Here, we present the first complete genome and annotation of P. kocurii ATCC 43650(T), identifying its potential as a plant growth promoting bacterium and its capability in the biosynthesis of butanol. Copyright © 2016 Elsevier B.V. All rights reserved.


July 7, 2019

Evolutionary redesign of the Atlantic cod (Gadus morhua L.) Toll-like receptor repertoire by gene losses and expansions.

Genome sequencing of the teleost Atlantic cod demonstrated loss of the Major Histocompatibility Complex (MHC) class II, an extreme gene expansion of MHC class I and gene expansions and losses in the innate pattern recognition receptor (PRR) family of Toll-like receptors (TLR). In a comparative genomic setting, using an improved version of the genome, we characterize PRRs in Atlantic cod with emphasis on TLRs demonstrating the loss of TLR1/6, TLR2 and TLR5 and expansion of TLR7, TLR8, TLR9, TLR22 and TLR25. We find that Atlantic cod TLR expansions are strongly influenced by diversifying selection likely to increase the detectable ligand repertoire through neo- and subfunctionalization. Using RNAseq we find that Atlantic cod TLRs display likely tissue or developmental stage-specific expression patterns. In a broader perspective, a comprehensive vertebrate TLR phylogeny reveals that the Atlantic cod TLR repertoire is extreme with regards to losses and expansions compared to other teleosts. In addition we identify a substantial shift in TLR repertoires following the evolutionary transition from an aquatic vertebrate (fish) to a terrestrial (tetrapod) life style. Collectively, our findings provide new insight into the function and evolution of TLRs in Atlantic cod as well as the evolutionary history of vertebrate innate immunity.


July 7, 2019

Mechanisms involved in acquisition of blaNDM genes by IncA/C2 and IncFIIY plasmids.

blaNDM genes confer carbapenem resistance and have been identified on transferable plasmids belonging to different incompatibility (Inc) groups. Here we present the complete sequences of four plasmids carrying a blaNDM gene, pKP1-NDM-1, pEC2-NDM-3, pECL3-NDM-1 and pEC4-NDM-6, from four clinical samples originating from four different patients. Different plasmids carry segments that align to different parts of the blaNDM region found on Acinetobacter plasmids. pKP1-NDM-1 and pEC2-NDM-3, from Klebsiella pneumoniae and Escherichia coli, respectively, were identified as type 1 IncA/C2 plasmids with almost identical backbones. Different regions carrying blaNDM are inserted in different locations in the antibiotic resistance island known as ARI-A and ISCR1 may have been involved in acquisition of blaNDM-3 by pEC2-NDM-3. pECL3-NDM-1 and pEC4-NDM-6, from Enterobacter cloacae and E. coli, respectively, have similar IncFIIY backbones but different regions carrying blaNDM are found in different locations. Tn3-derived Inverted-repeat Transposable Elements (TIME) appear to have been involved in acquisition of blaNDM-6 by pEC4-NDM-6 and the rmtC 16S rRNA methylase gene by IncFIIY plasmids. Characterisation of these plasmids further demonstrates that even very closely related plasmids may have acquired blaNDM genes by different mechanisms. These findings also illustrate the complex relationships between antimicrobial resistance genes, transposable elements and plasmids and provide insights into the possible routes for transmission of blaNDM genes amongst species of the Enterobacteriaceae family. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Variable presence of the inverted repeat and plastome stability in Erodium.

Several unrelated lineages such as plastids, viruses and plasmids, have converged on quadripartite genomes of similar size with large and small single copy regions and a large inverted repeat (IR). Except for Erodium (Geraniaceae), saguaro cactus and some legumes, the plastomes of all photosynthetic angiosperms display this structure. The functional significance of the IR is not understood and Erodium provides a system to examine the role of the IR in the long-term stability of these genomes. We compared the degree of genomic rearrangement in plastomes of Erodium that differ in the presence and absence of the IR.We sequenced 17 new Erodium plastomes. Using 454, Illumina, PacBio and Sanger sequences, 16 genomes were assembled and categorized along with one incomplete and two previously published Erodium plastomes. We conducted phylogenetic analyses among these species using a dataset of 19 protein-coding genes and determined if significantly higher evolutionary rates had caused the long branch seen previously in phylogenetic reconstructions within the genus. Bioinformatic comparisons were also performed to evaluate plastome evolution across the genus.Erodium plastomes fell into four types (Type 1-4) that differ in their substitution rates, short dispersed repeat content and degree of genomic rearrangement, gene and intron content and GC content. Type 4 plastomes had significantly higher rates of synonymous substitutions (dS) for all genes and for 14 of the 19 genes non-synonymous substitutions (dN) were significantly accelerated. We evaluated the evidence for a single IR loss in Erodium and in doing so discovered that Type 4 plastomes contain a novel IR.The presence or absence of the IR does not affect plastome stability in Erodium. Rather, the overall repeat content shows a negative correlation with genome stability, a pattern in agreement with other angiosperm groups and recent findings on genome stability in bacterial endosymbionts.© The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


July 7, 2019

Haemonchus contortus: genome structure, organization and comparative genomics

One of the first genome sequencing projects for a parasitic nematode was that for Haemonchus contortus. The open access data from the Wellcome Trust Sanger Institute provided a valuable early resource for the research community, particularly for the identification of specific genes and genetic markers. Later, a second sequencing project was initiated by the University of Melbourne, and the two draft genome sequences for H. contortus were published back-to-back in 2013. There is a pressing need for long-range genomic information for genetic mapping, population genetics and functional genomic studies, so we are continuing to improve the Wellcome Trust Sanger Institute assembly to provide a finished reference genome for H. contortus. This review describes this process, compares the H. contortus genome assemblies with draft genomes from other members of the strongylid group and discusses future directions for parasite genomics using the H. contortus model. Copyright © 2016 Elsevier Ltd. All rights reserved.


July 7, 2019

Complete nucleotide sequence of pH11, an IncHI2 plasmid conferring multi-antibiotic resistance and multi-heavy metal resistance genes in a clinical Klebsiella pneumoniae isolate.

The complete 284,628bp sequence of pH11, an IncHI2 plasmid, was determined through single-molecule, real-time (SMRT) sequencing. Harbored by a clinical Klebsiella pneumoniae strain H11, and isolated in Beijing, this plasmid contains multiple antibiotic resistance genes, including catA2, aac(6′)-Ib, strB, strA, dfrA19, blaTEM-1, blaSHV-12, sul1, qacE delta 1, ereA, arr2, and aac3. The aac(6′)-Ib is carried by a class I integron. Plasmid pH11 also carries several genes associated with resistance to heavy metals, such as tellurium, mercury, cobalt, zinc, nickel, copper, lead and cadmium. This plasmid exhibits numerous characteristics, including HipBA and RelBE toxin-antitoxin systems, two major transfer (Tra) regions closely related to those of Salmonella enterica serovar plasmid pRH-R27, a type II restriction modification system (EcoRII R-M system), several methyltransferases and methylases and genes encoding Hha and StpA. These characteristics suggest that pH11 may adapt to various hosts and environments. Multiple insertion sequence elements, transposases, recombinases, resolvases and integrases are scattered throughout pH11. The presence of these genes may indicate that horizontal gene transfer occurs frequently in pH11 and thus may facilitate the dissemination of antimicrobial resistance determinants. Our data suggest that pH11 is a chimera gradually assembled through the integration of different horizontally acquired DNA segments via transposition or homologous recombination. Copyright © 2016 Elsevier Inc. All rights reserved.


July 7, 2019

Complete genome sequence of probiotic Lactobacillus reuteri ZLR003 isolated from healthy weaned pig.

Lactobacillus reuteri ZLR003 was isolated from the caecum mucosa of healthy weaned pigs with displaying probiotic properties in our laboratory. Here, we present the complete genome sequence of L. reuteri ZLR003, which consists of a circular 2, 234, 097bp chromosome (G+C content of 38.66%). Such information will provide insights into the molecular mechanism of its probiotic activity and facilitate its application in animal production. Copyright © 2016. Published by Elsevier B.V.


July 7, 2019

Complete genome sequence of the novel thermophilic polyhydroxyalkanoates producer Aneurinibacillus sp. XH2 isolated from Gudao oilfield in China.

Aneurinibacillus sp. XH2 (CGMCC 1.15535) was isolated from Gudao oilfield in China. It is able to use simple carbon resources to accumulate Polyhydroxyalkanoates (PHAs) in a thermophilic fashion. Here, we describe the genomic features of this strain. The total genome size of Aneurinibacillus sp. XH2 is 3,664,835bp and contains 3441 coding sequences and 114 tRNAs. The annotated genome sequence of this strain provides the genetic basis for revealing its role as a themophilic PHAs producing bacterium. Copyright © 2016 Elsevier B.V. All rights reserved.


July 7, 2019

Complete genome sequence of Streptomyces venezuelae ATCC 15439, producer of the methymycin/pikromycin family of macrolide antibiotics, using PacBio technology.

Here, we report the complete genome sequence of Streptomyces venezuelae ATCC 15439, a producer of the methymycin/pikromycin family of macrolide antibiotics and a model host for natural product studies, obtained exclusively using PacBio sequencing technology. The 9.03-Mbp genome harbors 8,775 genes and 11 polyketide and nonribosomal peptide natural product gene clusters. Copyright © 2016 He et al.


July 7, 2019

Analysis of the genome sequence of the medicinal plant Salvia miltiorrhiza.

Salvia miltiorrhiza Bunge (Danshen) is a medicinal plant of the Lamiaceae family, and its dried roots have long been used in traditional Chinese medicine with hydrophilic phenolic acids and tanshinones as pharmaceutically active components (Zhang et al., 2014; Xu et al., 2016). The first step of tanshinone biosynthesis is bicyclization of the general diterpene precursor (E,E,E)-geranylgeranyl diphosphate (GGPP) to copalyl diphosphate (CPP) by CPP synthases (CPSs), which is followed by a cyclization or rearrangement reaction catalyzed by kaurene synthase-like enzymes (KSL).


July 7, 2019

Genome sequence and analysis of a stress-tolerant, wild-derived strain of Saccharomyces cerevisiae used in biofuels research

The genome sequences of more than 100 strains of the yeast Saccharomyces cerevisiae have been published. Unfortunately, most of these genome assemblies contain dozens to hundreds of gaps at repetitive sequences, including transposable elements, tRNAs, and subtelomeric regions, which is where novel genes generally reside. Relatively few strains have been chosen for genome sequencing based on their biofuel production potential, leaving an additional knowledge gap. Here, we describe the nearly complete genome sequence of GLBRCY22-3 (Y22-3), a strain of S. cerevisiae derived from the stress-tolerant wild strain NRRL YB-210 and subsequently engineered for xylose metabolism. After benchmarking several genome assembly approaches, we developed a pipeline to integrate Pacific Biosciences (PacBio) and Illumina sequencing data and achieved one of the highest quality genome assemblies for any S. cerevisiae strain. Specifically, the contig N50 is 693 kbp, and the sequences of most chromosomes, the mitochondrial genome, and the 2-micron plasmid are complete. Our annotation predicts 92 genes that are not present in the reference genome of the laboratory strain S288c, over 70% of which were expressed. We predicted functions for 43 of these genes, 28 of which were previously uncharacterized and unnamed. Remarkably, many of these genes are predicted to be involved in stress tolerance and carbon metabolism and are shared with a Brazilian bioethanol production strain, even though the strains differ dramatically at most genetic loci. The Y22-3 genome sequence provides an exceptionally high-quality resource for basic and applied research in bioenergy and genetics. Copyright © 2016 McIlwain et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.