Menu
July 19, 2019

PacBio but not Illumina technology can achieve fast, accurate and complete closure of the high GC, complex Burkholderia pseudomallei two-chromosome genome

Although PacBio third-generation sequencers have improved the read lengths of genome sequencing which facilitates the assembly of complete genomes, no study has reported success in using PacBio data alone to completely sequence a two-chromosome bacterial genome from a single library in a single run. Previous studies using earlier versions of sequencing chemistries have at most been able to finish bacterial genomes containing only one chromosome with de novo assembly. In this study, we compared the robustness of PacBio RS II, using one SMRT cell and the latest P6-C4 chemistry, with Illumina HiSeq 1500 in sequencing the genome of Burkholderia pseudomallei, a bacterium which contains two large circular chromosomes, very high G+C content of 68–69%, highly repetitive regions and substantial genomic diversity, and represents one of the largest and most complex bacterial genomes sequenced, using a reference genome generated by hybrid assembly using PacBio and Illumina datasets with subsequent manual validation. Results showed that PacBio data with de novo assembly, but not Illumina, was able to completely sequence the B. pseudomallei genome without any gaps or mis-assemblies. The two large contigs of the PacBio assembly aligned unambiguously to the reference genome, sharing >99.9% nucleotide identities. Conversely, Illumina data assembled using three different assemblers resulted in fragmented assemblies (201–366 contigs), sharing only 92.2–100% and 92.0–100% nucleotide identities to chromosomes I and II reference sequences, respectively, with no indication that the B. pseudomallei genome consisted of two chromosomes with four copies of ribosomal operons. Among all assemblies, the PacBio assembly recovered the highest number of core and virulence proteins, and housekeeping genes based on whole-genome multilocus sequence typing (wgMLST). Most notably, assembly solely based on PacBio outperformed even hybrid assembly using both PacBio and Illumina datasets. Hybrid approach generated only 74 contigs, while the PacBio data alone with de novo assembly achieved complete closure of the two-chromosome B. pseudomallei genome without additional costly bench work and further sequencing. PacBio RS II using P6-C4 chemistry is highly robust and cost-effective and should be the platform of choice in sequencing bacterial genomes, particularly for those that are well-known to be difficult-to-sequence.


July 19, 2019

Comparison between complete genomes of an isolate of Pseudomonas syringae pv. actinidiae from Japan and a New Zealand isolate of the pandemic.

The modern pandemic of the bacterial kiwifruit pathogen Pseudomonas syringae pv actinidiae (Psa) is caused by a particular Psa lineage. To better understand the genetic basis of the virulence of this lineage, we compare the completely assembled genome of a pandemic New Zealand strain with that of the Psa type strain first isolated in Japan in 1983. Aligning the two genomes shows numerous translocations, constrained so as to retain the appropriate orientation of the Architecture Imparting Sequences (AIMs). There are several large horizontally acquired regions, some of which include Type I, Type II or Type III restriction systems. The activity of these systems is reflected in the methylation patterns of the two strains. The pandemic strain carries an Integrative Conjugative Element (ICE) located at a tRNA-Lys site. Two other complex elements are also present at tRNA-Lys sites in the genome. These elements are derived from ICE but have now acquired some alternative secretion function. There are numerous types of mobile element in the two genomes. Analysis of these elements reveals no evidence of recombination between the two Psa lineages.


July 7, 2019

Genomes of ‘Candidatus Liberibacter solanacearum’ Haplotype A from New Zealand and the United States Suggest Significant Genome Plasticity in the Species.

‘Candidatus Liberibacter solanacearum’ contains two solanaceous crop-infecting haplotypes, A and B. Two haplotype A draft genomes were assembled and compared with ZC1 (haplotype B), revealing inversion and relocation genomic rearrangements, numerous single-nucleotide polymorphisms, and differences in phage-related regions. Differences in prophage location and sequence were seen both within and between haplotype comparisons. OrthoMCL and BLAST analyses identified 46 putative coding sequences present in haplotype A that were not present in haplotype B. Thirty-eight of these loci were not found in sequences from other Liberibacter spp. Quantitative polymerase chain reaction (qPCR) assays designed to amplify sequences from 15 of these loci were screened against a panel of ‘Ca. L. solanacearum’-positive samples to investigate genetic diversity. Seven of the assays demonstrated within-haplotype diversity; five failed to amplify loci in at least one haplotype A sample while three assays produced amplicons from some haplotype B samples. Eight of the loci assays showed consistent A-B differentiation. Differences in genome arrangements, prophage, and qPCR results suggesting locus diversity within the haplotypes provide more evidence for genetic complexity in this emerging bacterial species.


July 7, 2019

Complete genome sequence of Spirosoma radiotolerans, a gamma-radiation-resistant bacterium isolated from rice field in South Korea.

Spirosoma radiotolerans is a Gram-negative, short rod-shaped and gamma-radiation-resistant bacterium isolated from rice field in South Korea (GPS; 37°34’30?N, 127°00’30?E). The complete genome of S. radiotolerans consists of a chromosome (7,029,352bp). From the genome sequence database, we have identified the cluster of genes responsible for DNA recovery from ionizing radiation. The key enzymes for the nucleotide excision repair (NER) were investigated and were identified, suggesting that S. radiotolerans DG5A use (NER) pathways for efficient removal of pyrimidine dimers which are the most abundant type of UV-induced damage. Complete genome information enables further studies on the DNA repair mechanisms during the ionizing radiation. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Whole-genome sequence of Serratia liquefaciens HUMV-21, a cytotoxic, auorum-sensing, and biofilm-producing clinical isolate.

A clinical isolate of Serratia liquefaciens (strain HUMV-21) was obtained from a skin ulcer of an adult patient. We report here its complete genome assembly using PacBio single-molecule real-time (SMRT) sequencing, which resulted in a single circular chromosome with 5.3 Mb. About 5,844 protein-coding genes are predicted from this assembly. Copyright © 2015 Lázaro-Díez et al.


July 7, 2019

Complete chloroplast genome sequence of MD-2 pineapple and its comparative analysis among nine other plants from the subclass Commelinidae.

Pineapple (Ananas comosus var. comosus) is known as the king of fruits for its crown and is the third most important tropical fruit after banana and citrus. The plant, which is indigenous to South America, is the most important species in the Bromeliaceae family and is largely traded for fresh fruit consumption. Here, we report the complete chloroplast sequence of the MD-2 pineapple that was sequenced using the PacBio sequencing technology.In this study, the high error rate of PacBio long sequence reads of A. comosus’s total genomic DNA were improved by leveraging on the high accuracy but short Illumina reads for error-correction via the latest error correction module from Novocraft. Error corrected long PacBio reads were assembled by using a single tool to produce a contig representing the pineapple chloroplast genome. The genome of 159,636 bp in length is featured with the conserved quadripartite structure of chloroplast containing a large single copy region (LSC) with a size of 87,482 bp, a small single copy region (SSC) with a size of 18,622 bp and two inverted repeat regions (IRA and IRB) each with the size of 26,766 bp. Overall, the genome contained 117 unique coding regions and 30 were repeated in the IR region with its genes contents, structure and arrangement similar to its sister taxon, Typha latifolia. A total of 35 repeats structure were detected in both the coding and non-coding regions with a majority being tandem repeats. In addition, 205 SSRs were detected in the genome with six protein-coding genes contained more than two SSRs. Comparative chloroplast genomes from the subclass Commelinidae revealed a conservative protein coding gene albeit located in a highly divergence region. Analysis of selection pressure on protein-coding genes using Ka/Ks ratio showed significant positive selection exerted on the rps7 gene of the pineapple chloroplast with P less than 0.05. Phylogenetic analysis confirmed the recent taxonomical relation among the member of commelinids which support the monophyly relationship between Arecales and Dasypogonaceae and between Zingiberales to the Poales, which includes the A. comosus.The complete sequence of the chloroplast of pineapple provides insights to the divergence of genic chloroplast sequences from the members of the subclass Commelinidae. The complete pineapple chloroplast will serve as a reference for in-depth taxonomical studies in the Bromeliaceae family when more species under the family are sequenced in the future. The genetic sequence information will also make feasible other molecular applications of the pineapple chloroplast for plant genetic improvement.


July 7, 2019

Methyltransferases acquired by lactococcal 936-type phage provide protection against restriction endonuclease activity

BACKGROUND:So-called 936-type phages are among the most frequently isolated phages in dairy facilities utilising Lactococcus lactis starter cultures. Despite extensive efforts to control phage proliferation and decades of research, these phages continue to negatively impact cheese production in terms of the final product quality and consequently, monetary return.RESULTS:Whole genome sequencing and in silico analysis of three 936-type phage genomes identified several putative (orphan) methyltransferase (MTase)-encoding genes located within the packaging and replication regions of the genome. Utilising SMRT sequencing, methylome analysis was performed on all three phages, allowing the identification of adenine modifications consistent with N-6 methyladenine sequence methylation, which in some cases could be attributed to these phage-encoded MTases. Heterologous gene expression revealed that M.Phi145I/M.Phi93I and M.Phi93DAM, encoded by genes located within the packaging module, provide protection against the restriction enzymes HphI and DpnII, respectively, representing the first functional MTases identified in members of 936-type phages.CONCLUSIONS:SMRT sequencing technology enabled the identification of the target motifs of MTases encoded by the genomes of three lytic 936-type phages and these MTases represent the first functional MTases identified in this species of phage. The presence of these MTase-encoding genes on 936-type phage genomes is assumed to represent an adaptive response to circumvent host encoded restriction-modification systems thereby increasing the fitness of the phages in a dynamic dairy environment.


July 7, 2019

The odd one out: Bacillus ACT bacteriophage CP-51 exhibits unusual properties compared to related Spounavirinae W.Ph. and Bastille.

The Bacillus ACT group includes three important pathogenic species of Bacillus: anthracis, cereus and thuringiensis. We characterized three virulent bacteriophages, Bastille, W.Ph. and CP-51, that infect various strains of these three species. We have determined the complete genome sequences of CP-51, W.Ph. and Bastille, and their physical genome structures. The CP-51 genome sequence could only be obtained using a combination of conventional and second and third next generation sequencing technologies – illustrating the problems associated with sequencing highly modified DNA. We present evidence that the generalized transduction facilitated by CP-51 is independent of a specific genome structure, but likely due to sporadic packaging errors of the terminase. There is clear correlation of the genetic and morphological features of these phages validating their placement in the Spounavirinae subfamily (SPO1-related phages) of the Myoviridae. This study also provides tools for the development of phage-based diagnostics/therapeutics for this group of pathogens. Copyright © 2014 Elsevier Inc. All rights reserved.


July 7, 2019

Transcriptome analysis of the entomopathogenic oomycete Lagenidium giganteum reveals putative virulence factors.

A combination of 454 pyrosequencing and Sanger sequencing was used to sample and characterize the transcriptome of the entomopathogenic oomycete Lagenidium giganteum. More than 50,000 high-throughput reads were annotated through homology searches. Several selected reads served as seeds for the amplification and sequencing of full-length transcripts. Phylogenetic analyses inferred from full-length cellulose synthase alignments revealed that L giganteum is nested within the peronosporalean galaxy and as such appears to have evolved from a phytopathogenic ancestor. In agreement with the phylogeny reconstructions, full-length L. giganteum oomycete effector orthologs, corresponding to the cellulose-binding elicitor lectin (CBEL), crinkler (CRN), and elicitin proteins, were characterized by domain organizations similar to those of pathogenicity factors of plant-pathogenic oomycetes. Importantly, the L. giganteum effectors provide a basis for detailing the roles of canonical CRN, CBEL, and elicitin proteins in the infectious process of an oomycete known principally as an animal pathogen. Finally, phylogenetic analyses and genome mining identified members of glycoside hydrolase family 5 subfamily 27 (GH5_27) as putative virulence factors active on the host insect cuticle, based in part on the fact that GH5_27 genes are shared by entomopathogenic oomycetes and fungi but are underrepresented in nonentomopathogenic genomes. The genomic resources gathered from the L. giganteum transcriptome analysis strongly suggest that filamentous entomopathogens (oomycetes and fungi) exhibit convergent evolution: they have evolved independently from plant-associated microbes, have retained genes indicative of plant associations, and may share similar cores of virulence factors, such as GH5_27 enzymes, that are absent from the genomes of their plant-pathogenic relatives. Copyright © 2014, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Comparative mitogenomic analysis of three species of periwinkles: Littorina fabalis, L. obtusata and L. saxatilis.

The flat periwinkles, Littorina fabalis and L. obtusata, offer an interesting system for local adaptation and ecological speciation studies. In order to provide genomic resources for these species, we sequenced their mitogenomes together with that of the rough periwinkle L. saxatilis by means of next-generation sequencing technologies. The three mitogenomes present the typical repertoire of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes and a putative control region. Although the latter could not be fully recovered in flat periwinkles using short-reads due to a highly repetitive fragment, in L. saxatilis this problem was overcome with additional long-reads and we were able to assemble the complete mitogenome. Both gene order and nucleotide composition are similar between the three species as well as compared to other Littorinimorpha. A large variance in divergence was observed across mitochondrial regions, with six- to ten-fold difference between the highest and the lowest divergence rates. Based on nucleotide changes on the whole molecule and assuming a molecular clock, L. fabalis and L. obtusata started to diverge around 0.8 Mya (0.4-1.1 Mya). The evolution of the mitochondrial protein-coding genes in the three Littorina species appears mainly influenced by purifying selection as revealed by phylogenetic tests based on dN/dS ratios that did not detect any evidence for positive selection, although some caution is required given the limited power of the dataset and the implemented approaches. Copyright © 2016 Elsevier B.V. All rights reserved.


July 7, 2019

The complete genome sequence of Cronobacter sakazakii ATCC 29544(T), a food-borne pathogen, isolated from a child’s throat.

Cronobacter sakazakii is an emerging opportunistic pathogen that is associated with rare but life-threatening cases of severe diseases: meningitis, necrotizing enterocolitis, and sepsis in premature and full-term infants. However, the pathogenesis mechanism of this pathogen remains largely unknown. To determine its pathogenesis at the genomic level, the genome of C. sakazakii ATCC 29544(T) was completely sequenced and analyzed.The genomic DNA, containing a circular chromosome and three plasmids, is composed of 4,511,265 bp with a GC content of 56.71%, containing 4380 predicted open reading frames (ORFs), 22 rRNA genes, and 83 tRNA genes. The plasmids, designated pCSK29544_p1, pCSK29544_p2, and pCSK29544_p3, were 93,905-bp, 4938-bp, and 53,457-bp with GC contents of 57.02, 54.88, and 50.07%, respectively. They were also predicted to have 72, 6, and 57 ORFs without RNA genes.The strain ATCC 29544(T) genome has ompA and ibeB-homologous cusC genes, probably associated with the invasion of human brain microvascular endothelial cells (BMECs). In addition, gene clusters for siderophore production (iucABCD/iutA) and the related transport system (eitCBAD) were detected in pCSK29544_p1 plasmid, indicating better iron uptake ability for survival. Furthermore, to survive under extremely dry condition like milk powder, this genome has gene clusters for biosynthesis of capsular proteins (CSK29544_00281-00284) and cellulose (CSK29544_01124-01127) for biofilm formation and a gene cluster for utilization of sialic acid in the milk (nanKTAR). The genome information of C. sakazakii ATCC 29544(T) would provide further understanding of its pathogenesis at the molecular level for the regulation of pathogenicity and the development of a rapid detection method using biomarkers.


July 7, 2019

The secondary resistome of multidrug-resistant Klebsiella pneumoniae.

Klebsiella pneumoniae causes severe lung and bloodstream infections that are difficult to treat due to multidrug resistance. We hypothesized that antimicrobial resistance can be reversed by targeting chromosomal non-essential genes that are not responsible for acquired resistance but essential for resistant bacteria under therapeutic concentrations of antimicrobials. Conditional essentiality of individual genes to antimicrobial resistance was evaluated in an epidemic multidrug-resistant clone of K. pneumoniae (ST258). We constructed a high-density transposon mutant library of >430,000 unique Tn5 insertions and measured mutant depletion upon exposure to three clinically relevant antimicrobials (colistin, imipenem or ciprofloxacin) by Transposon Directed Insertion-site Sequencing (TraDIS). Using this high-throughput approach, we defined three sets of chromosomal non-essential genes essential for growth during exposure to colistin (n?=?35), imipenem (n?=?1) or ciprofloxacin (n?=?1) in addition to known resistance determinants, collectively termed the “secondary resistome”. As proof of principle, we demonstrated that inactivation of a non-essential gene not previously found linked to colistin resistance (dedA) restored colistin susceptibility by reducing the minimum inhibitory concentration from 8 to 0.5?µg/ml, 4-fold below the susceptibility breakpoint (S?=?2?µg/ml). This finding suggests that the secondary resistome is a potential target for developing antimicrobial “helper” drugs that restore the efficacy of existing antimicrobials.


July 7, 2019

De novo hybrid assembly of the rubber tree genome reveals evidence of paleotetraploidy in Hevea species.

Para rubber tree (Hevea brasiliensis) is an important economic species as it is the sole commercial producer of high-quality natural rubber. Here, we report a de novo hybrid assembly of BPM24 accession, which exhibits resistance to major fungal pathogens in Southeast Asia. Deep-coverage 454/Illumina short-read and Pacific Biosciences (PacBio) long-read sequence data were acquired to generate a preliminary draft, which was subsequently scaffolded using a long-range “Chicago” technique to obtain a final assembly of 1.26?Gb (N50?=?96.8?kb). The assembled genome contains 69.2% repetitive sequences and has a GC content of 34.31%. Using a high-density SNP-based genetic map, we were able to anchor 28.9% of the genome assembly (363?Mb) associated with over two thirds of the predicted protein-coding genes into rubber tree’s 18 linkage groups. These genetically anchored sequences allowed comparative analyses of the intragenomic homeologous synteny, providing the first concrete evidence to demonstrate the presence of paleotetraploidy in Hevea species. Additionally, the degree of macrosynteny conservation observed between rubber tree and cassava strongly supports the hypothesis that the paleotetraploidization event took place prior to the divergence of the Hevea and Manihot species.


July 7, 2019

Genomic sequence of ‘Candidatus Liberibacter solanacearum’ haplotype C and its comparison with haplotype A and B genomes.

Haplotypes A and B of ‘Candidatus Liberibacter solanacearum’ (CLso) are associated with diseases of solanaceous plants, especially Zebra chip disease of potato, and haplotypes C, D and E are associated with symptoms on apiaceous plants. To date, one complete genome of haplotype B and two high quality draft genomes of haplotype A have been obtained for these unculturable bacteria using metagenomics from the psyllid vector Bactericera cockerelli. Here, we present the first genomic sequences obtained for the carrot-associated CLso. These two genomic sequences of haplotype C, FIN114 (1.24 Mbp) and FIN111 (1.20 Mbp), were obtained from carrot psyllids (Trioza apicalis) harboring CLso. Genomic comparisons between the haplotypes A, B and C revealed that the genome organization differs between these haplotypes, due to large inversions and other recombinations. Comparison of protein-coding genes indicated that the core genome of CLso consists of 885 ortholog groups, with the pan-genome consisting of 1327 ortholog groups. Twenty-seven ortholog groups are unique to CLso haplotype C, whilst 11 ortholog groups shared by the haplotypes A and B, are not found in the haplotype C. Some of these ortholog groups that are not part of the core genome may encode functions related to interactions with the different host plant and psyllid species.


July 7, 2019

Genome sequence of Streptomyces sp. H-KF8, a marine actinobacterium isolated from a northern Chilean Patagonian fjord.

Streptomyces sp. H-KF8 is a fjord-derived marine actinobacterium capable of producing antimicrobial activity. Streptomyces sp. H-KF8 was isolated from sediments of the Comau fjord, located in the northern Chilean Patagonia. Here, we report the 7.7-Mb genome assembly, which represents the first genome of a Chilean marine actinobacterium. Copyright © 2017 Undabarrena et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.