Menu
July 7, 2019

Complete genome sequence of the drought resistance-promoting endophyte Klebsiella sp. LTGPAF-6F.

Bacterial endophytes with capacity to promote plant growth and improve plant tolerance against biotic and abiotic stresses have importance in agricultural practice and phytoremediation. A plant growth-promoting endophyte named Klebsiella sp. LTGPAF-6F, which was isolated from the roots of the desert plant Alhagi sparsifolia in north-west China, exhibits the ability to enhance the growth of wheat under drought stress. The complete genome sequence of this strain consists of one circular chromosome and two circular plasmids. From the genome, we identified genes related to the plant growth promotion and stress tolerance, such as nitrogen fixation, production of indole-3-acetic acid, acetoin, 2,3-butanediol, spermidine and trehalose. This genome sequence provides a basis for understanding the beneficial interactions between LTGPAF-6F and host plants, and will facilitate its applications as biotechnological agents in agriculture. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

Genomesequencing of Ralstonia solanacearum CQPS-1, a phylotype I strain collected from a highland area with continuous cropping of tobacco.

Ralstonia solanacearum, an agent of bacterial wilt, is a highly variable species with a broad host range and wide geographic distribution. As a species complex, it has extensive genetic diversity and its living environment is polymorphic like the lowland and the highland area, so more genomes are needed for studying population evolution and environment adaptation. In this paper, we reported the genome sequencing of R. solanacearum strain CQPS-1 isolated from wilted tobacco in Pengshui, Chongqing, China, a highland area with severely acidified soil and continuous cropping of tobacco more than 20 years. The comparative genomic analysis among different R. solanacearum strains was also performed. The completed genome size of CQPS-1 was 5.89 Mb and contained the chromosome (3.83 Mb) and the megaplasmid (2.06 Mb). A total of 5229 coding sequences were predicted (the chromosome and megaplasmid encoded 3573 and 1656 genes, respectively). A comparative analysis with eight strains from four phylotypes showed that there was some variation among the species, e.g., a large set of specific genes in CQPS-1. Type III secretion system gene cluster (hrp gene cluster) was conserved in CQPS-1 compared with the reference strain GMI1000. In addition, most genes coding core type III effectors were also conserved with GMI1000, but significant gene variation was found in the gene ripAA: the identity compared with strain GMI1000 was 75% and the hrpII box promoter in the upstream had significantly mutated. This study provided a potential resource for further understanding of the relationship between variation of pathogenicity factors and adaptation to the host environment.


July 7, 2019

Emergence of a new Neisseria meningitidis clonal complex 11 lineage 11.2 clade as an effective urogenital pathogen.

Neisseria meningitidis (Nm) clonal complex 11 (cc11) lineage is a hypervirulent pathogen responsible for outbreaks of invasive meningococcal disease, including among men who have sex with men, and is increasingly associated with urogenital infections. Recently, clusters of Nm urethritis have emerged primarily among heterosexual males in the United States. We determined that nonencapsulated meningococcal isolates from an ongoing Nm urethritis outbreak among epidemiologically unrelated men in Columbus, Ohio, are linked to increased Nm urethritis cases in multiple US cities, including Atlanta and Indianapolis, and that they form a unique clade (the US Nm urethritis clade, US_NmUC). The isolates belonged to the cc11 lineage 11.2/ET-15 with fine type of PorA P1.5-1, 10-8; FetA F3-6; PorB 2-2 and express a unique FHbp allele. A common molecular fingerprint of US_NmUC isolates was an IS1301 element in the intergenic region separating the capsule ctr-css operons and adjacent deletion of cssA/B/C and a part of csc, encoding the serogroup C capsule polymerase. This resulted in the loss of encapsulation and intrinsic lipooligosaccharide sialylation that may promote adherence to mucosal surfaces. Furthermore, we detected an IS1301-mediated inversion of an ~20-kb sequence near the cps locus. Surprisingly, these isolates had acquired by gene conversion the complete gonococcal denitrification norB-aniA gene cassette, and strains grow well anaerobically. The cc11 US_NmUC isolates causing urethritis clusters in the United States may have adapted to a urogenital environment by loss of capsule and gene conversion of the Neisseria gonorrheae norB-aniA cassette promoting anaerobic growth.


July 7, 2019

PipeCraft: Flexible open-source toolkit for bioinformatics analysis of custom high-throughput amplicon sequencing data.

High-throughput sequencing methods have become a routine analysis tool in environmental sciences as well as in public and private sector. These methods provide vast amount of data, which need to be analysed in several steps. Although the bioinformatics may be applied using several public tools, many analytical pipelines allow too few options for the optimal analysis for more complicated or customized designs. Here, we introduce PipeCraft, a flexible and handy bioinformatics pipeline with a user-friendly graphical interface that links several public tools for analysing amplicon sequencing data. Users are able to customize the pipeline by selecting the most suitable tools and options to process raw sequences from Illumina, Pacific Biosciences, Ion Torrent and Roche 454 sequencing platforms. We described the design and options of PipeCraft and evaluated its performance by analysing the data sets from three different sequencing platforms. We demonstrated that PipeCraft is able to process large data sets within 24 hr. The graphical user interface and the automated links between various bioinformatics tools enable easy customization of the workflow. All analytical steps and options are recorded in log files and are easily traceable.© 2017 John Wiley & Sons Ltd.


July 7, 2019

Resolving multicopy duplications de novo using polyploid phasing

While the rise of single-molecule sequencing systems has enabled an unprecedented rise in the ability to assemble complex regions of the genome, long segmental duplications in the genome still remain a challenging frontier in assembly. Segmental duplications are at the same time both gene rich and prone to large structural rearrangements, making the resolution of their sequences important in medical and evolutionary studies. Duplicated sequences that are collapsed in mammalian de novo assemblies are rarely identical; after a sequence is duplicated, it begins to acquire paralog-specific variants. In this paper, we study the problem of resolving the variations in multicopy, long segmental duplications by developing and utilizing algorithms for polyploid phasing. We develop two algorithms: the first one is targeted at maximizing the likelihood of observing the reads given the underlying haplotypes using discrete matrix completion. The second algorithm is based on correlation clustering and exploits an assumption, which is often satisfied in these duplications, that each paralog has a sizable number of paralog-specific variants. We develop a detailed simulation methodology and demonstrate the superior performance of the proposed algorithms on an array of simulated datasets. We measure the likelihood score as well as reconstruction accuracy, i.e., what fraction of the reads are clustered correctly. In both the performance metrics, we find that our algorithms dominate existing algorithms on more than 93% of the datasets. While the discrete matrix completion performs better on likelihood score, the correlation-clustering algorithm performs better on reconstruction accuracy due to the stronger regularization inherent in the algorithm. We also show that our correlation-clustering algorithm can reconstruct on average 7.0 haplotypes in 10-copy duplication datasets whereas existing algorithms reconstruct less than one copy on average.


July 7, 2019

Neisseria lactamica Y92-1009 complete genome sequence.

We present the high quality, complete genome assembly of Neisseria lactamica Y92-1009 used to manufacture an outer membrane vesicle (OMV)-based vaccine, and a member of the Neisseria genus. The strain is available on request from the Public Health England Meningococcal Reference Unit. This Gram negative, dipplococcoid bacterium is an organism of worldwide clinical interest because human nasopharyngeal carriage is related inversely to the incidence of meningococcal disease, caused by Neisseria meningitidis. The organism sequenced was isolated during a school carriage survey in Northern Ireland in 1992 and has been the subject of a variety of laboratory and clinical studies. Four SMRT cells on a RSII machine by Pacific Biosystems were used to produce a complete, closed genome assembly. Sequence data were obtained for a total of 30,180,391 bases from 2621 reads and assembled using the HGAP algorithm. The assembly was corrected using short reads obtained from an Illumina HiSeq 2000instrument. This resulted in a 2,146,723 bp assembly with approximately 460 fold mean coverage depth and a GC ratio of 52.3%.


July 7, 2019

Single-molecule sequencing and Hi-C-based proximity-guided assembly of amaranth (Amaranthus hypochondriacus) chromosomes provide insights into genome evolution.

Amaranth (Amaranthus hypochondriacus) was a food staple among the ancient civilizations of Central and South America that has recently received increased attention due to the high nutritional value of the seeds, with the potential to help alleviate malnutrition and food security concerns, particularly in arid and semiarid regions of the developing world. Here, we present a reference-quality assembly of the amaranth genome which will assist the agronomic development of the species.Utilizing single-molecule, real-time sequencing (Pacific Biosciences) and chromatin interaction mapping (Hi-C) to close assembly gaps and scaffold contigs, respectively, we improved our previously reported Illumina-based assembly to produce a chromosome-scale assembly with a scaffold N50 of 24.4 Mb. The 16 largest scaffolds contain 98% of the assembly and likely represent the haploid chromosomes (n?=?16). To demonstrate the accuracy and utility of this approach, we produced physical and genetic maps and identified candidate genes for the betalain pigmentation pathway. The chromosome-scale assembly facilitated a genome-wide syntenic comparison of amaranth with other Amaranthaceae species, revealing chromosome loss and fusion events in amaranth that explain the reduction from the ancestral haploid chromosome number (n?=?18) for a tetraploid member of the Amaranthaceae.The assembly method reported here minimizes cost by relying primarily on short-read technology and is one of the first reported uses of in vivo Hi-C for assembly of a plant genome. Our analyses implicate chromosome loss and fusion as major evolutionary events in the 2n?=?32 amaranths and clearly establish the homoeologous relationship among most of the subgenome chromosomes, which will facilitate future investigations of intragenomic changes that occurred post polyploidization.


July 7, 2019

Phase-variable methylation and epigenetic regulation by type I restriction-modification systems.

Epigenetic modifications in bacteria, such as DNA methylation, have been shown to affect gene regulation, thereby generating cells that are isogenic but with distinctly different phenotypes. Restriction-modification (RM) systems contain prototypic methylases that are responsible for much of bacterial DNA methylation. This review focuses on a distinctive group of type I RM loci that , through phase variation, can modify their methylation target specificity and can thereby switch bacteria between alternative patterns of DNA methylation. Phase variation occurs at the level of the target recognition domains of the hsdS (specificity) gene via reversible recombination processes acting upon multiple hsdS alleles. We describe the global distribution of such loci throughout the prokaryotic kingdom and highlight the differences in loci structure across the various bacterial species. Although RM systems are often considered simply as an evolutionary response to bacteriophages, these multi-hsdS type I systems have also shown the capacity to change bacterial phenotypes. The ability of these RM systems to allow bacteria to reversibly switch between different physiological states, combined with the existence of such loci across many species of medical and industrial importance, highlights the potential of phase-variable DNA methylation to act as a global regulatory mechanism in bacteria.© FEMS 2017.


July 7, 2019

The cacao Criollo genome v2.0: an improved version of the genome for genetic and functional genomic studies.

Theobroma cacao L., native to the Amazonian basin of South America, is an economically important fruit tree crop for tropical countries as a source of chocolate. The first draft genome of the species, from a Criollo cultivar, was published in 2011. Although a useful resource, some improvements are possible, including identifying misassemblies, reducing the number of scaffolds and gaps, and anchoring un-anchored sequences to the 10 chromosomes.We used a NGS-based approach to significantly improve the assembly of the Belizian Criollo B97-61/B2 genome. We combined four Illumina large insert size mate paired libraries with 52x of Pacific Biosciences long reads to correct misassembled regions and reduced the number of scaffolds. We then used genotyping by sequencing (GBS) methods to increase the proportion of the assembly anchored to chromosomes.The scaffold number decreased from 4,792 in assembly V1 to 554 in V2 while the scaffold N50 size has increased from 0.47 Mb in V1 to 6.5 Mb in V2. A total of 96.7% of the assembly was anchored to the 10 chromosomes compared to 66.8% in the previous version. Unknown sites (Ns) were reduced from 10.8% to 5.7%. In addition, we updated the functional annotations and performed a new RefSeq structural annotation based on RNAseq evidence.Theobroma cacao Criollo genome version 2 will be a valuable resource for the investigation of complex traits at the genomic level and for future comparative genomics and genetics studies in cacao tree. New functional tools and annotations are available on the Cocoa Genome Hub ( http://cocoa-genome-hub.southgreen.fr ).


July 7, 2019

Draft genome sequences of Trichophyton rubrum CMCC(F)T1i and Trichophyton violaceum CMCC(F)T3l by Illumina 2000 and Pacific Biosciences.

One strain of Trichophyton rubrum CMCC(F)T1i (=CBS 139224) isolated from onychomycosis and one strain of Trichophyton violaceum CMCC(F)T3l (=CBS 141829) isolated from tinea capitis in China were whole-genome sequenced by Illumina/Solexa, while the former was also sequenced by Pacific Biosciences sequencing in parallel. Copyright © 2017 Zhan et al.


July 7, 2019

Archetype JC polyomavirus prevails in a rare case of JC polyomavirus nephropathy and in stable renal transplant recipients with JC polyomavirus viruria.

JC polyomavirus (JCPyV) is reactivated in approximately 20% of renal transplant recipients and it may rarely cause JCPyV-associated nephropathy (JCPyVAN). Whereas progressive multifocal leukoencephalopathy of the brain is caused by rearranged neurotropic JCPyV, little is known about viral sequence variation in JCPyVAN due to the rarity of this condition.Using single-molecule real-time sequencing, characterization of full-length JCPyV genomes from urine and plasma of one JCPyVAN patient and twenty stable renal transplant recipients with JCPyV viruria was attempted. Sequence analysis of JCPyV strains was performed with the emphasis on the NCCR region, the major capsid protein gene VP1 and the large T antigen (LTag) gene.Exclusively archetype strains were identified in urine of the JCPyVAN patient. Full-length JCPyV sequences were not retrieved from plasma. Archetype strains were found in urine of nineteen stable renal transplant recipients, with JCPyV quasispecies detected in five samples. In a patient with minor graft dysfunction, a strain with archetype-like NCCR region was discovered. Individual point mutations were detected in both VP1 and LTag genes.Archetype JCPyV was dominant in the JCPyVAN patient and in stable renal transplant recipients. Archetype rather than rearranged JCPyV seems to drive the pathogenesis of JCPyVAN.


July 7, 2019

Rapid and affordable size-selected PacBio single-molecule real-time sequencing template library construction using the bead-beating DNA extraction method

This study demonstrated that bead-beating method facilitates a simple and rapid protocol for genomic DNA isolation for Pacific BioSciences (PacBio) sequencing with library construction of sufficient length. The protocol may also be beneficial for inactivating pathogens by simultaneous and instant DNA fragmentation, with no special equipment required to obtain large DNA fragments. This protocol was comparable in terms of quality to the standard protocol suggested by PacBioand represents an alternative, rapid shortcut for performing accurate PacBio sequencing.


July 7, 2019

Bioinformatics analysis and characterization of highly efficient polyvinyl alcohol (PVA)-degrading enzymes from the novel PVA degrader Stenotrophomonas rhizophila QL-P4.

Polyvinyl alcohol (PVA) is used widely in industry, and associated environmental pollution is a serious problem. Herein, we report a novel, efficient PVA degrader, Stenotrophomonas rhizophila QL-P4, isolated from fallen leaves from virgin forest in the Qinling Mountains. The complete genome was obtained using single-molecule real-time (SMRT) technology and corrected using Illumina sequencing. Bioinformatics analysis revealed eight PVA/OVA (vinyl alcohol oligomer)-degrading genes. Of these, seven genes were predicted to be involved in the classical intracellular PVA/OVA degradation pathway, and one (BAY15_3292) was identified as a novel PVA oxidase. Five PVA/OVA-degrading enzymes were purified and characterised. Among which, BAY15_1712, a PVA dehydrogenase (PVADH), displayed high catalytic efficiency towards PVA and OVA substrate. All reported PVADHs only have PVA-degrading ability. Most importantly, we discovered a novel PVA oxidase (BAY15_3292) that exhibited highest PVA-degrading efficiency than the reported PVADHs. Further investigation indicated that BAY15_3292 plays a crucial role in PVA degradation in S. rhizophila QL-P4. Knocking out BAY15_3292 resulted in a significant decline in PVA-degrading activity in S. rhizophila QL-P4. Interestingly, we found that BAY15_3292 possesses exocrine activity, which distinguishes it from classical PVADHs. Transparent circle experiments further proved that BAY15_3292 greatly affects extracellular PVA degradation in S. rhizophila QL-P4. The exocrine characteristics of BAY15_3292 facilitate its potential application to PVA bioremediation. In addition, we report three new efficient secondary alcohol dehydrogenases (SADHs) with OVA-degrading ability in S. rhizophila QL-P4, compared with only one OVA-degrading SADH as reported previously.Importance With the widespread application of PVA in industry, PVA-related environmental pollution is an increasingly serious issue. Because PVA is difficult to degrade, it accumulates in aquatic environments and causes chronic toxicity to aquatic organisms. Biodegradation of PVA, as an economical and environment-friendly method, has attracted much interest. To date, effective and applicable PVA-degrading bacteria/enzymes have not been reported. Herein, we report a new efficient PVA degrader (S. rhizophila QL-P4) that has five PVA/OVA-degrading enzymes with high catalytic efficiency, among which BAY15_1712 is the only reported PVADH with both PVA- and OVA-degrading abilities. Importantly, we discovered a novel PVA oxidase (BAY15_3292) that is not only more efficient than other reported PVA-degrading PVADHs, but also has exocrine activity. Overall, our findings provide new insight into PVA-degrading pathways in microorganisms, and suggest S. rhizophila QL-P4 and its enzymes have potential for application to PVA bioremediation to reduce or eliminate PVA-related environmental pollution. Copyright © 2017 American Society for Microbiology.


July 7, 2019

Public health surveillance in the UK revolutionises our understanding of the invasive Salmonella Typhimurium epidemic in Africa.

The ST313 sequence type of Salmonella Typhimurium causes invasive non-typhoidal salmonellosis and was thought to be confined to sub-Saharan Africa. Two distinct phylogenetic lineages of African ST313 have been identified.We analysed the whole genome sequences of S. Typhimurium isolates from UK patients that were generated following the introduction of routine whole-genome sequencing (WGS) of Salmonella enterica by Public Health England in 2014.We found that 2.7% (84/3147) of S. Typhimurium from patients in England and Wales were ST313 and were associated with gastrointestinal infection. Phylogenetic analysis revealed novel diversity of ST313 that distinguished UK-linked gastrointestinal isolates from African-associated extra-intestinal isolates. The majority of genome degradation of African ST313 lineage 2 was conserved in the UK-ST313, but the African lineages carried a characteristic prophage and antibiotic resistance gene repertoire. These findings suggest that a strong selection pressure exists for certain horizontally acquired genetic elements in the African setting. One UK-isolated lineage 2 strain that probably originated in Kenya carried a chromosomally located bla CTX-M-15, demonstrating the continual evolution of this sequence type in Africa in response to widespread antibiotic usage.The discovery of ST313 isolates responsible for gastroenteritis in the UK reveals new diversity in this important sequence type. This study highlights the power of routine WGS by public health agencies to make epidemiologically significant deductions that would be missed by conventional microbiological methods. We speculate that the niche specialisation of sub-Saharan African ST313 lineages is driven in part by the acquisition of accessory genome elements.


July 7, 2019

Highly accurate fluorogenic DNA sequencing with information theory-based error correction.

Eliminating errors in next-generation DNA sequencing has proved challenging. Here we present error-correction code (ECC) sequencing, a method to greatly improve sequencing accuracy by combining fluorogenic sequencing-by-synthesis (SBS) with an information theory-based error-correction algorithm. ECC embeds redundancy in sequencing reads by creating three orthogonal degenerate sequences, generated by alternate dual-base reactions. This is similar to encoding and decoding strategies that have proved effective in detecting and correcting errors in information communication and storage. We show that, when combined with a fluorogenic SBS chemistry with raw accuracy of 98.1%, ECC sequencing provides single-end, error-free sequences up to 200 bp. ECC approaches should enable accurate identification of extremely rare genomic variations in various applications in biology and medicine.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.