pacbio data Archives - Page 11 of 21

July 7, 2019

A gapless genome sequence of the fungus Botrytis cinerea.

Following earlier incomplete and fragmented versions of a genome sequence for the grey mould Botrytis cinerea, we here report a gapless, near-finished genome sequence for B. cinerea strain B05.10. The assembly comprises 18 chromosomes and was confirmed by an optical map and a genetic map based on ~75 000 SNP markers. All chromosomes contain fully assembled centromeric regions, and 10 chromosomes have telomeres on both ends. The genetic map consisted of 4153 cM and comparison of genetic distances with the physical distances identified 40 recombination hotspots. The linkage map also identified two mutations, located in the previously described genes Bos1 and BcsdhB, that confer resistance to the fungicides boscalid and iprodione. The genome was predicted to encode 11 701 proteins. RNAseq data from >20 different samples were used to validate and improve gene models. Manual curation of chromosome 1 revealed interesting features, such as the occurrence of a dicistronic transcript and fully overlapping genes in opposite orientations, as well as many spliced antisense transcripts. Manual curation also revealed that UTRs of genes can be complex and long, with many UTRs exceeding lengths of 1 kb and possessing multiple introns. Community annotation is in progress. This article is protected by copyright. All rights reserved. © 2016 BSPP AND JOHN WILEY & SONS LTD.

July 7, 2019

Genomic analysis of 495 vancomycin-resistant Enterococcus faecium reveals broad dissemination of a vanA plasmid in more than 19 clones from Copenhagen, Denmark.

From 2012 to 2014, there has been a huge increase in vancomycin-resistant (vanA) Enterococcus faecium (VREfm) in Copenhagen, Denmark, with 602 patients infected or colonized with VREfm in 2014 compared with just 22 in 2012. The objective of this study was to describe the genetic epidemiology of VREfm to assess the contribution of clonal spread and horizontal transfer of the vanA transposon (Tn1546) and plasmid in the dissemination of VREfm in hospitals.VREfm from Copenhagen, Denmark (2012-14) were whole-genome sequenced. The clonal structure was determined and the structure of Tn1546-like transposons was characterized. One VREfm isolate belonging to the largest clonal group was sequenced using long-read technology to close a 37 kb vanA plasmid.Phylogeny revealed a polyclonal structure where 495 VREfm isolates were divided into 13 main groups and 7 small groups. The majority of the isolates were located in three groups (n?=?44, 100 and 218) and clonal spread of VREfm between wards and hospitals was identified. Five Tn1546-like transposon types were identified. A dominant truncated transposon (type 4, 92%) was spread across all but one VREfm group. The closed vanA plasmid was highly covered by reads from isolates containing the type 4 transposon.This study suggests that it was the dissemination of the type 4 Tn1546-like transposon and plasmid via horizontal transfer to multiple populations of E. faecium, followed by clonal spread of new VREfm clones, that contributed to the increase in and diversity of VREfm in Danish hospitals.© The Author 2016. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

July 7, 2019

Spike gene deletion quasispecies in serum of patient with acute MERS-CoV infection.

The spike glycoprotein of the Middle East respiratory coronavirus (MERS-CoV) facilitates receptor binding and cell entry. During investigation of a multi-facility outbreak of MERS-CoV in Taif, Saudi Arabia, we identified a mixed population of wild-type and variant sequences with a large 530 nucleotide deletion in the spike gene from the serum of one patient. The out of frame deletion predicted loss of most of the S2 subunit of the spike protein leaving the S1 subunit with an intact receptor binding domain. This finding documents human infection with a novel genetic variant of MERS-CoV present as a quasispecies. J. Med. Virol. 89:542-545, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

July 7, 2019

Methods for genome-wide methylome profiling of Campylobacter jejuni.

Methylation has a profound role in the regulation of numerous biological processes in bacteria including virulence. The study of methylation in bacteria has greatly advanced thanks to next-generation sequencing technologies. These technologies have expedited the process of uncovering unique features of many bacterial methylomes such as characterizing previously uncharacterized methyltransferases, cataloging genome-wide DNA methylations in bacteria, identifying the frequency of methylation at particular genomic loci, and revealing regulatory roles of methylation in the biology of various bacterial species. For instance, methylation has been cited as a potential source for the pathogenicity differences observed in C. jejuni strains with syntenic genomes as seen in recent publications. Here, we describe the methodology for the use of Pacific Biosciences’ single molecule real-time (SMRT) sequencing for detecting methylation patterns in C. jejuni and bioinformatics tools to profile its methylome.

July 7, 2019

The comparative landscape of duplications in Heliconius melpomene and Heliconius cydno.

Gene duplications can facilitate adaptation and may lead to interpopulation divergence, causing reproductive isolation. We used whole-genome resequencing data from 34 butterflies to detect duplications in two Heliconius species, Heliconius cydno and Heliconius melpomene. Taking advantage of three distinctive signals of duplication in short-read sequencing data, we identified 744 duplicated loci in H. cydno and H. melpomene and evaluated the accuracy of our approach using single-molecule sequencing. We have found that duplications overlap genes significantly less than expected at random in H. melpomene, consistent with the action of background selection against duplicates in functional regions of the genome. Duplicate loci that are highly differentiated between H. melpomene and H. cydno map to four different chromosomes. Four duplications were identified with a strong signal of divergent selection, including an odorant binding protein and another in close proximity with a known wing colour pattern locus that differs between the two species. Heredity advance online publication, 7 December 2016; doi:10.1038/hdy.2016.107.

July 7, 2019

Brassica rapa genome 2.0: a reference upgrade through sequence re-assembly and gene re-annotation.

Brassica rapa includes many important crops that are cultivated as vegetables, condiments, and oilseeds. Recently, the Brassica genomes have been sequenced extensively: a B. rapa draft reference genome in 2011 (Wang et al., 2011), a Brassica oleracea in 2014 (Liu et al., 2014), a Brassica napus in 2014 (Chalhoub et al., 2014), and Brassica nigra and Brassica juncea in 2016 (Yang et al., 2016). The first released B. rapa genome reference served as a valuable resource in the genome assembly and annotation of the other Brassicas (Chalhoub et al., 2014, Liu et al., 2014, Parkin et al., 2014). B. rapa has been used widely in Brassica comparative and evolutionary genomics among the Brassicaceae (Cheng et al., 2013). However, the first B. rapa genome assembly (version 1.5) is only about 283.8 Mb, 58.52% of the estimated genome size (485 Mb) (Wang et al., 2011). Considering that much of the genome assembly is still missing (41.48%), there is a considerable possibility that important genes have been missed.

July 7, 2019

Organelle_PBA, a pipeline for assembling chloroplast and mitochondrial genomes from PacBio DNA sequencing data.

The development of long-read sequencing technologies, such as single-molecule real-time (SMRT) sequencing by PacBio, has produced a revolution in the sequencing of small genomes. Sequencing organelle genomes using PacBio long-read data is a cost effective, straightforward approach. Nevertheless, the availability of simple-to-use software to perform the assembly from raw reads is limited at present.We present Organelle-PBA, a Perl program designed specifically for the assembly of chloroplast and mitochondrial genomes. For chloroplast genomes, the program selects the chloroplast reads from a whole genome sequencing pool, maps the reads to a reference sequence from a closely related species, and then performs read correction and de novo assembly using Sprai. Organelle-PBA completes the assembly process with the additional step of scaffolding by SSPACE-LongRead. The program then detects the chloroplast inverted repeats and reassembles and re-orients the assembly based on the organelle origin of the reference. We have evaluated the performance of the software using PacBio reads from different species, read coverage, and reference genomes. Finally, we present the assembly of two novel chloroplast genomes from the species Picea glauca (Pinaceae) and Sinningia speciosa (Gesneriaceae).Organelle-PBA is an easy-to-use Perl-based software pipeline that was written specifically to assemble mitochondrial and chloroplast genomes from whole genome PacBio reads. The program is available at https://github.com/aubombarely/Organelle_PBA .

July 7, 2019

Identification of small RNAs in extracellular vesicles from the commensal yeast Malassezia sympodialis.

Malassezia is the dominant fungus in the human skin mycobiome and is associated with common skin disorders including atopic eczema (AE)/dermatitis. Recently, it was found that Malassezia sympodialis secretes nanosized exosome-like vesicles, designated MalaEx, that carry allergens and can induce inflammatory cytokine responses. Extracellular vesicles from different cell-types including fungi have been found to deliver functional RNAs to recipient cells. In this study we assessed the presence of small RNAs in MalaEx and addressed if the levels of these RNAs differ when M. sympodialis is cultured at normal human skin pH versus the elevated pH present on the skin of patients with AE. The total number and the protein concentration of the released MalaEx harvested after 48?h culture did not differ significantly between the two pH conditions nor did the size of the vesicles. From small RNA sequence data, we identified a set of reads with well-defined start and stop positions, in a length range of 16 to 22 nucleotides consistently present in the MalaEx. The levels of small RNAs were not significantly differentially expressed between the two different pH conditions indicating that they are not influenced by the elevated pH level observed on the AE skin.

July 7, 2019

Evolutionary genomics of the cold-adapted diatom Fragilariopsis cylindrus.

The Southern Ocean houses a diverse and productive community of organisms. Unicellular eukaryotic diatoms are the main primary producers in this environment, where photosynthesis is limited by low concentrations of dissolved iron and large seasonal fluctuations in light, temperature and the extent of sea ice. How diatoms have adapted to this extreme environment is largely unknown. Here we present insights into the genome evolution of a cold-adapted diatom from the Southern Ocean, Fragilariopsis cylindrus, based on a comparison with temperate diatoms. We find that approximately 24.7 per cent of the diploid F. cylindrus genome consists of genetic loci with alleles that are highly divergent (15.1 megabases of the total genome size of 61.1 megabases). These divergent alleles were differentially expressed across environmental conditions, including darkness, low iron, freezing, elevated temperature and increased CO2. Alleles with the largest ratio of non-synonymous to synonymous nucleotide substitutions also show the most pronounced condition-dependent expression, suggesting a correlation between diversifying selection and allelic differentiation. Divergent alleles may be involved in adaptation to environmental fluctuations in the Southern Ocean.

July 7, 2019

De novo hybrid assembly of the rubber tree genome reveals evidence of paleotetraploidy in Hevea species.

Para rubber tree (Hevea brasiliensis) is an important economic species as it is the sole commercial producer of high-quality natural rubber. Here, we report a de novo hybrid assembly of BPM24 accession, which exhibits resistance to major fungal pathogens in Southeast Asia. Deep-coverage 454/Illumina short-read and Pacific Biosciences (PacBio) long-read sequence data were acquired to generate a preliminary draft, which was subsequently scaffolded using a long-range “Chicago” technique to obtain a final assembly of 1.26?Gb (N50?=?96.8?kb). The assembled genome contains 69.2% repetitive sequences and has a GC content of 34.31%. Using a high-density SNP-based genetic map, we were able to anchor 28.9% of the genome assembly (363?Mb) associated with over two thirds of the predicted protein-coding genes into rubber tree’s 18 linkage groups. These genetically anchored sequences allowed comparative analyses of the intragenomic homeologous synteny, providing the first concrete evidence to demonstrate the presence of paleotetraploidy in Hevea species. Additionally, the degree of macrosynteny conservation observed between rubber tree and cassava strongly supports the hypothesis that the paleotetraploidization event took place prior to the divergence of the Hevea and Manihot species.

July 7, 2019

Complex modular architecture around a simple toolkit of wing pattern genes

Identifying the genomic changes that control morphological variation and understanding how they generate diversity is a major goal of evolutionary biology. In Heliconius butterflies, a small number of genes control the development of diverse wing colour patterns. Here, we used full-genome sequencing of individuals across the Heliconius erato radiation and closely related species to characterize genomic variation associated with wing pattern diversity. We show that variation around colour pattern genes is highly modular, with narrow genomic intervals associated with specific differences in colour and pattern. This modular architecture explains the diversity of colour patterns and provides a flexible mechanism for rapid morphological diversification.

July 7, 2019

The genome sequence of Barbarea vulgaris facilitates the study of ecological biochemistry.

The genus Barbarea has emerged as a model for evolution and ecology of plant defense compounds, due to its unusual glucosinolate profile and production of saponins, unique to the Brassicaceae. One species, B. vulgaris, includes two ‘types’, G-type and P-type that differ in trichome density, and their glucosinolate and saponin profiles. A key difference is the stereochemistry of hydroxylation of their common phenethylglucosinolate backbone, leading to epimeric glucobarbarins. Here we report a draft genome sequence of the G-type, and re-sequencing of the P-type for comparison. This enables us to identify candidate genes underlying glucosinolate diversity, trichome density, and study the genetics of biochemical variation for glucosinolate and saponins. B. vulgaris is resistant to the diamondback moth, and may be exploited for “dead-end” trap cropping where glucosinolates stimulate oviposition and saponins deter larvae to the extent that they die. The B. vulgaris genome will promote the study of mechanisms in ecological biochemistry to benefit crop resistance breeding.

July 7, 2019

Fallacy of the unique genome: sequence diversity within single Helicobacter pylori strains.

Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these H. pylori isolates, including movement of the transposable element IS607, large and small inversions, multiple single nucleotide polymorphisms, and variation in cagA copy number. The cagA gene was found as 1 to 4 tandem copies located off the cag island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in cagY, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB a-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.IMPORTANCE Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish “the genome” of a bacterial strain. Variability is usually reduced (“only sequence from a single colony”), ignored (“just publish the consensus”), or placed in the “too-hard” basket (“analysis of raw read data is more robust”). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen Helicobacter pylori model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as “the genome” of a bacterial strain may be misleading. Copyright © 2017 Draper et al.

July 7, 2019

Complete genome sequences of Legionella pneumophila subsp. fraseri strains Detroit-1 and Dallas 1E.

We report here the complete genome sequences of two of the earliest known strains of Legionella pneumophila subsp. fraseri Detroit-1 is serogroup 1 and was isolated from a lung biopsy specimen in 1977. Dallas 1E is serogroup 5 and was isolated in 1978 from a cooling tower. Copyright © 2017 Raphael et al.

July 7, 2019

Non hybrid long read consensus using local De Bruijn graph assembly

While second generation sequencing led to a vast increase in sequenced data, the shorter reads which came with it made assembly a much harder task and for some regions impossible with only short read data. This changed again with the advent of third generation long read sequencers. The length of the long reads allows a much better resolution of repetitive regions, their high error rate however is a major challenge. Using the data successfully requires to remove most of the sequencing errors. The first hybrid correction methods used low noise second generation data to correct third generation data, but this approach has issues when it is unclear where to place the short reads due to repeats and also because second generation sequencers fail to sequence some regions which third generation sequencers work on. Later non hybrid methods appeared. We present a new method for non hybrid long read error correction based on De Bruijn graph assembly of short windows of long reads with subsequent combination of these correct windows to corrected long reads. Our experiments show that this method yields a better correction than other state of the art non hybrid correction approaches.

Auto Tag: pacbio data

A gapless genome sequence of the fungus Botrytis cinerea.

Genomic analysis of 495 vancomycin-resistant Enterococcus faecium reveals broad dissemination of a vanA plasmid in more than 19 clones from Copenhagen, Denmark.

Spike gene deletion quasispecies in serum of patient with acute MERS-CoV infection.

Methods for genome-wide methylome profiling of Campylobacter jejuni.

The comparative landscape of duplications in Heliconius melpomene and Heliconius cydno.

Brassica rapa genome 2.0: a reference upgrade through sequence re-assembly and gene re-annotation.

Organelle_PBA, a pipeline for assembling chloroplast and mitochondrial genomes from PacBio DNA sequencing data.

Identification of small RNAs in extracellular vesicles from the commensal yeast Malassezia sympodialis.

Evolutionary genomics of the cold-adapted diatom Fragilariopsis cylindrus.

De novo hybrid assembly of the rubber tree genome reveals evidence of paleotetraploidy in Hevea species.

Complex modular architecture around a simple toolkit of wing pattern genes

The genome sequence of Barbarea vulgaris facilitates the study of ecological biochemistry.

Fallacy of the unique genome: sequence diversity within single Helicobacter pylori strains.

Complete genome sequences of Legionella pneumophila subsp. fraseri strains Detroit-1 and Dallas 1E.

Non hybrid long read consensus using local De Bruijn graph assembly

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert