Pacbio reads Archives - Page 32 of 53

July 7, 2019

Sequencing and de novo assembly of a near complete indica rice genome.

A high-quality reference genome is critical for understanding genome structure, genetic variation and evolution of an organism. Here we report the de novo assembly of an indica rice genome Shuhui498 (R498) through the integration of single-molecule sequencing and mapping data, genetic map and fosmid sequence tags. The 390.3?Mb assembly is estimated to cover more than 99% of the R498 genome and is more continuous than the current reference genomes of japonica rice Nipponbare (MSU7) and Arabidopsis thaliana (TAIR10). We annotate high-quality protein-coding genes in R498 and identify genetic variations between R498 and Nipponbare and presence/absence variations by comparing them to 17 draft genomes in cultivated rice and its closest wild relatives. Our results demonstrate how to de novo assemble a highly contiguous and near-complete plant genome through an integrative strategy. The R498 genome will serve as a reference for the discovery of genes and structural variations in rice.

July 7, 2019

De novo genome and transcriptome assembly of the Canadian beaver (Castor canadensis).

The Canadian beaver (Castor canadensis) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon-gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology. Copyright © 2017 Lok et al.

July 7, 2019

Detection and assessment of copy number variation using PacBio long-read and Illumina sequencing in New Zealand dairy cattle.

Single nucleotide polymorphisms have been the DNA variant of choice for genomic prediction, largely because of the ease of single nucleotide polymorphism genotype collection. In contrast, structural variants (SV), which include copy number variants (CNV), translocations, insertions, and inversions, have eluded easy detection and characterization, particularly in nonhuman species. However, evidence increasingly shows that SV not only contribute a substantial proportion of genetic variation but also have significant influence on phenotypes. Here we present the discovery of CNV in a prominent New Zealand dairy bull using long-read PacBio (Pacific Biosciences, Menlo Park, CA) sequencing technology and the Sniffles SV discovery tool (version 0.0.1; https://github.com/fritzsedlazeck/Sniffles). The CNV identified from long reads were compared with CNV discovered in the same bull from Illumina sequencing using CNVnator (read depth-based tool; Illumina Inc., San Diego, CA) as a means of validation. Subsequently, further validation was undertaken using whole-genome Illumina sequencing of 556 cattle representing the wider New Zealand dairy cattle population. Very limited overlap was observed in CNV discovered from the 2 sequencing platforms, in part because of the differences in size of CNV detected. Only a few CNV were therefore able to be validated using this approach. However, the ability to use CNVnator to genotype the 557 cattle for copy number across all regions identified as putative CNV allowed a genome-wide assessment of transmission level of copy number based on pedigree. The more highly transmissible a putative CNV region was observed to be, the more likely the distribution of copy number was multimodal across the 557 sequenced animals. Furthermore, visual assessment of highly transmissible CNV regions provided evidence supporting the presence of CNV across the sequenced animals. This transmission-based approach was able to confirm a subset of CNV that segregates in the New Zealand dairy cattle population. Genome-wide identification and validation of CNV is an important step toward their inclusion in genomic selection strategies.The Authors. Published by the Federation of Animal Science Societies and Elsevier Inc. on behalf of the American Dairy Science Association®. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/3.0/).

July 7, 2019

The evolutionary life cycle of the polysaccharide biosynthetic gene cluster based on the Sphingomonadaceae.

Although clustering of genes from the same metabolic pathway is a widespread phenomenon, the evolution of the polysaccharide biosynthetic gene cluster remains poorly understood. To determine the evolution of this pathway, we identified a scattered production pathway of the polysaccharide sanxan by Sphingomonas sanxanigenens NX02, and compared the distribution of genes between sphingan-producing and other Sphingomonadaceae strains. This allowed us to determine how the scattered sanxan pathway developed, and how the polysaccharide gene cluster evolved. Our findings suggested that the evolution of microbial polysaccharide biosynthesis gene clusters is a lengthy cyclic process comprising cluster 1???scatter???cluster 2. The sanxan biosynthetic pathway proved the existence of a dispersive process. We also report the complete genome sequence of NX02, in which we identified many unstable genetic elements and powerful secretion systems. Furthermore, nine enzymes for the formation of activated precursors, four glycosyltransferases, four acyltransferases, and four polymerization and export proteins were identified. These genes were scattered in the NX02 genome, and the positive regulator SpnA of sphingans synthesis could not regulate sanxan production. Finally, we concluded that the evolution of the sanxan pathway was independent. NX02 evolved naturally as a polysaccharide producing strain over a long-time evolution involving gene acquisitions and adaptive mutations.

July 7, 2019

Genome sequencing and population genomic analyses provide insights into the adaptive landscape of silver birch.

Silver birch (Betula pendula) is a pioneer boreal tree that can be induced to flower within 1 year. Its rapid life cycle, small (440-Mb) genome, and advanced germplasm resources make birch an attractive model for forest biotechnology. We assembled and chromosomally anchored the nuclear genome of an inbred B. pendula individual. Gene duplicates from the paleohexaploid event were enriched for transcriptional regulation, whereas tandem duplicates were overrepresented by environmental responses. Population resequencing of 80 individuals showed effective population size crashes at major points of climatic upheaval. Selective sweeps were enriched among polyploid duplicates encoding key developmental and physiological triggering functions, suggesting that local adaptation has tuned the timing of and cross-talk between fundamental plant processes. Variation around the tightly-linked light response genes PHYC and FRS10 correlated with latitude and longitude and temperature, and with precipitation for PHYC. Similar associations characterized the growth-promoting cytokinin response regulator ARR1, and the wood development genes KAK and MED5A.

July 7, 2019

High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development.

Using the latest sequencing and optical mapping technologies, we have produced a high-quality de novo assembly of the apple (Malus domestica Borkh.) genome. Repeat sequences, which represented over half of the assembly, provided an unprecedented opportunity to investigate the uncharacterized regions of a tree genome; we identified a new hyper-repetitive retrotransposon sequence that was over-represented in heterochromatic regions and estimated that a major burst of different transposable elements (TEs) occurred 21 million years ago. Notably, the timing of this TE burst coincided with the uplift of the Tian Shan mountains, which is thought to be the center of the location where the apple originated, suggesting that TEs and associated processes may have contributed to the diversification of the apple ancestor and possibly to its divergence from pear. Finally, genome-wide DNA methylation data suggest that epigenetic marks may contribute to agronomically relevant aspects, such as apple fruit development.

July 7, 2019

Complete genome sequences of 12 isolates of Listeria monocytogenes belonging to serotypes 1/2a, 1/2b, and 4b obtained from food products and food-processing environments in Canada.

Listeria monocytogenes is the etiological agent for an often fatal foodborne illness known as listeriosis. Here, we present the complete genome sequences of 12 L. monocytogenes isolates representing the three most common serotypes of this pathogen (1/2a, 1/2b, and 4b), collected in Canada from different food products and environmental sources.© Crown copyright 2017.

July 7, 2019

Draft genome sequence of Microbacterium foliorum strain 122 isolated from a plant growing in a chronically hydrocarbon-contaminated site.

Microbacterium foliorum strain 122 is a bacterial endophyte isolated from a Dactylis glomerata plant growing in a natural oil seep soil located in Oil Springs, Ontario, Canada. We present here a draft genome sequence of an endophytic strain that has promising potential in hydrocarbon degradation and plant growth promotion. Copyright © 2017 Lumactud et al.

July 7, 2019

Evolutionary origin of the staphylococcal cassette chromosome mec (SCCmec).

Several lines of evidence indicate that the most primitive staphylococcal species, those of the Staphylococcus sciuri group, were involved in the first stages of evolution of the staphylococcal cassette chromosome mec (SCCmec), the genetic element carrying the ß-lactam resistance gene mecA However, many steps are still missing from this evolutionary history. In particular, it is not known how mecA was incorporated into the mobile element SCC prior to dissemination among Staphylococcus aureus and other pathogenic staphylococcal species. To gain insights into the possible contribution of several species of the Staphylococcus sciuri group to the assembly of SCCmec, we sequenced the genomes of 106 isolates, comprising S. sciuri (n = 76), Staphylococcus vitulinus (n = 18), and Staphylococcus fleurettii (n = 12) from animal and human sources, and characterized the native location of mecA and the SCC insertion site by using a variety of comparative genomic approaches. Moreover, we performed a single nucleotide polymorphism (SNP) analysis of the genomes in order to understand SCCmec evolution in relation to phylogeny. We found that each of three species of the S. sciuri group contributed to the evolution of SCCmec: S. vitulinus and S. fleurettii contributed to the assembly of the mec complex, and S. sciuri most likely provided the mobile element in which mecA was later incorporated. We hypothesize that an ancestral SCCmec III cassette (an element carried by one of the most epidemic methicillin-resistant S. aureus clones) originated in S. sciuri possibly by a recombination event in a human host or a human-created environment and later was transferred to S. aureus. Copyright © 2017 American Society for Microbiology.

July 7, 2019

Draft genome sequence of Plantibacter flavus strain 251 isolated from a plant growing in a chronically hydrocarbon-contaminated site.

Plantibacter flavus isolate 251 is a bacterial endophyte isolated from an Achillea millefolium plant growing in a natural oil seep soil located in Oil Springs, Ontario, Canada. We present here a draft genome sequence of an infrequently reported genus Plantibacter, highlighting an endophytic lifestyle and biotechnological potential. Copyright © 2017 Lumactud et al.

July 7, 2019

Draft genome sequence of Acidihalobacter ferrooxidans DSM 14175 (strain V8), a new iron- and sulfur-oxidizing, halotolerant, acidophilic species.

The use of halotolerant acidophiles for bioleaching provides a biotechnical approach for the extraction of metals from regions where high salinity exists in the ores and source water. Here, we describe the first draft genome of a new species of a halotolerant and iron- and sulfur-oxidizing acidophile, Acidihalobacter ferrooxidans DSM 14175 (strain V8). Copyright © 2017 Khaleque et al.

July 7, 2019

Whole-genome sequence of Coxiella burnetii Nine Mile RSA439 (phase II, clone 4), a laboratory workhorse strain.

Here, we report the whole-genome sequence of Coxiella burnetii Nine Mile RSA439 (phase II, clone 4), a laboratory strain used extensively to investigate the biology of this intracellular bacterial pathogen. The genome consists of a 1.97-Mb chromosome and a 37.32-kb plasmid. Copyright © 2017 Millar et al.

July 7, 2019

Hybrid de novo genome assembly of the Chinese herbal fleabane Erigeron breviscapus.

The plants in the Erigeron genus of the Compositae (Asteraceae) family are commonly called fleabanes, possibly due to the belief that certain chemicals in these plants repel fleas. In the traditional Chinese medicine, Erigeron breviscapus , which is native to China, was widely used in the treatment of cerebrovascular disease. A handful of bioactive compounds, including scutellarin, 3,5-dicaffeoylquinic acid, and 3,4-dicaffeoylquinic acid, have been isolated from the plant. With the purpose of finding novel medicinal compounds and understanding their biosynthetic pathways, we propose to sequence the genome of E. breviscapus . We assembled the highly heterozygous E. breviscapus genome using a combination of PacBio single-molecular real-time sequencing and next-generation sequencing methods on the Illumina HiSeq platform. The final draft genome is approximately 1.2 Gb, with contig and scaffold N50 sizes of 18.8 kb and 31.5 kb, respectively. Further analyses predicted 37 504 protein-coding genes in the E. breviscapus genome and 8172 shared gene families among Compositae species. The E. breviscapus genome provides a valuable resource for the investigation of novel bioactive compounds in this Chinese herb.

July 7, 2019

No evidence for maintenance of a sympatric Heliconius species barrier by chromosomal inversions.

Mechanisms that suppress recombination are known to help maintain species barriers by preventing the breakup of coadapted gene combinations. The sympatric butterfly species Heliconius melpomene and Heliconius cydno are separated by many strong barriers, but the species still hybridize infrequently in the wild, and around 40% of the genome is influenced by introgression. We tested the hypothesis that genetic barriers between the species are maintained by inversions or other mechanisms that reduce between-species recombination rate. We constructed fine-scale recombination maps for Panamanian populations of both species and their hybrids to directly measure recombination rate within and between species, and generated long sequence reads to detect inversions. We find no evidence for a systematic reduction in recombination rates in F1 hybrids, and also no evidence for inversions longer than 50 kb that might be involved in generating or maintaining species barriers. This suggests that mechanisms leading to global or local reduction in recombination do not play a significant role in the maintenance of species barriers between H. melpomene and H. cydno.

July 7, 2019

De novo yeast genome assemblies from MinION, PacBio and MiSeq platforms.

Long-read sequencing technologies such as Pacific Biosciences and Oxford Nanopore MinION are capable of producing long sequencing reads with average fragment lengths of over 10,000 base-pairs and maximum lengths reaching 100,000 base- pairs. Compared with short reads, the assemblies obtained from long-read sequencing platforms have much higher contig continuity and genome completeness as long fragments are able to extend paths into problematic or repetitive regions. Many successful assembly applications of the Pacific Biosciences technology have been reported ranging from small bacterial genomes to large plant and animal genomes. Recently, genome assemblies using Oxford Nanopore MinION data have attracted much attention due to the portability and low cost of this novel sequencing instrument. In this paper, we re-sequenced a well characterized genome, the Saccharomyces cerevisiae S288C strain using three different platforms: MinION, PacBio and MiSeq. We present a comprehensive metric comparison of assemblies generated by various pipelines and discuss how the platform associated data characteristics affect the assembly quality. With a given read depth of 31X, the assemblies from both Pacific Biosciences and Oxford Nanopore MinION show excellent continuity and completeness for the 16 nuclear chromosomes, but not for the mitochondrial genome, whose reconstruction still represents a significant challenge.

Auto Tag: Pacbio reads

Sequencing and de novo assembly of a near complete indica rice genome.

De novo genome and transcriptome assembly of the Canadian beaver (Castor canadensis).

Detection and assessment of copy number variation using PacBio long-read and Illumina sequencing in New Zealand dairy cattle.

The evolutionary life cycle of the polysaccharide biosynthetic gene cluster based on the Sphingomonadaceae.

Genome sequencing and population genomic analyses provide insights into the adaptive landscape of silver birch.

High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development.

Complete genome sequences of 12 isolates of Listeria monocytogenes belonging to serotypes 1/2a, 1/2b, and 4b obtained from food products and food-processing environments in Canada.

Draft genome sequence of Microbacterium foliorum strain 122 isolated from a plant growing in a chronically hydrocarbon-contaminated site.

Evolutionary origin of the staphylococcal cassette chromosome mec (SCCmec).

Draft genome sequence of Plantibacter flavus strain 251 isolated from a plant growing in a chronically hydrocarbon-contaminated site.

Draft genome sequence of Acidihalobacter ferrooxidans DSM 14175 (strain V8), a new iron- and sulfur-oxidizing, halotolerant, acidophilic species.

Whole-genome sequence of Coxiella burnetii Nine Mile RSA439 (phase II, clone 4), a laboratory workhorse strain.

Hybrid de novo genome assembly of the Chinese herbal fleabane Erigeron breviscapus.

No evidence for maintenance of a sympatric Heliconius species barrier by chromosomal inversions.

De novo yeast genome assemblies from MinION, PacBio and MiSeq platforms.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert