Circular consensus sequencing Archives - Page 29 of 33

July 7, 2019

Spike gene deletion quasispecies in serum of patient with acute MERS-CoV infection.

The spike glycoprotein of the Middle East respiratory coronavirus (MERS-CoV) facilitates receptor binding and cell entry. During investigation of a multi-facility outbreak of MERS-CoV in Taif, Saudi Arabia, we identified a mixed population of wild-type and variant sequences with a large 530 nucleotide deletion in the spike gene from the serum of one patient. The out of frame deletion predicted loss of most of the S2 subunit of the spike protein leaving the S1 subunit with an intact receptor binding domain. This finding documents human infection with a novel genetic variant of MERS-CoV present as a quasispecies. J. Med. Virol. 89:542-545, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

July 7, 2019

Genomic sequencing of a strain of Acinetobacter baumannii and potential mechanisms to antibiotics resistance.

Acinetobacter baumannii has been becoming a great challenge to clinicians due to their resistance to almost all available antibiotics. In this study, we sequenced the genome from a multiple antibiotics resistant Acinetobacter baumannii stain which was named A. baumannii-1isolated from China by SMRT sequencing technology to explore its potential mechanisms to antibiotic resistance. We found that several mechanisms might contribute to the antibiotic resistance of Acinetobacter baumannii. Specifically, we found that SNP in genes associated with nucleotide excision repair and ABC transporter might contribute to its resistance to multiple antibiotics; we also found that specific genes associated with bacterial DNA integration and recombination, DNA-mediated transposition and response to antibiotics might contribute to its resistance to multiple antibiotics; Furthermore, specific genes associated with penicillin and cephalosporin biosynthetic pathway and specific genes associated with CHDL and MBL ß-lactamase genes might contribute to its resistance to multiple antibiotics. Thus, the detailed mechanisms by which Acinetobacter baumannii show extensive resistance to multiple antibiotics are very complicated. Such a study might be helpful to develop new strategies to control Acinetobacter baumannii infection. Copyright Â© 2017 Elsevier B.V. All rights reserved.

July 7, 2019

Genetic adaptation of porcine circovirus type 1 to cultured porcine kidney cells revealed by single-molecule long-read sequencing technology.

Porcine circovirus type 1 (PCV1) is a nonpathogenic circovirus, and a contaminant of the porcine kidney (PK-15) cell line. We present the complete and annotated genome sequence of strain Szeged of PCV1, determined by Pacific Biosciences RSII long-read sequencing platform. Copyright © 2017 Tombácz et al.

July 7, 2019

Innovations and challenges in detecting long read overlaps: an evaluation of the state-of-the-art.

Identifying overlaps between error-prone long reads, specifically those from Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PB), is essential for certain downstream applications, including error correction and de novo assembly. Though akin to the read-to-reference alignment problem, read-to-read overlap detection is a distinct problem that can benefit from specialized algorithms that perform efficiently and robustly on high error rate long reads. Here, we review the current state-of-the-art read-to-read overlap tools for error-prone long reads, including BLASR, DALIGNER, MHAP, GraphMap and Minimap. These specialized bioinformatics tools differ not just in their algorithmic designs and methodology, but also in their robustness of performance on a variety of datasets, time and memory efficiency and scalability. We highlight the algorithmic features of these tools, as well as their potential issues and biases when utilizing any particular method. To supplement our review of the algorithms, we benchmarked these tools, tracking their resource needs and computational performance, and assessed the specificity and precision of each. In the versions of the tools tested, we observed that Minimap is the most computationally efficient, specific and sensitive method on the ONT datasets tested; whereas GraphMap and DALIGNER are the most specific and sensitive methods on the tested PB datasets. The concepts surveyed may apply to future sequencing technologies, as scalability is becoming more relevant with increased sequencing throughput.cjustin@bcgsc.ca , ibirol@bcgsc.ca.Supplementary data are available at Bioinformatics online.

July 7, 2019

ThermoAlign: a genome-aware primer design tool for tiled amplicon resequencing.

Isolating and sequencing specific regions in a genome is a cornerstone of molecular biology. This has been facilitated by computationally encoding the thermodynamics of DNA hybridization for automated design of hybridization and priming oligonucleotides. However, the repetitive composition of genomes challenges the identification of target-specific oligonucleotides, which limits genetics and genomics research on many species. Here, a tool called ThermoAlign was developed that ensures the design of target-specific primer pairs for DNA amplification. This is achieved by evaluating the thermodynamics of hybridization for full-length oligonucleotide-template alignments – thermoalignments – across the genome to identify primers predicted to bind specifically to the target site. For amplification-based resequencing of regions that cannot be amplified by a single primer pair, a directed graph analysis method is used to identify minimum amplicon tiling paths. Laboratory validation by standard and long-range polymerase chain reaction and amplicon resequencing with maize, one of the most repetitive genomes sequenced to date (˜85% repeat content), demonstrated the specificity-by-design functionality of ThermoAlign. ThermoAlign is released under an open source license and bundled in a dependency-free container for wide distribution. It is anticipated that this tool will facilitate multiple applications in genetics and genomics and be useful in the workflow of high-throughput targeted resequencing studies.

July 7, 2019

Combination of short-read, long-read and optical mapping assemblies reveals large-scale tandem repeat arrays with population genetic implications.

Accurate and contiguous genome assembly is key to a comprehensive understanding of the processes shaping genomic diversity and evolution. Yet, it is frequently constrained by constitutive heterochromatin, usually characterized by highly repetitive DNA. As a key feature of genome architecture associated with centromeric and telomeric regions it influences meiotic recombination. In this study, we assess the impact of large tandem repeat arrays on the recombination rate landscape in an avian speciation model, the Eurasian crow. We assembled two high-quality genome references using single-molecule real-time sequencing (long-read assembly, LR) and single-molecule restriction maps (optical map assembly, OM). A three-way comparison including the published short-read assembly (SR) constructed for the same individual allowed assessing assembly properties and pinpointing mis-assemblies. Combining information from all three assemblies, we characterized 36 previously unidentified large repetitive regions in the proximity of sequence assembly breakpoints, the majority of which contained complex arrays of a 14-kb satellite repeat or its 1.2-kb subunit. Using genome-wide population re-sequencing data, we estimated the population-scaled recombination rate (?) and found it to be significantly reduced in these regions. These findings are consistent with an effect of low recombination in regions adjacent to centromeric or subtelomeric heterochromatin, and add to our understanding of the processes generating widespread heterogeneity in genetic diversity and differentiation along the genome. By combining three independent technologies, our results highlight the importance of adding a layer of information on genome structure inaccessible to each approach independently. Published by Cold Spring Harbor Laboratory Press.

July 7, 2019

Isolation and genomic characterization of a Dehalococcoides strain suggests genomic rearrangement during culture.

We have developed and characterized a bacterial consortium that reductively dechlorinates trichloroethene to ethene. Quantitative PCR analysis for the 16S rRNA and reductive dehalogenase genes showed that the consortium is highly enriched with Dehalococcoides spp. that have two vinyl chloride reductive dehalogenase genes, bvcA and vcrA, and a trichloroethene reductive dehalogenase gene, tceA. The metagenome analysis of the consortium by the next generation sequencer SOLiD 3 Plus suggests that a Dehalococcoides sp. that is highly homologous to D. mccartyi 195 and equipped with vcrA and tceA exists in the consortium. We isolated this Dehalococcoides sp. and designated it as D. mccartyi UCH-ATV1. As the growth of D. mccartyi UCH-ATV1 is too slow under isolated conditions, we constructed a consortium by mixing D. mccartyi UCH-ATV1 with several other bacteria and performed metagenomic sequencing using the single molecule DNA sequencer PacBio RS II. We successfully determined the complete genome sequence of D. mccartyi UCH-ATV1. The strain is equipped with vcrA and tceA, but lacks bvcA. Comparison with tag sequences of SOLiD 3 Plus from the original consortium shows a few differences between the sequences. This suggests that a genome rearrangement of Dehalococcoides sp. occurred during culture.

July 7, 2019

MHC class I diversity in chimpanzees and bonobos.

Major histocompatibility complex (MHC) class I genes are critically involved in the defense against intracellular pathogens. MHC diversity comparisons among samples of closely related taxa may reveal traces of past or ongoing selective processes. The bonobo and chimpanzee are the closest living evolutionary relatives of humans and last shared a common ancestor some 1 mya. However, little is known concerning MHC class I diversity in bonobos or in central chimpanzees, the most numerous and genetically diverse chimpanzee subspecies. Here, we used a long-read sequencing technology (PacBio) to sequence the classical MHC class I genes A, B, C, and A-like in 20 and 30 wild-born bonobos and chimpanzees, respectively, with a main focus on central chimpanzees to assess and compare diversity in those two species. We describe in total 21 and 42 novel coding region sequences for the two species, respectively. In addition, we found evidence for a reduced MHC class I diversity in bonobos as compared to central chimpanzees as well as to western chimpanzees and humans. The reduced bonobo MHC class I diversity may be the result of a selective process in their evolutionary past since their split from chimpanzees.

July 7, 2019

Designing robust watermark barcodes for multiplex long-read sequencing.

To attain acceptable sample misassignment rates, current approaches to multiplex single-molecule real-time sequencing require upstream quality improvement, which is obtained from multiple passes over the sequenced insert and significantly reduces the effective read length. In order to fully exploit the raw read length on multiplex applications, robust barcodes capable of dealing with the full single-pass error rates are needed.We present a method for designing sequencing barcodes that can withstand a large number of insertion, deletion and substitution errors and are suitable for use in multiplex single-molecule real-time sequencing. The manuscript focuses on the design of barcodes for full-length single-pass reads, impaired by challenging error rates in the order of 11%. The proposed barcodes can multiplex hundreds or thousands of samples while achieving sample misassignment probabilities as low as 10-7 under the above conditions, and are designed to be compatible with chemical constraints imposed by the sequencing process.Software tools for constructing watermark barcode sets and demultiplexing barcoded reads, together with example sets of barcodes and synthetic barcoded reads, are freely available at www.cifasis-conicet.gov.ar/ezpeleta/NS-watermark .ezpeleta@cifasis-conicet.gov.ar.

July 7, 2019

Heterogeneous resistance to quizartinib in acute myeloid leukemia revealed by single-cell analysis.

Genomic studies have revealed significant branching heterogeneity in cancer. Studies of resistance to tyrosine kinase inhibitor therapy have not fully reflected this heterogeneity because resistance in individual patients has been ascribed to largely mutually exclusive on-target or off-target mechanisms in which tumors either retain dependency on the target oncogene or subvert it through a parallel pathway. Using targeted sequencing from single cells and colonies from patient samples, we demonstrate tremendous clonal diversity in the majority of acute myeloid leukemia (AML) patients with activating FLT3 internal tandem duplication mutations at the time of acquired resistance to the FLT3 inhibitor quizartinib. These findings establish that clinical resistance to quizartinib is highly complex and reflects the underlying clonal heterogeneity of AML.© 2017 by The American Society of Hematology.

July 7, 2019

Untangling heteroplasmy, structure, and evolution of an atypical mitochondrial genome by PacBio Sequencing.

The highly compact mitochondrial (mt) genome of terrestrial isopods (Oniscidae) presents two unusual features. First, several loci can individually encode two tRNAs, thanks to single nucleotide polymorphisms at anticodon sites. Within-individual variation (heteroplasmy) at these loci is thought to have been maintained for millions of years because individuals that do not carry all tRNA genes die, resulting in strong balancing selection. Second, the oniscid mtDNA genome comes in two conformations: a ~14 kb linear monomer and a ~28 kb circular dimer comprising two monomer units fused in palindrome. We hypothesized that heteroplasmy actually results from two genome units of the same dimeric molecule carrying different tRNA genes at mirrored loci. This hypothesis, however, contradicts the earlier proposition that dimeric molecules result from the replication of linear monomers-a process that should yield totally identical genome units within a dimer. To solve this contradiction, we used the SMRT (PacBio) technology to sequence mirrored tRNA loci in single dimeric molecules. We show that dimers do present different tRNA genes at mirrored loci; thus covalent linkage, rather than balancing selection, maintains vital variation at anticodons. We also leveraged unique features of the SMRT technology to detect linear monomers closed by hairpins and carrying noncomplementary bases at anticodons. These molecules contain the necessary information to encode two tRNAs at the same locus, and suggest new mechanisms of transition between linear and circular mtDNA. Overall, our analyses clarify the evolution of an atypical mt genome where dimerization counterintuitively enabled further mtDNA compaction. Copyright © 2017 by the Genetics Society of America.

July 7, 2019

ConcatSeq: A method for increasing throughput of single molecule sequencing by concatenating short DNA fragments.

Single molecule sequencing (SMS) platforms enable base sequences to be read directly from individual strands of DNA in real-time. Though capable of long read lengths, SMS platforms currently suffer from low throughput compared to competing short-read sequencing technologies. Here, we present a novel strategy for sequencing library preparation, dubbed ConcatSeq, which increases the throughput of SMS platforms by generating long concatenated templates from pools of short DNA molecules. We demonstrate adaptation of this technique to two target enrichment workflows, commonly used for oncology applications, and feasibility using PacBio single molecule real-time (SMRT) technology. Our approach is capable of increasing the sequencing throughput of the PacBio RSII platform by more than five-fold, while maintaining the ability to correctly call allele frequencies of known single nucleotide variants. ConcatSeq provides a versatile new sample preparation tool for long-read sequencing technologies.

July 7, 2019

BusyBee Web: metagenomic data analysis by bootstrapped supervised binning and annotation.

Metagenomics-based studies of mixed microbial communities are impacting biotechnology, life sciences and medicine. Computational binning of metagenomic data is a powerful approach for the culture-independent recovery of population-resolved genomic sequences, i.e. from individual or closely related, constituent microorganisms. Existing binning solutions often require a priori characterized reference genomes and/or dedicated compute resources. Extending currently available reference-independent binning tools, we developed the BusyBee Web server for the automated deconvolution of metagenomic data into population-level genomic bins using assembled contigs (Illumina) or long reads (Pacific Biosciences, Oxford Nanopore Technologies). A reversible compression step as well as bootstrapped supervised binning enable quick turnaround times. The binning results are represented in interactive 2D scatterplots. Moreover, bin quality estimates, taxonomic annotations and annotations of antibiotic resistance genes are computed and visualized. Ground truth-based benchmarks of BusyBee Web demonstrate comparably high performance to state-of-the-art binning solutions for assembled contigs and markedly improved performance for long reads (median F1 scores: 70.02-95.21%). Furthermore, the applicability to real-world metagenomic datasets is shown. In conclusion, our reference-independent approach automatically bins assembled contigs or long reads, exhibits high sensitivity and precision, enables intuitive inspection of the results, and only requires FASTA-formatted input. The web-based application is freely accessible at: https://ccb-microbe.cs.uni-saarland.de/busybee.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

July 7, 2019

Characterization of miR-122-independent propagation of HCV.

miR-122, a liver-specific microRNA, is one of the determinants for liver tropism of hepatitis C virus (HCV) infection. Although miR-122 is required for efficient propagation of HCV, we have previously shown that HCV replicates at a low rate in miR-122-deficient cells, suggesting that HCV-RNA is capable of propagating in an miR-122-independent manner. We herein investigated the roles of miR-122 in both the replication of HCV-RNA and the production of infectious particles by using miR-122-knockout Huh7 (Huh7-122KO) cells. A slight increase of intracellular HCV-RNA levels and infectious titers in the culture supernatants was observed in Huh7-122KO cells upon infection with HCV. Moreover, after serial passages of HCV in miR-122-knockout Huh7.5.1 cells, we obtained an adaptive mutant, HCV122KO, possessing G28A substitution in the 5’UTR of the HCV genotype 2a JFH1 genome, and this mutant may help to enhance replication complex formation, a possibility supported by polysome analysis. We also found the introduction of adaptive mutation around miR-122 binding site in the genotype 1b/2a chimeric virus, which originally had an adenine at the nucleotide position 29. HCV122KO exhibited efficient RNA replication in miR-122-knockout cells and non-hepatic cells without exogenous expression of miR-122. Competition assay revealed that the G28A mutant was dominant in the absence of miR-122, but its effects were equivalent to those of the wild type in the presence of miR-122, suggesting that the G28A mutation does not confer an advantage for propagation in miR-122-rich hepatocytes. These observations may explain the clinical finding that the positive rate of G28A mutation was higher in miR-122-deficient PBMCs than in the patient serum, which mainly included the hepatocyte-derived virus from HCV-genotype-2a patients. These results suggest that the emergence of HCV mutants that can propagate in non-hepatic cells in an miR-122-independent manner may participate in the induction of extrahepatic manifestations in chronic hepatitis C patients.

July 7, 2019

PipeCraft: Flexible open-source toolkit for bioinformatics analysis of custom high-throughput amplicon sequencing data.

High-throughput sequencing methods have become a routine analysis tool in environmental sciences as well as in public and private sector. These methods provide vast amount of data, which need to be analysed in several steps. Although the bioinformatics may be applied using several public tools, many analytical pipelines allow too few options for the optimal analysis for more complicated or customized designs. Here, we introduce PipeCraft, a flexible and handy bioinformatics pipeline with a user-friendly graphical interface that links several public tools for analysing amplicon sequencing data. Users are able to customize the pipeline by selecting the most suitable tools and options to process raw sequences from Illumina, Pacific Biosciences, Ion Torrent and Roche 454 sequencing platforms. We described the design and options of PipeCraft and evaluated its performance by analysing the data sets from three different sequencing platforms. We demonstrated that PipeCraft is able to process large data sets within 24 hr. The graphical user interface and the automated links between various bioinformatics tools enable easy customization of the workflow. All analytical steps and options are recorded in log files and are easily traceable.© 2017 John Wiley & Sons Ltd.

Auto Tag: Circular consensus sequencing

Spike gene deletion quasispecies in serum of patient with acute MERS-CoV infection.

Genomic sequencing of a strain of Acinetobacter baumannii and potential mechanisms to antibiotics resistance.

Genetic adaptation of porcine circovirus type 1 to cultured porcine kidney cells revealed by single-molecule long-read sequencing technology.

Innovations and challenges in detecting long read overlaps: an evaluation of the state-of-the-art.

ThermoAlign: a genome-aware primer design tool for tiled amplicon resequencing.

Combination of short-read, long-read and optical mapping assemblies reveals large-scale tandem repeat arrays with population genetic implications.

Isolation and genomic characterization of a Dehalococcoides strain suggests genomic rearrangement during culture.

MHC class I diversity in chimpanzees and bonobos.

Designing robust watermark barcodes for multiplex long-read sequencing.

Heterogeneous resistance to quizartinib in acute myeloid leukemia revealed by single-cell analysis.

Untangling heteroplasmy, structure, and evolution of an atypical mitochondrial genome by PacBio Sequencing.

ConcatSeq: A method for increasing throughput of single molecule sequencing by concatenating short DNA fragments.

BusyBee Web: metagenomic data analysis by bootstrapped supervised binning and annotation.

Characterization of miR-122-independent propagation of HCV.

PipeCraft: Flexible open-source toolkit for bioinformatics analysis of custom high-throughput amplicon sequencing data.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert