Menu
September 22, 2019

Identification of DNA base modifications by means of Pacific Biosciences RS Sequencing technology.

Whole phage genomes can be sequenced readily using one or a combination of next generation sequencing (NGS) technologies. One of the most recently developed NGS platforms, the so-called Single-Molecule Real-Time (SMRT) sequencing approach provided by the PacBio RS platform, is particularly useful in providing complete (i.e., un-gapped) genome sequences, but differs from other technologies in that the platform also allows for downstream analysis to identify nucleotides that have been modified by DNA methylation. Here, we describe the methodological approach for the detection of genomic methylation motifs by means of SMRT sequencing.


September 22, 2019

Three New Genome Assemblies Support a Rapid Radiation in Musa acuminata (Wild Banana).

Edible bananas result from interspecific hybridization between Musa acuminata and Musa balbisiana, as well as among subspecies in M. acuminata. Four particular M. acuminata subspecies have been proposed as the main contributors of edible bananas, all of which radiated in a short period of time in southeastern Asia. Clarifying the evolution of these lineages at a whole-genome scale is therefore an important step toward understanding the domestication and diversification of this crop. This study reports the de novo genome assembly and gene annotation of a representative genotype from three different subspecies of M. acuminata. These data are combined with the previously published genome of the fourth subspecies to investigate phylogenetic relationships. Analyses of shared and unique gene families reveal that the four subspecies are quite homogenous, with a core genome representing at least 50% of all genes and very few M. acuminata species-specific gene families. Multiple alignments indicate high sequence identity between homologous single copy-genes, supporting the close relationships of these lineages. Interestingly, phylogenomic analyses demonstrate high levels of gene tree discordance, due to both incomplete lineage sorting and introgression. This pattern suggests rapid radiation within Musa acuminata subspecies that occurred after the divergence with M. balbisiana. Introgression between M. a. ssp. malaccensis and M. a. ssp. burmannica was detected across the genome, though multiple approaches to resolve the subspecies tree converged on the same topology. To support evolutionary and functional analyses, we introduce the PanMusa database, which enables researchers to exploration of individual gene families and trees.


September 22, 2019

Extensive and deep sequencing of the Venter/HuRef genome for developing and benchmarking genome analysis tools.

We produced an extensive collection of deep re-sequencing datasets for the Venter/HuRef genome using the Illumina massively-parallel DNA sequencing platform. The original Venter genome sequence is a very-high quality phased assembly based on Sanger sequencing. Therefore, researchers developing novel computational tools for the analysis of human genome sequence variation for the dominant Illumina sequencing technology can test and hone their algorithms by making variant calls from these Venter/HuRef datasets and then immediately confirm the detected variants in the Sanger assembly, freeing them of the need for further experimental validation. This process also applies to implementing and benchmarking existing genome analysis pipelines. We prepared and sequenced 200?bp and 350?bp short-insert whole-genome sequencing libraries (sequenced to 100x and 40x genomic coverages respectively) as well as 2?kb, 5?kb, and 12?kb mate-pair libraries (49x, 122x, and 145x physical coverages respectively). Lastly, we produced a linked-read library (128x physical coverage) from which we also performed haplotype phasing.


September 22, 2019

Emergence of pathogenic and multiple-antibiotic-resistant Macrococcus caseolyticus in commercial broiler chickens.

Macrococcus caseolyticus is generally considered to be a non-pathogenic bacterium that does not cause human or animal diseases. However, recently, a strain of M. caseolyticus (SDLY strain) that causes high mortality rates was isolated from commercial broiler chickens in China. The main pathological changes caused by SDLY included caseous exudation in cranial cavities, inflammatory infiltration, haemorrhages and multifocal necrosis in various organs. The whole genome of the SDLY strain was sequenced and was compared with that of the non-pathogenic JCSC5402 strain of M. caseolyticus. The results showed that the SDLY strain harboured a large quantity of mutations, antibiotic resistance genes and numerous insertions and deletions of virulence genes. In particular, among the inserted genes, there is a cluster of eight connected genes associated with the synthesis of capsular polysaccharide. This cluster encodes a transferase and capsular polysaccharide synthase, promotes the formation of capsules and causes changes in pathogenicity. Electron microscopy revealed a distinct capsule surrounding the SDLY strain. The pathogenicity test showed that the SDLY strain could cause significant clinical symptoms and pathological changes in both SPF chickens and mice. In addition, these clinical symptoms and pathological changes were the same as those observed in field cases. Furthermore, the anti-microbial susceptibility test demonstrated that the SDLY strain exhibits multiple-antibiotic resistance. The emergence of pathogenic M. caseolyticus indicates that more attention should be paid to the effects of this micro-organism on both poultry and public health.© 2018 Blackwell Verlag GmbH.


September 22, 2019

Hybrid correction of highly noisy long reads using a variable-order de Bruijn graph.

The recent rise of long read sequencing technologies such as Pacific Biosciences and Oxford Nanopore allows to solve assembly problems for larger and more complex genomes than what allowed short reads technologies. However, these long reads are very noisy, reaching an error rate of around 10-15% for Pacific Biosciences, and up to 30% for Oxford Nanopore. The error correction problem has been tackled by either self-correcting the long reads, or using complementary short reads in a hybrid approach. However, even though sequencing technologies promise to lower the error rate of the long reads below 10%, it is still higher in practice, and correcting such noisy long reads remains an issue.We present HG-CoLoR, a hybrid error correction method that focuses on a seed-and-extend approach based on the alignment of the short reads to the long reads, followed by the traversal of a variable-order de Bruijn graph, built from the short reads. Our experiments show that HG-CoLoR manages to efficiently correct highly noisy long reads that display an error rate as high as 44%. When compared to other state-of-the-art long read error correction methods, our experiments also show that HG-CoLoR provides the best trade-off between runtime and quality of the results, and is the only method able to efficiently scale to eukaryotic genomes.HG-CoLoR is implemented is C++, supported on Linux platforms and freely available at https://github.com/morispi/HG-CoLoR.Supplementary data are available at Bioinformatics online.


September 22, 2019

Enterobacter cloacae Complex Sequence Type 171 Isolates Expressing KPC-4 Carbapenemase Recovered from Canine Patients in Ohio.

Companion animals are likely relevant in the transmission of antimicrobial-resistant bacteria. Enterobacter xiangfangensis sequence type 171 (ST171), a clone that has been implicated in clusters of infections in humans, was isolated from two dogs with clinical disease in Ohio. The canine isolates contained IncHI2 plasmids encoding blaKPC-4 Whole-genome sequencing was used to put the canine isolates in phylogenetic context with available human ST171 sequences, as well as to characterize their blaKPC-4 plasmids. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Impact of index hopping and bias towards the reference allele on accuracy of genotype calls from low-coverage sequencing.

Inherent sources of error and bias that affect the quality of sequence data include index hopping and bias towards the reference allele. The impact of these artefacts is likely greater for low-coverage data than for high-coverage data because low-coverage data has scant information and many standard tools for processing sequence data were designed for high-coverage data. With the proliferation of cost-effective low-coverage sequencing, there is a need to understand the impact of these errors and bias on resulting genotype calls from low-coverage sequencing.We used a dataset of 26 pigs sequenced both at 2× with multiplexing and at 30× without multiplexing to show that index hopping and bias towards the reference allele due to alignment had little impact on genotype calls. However, pruning of alternative haplotypes supported by a number of reads below a predefined threshold, which is a default and desired step of some variant callers for removing potential sequencing errors in high-coverage data, introduced an unexpected bias towards the reference allele when applied to low-coverage sequence data. This bias reduced best-guess genotype concordance of low-coverage sequence data by 19.0 absolute percentage points.We propose a simple pipeline to correct the preferential bias towards the reference allele that can occur during variant discovery and we recommend that users of low-coverage sequence data be wary of unexpected biases that may be produced by bioinformatic tools that were designed for high-coverage sequence data.


September 22, 2019

DNA Methylation by Restriction Modification Systems Affects the Global Transcriptome Profile in Borrelia burgdorferi.

Prokaryote restriction modification (RM) systems serve to protect bacteria from potentially detrimental foreign DNA. Recent evidence suggests that DNA methylation by the methyltransferase (MTase) components of RM systems can also have effects on transcriptome profiles. The type strain of the causative agent of Lyme disease, Borrelia burgdorferi B31, possesses two RM systems with N6-methyladenosine (m6A) MTase activity, which are encoded by the bbe02 gene located on linear plasmid lp25 and bbq67 on lp56. The specific recognition and/or methylation sequences had not been identified for either of these B. burgdorferi MTases, and it was not previously known whether these RM systems influence transcript levels. In the current study, single-molecule real-time sequencing was utilized to map genome-wide m6A sites and to identify consensus modified motifs in wild-type B. burgdorferi as well as MTase mutants lacking either the bbe02 gene alone or both bbe02 and bbq67 genes. Four novel conserved m6A motifs were identified and were fully attributable to the presence of specific MTases. Whole-genome transcriptome changes were observed in conjunction with the loss of MTase enzymes, indicating that DNA methylation by the RM systems has effects on gene expression. Genes with altered transcription in MTase mutants include those involved in vertebrate host colonization (e.g., rpoS regulon) and acquisition by/transmission from the tick vector (e.g., rrp1 and pdeB). The results of this study provide a comprehensive view of the DNA methylation pattern in B. burgdorferi, and the accompanying gene expression profiles add to the emerging body of research on RM systems and gene regulation in bacteria.IMPORTANCE Lyme disease is the most prevalent vector-borne disease in North America and is classified by the Centers for Disease Control and Prevention (CDC) as an emerging infectious disease with an expanding geographical area of occurrence. Previous studies have shown that the causative bacterium, Borrelia burgdorferi, methylates its genome using restriction modification systems that enable the distinction from foreign DNA. Although much research has focused on the regulation of gene expression in B. burgdorferi, the effect of DNA methylation on gene regulation has not been evaluated. The current study characterizes the patterns of DNA methylation by restriction modification systems in B. burgdorferi and evaluates the resulting effects on gene regulation in this important pathogen. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Meiotic drive of female-inherited supernumerary chromosomes in a pathogenic fungus.

Meiosis is a key cellular process of sexual reproduction that includes pairing of homologous sequences. In many species however, meiosis can also involve the segregation of supernumerary chromosomes, which can lack a homolog. How these unpaired chromosomes undergo meiosis is largely unknown. In this study we investigated chromosome segregation during meiosis in the haploid fungus Zymoseptoria tritici that possesses a large complement of supernumerary chromosomes. We used isogenic whole chromosome deletion strains to compare meiotic transmission of chromosomes when paired and unpaired. Unpaired chromosomes inherited from the male parent as well as paired supernumerary chromosomes in general showed Mendelian inheritance. In contrast, unpaired chromosomes inherited from the female parent showed non-Mendelian inheritance but were amplified and transmitted to all meiotic products. We concluded that the supernumerary chromosomes of Z. tritici show a meiotic drive and propose an additional feedback mechanism during meiosis, which initiates amplification of unpaired female-inherited chromosomes.© 2018, Habig et al.


September 22, 2019

The Genome of Opium Poppy Reveals Evolutionary History of Morphinan Pathway.

Plants, as primary producers, have been playing an indispensable role in other organisms’ survival and the balance of whole ecosystem on Earth. Especially, they provide the main source of energy, food, and medicine for human beings, some of which are derived from the primary or secondary metabolites [1]. Angiosperms, with more than 300,000 species on Earth, are the largest group of land plants by far. Most agricultural crops, fruits, ornamental plants, and medicinal herbs belong to this group. The medicinal herbs are usually rich in specialized metabolites that could provide safe and valuable resources for pharmaceutical development.


September 22, 2019

Emergence of an extensively drug-resistant (XDR) Streptococcus pneumoniae serotype 15A by capsular switching.

Recently, we have identified an extensively drug-resistant (XDR) Streptococcus pneumoniae serotype 15A isolate from a patient with bacterial meningitis. It belonged to sequence type 8279 (ST8279), a clone identified as XDR serotype 11A isolated in South Korea. We obtained and compared the genome sequences of an XDR 15A and an XDR 11A isolate. The genomes of two XDR isolates were highly identical, except for the capsular polysaccharide (cps) locus and another small region. Capsular switching from 11A to 15A may have occurred via recombination of the cps locus. The emergence of a new XDR clone via capsular switching would be a great concern for public health and in clinical settings. Copyright © 2018 Elsevier GmbH. All rights reserved.


September 22, 2019

Achieving Accurate Sequence and Annotation Data for Caulobacter vibrioides CB13.

Annotated sequence data are instrumental in nearly all realms of biology. However, the advent of next-generation sequencing has rapidly facilitated an imbalance between accurate sequence data and accurate annotation data. To increase the annotation accuracy of the Caulobacter vibrioides CB13b1a (CB13) genome, we compared the PGAP and RAST annotations of the CB13 genome. A total of 64 unique genes were identified in the PGAP annotation that were either completely or partially absent in the RAST annotation, and a total of 16 genes were identified in the RAST annotation that were not included in the PGAP annotation. Moreover, PGAP identified 73 frameshifted genes and 22 genes with an internal stop. In contrast, RAST annotated the larger segment of these frameshifted genes without indicating a change in reading frame may have occurred. The RAST annotation did not include any genes with internal stop codons, since it chose start codons that were after the internal stop. To confirm the discrepancies between the two annotations and verify the accuracy of the CB13 genome sequence data, we re-sequenced and re-annotated the entire genome and obtained an identical sequence, except in a small number of homopolymer regions. A genome sequence comparison between the two versions allowed us to determine the correct number of bases in each homopolymer region, which eliminated frameshifts for 31 genes annotated as frameshifted genes and removed 24 pseudogenes from the PGAP annotation. Both annotation systems correctly identified genes that were missed by the other system. In addition, PGAP identified conserved gene fragments that represented the beginning of genes, but it employed no corrective method to adjust the reading frame of frameshifted genes or the start sites of genes harboring an internal stop codon. In doing so, the PGAP annotation identified a large number of pseudogenes, which may reflect evolutionary history but likely do not produce gene products. These results demonstrate that re-sequencing and annotation comparisons can be used to increase the accuracy of genomic data and the corresponding gene annotation.


September 22, 2019

Genome-scale analysis of Acetobacterium bakii reveals the cold adaptation of psychrotolerant acetogens by post-transcriptional regulation.

Acetogens synthesize acetyl-CoA via CO2 or CO fixation, producing organic compounds. Despite their ecological and industrial importance, their transcriptional and post-transcriptional regulation has not been systematically studied. With completion of the genome sequence of Acetobacterium bakii (4.28-Mb), we measured changes in the transcriptome of this psychrotolerant acetogen in response to temperature variations under autotrophic and heterotrophic growth conditions. Unexpectedly, acetogenesis genes were highly up-regulated at low temperatures under heterotrophic, as well as autotrophic, growth conditions. To mechanistically understand the transcriptional regulation of acetogenesis genes via changes in RNA secondary structures of 5′-untranslated regions (5′-UTR), the primary transcriptome was experimentally determined, and 1379 transcription start sites (TSS) and 1100 5′-UTR were found. Interestingly, acetogenesis genes contained longer 5′-UTR with lower RNA-folding free energy than other genes, revealing that the 5′-UTRs control the RNA abundance of the acetogenesis genes under low temperature conditions. Our findings suggest that post-transcriptional regulation via RNA conformational changes of 5′-UTRs is necessary for cold-adaptive acetogenesis.© 2018 Shin et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.


September 22, 2019

Whole-genome landscape of Medicago truncatula symbiotic genes.

Advances in deciphering the functional architecture of eukaryotic genomes have been facilitated by recent breakthroughs in sequencing technologies, enabling a more comprehensive representation of genes and repeat elements in genome sequence assemblies, as well as more sensitive and tissue-specific analyses of gene expression. Here we show that PacBio sequencing has led to a substantially improved genome assembly of Medicago truncatula A17, a legume model species notable for endosymbiosis studies1, and has enabled the identification of genome rearrangements between genotypes at a near-base-pair resolution. Annotation of the new M. truncatula genome sequence has allowed for a thorough analysis of transposable elements and their dynamics, as well as the identification of new players involved in symbiotic nodule development, in particular 1,037 upregulated long non-coding RNAs (lncRNAs). We have also discovered that a substantial proportion (~35% and 38%, respectively) of the genes upregulated in nodules or expressed in the nodule differentiation zone colocalize in genomic clusters (270 and 211, respectively), here termed symbiotic islands. These islands contain numerous expressed lncRNA genes and display differentially both DNA methylation and histone marks. Epigenetic regulations and lncRNAs are therefore attractive candidate elements for the orchestration of symbiotic gene expression in the M. truncatula genome.


September 22, 2019

Mosaicism diminishes the value of pre-implantation embryo biopsies for detecting CRISPR/Cas9 induced mutations in sheep.

The production of knock-out (KO) livestock models is both expensive and time consuming due to their long gestational interval and low number of offspring. One alternative to increase efficiency is performing a genetic screening to select pre-implantation embryos that have incorporated the desired mutation. Here we report the use of sheep embryo biopsies for detecting CRISPR/Cas9-induced mutations targeting the gene PDX1 prior to embryo transfer. PDX1 is a critical gene for pancreas development and the target gene required for the creation of pancreatogenesis-disabled sheep. We evaluated the viability of biopsied embryos in vitro and in vivo, and we determined the mutation efficiency using PCR combined with gel electrophoresis and digital droplet PCR (ddPCR). Next, we determined the presence of mosaicism in?~?50% of the recovered fetuses employing a clonal sequencing methodology. While the use of biopsies did not compromise embryo viability, the presence of mosaicism diminished the diagnostic value of the technique. If mosaicism could be overcome, pre-implantation embryo biopsies for mutation screening represents a powerful approach that will streamline the creation of KO animals.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.