Menu
July 7, 2019

Chromosome assembly of large and complex genomes using multiple references

Despite the rapid development of sequencing technologies, assembly of mammalian-scale genomes into complete chromosomes remains one of the most challenging problems in bioinformatics. To help address this difficulty, we developed Ragout, a reference-assisted assembly tool that now works for large and complex genomes. Taking one or more target assemblies (generated from an NGS assembler) and one or multiple related reference genomes, Ragout infers the evolutionary relationships between the genomes and builds the final assemblies using a genome rearrangement approach. Using Ragout, we transformed NGS assemblies of 15 different Mus musculus and one Mus spretus genomes into sets of complete chromosomes, leaving less than 5% of sequence unlocalized per set. Various benchmarks, including PCR testing and realigning of long PacBio reads, suggest only a small number of structural errors in the final assemblies, comparable with direct assembly approaches. Additionally, we applied Ragout to Mus caroli and Mus pahari genomes, which exhibit karyotype-scale variations compared to other genomes from the Muridae family. Chromosome color maps confirmed most large-scale rearrangements that Ragout detected.


July 7, 2019

WhatsHap: fast and accurate read-based phasing

Read-based phasing allows to reconstruct the haplotype structure of a sample purely from sequencing reads. While phasing is a required step for answering questions about population genetics, compound heterozygosity, and to aid in clinical decision making, there has been a lack of an accurate, usable and standards-based software. WhatsHap is a production-ready tool for highly accurate read-based phasing. It was designed from the beginning to leverage third-generation sequencing technologies, whose long reads can span many variants and are therefore ideal for phasing. WhatsHap works also well with second-generation data, is easy to use and will phase not only SNVs, but also indels and other variants. It is unique in its ability to combine read-based with genetic phasing, allowing to further improve accuracy if multiple related samples are provided.


July 7, 2019

Complete genome sequence of Aggregatibacter actinomycetemcomitans strain IDH781.

We report here the complete genomic sequence and methylome of Aggregatibacter actinomycetemcomitans strain IDH781. This rough strain is used extensively as a model organism to characterize localized aggressive periodontitis pathogenesis, the basic biology and oral cavity colonization of A. actinomycetemcomitans, and its interactions with other members of the oral microbiome. Copyright © 2016 May et al.


July 7, 2019

High-quality draft genome sequence of the actinobacterium Nocardia terpenica IFM 0406, producer of the immunosuppressant brasilicardins, using Illumina and PacBio technologies.

The bacterium Nocardia terpenica IFM 0406 is known as the producer of the immunosuppressant brasilicardin A. Here, we report the completely sequenced genome of strain IFM 0406, which facilitates the heterologous expression of the brasilicardin biosynthetic gene cluster but also unveils the intriguing biosynthetic capacity of the strain to produce secondary metabolites. Copyright © 2016 Buchmann et al.


July 7, 2019

Genome sequence of Pseudomonas citronellolis SJTE-3, an estrogen- and polycyclic aromatic hydrocarbon-degrading bacterium.

Pseudomonas citronellolis SJTE-3, isolated from the active sludge of a wastewater treatment plant in China, can utilize a series of environmental estrogens and estrogen-like toxicants. Here, we report its whole-genome sequence, containing one circular chromosome and one circular plasmid. Genes involved in estrogen biodegradation in this bacterium were predicted. Copyright © 2016 Zheng et al.


July 7, 2019

Complete genome sequence of a Burkholderia mallei isolate originating from a glanderous horse from the Kingdom of Bahrain.

Burkholderia mallei is a zoonotic agent causing glanders, a notifiable disease in equines. During the past decades glanders emerged, and the Kingdom of Bahrain reported outbreaks to the World Organization of Animal Health in 2010 and 2011. This paper presents the complete genome sequence of the Burkholderia mallei strain 11RR2811 Bahrain1. Copyright © 2016 Elschner et al.


July 7, 2019

Listeria monocytogenes in stone fruits linked to a multistate outbreak: enumeration of cells and whole-genome sequencing.

In 2014, the identification of stone fruits contaminated with Listeria monocytogenes led to the subsequent identification of a multistate outbreak. Simultaneous detection and enumeration of L. monocytogenes were performed on 105 fruits, each weighing 127 to 145 g, collected from 7 contaminated lots. The results showed that 53.3% of the fruits yielded L. monocytogenes (lower limit of detection, 5 CFU/fruit), and the levels ranged from 5 to 2,850 CFU/fruit, with a geometric mean of 11.3 CFU/fruit (0.1 CFU/g of fruit). Two serotypes, IVb-v1 and 1/2b, were identified by a combination of PCR- and antiserum-based serotyping among isolates from fruits and their packing environment; certain fruits contained a mixture of both serotypes. Single nucleotide polymorphism (SNP)-based whole-genome sequencing (WGS) analysis clustered isolates from two case-patients with the serotype IVb-v1 isolates and distinguished outbreak-associated isolates from pulsed-field gel electrophoresis (PFGE)-matched, but epidemiologically unrelated, clinical isolates. The outbreak-associated isolates differed by up to 42 SNPs. All but one serotype 1/2b isolate formed another WGS cluster and differed by up to 17 SNPs. Fully closed genomes of isolates from the stone fruits were used as references to maximize the resolution and to increase our confidence in prophage analysis. Putative prophages were conserved among isolates of each WGS cluster. All serotype IVb-v1 isolates belonged to singleton sequence type 382 (ST382); all but one serotype 1/2b isolate belonged to clonal complex 5.WGS proved to be an excellent tool to assist in the epidemiologic investigation of listeriosis outbreaks. The comparison at the genome level contributed to our understanding of the genetic diversity and variations among isolates involved in an outbreak or isolates associated with food and environmental samples from one facility. Fully closed genomes increased our confidence in the identification and comparison of accessory genomes. The diversity among the outbreak-associated isolates and the inclusion of PFGE-matched, but epidemiologically unrelated, isolates demonstrate the high resolution of WGS. The prevalence and enumeration data could contribute to our further understanding of the risk associated with Listeria monocytogenes contamination, especially among high-risk populations. Copyright © 2016 Chen et al.


July 7, 2019

Complete genome sequence of a copper-resistant bacterium from the citrus phyllosphere, Stenotrophomonas sp. strain LM091, obtained using long-read technology.

The Stenotrophomonas genus shows great adaptive potential including resistance to multiple antimicrobials, opportunistic pathogenicity, and production of numerous secondary metabolites. Using long-read technology, we report the sequence of a plant-associated Stenotrophomonas strain originating from the citrus phyllosphere that displays a copper resistance phenotype. Copyright © 2016 Richard et al.


July 7, 2019

Cell cycle constraints and environmental control of local DNA hypomethylation in a-proteobacteria.

Heritable DNA methylation imprints are ubiquitous and underlie genetic variability from bacteria to humans. In microbial genomes, DNA methylation has been implicated in gene transcription, DNA replication and repair, nucleoid segregation, transposition and virulence of pathogenic strains. Despite the importance of local (hypo)methylation at specific loci, how and when these patterns are established during the cell cycle remains poorly characterized. Taking advantage of the small genomes and the synchronizability of a-proteobacteria, we discovered that conserved determinants of the cell cycle transcriptional circuitry establish specific hypomethylation patterns in the cell cycle model system Caulobacter crescentus. We used genome-wide methyl-N6-adenine (m6A-) analyses by restriction-enzyme-cleavage sequencing (REC-Seq) and single-molecule real-time (SMRT) sequencing to show that MucR, a transcriptional regulator that represses virulence and cell cycle genes in S-phase but no longer in G1-phase, occludes 5′-GANTC-3′ sequence motifs that are methylated by the DNA adenine methyltransferase CcrM. Constitutive expression of CcrM or heterologous methylases in at least two different a-proteobacteria homogenizes m6A patterns even when MucR is present and affects promoter activity. Environmental stress (phosphate limitation) can override and reconfigure local hypomethylation patterns imposed by the cell cycle circuitry that dictate when and where local hypomethylation is instated.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.