Menu
July 7, 2019

Hybrid assembly with long and short reads improves discovery of gene family expansions.

Long-read and short-read sequencing technologies offer competing advantages for eukaryotic genome sequencing projects. Combinations of both may be appropriate for surveys of within-species genomic variation.We developed a hybrid assembly pipeline called “Alpaca” that can operate on 20X long-read coverage plus about 50X short-insert and 50X long-insert short-read coverage. To preclude collapse of tandem repeats, Alpaca relies on base-call-corrected long reads for contig formation.Compared to two other assembly protocols, Alpaca demonstrated the most reference agreement and repeat capture on the rice genome. On three accessions of the model legume Medicago truncatula, Alpaca generated the most agreement to a conspecific reference and predicted tandemly repeated genes absent from the other assemblies.Our results suggest Alpaca is a useful tool for investigating structural and copy number variation within de novo assemblies of sampled populations.


July 7, 2019

Complete genome sequences of two acetic acid-producing Acetobacter pasteurianus strains (subsp. ascendens LMG 1590(T) and subsp. paradoxus LMG 1591(T)).

Foods and beverages produced by fermentation are essential to human nutrition worldwide and, therefore, have been extensively studied (Sõukand et al., 2015). Vinegar, kombucha beverage, milk kefir, water kefir, and cocoa are the products of acetic acid fermentation (Li et al., 2015). Acetic acid bacteria (AAB) oxidize sugars or ethanol to produce acetic acid, playing an important role in fermentation. AAB have been used historically for various fermentation processes and are Gram-negative obligate aerobic bacteria of the family Acetobacteraceae of Alphaproteobacteria (Saichana et al., 2015). Although various bacteria can produce acetic acid, most commercially used bacteria are species of Acetobacter, Gluconacetobacter, and Gluconobacter (Raspor and Goranovic, 2008). Among these organisms, Acetobacter species have attracted much attention in the field of biotechnology because these species are able to tolerate high acetic acid concentrations in the environment (Matsutani et al., 2011).


July 7, 2019

Complete genome sequence of Campylobacter concisus ATCC 33237T and draft genome sequences for an additional eight well-characterized C. concisus strains.

We report the complete genome sequence of the Campylobacter concisus type strain ATCC 33237 and the draft genome sequences of eight additional well-characterized C. concisus strains. C. concisus has been shown to be a genetically heterogeneous species, and these nine genomes provide valuable information regarding the diversity within this taxon. Copyright © 2017 Cornelius et al.


July 7, 2019

Whole-genome sequence of Staphylococcus hominis strain J31 isolated from healthy human skin.

We report here the first whole-genome sequence of a skin-associated strain of Staphylococcus hominis determined using the PacBio long-read sequencing platform. S. hominis is a major commensal of the skin microflora. This genome sequence adds to our understanding of this species and will aid studies of gene traffic between staphylococci. Copyright © 2017 Coates-Brown and Horsburgh.


July 7, 2019

Repeated divergent selection on pigmentation genes in a rapid finch radiation.

Instances of recent and rapid speciation are suitable for associating phenotypes with their causal genotypes, especially if gene flow homogenizes areas of the genome that are not under divergent selection. We study a rapid radiation of nine sympatric bird species known as capuchino seedeaters, which are differentiated in sexually selected characters of male plumage and song. We sequenced the genomes of a phenotypically diverse set of species to search for differentiated genomic regions. Capuchinos show differences in a small proportion of their genomes, yet selection has acted independently on the same targets in different members of this radiation. Many divergent regions contain genes involved in the melanogenesis pathway, with the strongest signal originating from putative regulatory regions. Selection has acted on these same genomic regions in different lineages, likely shaping the evolution of cis-regulatory elements, which control how more conserved genes are expressed and thereby generate diversity in classically sexually selected traits.


July 7, 2019

Whole-genome sequence of Acinetobacter pittii HUMV-6483 isolated from human urine.

Acinetobacter pittii strain HUMV-6483 was obtained from urine from an adult patient. We report here its complete genome assembly using PacBio single-molecule real-time sequencing, which resulted in a chromosome with 4.07 Mb and a circular contig of 112 kb. About 3,953 protein-coding genes are predicted from this assembly. Copyright © 2017 Chapartegui-González et al.


July 7, 2019

Virulence and genomic feature of a virulent Klebsiella pneumoniae sequence type 14 strain of serotype K2 harboring blaNDM-5 in China.

The objective of this study was to reveal the molecular mechanism involved in carbapenem resistance and virulence of a K2 Klebsiella pneumoniae clinical isolate 24835. The virulence of the strain was determined by in vitro and in vivo methods. The de novo whole-genome sequencing technology and molecular biology methods were used to analyze the genomic features associated with the carbapenem resistance and virulence of K. pneumoniae 24835. Strain 24835 was highly resistant to carbapenems and belonged to ST14, exhibited hypermucoviscous and unique K2-aerobactin-kfu-rmpA positive phenotype. As the only carbapenemase gene in strain 24835, blaNDM-5 was located on a 46-kb IncX3 self-transmissible plasmid, which is a very close relation of pNDM-MGR194 from India. Genetic context of blaNDM-5 in strain 24835 was closely related to those on IncX3 plasmids in various Enterobacteriaceae species in China. The combination of multiple virulence genes may work together to confer the relative higher virulence in K. pneumoniae 24835. Significantly increased resistance to serum killing and mice mortality were found in the virulent New Delhi metallo-ß-lactamase (NDM)-producing K. pneumoniae strain compared to the other NDM-producing K. pneumoniae strain. Our study provides basic information of phenotypic and genomic features of K. pneumoniae 24835, a strain displaying carbapenem resistance and relatively high level of virulence. These findings are concerning for the potential of NDM-like genes to disseminate among virulent K. pneumoniae isolates.


July 7, 2019

Multiple genome sequences of Lactobacillus plantarum strains.

We report here the genome sequences of four Lactobacillus plantarum strains which vary in surface hydrophobicity. Bioinformatic analysis, using additional genomes of Lactobacillus plantarum strains, revealed a possible correlation between the cell wall teichoic acid-type and cell surface hydrophobicity and provide the basis for consecutive analyses. Copyright © 2017 Kafka et al.


July 7, 2019

Genomics and comparative genomic analyses provide insight into the taxonomy and pathogenic potential of novel Emmonsia pathogens.

Over the last 50 years, newly described species of Emmonsia-like fungi have been implicated globally as sources of systemic human mycosis (emmonsiosis). Their ability to convert into yeast-like cells capable of replication and extra-pulmonary dissemination during the course of infection differentiates them from classical Emmonsia species. Immunocompromised patients are at highest risk of emmonsiosis and exhibit high mortality rates. In order to investigate the molecular basis for pathogenicity of the newly described Emmonsia species, genomic sequencing and comparative genomic analyses of Emmonsia sp. 5z489, which was isolated from a non-deliberately immunosuppressed diabetic patient in China and represents a novel seventh isolate of Emmonsia-like fungi, was performed. The genome size of 5z489 was 35.5 Mbp in length, which is ~5 Mbp larger than other Emmonsia strains. Further, 9,188 protein genes were predicted in the 5z489 genome and 16% of the assembly was identified as repetitive elements, which is the largest abundance in Emmonsia species. Phylogenetic analyses based on whole genome data classified 5z489 and CAC-2015a, another novel isolate, as members of the genus Emmonsia. Our analyses showed that divergences among Emmonsia occurred much earlier than other genera within the family Ajellomycetaceae, suggesting relatively distant evolutionary relationships among the genus. Through comparisons of Emmonsia species, we discovered significant pathogenicity characteristics within the genus as well as putative virulence factors that may play a role in the infection and pathogenicity of the novel Emmonsia strains. Moreover, our analyses revealed a novel distribution mode of DNA methylation patterns across the genome of 5z489, with >50% of methylated bases located in intergenic regions. These methylation patterns differ considerably from other reported fungi, where most methylation occurs in repetitive loci. It is unclear if this difference is related to physiological adaptations of new Emmonsia, but this question warrants further investigation. Overall, our analyses provide a framework from which to further study the evolutionary dynamics of Emmonsia strains and identity the underlying molecular mechanisms that determine the infectious and pathogenic potency of these fungal pathogens, and also provide insight into potential targets for therapeutic intervention of emmonsiosis and further research.


July 7, 2019

CLOVE: classification of genomic fusions into structural variation events.

A precise understanding of structural variants (SVs) in DNA is important in the study of cancer and population diversity. Many methods have been designed to identify SVs from DNA sequencing data. However, the problem remains challenging because existing approaches suffer from low sensitivity, precision, and positional accuracy. Furthermore, many existing tools only identify breakpoints, and so not collect related breakpoints and classify them as a particular type of SV. Due to the rapidly increasing usage of high throughput sequencing technologies in this area, there is an urgent need for algorithms that can accurately classify complex genomic rearrangements (involving more than one breakpoint or fusion).We present CLOVE, an algorithm for integrating the results of multiple breakpoint or SV callers and classifying the results as a particular SV. CLOVE is based on a graph data structure that is created from the breakpoint information. The algorithm looks for patterns in the graph that are characteristic of more complex rearrangement types. CLOVE is able to integrate the results of multiple callers, producing a consensus call.We demonstrate using simulated and real data that re-classified SV calls produced by CLOVE improve on the raw call set of existing SV algorithms, particularly in terms of accuracy. CLOVE is freely available from http://www.github.com/PapenfussLab .


July 7, 2019

Genome graphs

There is increasing recognition that a single, monoploid reference genome is a poor universal reference structure for human genetics, because it represents only a tiny fraction of human variation. Adding this missing variation results in a structure that can be described as a mathematical graph: a genome graph. We demonstrate that, in comparison to the existing reference genome (GRCh38), genome graphs can substantially improve the fractions of reads that map uniquely and perfectly. Furthermore, we show that this fundamental simplification of read mapping transforms the variant calling problem from one in which many non-reference variants must be discovered de-novo to one in which the vast majority of variants are simply re-identified within the graph. Using standard benchmarks as well as a novel reference-free evaluation, we show that a simplistic variant calling procedure on a genome graph can already call variants at least as well as, and in many cases better than, a state-of-the-art method on the linear human reference genome. We anticipate that graph-based references will supplant linear references in humans and in other applications where cohorts of sequenced individuals are available.


July 7, 2019

Evolutionary dynamics of pathoadaptation revealed by three independent acquisitions of the VirB/D4 type IV secretion system in Bartonella.

The a-proteobacterial genus Bartonella comprises a group of ubiquitous mammalian pathogens that are studied as a model for the evolution of bacterial pathogenesis. Vast abundance of two particular phylogenetic lineages of Bartonella had been linked to enhanced host adaptability enabled by lineage-specific acquisition of a VirB/D4 type IV secretion system (T4SS) and parallel evolution of complex effector repertoires. However, the limited availability of genome sequences from one of those lineages as well as other, remote branches of Bartonella has so far hampered comprehensive understanding of how the VirB/D4 T4SS and its effectors called Beps have shaped Bartonella evolution. Here, we report the discovery of a third repertoire of Beps associated with the VirB/D4 T4SS of B. ancashensis, a novel human pathogen that lacks any signs of host adaptability and is only distantly related to the two species-rich lineages encoding a VirB/D4 T4SS. Furthermore, sequencing of ten new Bartonella isolates from under-sampled lineages enabled combined in silico analyses and wet lab experiments that suggest several parallel layers of functional diversification during evolution of the three Bep repertoires from a single ancestral effector. Our analyses show that the Beps of B. ancashensis share many features with the two other repertoires, but may represent a more ancestral state that has not yet unleashed the adaptive potential of such an effector set. We anticipate that the effectors of B. ancashensis will enable future studies to dissect the evolutionary history of Bartonella effectors and help unraveling the evolutionary forces underlying bacterial host adaptation.© The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Free-living Enterobacterium Pragia fontium 24613: complete genome sequence and metabolic profiling.

Pragia fontium is one of the few species that belongs to the group of atypical hydrogen sulfide-producing enterobacteria. Unlike other members of this closely related group, P. fontium is not associated with any known host and has been reported as a free-living bacterium. Whole genome sequencing and metabolic fingerprinting confirmed the phylogenetic position of P. fontium inside the group of atypical H2S producers. Genomic data have revealed that P. fontium 24613 has limited pathogenic potential, although there are signs of genome decay. Although the lack of specific virulence factors and no association with a host species suggest a free-living style, the signs of genome decay suggest a process of adaptation to an as-yet-unknown host.


July 7, 2019

Improved high-quality draft genome sequence and annotation of Burkholderia contaminans LMG 23361T.

Burkholderia contaminans LMG 23361 is the type strain of the species isolated from the milk of a dairy sheep with mastitis. Some pharmaceutical products contain disinfectants such as benzalkonium chloride (BZK) and previously we reported that B. contaminans LMG 23361(T) possesses the ability to inactivate BZK with high biodegradation rates. Here, we report an improved high-quality draft genome sequence of this strain. Copyright © 2017 Jung et al.


July 7, 2019

Proteogenomics produces comprehensive and highly accurate protein-coding gene annotation in a complete genome assembly of Malassezia sympodialis.

Complete and accurate genome assembly and annotation is a crucial foundation for comparative and functional genomics. Despite this, few complete eukaryotic genomes are available, and genome annotation remains a major challenge. Here, we present a complete genome assembly of the skin commensal yeast Malassezia sympodialis and demonstrate how proteogenomics can substantially improve gene annotation. Through long-read DNA sequencing, we obtained a gap-free genome assembly for M. sympodialis (ATCC 42132), comprising eight nuclear and one mitochondrial chromosome. We also sequenced and assembled four M. sympodialis clinical isolates, and showed their value for understanding Malassezia reproduction by confirming four alternative allele combinations at the two mating-type loci. Importantly, we demonstrated how proteomics data could be readily integrated with transcriptomics data in standard annotation tools. This increased the number of annotated protein-coding genes by 14% (from 3612 to 4113), compared to using transcriptomics evidence alone. Manual curation further increased the number of protein-coding genes by 9% (to 4493). All of these genes have RNA-seq evidence and 87% were confirmed by proteomics. The M. sympodialis genome assembly and annotation presented here is at a quality yet achieved only for a few eukaryotic organisms, and constitutes an important reference for future host-microbe interaction studies.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.