Menu
July 7, 2019

An improved genome assembly of Azadirachta indica A. Juss.

Neem (Azadirachta indica A. Juss.), an evergreen tree of the Meliaceae family, is known for its medicinal, cosmetic, pesticidal and insecticidal properties. We had previously sequenced and published the draft genome of the plant, using mainly short read sequencing data. In this report, we present an improved genome assembly generated using additional short reads from Illumina and long reads from Pacific Biosciences SMRT sequencer. We assembled short reads and error corrected long reads using Platanus, an assembler designed to perform well for heterozygous genomes. The updated genome assembly (v2.0) yielded 3- and 3.5-fold increase in N50 and N75, respectively; 2.6-fold decrease in the total number of scaffolds; 1.25-fold increase in the number of valid transcriptome alignments; 13.4-fold less mis-assembly and 1.85-fold increase in the percentage repeat, over the earlier assembly (v1.0). The current assembly also maps better to the genes known to be involved in the terpenoid biosynthesis pathway. Together, the data represents an improved assembly of the A. indica genome. The raw data described in this manuscript are submitted to the NCBI Short Read Archive under the accession numbers SRX1074131, SRX1074132, SRX1074133, and SRX1074134 (SRP013453). Copyright © 2016 Author et al.


July 7, 2019

Alpha-CENTAURI: assessing novel centromeric repeat sequence variation with long read sequencing.

Long arrays of near-identical tandem repeats are a common feature of centromeric and subtelomeric regions in complex genomes. These sequences present a source of repeat structure diversity that is commonly ignored by standard genomic tools. Unlike reads shorter than the underlying repeat structure that rely on indirect inference methods, e.g. assembly, long reads allow direct inference of satellite higher order repeat structure. To automate characterization of local centromeric tandem repeat sequence variation we have designed Alpha-CENTAURI (ALPHA satellite CENTromeric AUtomated Repeat Identification), that takes advantage of Pacific Bioscience long-reads from whole-genome sequencing datasets. By operating on reads prior to assembly, our approach provides a more comprehensive set of repeat-structure variants and is not impacted by rearrangements or sequence underrepresentation due to misassembly.We demonstrate the utility of Alpha-CENTAURI in characterizing repeat structure for alpha satellite containing reads in the hydatidiform mole (CHM1, haploid-like) genome. The pipeline is designed to report local repeat organization summaries for each read, thereby monitoring rearrangements in repeat units, shifts in repeat orientation and sites of array transition into non-satellite DNA, typically defined by transposable element insertion. We validate the method by showing consistency with existing centromere high order repeat references. Alpha-CENTAURI can, in principle, run on any sequence data, offering a method to generate a sequence repeat resolution that could be readily performed using consensus sequences available for other satellite families in genomes without high-quality reference assemblies.Documentation and source code for Alpha-CENTAURI are freely available at http://github.com/volkansevim/alpha-CENTAURI CONTACT: ali.bashir@mssm.eduSupplementary information: Supplementary data are available at Bioinformatics online.© The Author 2016. Published by Oxford University Press.


July 7, 2019

Antibiotic resistance mechanisms of Myroides sp.

Bacteria of the genus Myroides (Myroides spp.) are rare opportunistic pathogens. Myroides sp. infections have been reported mainly in China. Myroides sp. is highly resistant to most available antibiotics, but the resistance mechanisms are not fully elucidated. Current strain identification methods based on biochemical traits are unable to identify strains accurately at the species level. While 16S ribosomal RNA (rRNA) gene sequencing can accurately achieve this, it fails to give information on the status and mechanisms of antibiotic resistance, because the 16S rRNA sequence contains no information on resistance genes, resistance islands or enzymes. We hypothesized that obtaining the whole genome sequence of Myroides sp., using next generation sequencing methods, would help to clarify the mechanisms of pathogenesis and antibiotic resistance, and guide antibiotic selection to treat Myroides sp. infections. As Myroides sp. can survive in hospitals and the environment, there is a risk of nosocomial infections and pandemics. For better management of Myroides sp. infections, it is imperative to apply next generation sequencing technologies to clarify the antibiotic resistance mechanisms in these bacteria.


July 7, 2019

Elucidating the triplicated ancestral genome structure of radish based on chromosome-level comparison with the Brassica genomes.

This study presents a chromosome-scale draft genome sequence of radish that is assembled into nine chromosomal pseudomolecules. A comprehensive comparative genome analysis with the Brassica genomes provides genomic evidences on the evolution of the mesohexaploid radish genome. Radish (Raphanus sativus L.) is an agronomically important root vegetable crop and its origin and phylogenetic position in the tribe Brassiceae is controversial. Here we present a comprehensive analysis of the radish genome based on the chromosome sequences of R. sativus cv. WK10039. The radish genome was sequenced and assembled into 426.2 Mb spanning >98 % of the gene space, of which 344.0 Mb were integrated into nine chromosome pseudomolecules. Approximately 36 % of the genome was repetitive sequences and 46,514 protein-coding genes were predicted and annotated. Comparative mapping of the tPCK-like ancestral genome revealed that the radish genome has intermediate characteristics between the Brassica A/C and B genomes in the triplicated segments, suggesting an internal origin from the genus Brassica. The evolutionary characteristics shared between radish and other Brassica species provided genomic evidences that the current form of nine chromosomes in radish was rearranged from the chromosomes of hexaploid progenitor. Overall, this study provides a chromosome-scale draft genome sequence of radish as well as novel insight into evolution of the mesohexaploid genomes in the tribe Brassiceae.


July 7, 2019

OPERA-LG: efficient and exact scaffolding of large, repeat-rich eukaryotic genomes with performance guarantees.

The assembly of large, repeat-rich eukaryotic genomes represents a significant challenge in genomics. While long-read technologies have made the high-quality assembly of small, microbial genomes increasingly feasible, data generation can be expensive for larger genomes. OPERA-LG is a scalable, exact algorithm for the scaffold assembly of large, repeat-rich genomes, out-performing state-of-the-art programs for scaffold correctness and contiguity. It provides a rigorous framework for scaffolding of repetitive sequences and a systematic approach for combining data from different second-generation and third-generation sequencing technologies. OPERA-LG provides an avenue for systematic augmentation and improvement of thousands of existing draft eukaryotic genome assemblies.


July 7, 2019

Emergence of host-adapted Salmonella Enteritidis through rapid evolution in an immunocompromised host.

Host adaptation is a key factor contributing to the emergence of new bacterial, viral and parasitic pathogens. Many pathogens are considered promiscuous because they cause disease across a range of host species, while others are host-adapted, infecting particular hosts(1). Host adaptation can potentially progress to host restriction where the pathogen is strictly limited to a single host species and is frequently associated with more severe symptoms. Host-adapted and host-restricted bacterial clades evolve from within a broader host-promiscuous species and sometimes target different niches within their specialist hosts, such as adapting from a mucosal to a systemic lifestyle. Genome degradation, marked by gene inactivation and deletion, is a key feature of host adaptation, although the triggers initiating genome degradation are not well understood. Here, we show that a chronic systemic non-typhoidal Salmonella infection in an immunocompromised human patient resulted in genome degradation targeting genes that are expendable for a systemic lifestyle. We present a genome-based investigation of a recurrent blood-borne Salmonella enterica serotype Enteritidis (S. Enteritidis) infection covering 15 years in an interleukin (IL)-12 ß-1 receptor-deficient individual that developed into an asymptomatic chronic infection. The infecting S. Enteritidis harbored a mutation in the mismatch repair gene mutS that accelerated the genomic mutation rate. Phylogenetic analysis and phenotyping of multiple patient isolates provides evidence for a remarkable level of within-host evolution that parallels genome changes present in successful host-restricted bacterial pathogens but never before observed on this timescale. Our analysis identifies common pathways of host adaptation and demonstrates the role that immunocompromised individuals can play in this process.


July 7, 2019

Antibiotic failure mediated by a resistant subpopulation in Enterobacter cloacae.

Antibiotic resistance is a major public health threat, further complicated by unexplained treatment failures caused by bacteria that appear antibiotic susceptible. We describe an Enterobacter cloacae isolate harbouring a minor subpopulation that is highly resistant to the last-line antibiotic colistin. This subpopulation was distinct from persisters, became predominant in colistin, returned to baseline after colistin removal and was dependent on the histidine kinase PhoQ. During murine infection, but in the absence of colistin, innate immune defences led to an increased frequency of the resistant subpopulation, leading to inefficacy of subsequent colistin therapy. An isolate with a lower-frequency colistin-resistant subpopulation similarly caused treatment failure but was misclassified as susceptible by current diagnostics once cultured outside the host. These data demonstrate the ability of low-frequency bacterial subpopulations to contribute to clinically relevant antibiotic resistance, elucidating an enigmatic cause of antibiotic treatment failure and highlighting the critical need for more sensitive diagnostics.


July 7, 2019

Structure of Type IIL restriction-modification enzyme MmeI in complex with DNA has implications for engineering new specificities.

The creation of restriction enzymes with programmable DNA-binding and -cleavage specificities has long been a goal of modern biology. The recently discovered Type IIL MmeI family of restriction-and-modification (RM) enzymes that possess a shared target recognition domain provides a framework for engineering such new specificities. However, a lack of structural information on Type IIL enzymes has limited the repertoire that can be rationally engineered. We report here a crystal structure of MmeI in complex with its DNA substrate and an S-adenosylmethionine analog (Sinefungin). The structure uncovers for the first time the interactions that underlie MmeI-DNA recognition and methylation (5′-TCCRAC-3′; R = purine) and provides a molecular basis for changing specificity at four of the six base pairs of the recognition sequence (5′-TCCRAC-3′). Surprisingly, the enzyme is resilient to specificity changes at the first position of the recognition sequence (5′-TCCRAC-3′). Collectively, the structure provides a basis for engineering further derivatives of MmeI and delineates which base pairs of the recognition sequence are more amenable to alterations than others.


July 7, 2019

The challenges of implementing next generation sequencing across a large healthcare system, and the molecular epidemiology and antibiotic susceptibilities of carbapenemase-producing bacteria in the healthcare system of the U.S. Department of Defense.

We sought to: 1) provide an overview of the genomic epidemiology of an extensive collection of carbapenemase-producing bacteria (CPB) collected in the U.S. Department of Defense health system; 2) increase awareness of the public availability of the sequences, isolates, and customized antimicrobial resistance database of that system; and 3) illustrate challenges and offer mitigations for implementing next generation sequencing (NGS) across large health systems.Prospective surveillance and system-wide implementation of NGS.288-hospital healthcare network.All phenotypically carbapenem resistant bacteria underwent CarbaNP® testing and PCR, followed by NGS. Commercial (Newbler and Geneious), on-line (ResFinder), and open-source software (Btrim, FLASh, Bowtie2, an Samtools) were used for assembly, SNP detection and clustering. Laboratory capacity, throughput, and response time were assessed. From 2009 through 2015, 27,000 multidrug-resistant Gram-negative isolates were submitted. 225 contained carbapenemase-encoding genes (most commonly blaKPC, blaNDM, and blaOXA23). These were found in 15 species from 146 inpatients in 19 facilities. Genetically related CPB were found in more than one hospital. Other clusters or outbreaks were not clonal and involved genetically related plasmids, while some involved several unrelated plasmids. Relatedness depended on the clustering algorithm used. Transmission patterns of plasmids and other mobile genetic elements could not be determined without ultra-long read, single-molecule real-time sequencing. 80% of carbapenem-resistant phenotypes retained susceptibility to aminoglycosides, and 70% retained susceptibility to fluoroquinolones. However, among the CPB-confirmed genotypes, fewer than 25% retained susceptibility to aminoglycosides or fluoroquinolones.Although NGS is increasingly acclaimed to revolutionize clinical practice, resource-constrained environments, large or geographically dispersed healthcare networks, and military or government-funded public health laboratories are likely to encounter constraints and challenges as they implement NGS across their health systems. These include lack of standardized definitions and quality control metrics, limitations of short-read sequencing, insufficient bandwidth, and the current limited availability of very expensive and scarcely available sequencing platforms. Possible solutions and mitigations are also proposed.


July 7, 2019

Resurgence of less-studied smut fungi as models of phytopathogenesis in the -omics era.

The smut fungi form a large, diverse, and non-monophyletic group of plant pathogens that have long served as both important pests of human agriculture but also as fertile organisms of scientific investigation. As modern techniques of molecular genetic analysis became available, many previously-studied species that proved refractive to these techniques fell by the wayside to become neglected. Now, as the advent of rapid and affordable next-generation sequencing provides genomic and transcriptomic resources for even these “forgotten” fungi, several species are making a come-back and retaking prominent places in phytopathogenic research. In this review, we highlight several of these smut fungi, with special emphasis on Microbotryum lychnidis-dioicae, an anther smut, whose molecular genetic tools have finally begun to catch up with its historical importance in classical genetics and now provide mechanistic insights for ecological studies, evolution of host/pathogen interaction, and investigations of emerging infectious disease.


July 7, 2019

Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences.

Single Molecule Real-Time (SMRT) sequencing technology and Oxford Nanopore technologies (ONT) produce reads over 10?kb in length, which have enabled high-quality genome assembly at an affordable cost. However, at present, long reads have an error rate as high as 10-15%. Complex and computationally intensive pipelines are required to assemble such reads.We present a new mapper, minimap and a de novo assembler, miniasm, for efficiently mapping and assembling SMRT and ONT reads without an error correction stage. They can often assemble a sequencing run of bacterial data into a single contig in a few minutes, and assemble 45-fold Caenorhabditis elegans data in 9?min, orders of magnitude faster than the existing pipelines, though the consensus sequence error rate is as high as raw reads. We also introduce a pairwise read mapping format and a graphical fragment assembly format, and demonstrate the interoperability between ours and current tools.https://github.com/lh3/minimap and https://github.com/lh3/miniasmhengli@broadinstitute.orgSupplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Whole genome DNA sequence analysis of Salmonella subspecies enterica serotype Tennessee obtained from related peanut butter foodborne outbreaks.

Establishing an association between possible food sources and clinical isolates requires discriminating the suspected pathogen from an environmental background, and distinguishing it from other closely-related foodborne pathogens. We used whole genome sequencing (WGS) to Salmonella subspecies enterica serotype Tennessee (S. Tennessee) to describe genomic diversity across the serovar as well as among and within outbreak clades of strains associated with contaminated peanut butter. We analyzed 71 isolates of S. Tennessee from disparate food, environmental, and clinical sources and 2 other closely-related Salmonella serovars as outgroups (S. Kentucky and S. Cubana), which were also shot-gun sequenced. A whole genome single nucleotide polymorphism (SNP) analysis was performed using a maximum likelihood approach to infer phylogenetic relationships. Several monophyletic lineages of S. Tennessee with limited SNP variability were identified that recapitulated several food contamination events. S. Tennessee clades were separated from outgroup salmonellae by more than sixteen thousand SNPs. Intra-serovar diversity of S. Tennessee was small compared to the chosen outgroups (1,153 SNPs), suggesting recent divergence of some S. Tennessee clades. Analysis of all 1,153 SNPs structuring an S. Tennessee peanut butter outbreak cluster revealed that isolates from several food, plant, and clinical isolates were very closely related, as they had only a few SNP differences between them. SNP-based cluster analyses linked specific food sources to several clinical S. Tennessee strains isolated in separate contamination events. Environmental and clinical isolates had very similar whole genome sequences; no markers were found that could be used to discriminate between these sources. Finally, we identified SNPs within variable S. Tennessee genes that may be useful markers for the development of rapid surveillance and typing methods, potentially aiding in traceback efforts during future outbreaks. Using WGS can delimit contamination sources for foodborne illnesses across multiple outbreaks and reveal otherwise undetected DNA sequence differences essential to the tracing of bacterial pathogens as they emerge.


July 7, 2019

Extensive sequencing of seven human genomes to characterize benchmark reference materials.

The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.


July 7, 2019

Resistance from relatives.

Crops are made resistant to pathogens such as wheat stem rust, Asian soybean rust and potato late blight by methods to access the pool of resistance genes present in related plants.


July 7, 2019

Ploidy influences the functional attributes of de novo lager yeast hybrids.

The genomes of hybrid organisms, such as lager yeast (Saccharomyces cerevisiae × Saccharomyces eubayanus), contain orthologous genes, the functionality and effect of which may differ depending on their origin and copy number. How the parental subgenomes in lager yeast contribute to important phenotypic traits such as fermentation performance, aroma production, and stress tolerance remains poorly understood. Here, three de novo lager yeast hybrids with different ploidy levels (allodiploid, allotriploid, and allotetraploid) were generated through hybridization techniques without genetic modification. The hybrids were characterized in fermentations of both high gravity wort (15 °P) and very high gravity wort (25 °P), which were monitored for aroma compound and sugar concentrations. The hybrid strains with higher DNA content performed better during fermentation and produced higher concentrations of flavor-active esters in both worts. The hybrid strains also outperformed both the parent strains. Genome sequencing revealed that several genes related to the formation of flavor-active esters (ATF1, ATF2¸ EHT1, EEB1, and BAT1) were present in higher copy numbers in the higher ploidy hybrid strains. A direct relationship between gene copy number and transcript level was also observed. The measured ester concentrations and transcript levels also suggest that the functionality of the S. cerevisiae- and S. eubayanus-derived gene products differs. The results contribute to our understanding of the complex molecular mechanisms that determine phenotypes in lager yeast hybrids and are expected to facilitate targeted strain development through interspecific hybridization.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.