Menu
September 22, 2019

A hybrid-hierarchical genome assembly strategy to sequence the invasive golden mussel Limnoperna fortunei.

For more than 25 years, the golden mussel Limnoperna fortunei has aggressively invaded South American freshwaters, having travelled more than 5,000 km upstream across five countries. Along the way, the golden mussel has outcompeted native species and economically harmed aquaculture, hydroelectric powers, and ship transit. We have sequenced the complete genome of the golden mussel to understand the molecular basis of its invasiveness and search for ways to control it.We assembled the 1.6 Gb genome into 20548 scaffolds with an N50 length of 312 Kb using a hybrid and hierarchical assembly strategy from short and long DNA reads and transcriptomes. A total of 60717 coding genes were inferred from a customized transcriptome-trained AUGUSTUS run. We also compared predicted protein sets with those of complete molluscan genomes, revealing an exacerbation of protein-binding domains in L. fortunei. Conclusions: We built one of the best bivalve genome assemblies available using a cost-effective approach using Illumina pair-end, mate pair, and PacBio long reads. We expect that the continuous and careful annotation of L. fortunei’s genome will contribute to the investigation of bivalve genetics, evolution, and invasiveness, as well as to the development of biotechnological tools for aquatic pest control.© The Authors 2017. Published by Oxford University Press.


September 22, 2019

Topical antibiotic use coselects for the carriage of mobile genetic elements conferring resistance to unrelated antimicrobials in Staphylococcus aureus.

Topical antibiotics, such as mupirocin and fusidic acid, are commonly used in the prevention and treatment of skin infections, particularly those caused by staphylococci. However, the widespread use of these agents is associated with increased resistance to these agents, potentially limiting their efficacy. Of particular concern is the observation that resistance to topical antibiotics is often associated with multidrug resistance, suggesting that topical antibiotics may play a role in the emergence of multidrug-resistant (MDR) strains. New Zealand (NZ) has some of the highest globally recorded rates of topical antibiotic usage and resistance. Using a combination of Pacific Biosciences single-molecule real-time (SMRT) whole-genome sequencing, Illumina short-read sequencing, and Bayesian phylogenomic modeling on 118 new multilocus sequence type 1 (ST1) communityStaphylococcus aureusisolates from New Zealand and 61 publically available international ST1 genome sequences, we demonstrate a strong correlation between the clinical introduction of topical antibiotics and the emergence of MDR ST1S. aureusWe also providein vitroexperimental evidence showing that exposure to topical antibiotics can lead to the rapid selection of MDRS. aureusisolates carrying plasmids that confer resistance to multiple unrelated antibiotics, from within a mixed population of competitor strains. These findings have important implications regarding the impact of the indiscriminate use of topical antibiotics. Copyright © 2018 Carter et al.


September 22, 2019

Comparative genomics reveals new single-nucleotide polymorphisms that can assist in identification of adherent-invasive Escherichia coli.

Adherent-invasive Escherichia coli (AIEC) have been involved in Crohn’s disease (CD). Currently, AIEC are identified by time-consuming techniques based on in vitro infection of cell lines to determine their ability to adhere to and invade intestinal epithelial cells as well as to survive and replicate within macrophages. Our aim was to find signature sequences that can be used to identify the AIEC pathotype. Comparative genomics was performed between three E. coli strain pairs, each pair comprised one AIEC and one non-AIEC with identical pulsotype, sequence type and virulence gene carriage. Genetic differences were further analysed in 22 AIEC and 28 non-AIEC isolated from CD patients and controls. The strain pairs showed similar genome structures, and no gene was specific to AIEC. Three single nucleotide polymorphisms displayed different nucleotide distributions between AIEC and non-AIEC, and four correlated with increased adhesion and/or invasion indices. Here, we present a classification algorithm based on the identification of three allelic variants that can predict the AIEC phenotype with 84% accuracy. Our study corroborates the absence of an AIEC-specific genetic marker distributed across all AIEC strains. Nonetheless, point mutations putatively involved in the AIEC phenotype can be used for the molecular identification of the AIEC pathotype.


September 22, 2019

The sea lamprey germline genome provides insights into programmed genome rearrangement and vertebrate evolution.

The sea lamprey (Petromyzon marinus) serves as a comparative model for reconstructing vertebrate evolution. To enable more informed analyses, we developed a new assembly of the lamprey germline genome that integrates several complementary data sets. Analysis of this highly contiguous (chromosome-scale) assembly shows that both chromosomal and whole-genome duplications have played significant roles in the evolution of ancestral vertebrate and lamprey genomes, including chromosomes that carry the six lamprey HOX clusters. The assembly also contains several hundred genes that are reproducibly eliminated from somatic cells during early development in lamprey. Comparative analyses show that gnathostome (mouse) homologs of these genes are frequently marked by polycomb repressive complexes (PRCs) in embryonic stem cells, suggesting overlaps in the regulatory logic of somatic DNA elimination and bivalent states that are regulated by early embryonic PRCs. This new assembly will enhance diverse studies that are informed by lampreys’ unique biology and evolutionary/comparative perspective.


September 22, 2019

Early transmissible ampicillin resistance in zoonotic Salmonella enterica serotype Typhimurium in the late 1950s: a retrospective, whole-genome sequencing study.

Ampicillin, the first semi-synthetic penicillin active against Enterobacteriaceae, was released onto the market in 1961. The first outbreaks of disease caused by ampicillin-resistant strains of Salmonella enterica serotype Typhimurium were identified in the UK in 1962 and 1964. We aimed to date the emergence of this resistance in historical isolates of S enterica serotype Typhimurium.In this retrospective, whole-genome sequencing study, we analysed 288 S enterica serotype Typhimurium isolates collected between 1911 and 1969 from 31 countries on four continents and from various sources including human beings, animals, feed, and food. All isolates were tested for antimicrobial drug susceptibility with the disc diffusion method, and isolates shown to be resistant to ampicillin underwent resistance-transfer experiments. To provide insights into population structure and mechanisms of ampicillin resistance, we did whole-genome sequencing on a subset of 225 isolates, selected to maximise source, spatiotemporal, and genetic diversity.11 (4%) of 288 isolates were resistant to ampicillin because of acquisition of various ß lactamase genes, including blaTEM-1, carried by various plasmids, including the virulence plasmid of S enterica serotype Typhimurium. These 11 isolates were from three phylogenomic groups. One isolate producing TEM-1 ß lactamase was isolated in France in 1959 and two isolates producing TEM-1 ß lactamase were isolated in Tunisia in 1960, before ampicillin went on sale. The vectors for ampicillin resistance were different from those reported in the strains responsible for the outbreaks in the UK in the 1960s.The association between antibiotic use and selection of resistance determinants is not as direct as often presumed. Our results suggest that the non-clinical use of narrow-spectrum penicillins (eg, benzylpenicillin) might have favoured the diffusion of plasmids carrying the blaTEM-1gene in S enterica serotype Typhimurium in the late 1950s.Institut Pasteur, Santé publique France, the French Government’s Investissement d’Avenir programme, the Fondation Le Roch-Les Mousquetaires. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019

LTR_retriever: A highly accurate and sensitive program for identification of long terminal repeat retrotransposons.

Long terminal repeat retrotransposons (LTR-RTs) are prevalent in plant genomes. The identification of LTR-RTs is critical for achieving high-quality gene annotation. Based on the well-conserved structure, multiple programs were developed for the de novo identification of LTR-RTs; however, these programs are associated with low specificity and high false discovery rates. Here, we report LTR_retriever, a multithreading-empowered Perl program that identifies LTR-RTs and generates high-quality LTR libraries from genomic sequences. LTR_retriever demonstrated significant improvements by achieving high levels of sensitivity (91%), specificity (97%), accuracy (96%), and precision (90%) in rice (Oryza sativa). LTR_retriever is also compatible with long sequencing reads. With 40k self-corrected PacBio reads equivalent to 4.5× genome coverage in Arabidopsis (Arabidopsis thaliana), the constructed LTR library showed excellent sensitivity and specificity. In addition to canonical LTR-RTs with 5′-TG…CA-3′ termini, LTR_retriever also identifies noncanonical LTR-RTs (non-TGCA), which have been largely ignored in genome-wide studies. We identified seven types of noncanonical LTRs from 42 out of 50 plant genomes. The majority of noncanonical LTRs areCopiaelements, with which the LTR is four times shorter than that of otherCopiaelements, which may be a result of their target specificity. Strikingly, non-TGCACopiaelements are often located in genic regions and preferentially insert nearby or within genes, indicating their impact on the evolution of genes and their potential as mutagenesis tools.© 2018 American Society of Plant Biologists. All Rights Reserved.


September 22, 2019

Sequence analysis of European maize inbred line F2 provides new insights into molecular and chromosomal characteristics of presence/absence variants.

Maize is well known for its exceptional structural diversity, including copy number variants (CNVs) and presence/absence variants (PAVs), and there is growing evidence for the role of structural variation in maize adaptation. While PAVs have been described in this important crop species, they have been only scarcely characterized at the sequence level and the extent of presence/absence variation and relative chromosomal landscape of inbred-specific regions remain to be elucidated.De novo genome sequencing of the French F2 maize inbred line revealed 10,044 novel genomic regions larger than 1 kb, making up 88 Mb of DNA, that are present in F2 but not in B73 (PAV). This set of maize PAV sequences allowed us to annotate PAV content and to analyze sequence breakpoints. Using PAV genotyping on a collection of 25 temperate lines, we also analyzed Linkage Disequilibrium in PAVs and flanking regions, and PAV frequencies within maize genetic groups.We highlight the possible role of MMEJ-type double strand break repair in maize PAV formation and discover 395 new genes with transcriptional support. Pattern of linkage disequilibrium within PAVs strikingly differs from this of flanking regions and is in accordance with the intuition that PAVs may recombine less than other genomic regions. We show that most PAVs are ancient, while some are found only in European Flint material, thus pinpointing structural features that may be at the origin of adaptive traits involved in the success of this material. Characterization of such PAVs will provide useful material for further association genetic studies in European and temperate maize.


September 22, 2019

Jointly aligning a group of DNA reads improves accuracy of identifying large deletions.

Performing sequence alignment to identify structural variants, such as large deletions, from genome sequencing data is a fundamental task, but current methods are far from perfect. The current practice is to independently align each DNA read to a reference genome. We show that the propensity of genomic rearrangements to accumulate in repeat-rich regions imposes severe ambiguities in these alignments, and consequently on the variant calls-with current read lengths, this affects more than one third of known large deletions in the C. Venter genome. We present a method to jointly align reads to a genome, whereby alignment ambiguity of one read can be disambiguated by other reads. We show this leads to a significant improvement in the accuracy of identifying large deletions (=20 bases), while imposing minimal computational overhead and maintaining an overall running time that is at par with current tools. A software implementation is available as an open-source Python program called JRA at https://bitbucket.org/jointreadalignment/jra-src.


September 22, 2019

Bat biology, genomes, and the Bat1K project: To generate chromosome-level genomes for all living bat species.

Bats are unique among mammals, possessing some of the rarest mammalian adaptations, including true self-powered flight, laryngeal echolocation, exceptional longevity, unique immunity, contracted genomes, and vocal learning. They provide key ecosystem services, pollinating tropical plants, dispersing seeds, and controlling insect pest populations, thus driving healthy ecosystems. They account for more than 20% of all living mammalian diversity, and their crown-group evolutionary history dates back to the Eocene. Despite their great numbers and diversity, many species are threatened and endangered. Here we announce Bat1K, an initiative to sequence the genomes of all living bat species (n~1,300) to chromosome-level assembly. The Bat1K genome consortium unites bat biologists (>148 members as of writing), computational scientists, conservation organizations, genome technologists, and any interested individuals committed to a better understanding of the genetic and evolutionary mechanisms that underlie the unique adaptations of bats. Our aim is to catalog the unique genetic diversity present in all living bats to better understand the molecular basis of their unique adaptations; uncover their evolutionary history; link genotype with phenotype; and ultimately better understand, promote, and conserve bats. Here we review the unique adaptations of bats and highlight how chromosome-level genome assemblies can uncover the molecular basis of these traits. We present a novel sequencing and assembly strategy and review the striking societal and scientific benefits that will result from the Bat1K initiative.


September 22, 2019

Insights on a founder effect: the case of Xylella fastidiosa in the Salento area of Apulia, Italy

Xylella fastidiosa causing disease on different plant species has been reported in several European countries, since 2013. Based on multilocus sequence typing (MLST) results, there is evidence of repeated introductions of the pathogen in Spain and France. In contrast, in the Salento area of Apulia (Puglia) in Southern Italy, the existence of a unique Apulian MLST genotype of X. fastidiosa, causing the olive quick decline syndrome (OQDS; also referred to as “CoDiRO” or “ST53”) was proven, and this was tentatively ascribed to X. fastidiosa subsp. pauca. In order to acquire information on intra population diversity European Food Safety Authority (EFSA) has strongly called for the characterization of X. fastidiosa isolates from Apulia to produce the necessary data to better understand strain diversity and evolution. In this work, for the first time the existence of sub-variants within a set of 14 “ST53” isolates of X. fastidiosa collected from different locations was searched using DNA typing methods targeting the whole pathogen genome. Invariably, VNTR, RAPD and rep-PCR (ERIC and BOX motifs) analyses indicated that all tested isolates possessed the same genomic fingerprint, supporting the existence of predominant epidemiological strain in Apulia. To further explore the degree of clonality within this population, two isolates from two different Salento areas (Taviano and Ugento) were completely sequenced using PacBio SMRT technology. The whole genome map and sequence comparisons revealed that both isolates are nearly identical, showing less than 0.001% nucleotide diversity. However, the complete and circularized Salento-1 and Salento-2 genome sequences were different, in genome and plasmid size, from the reference strain 9a5c of X. fastidiosa subsp. pauca (from citrus), and showed a PCR-proved large genome inversion of about 1.7 Mb. Genome-wide indices ANIm and dDDH indicated that the three isolates of X. fastidiosa from Salento (Apulia, Italy), namely Salento-1, Salento-2, and De Donno, whose complete genome sequence has been recently released, share a very recent common ancestor. This highlights the importance of continuous and extensive monitoring of molecular variation of this invasive pathogen to understand evolution of adaptive traits, and the necessity for adoption of all possible measures to reduce the risk of new introductions that may augment pathogen diversity.


September 22, 2019

A survey of localized sequence rearrangements in human DNA.

Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-read DNA sequencing methods (e.g. PacBio, nanopore) are promising for this task, because one read may encompass a whole complex mutation. We describe an analysis pipeline to characterize arbitrarily-complex ‘local’ mutations, i.e. intrachromosomal mutations encompassed by one DNA read. We apply it to nanopore and PacBio reads from one human cell line (NA12878), and survey sequence rearrangements, both real and artifactual. Almost all the real rearrangements belong to recurring patterns or motifs: the most common is tandem multiplication (e.g. heptuplication), but there are also complex patterns such as localized shattering, which resembles DNA damage by radiation. Gene conversions are identified, including one between hemoglobin gamma genes. This study demonstrates a way to find intricate rearrangements with any number of duplications, deletions, and repositionings. It demonstrates a probability-based method to resolve ambiguous rearrangements involving highly similar sequences, as occurs in gene conversion. We present a catalog of local rearrangements in one human cell line, and show which rearrangement patterns occur.


September 22, 2019

Using experimental evolution to identify druggable targets that could inhibit the evolution of antimicrobial resistance.

With multi-drug and pan-drug-resistant bacteria becoming increasingly common in hospitals, antibiotic resistance has threatened to return us to a pre-antibiotic era that would completely undermine modern medicine. There is an urgent need to develop new antibiotics and strategies to combat resistance that are substantially different from earlier drug discovery efforts. One such strategy that would complement current and future antibiotics would be a class of co-drugs that target the evolution of resistance and thereby extend the efficacy of specific classes of antibiotics. A critical step in the development of such strategies lies in understanding the critical evolutionary trajectories responsible for resistance and which proteins or biochemical pathways within those trajectories would be good candidates for co-drug discovery. We identify the most important steps in the evolution of resistance for a specific pathogen and antibiotic combination by evolving highly polymorphic populations of pathogens to resistance in a novel bioreactor that favors biofilm development. As the populations evolve to increasing drug concentrations, we use deep sequencing to elucidate the network of genetic changes responsible for resistance and subsequent in vitro biochemistry and often structure determination to determine how the adaptive mutations produce resistance. Importantly, the identification of the molecular steps, their frequency within the populations and their chronology within the evolutionary trajectory toward resistance is critical to assessing their relative importance. In this work, we discuss findings from the evolution of the ESKAPE pathogen, Pseudomonas aeruginosa to the drug of last resort, colistin to illustrate the power of this approach.


September 22, 2019

Cultivation-independent and cultivation-dependent analysis of microbes in the shallow-sea hydrothermal system off Kueishantao island, Taiwan: Unmasking heterotrophic bacterial diversity and functional capacity.

Shallow-sea hydrothermal systems experience continuous fluctuations of physicochemical conditions due to seawater influx which generates variable habitats, affecting the phylogenetic composition and metabolic potential of microbial communities. Until recently, studies of submarine hydrothermal communities have focused primarily on chemolithoautotrophic organisms, however, there have been limited studies on heterotrophic bacteria. Here, fluorescence in situ hybridization, high throughput 16S rRNA gene amplicon sequencing, and functional metagenomes were used to assess microbial communities from the shallow-sea hydrothermal system off Kueishantao Island, Taiwan. The results showed that the shallow-sea hydrothermal system harbored not only autotrophic bacteria but abundant heterotrophic bacteria. The potential for marker genes sulfur oxidation and carbon fixation were detected in the metagenome datasets, suggesting a role for sulfur and carbon cycling in the shallow-sea hydrothermal system. Furthermore, the presence of diverse genes that encode transporters, glycoside hydrolases, and peptidase indicates the genetic potential for heterotrophic utilization of organic substrates. A total of 408 cultivable heterotrophic bacteria were isolated, in which the taxonomic families typically associated with oligotrophy, copiotrophy, and phototrophy were frequently found. The cultivation-independent and -dependent analyses performed herein show that Alphaproteobacteria and Gammaproteobacteria represent the dominant heterotrophs in the investigated shallow-sea hydrothermal system. Genomic and physiological characterization of a novel strain P5 obtained in this study, belonging to the genus Rhodovulum within Alphaproteobacteria, provides an example of heterotrophic bacteria with major functional capacity presented in the metagenome datasets. Collectively, in addition to autotrophic bacteria, the shallow-sea hydrothermal system also harbors many heterotrophic bacteria with versatile genetic potential to adapt to the unique environmental conditions.


September 22, 2019

Reference quality genome assemblies of three Parastagonospora nodorum isolates differing in virulence on wheat.

Parastagonospora nodorum, the causal agent of Septoria nodorum blotch in wheat, has emerged as a model necrotrophic fungal organism for the study of host-microbe interactions. To date, three necrotrophic effectors have been identified and characterized from this pathogen, including SnToxA, SnTox1, and SnTox3. Necrotrophic effector identification was greatly aided by the development of a draft genome of Australian isolate SN15 via Sanger sequencing, yet it remained largely fragmented. This research presents the development of nearly finished genomes of P. nodorum isolates Sn4, Sn2000, and Sn79-1087 using long-read sequencing technology. RNAseq analysis of isolate Sn4, consisting of eight time points covering various developmental and infection stages, mediated the annotation of 13,379 genes. Analysis of these genomes revealed large-scale polymorphism between the three isolates, including the complete absence of contig 23 from isolate Sn79-1087, and a region of genome expansion on contig 10 in isolates Sn4 and Sn2000. Additionally, these genomes exhibit the hallmark characteristics of a “two-speed” genome, being partitioned into two distinct GC-equilibrated and AT-rich compartments. Interestingly, isolate Sn79-1087 contains a lower proportion of AT-rich segments, indicating a potential lack of evolutionary hotspots. These newly sequenced genomes, consisting of telomere-to-telomere assemblies of nearly all 23 P. nodorum chromosomes, provide a robust foundation for the further examination of effector biology and genome evolution. Copyright © 2018 Richards et al.


September 22, 2019

De novo assembly and phasing of dikaryotic genomes from two isolates of Puccinia coronata f. sp. avenae, the causal agent of oat crown rust.

Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae, is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenaeIMPORTANCE Disease management strategies for oat crown rust are challenged by the rapid evolution of Puccinia coronata f. sp. avenae, which renders resistance genes in oat varieties ineffective. Despite the economic importance of understanding P. coronata f. sp. avenae, resources to study the molecular mechanisms underpinning pathogenicity and the emergence of new virulence traits are lacking. Such limitations are partly due to the obligate biotrophic lifestyle of P. coronata f. sp. avenae as well as the dikaryotic nature of the genome, features that are also shared with other important rust pathogens. This study reports the first release of a haplotype-phased genome assembly for a dikaryotic fungal species and demonstrates the amenability of using emerging technologies to investigate genetic diversity in populations of P. coronata f. sp. avenae. Copyright © 2018 Miller et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.