Menu
September 22, 2019  |  

A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea.

Transposable elements (TEs) are mobile DNA sequences known as drivers of genome evolution. Their impacts have been widely studied in animals, plants and insects, but little is known about them in microalgae. In a previous study, we compared the genetic polymorphisms between strains of the haptophyte microalga Tisochrysis lutea and suggested the involvement of active autonomous TEs in their genome evolution.To identify potentially autonomous TEs, we designed a pipeline named PiRATE (Pipeline to Retrieve and Annotate Transposable Elements, download: https://doi.org/10.17882/51795 ), and conducted an accurate TE annotation on a new genome assembly of T. lutea. PiRATE is composed of detection, classification and annotation steps. Its detection step combines multiple, existing analysis packages representing all major approaches for TE detection and its classification step was optimized for microalgal genomes. The efficiency of the detection and classification steps was evaluated with data on the model species Arabidopsis thaliana. PiRATE detected 81% of the TE families of A. thaliana and correctly classified 75% of them. We applied PiRATE to T. lutea genomic data and established that its genome contains 15.89% Class I and 4.95% Class II TEs. In these, 3.79 and 17.05% correspond to potentially autonomous and non-autonomous TEs, respectively. Annotation data was combined with transcriptomic and proteomic data to identify potentially active autonomous TEs. We identified 17 expressed TE families and, among these, a TIR/Mariner and a TIR/hAT family were able to synthesize their transposase. Both these TE families were among the three highest expressed genes in a previous transcriptomic study and are composed of highly similar copies throughout the genome of T. lutea. This sum of evidence reveals that both these TE families could be capable of transposing or triggering the transposition of potential related MITE elements.This manuscript provides an example of a de novo transposable element annotation of a non-model organism characterized by a fragmented genome assembly and belonging to a poorly studied phylum at genomic level. Integration of multi-omics data enabled the discovery of potential mobile TEs and opens the way for new discoveries on the role of these repeated elements in genomic evolution of microalgae.


September 22, 2019  |  

Nucleotide-binding resistance gene signatures in sugar beet, insights from a new reference genome.

Nucleotide-binding (NB-ARC), leucine-rich-repeat genes (NLRs) account for 60.8% of resistance (R) genes molecularly characterized from plants. NLRs exist as large gene families prone to tandem duplication and transposition, with high sequence diversity among crops and their wild relatives. This diversity can be a source of new disease resistance, but difficulty in distinguishing specific sequences from homologous gene family members hinders characterization of resistance for improving crop varieties. Current genome sequencing and assembly technologies, especially those using long-read sequencing, are improving resolution of repeat-rich genomic regions and clarifying locations of duplicated genes, such as NLRs. Using the conserved NB-ARC domain as a model, 231 tentative NB-ARC loci were identified in a highly contiguous genome assembly of sugar beet, revealing diverged and truncated NB-ARC signatures as well as full-length sequences. The NB-ARC-associated proteins contained NLR resistance gene domains, including TIR, CC, and LRR, as well as other integrated domains. Phylogenetic relationships of partial and complete domains were determined, and patterns of physical clustering in the genome were evaluated. Comparison of sugar beet NB-ARC domains to validated R genes from monocots and eudicots suggested extensive B. vulgaris-specific subfamily expansions. The NLR landscape in the rhizomania resistance conferring Rz region of Chromosome 3 was characterized, identifying 26 NLR-like sequences spanning 20 MB. This work presents the first detailed view of NLR family composition in a member of the Caryophyllales, builds a foundation for additional disease resistance work in B. vulgaris, and demonstrates an additional nucleic-acid-based method for NLR prediction in non-model plant species. This article is protected by copyright. All rights reserved.This article is protected by copyright. All rights reserved.


September 22, 2019  |  

Genus-wide assessment of lignocellulose utilization in the extremely thermophilic Caldicellulosiruptor by genomic, pan-genomic and metagenomic analysis

Metagenomic data from Obsidian Pool (Yellowstone National Park, USA) and 13 genome sequences were used to reassess genus-wide biodiversity for the extremely thermophilic Caldicellulosiruptor The updated core genome contains 1,401 ortholog groups (average genome size for 13 species = 2,516 genes). The pangenome, which remains open with a revised total of 3,493 ortholog groups, encodes a variety of multidomain glycoside hydrolases (GHs). These include three cellulases with GH48 domains that are colocated in the glucan degradation locus (GDL) and are specific determinants for microcrystalline cellulose utilization. Three recently sequenced species, Caldicellulosiruptor sp. strain Rt8.B8 (renamed here Caldicellulosiruptor morganii), Thermoanaerobacter cellulolyticus strain NA10 (renamed here Caldicellulosiruptor naganoensis), and Caldicellulosiruptor sp. strain Wai35.B1 (renamed here Caldicellulosiruptor danielii), degraded Avicel and lignocellulose (switchgrass). C. morganii was more efficient than Caldicellulosiruptor bescii in this regard and differed from the other 12 species examined, both based on genome content and organization and in the specific domain features of conserved GHs. Metagenomic analysis of lignocellulose-enriched samples from Obsidian Pool revealed limited new information on genus biodiversity. Enrichments yielded genomic signatures closely related to that of Caldicellulosiruptor obsidiansis, but there was also evidence for other thermophilic fermentative anaerobes (Caldanaerobacter, Fervidobacterium, Caloramator, and Clostridium). One enrichment, containing 89.8% Caldicellulosiruptor and 9.7% Caloramator, had a capacity for switchgrass solubilization comparable to that of C. bescii These results refine the known biodiversity of Caldicellulosiruptor and indicate that microcrystalline cellulose degradation at temperatures above 70°C, based on current information, is limited to certain members of this genus that produce GH48 domain-containing enzymes.IMPORTANCE The genus Caldicellulosiruptor contains the most thermophilic bacteria capable of lignocellulose deconstruction, which are promising candidates for consolidated bioprocessing for the production of biofuels and bio-based chemicals. The focus here is on the extant capability of this genus for plant biomass degradation and the extent to which this can be inferred from the core and pangenomes, based on analysis of 13 species and metagenomic sequence information from environmental samples. Key to microcrystalline hydrolysis is the content of the glucan degradation locus (GDL), a set of genes encoding glycoside hydrolases (GHs), several of which have GH48 and family 3 carbohydrate binding module domains, that function as primary cellulases. Resolving the relationship between the GDL and lignocellulose degradation will inform efforts to identify more prolific members of the genus and to develop metabolic engineering strategies to improve this characteristic. Copyright © 2018 American Society for Microbiology.


September 22, 2019  |  

Comparative genomic analysis of Geosporobacter ferrireducens and its versatility of anaerobic energy metabolism.

Members of the family Clostridiaceae within phylum Firmicutes are ubiquitous in various iron-reducing environments. However, genomic data on iron-reducing bacteria of the family Clostridiaceae, particularly regarding their environmental distribution, are limited. Here, we report the analysis and comparison of the genomic properties of Geosporobacter ferrireducens IRF9, a strict anaerobe that ferments sugars and degrades toluene under iron-reducing conditions, with those of the closely related species, Geosporobacter subterraneus DSM 17957. Putative alkyl succinate synthase-encoding genes were observed in the genome of strain IRF9 instead of the typical benzyl succinate synthase-encoding genes. Canonical genes associated with iron reduction were not observed in either genome. The genomes of strains IRF9 and DMS 17957 harbored genes for acetogenesis, that encode two types of Rnf complexes mediating the translocation of H+ and Na+ ions, respectively. Strain IRF9 harbored two different types of ATPases (Na+-dependent F-type ATPase and H+-dependent V-type ATPase), which enable full exploitation of ion gradients. The versatile energy conservation potential of strain IRF9 promotes its survival in various environmental conditions.


September 22, 2019  |  

Genome-based evolutionary history of Pseudomonas spp.

Pseudomonas is a large and diverse genus of Gammaproteobacteria. To provide a framework for discovery of evolutionary and taxonomic relationships of these bacteria, we compared the genomes of type strains of 163 species and 3 additional subspecies of Pseudomonas, including 118 genomes sequenced herein. A maximum likelihood phylogeny of the 166 type strains based on protein sequences of 100 single-copy orthologous genes revealed thirteen groups of Pseudomonas, composed of two to sixty three species each. Pairwise average nucleotide identities and alignment fractions were calculated for the data set of the 166 type strains and 1224 genomes of Pseudomonas available in public databases. Results revealed that 394 of the 1224 genomes were distinct from any type strain, suggesting that the type strains represent only a fraction of the genomic diversity of the genus. The core genome of Pseudomonas was determined to contain 794 genes conferring primarily housekeeping functions. The results of this study provide a phylogenetic framework for future studies aiming to resolve the classification and phylogenetic relationships, identify new gene functions and phenotypes, and explore the ecological and metabolic potential of the Pseudomonas spp.© 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.


September 22, 2019  |  

Long-read sequencing data analysis for yeasts.

Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ~41 h to generate a complete and well-annotated genome from ~100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.


September 22, 2019  |  

Diversity and evolution of the emerging Pandoraviridae family.

With DNA genomes reaching 2.5?Mb packed in particles of bacterium-like shape and dimension, the first two Acanthamoeba-infecting pandoraviruses remained up to now the most complex viruses since their discovery in 2013. Our isolation of three new strains from distant locations and environments is now used to perform the first comparative genomics analysis of the emerging worldwide-distributed Pandoraviridae family. Thorough annotation of the genomes combining transcriptomic, proteomic, and bioinformatic analyses reveals many non-coding transcripts and significantly reduces the former set of predicted protein-coding genes. Here we show that the pandoraviruses exhibit an open pan-genome, the enormous size of which is not adequately explained by gene duplications or horizontal transfers. As most of the strain-specific genes have no extant homolog and exhibit statistical features comparable to intergenic regions, we suggest that de novo gene creation could contribute to the evolution of the giant pandoravirus genomes.


September 22, 2019  |  

Parallels between experimental and natural evolution of legume symbionts.

The emergence of symbiotic interactions has been studied using population genomics in nature and experimental evolution in the laboratory, but the parallels between these processes remain unknown. Here we compare the emergence of rhizobia after the horizontal transfer of a symbiotic plasmid in natural populations of Cupriavidus taiwanensis, over 10 MY ago, with the experimental evolution of symbiotic Ralstonia solanacearum for a few hundred generations. In spite of major differences in terms of time span, environment, genetic background, and phenotypic achievement, both processes resulted in rapid genetic diversification dominated by purifying selection. We observe no adaptation in the plasmid carrying the genes responsible for the ecological transition. Instead, adaptation was associated with positive selection in a set of genes that led to the co-option of the same quorum-sensing system in both processes. Our results provide evidence for similarities in experimental and natural evolutionary transitions and highlight the potential of comparisons between both processes to understand symbiogenesis.


September 22, 2019  |  

Horizontal transfer and proliferation of Tsu4 in Saccharomyces paradoxus.

Recent evidence suggests that horizontal transfer plays a significant role in the evolution of of transposable elements (TEs) in eukaryotes. Many cases of horizontal TE transfer (HTT) been reported in animals and plants, however surprisingly few examples of HTT have been reported in fungi.Here I report evidence for a novel HTT event in fungi involving Tsu4 in Saccharomyces paradoxus based on (i) unexpectedly high similarity between Tsu4 elements in S. paradoxus and S. uvarum, (ii) a patchy distribution of Tsu4 in S. paradoxus and general absence from its sister species S. cerevisiae, and (iii) discordance between the phylogenetic history of Tsu4 sequences and species in the Saccharomyces sensu stricto group. Available data suggests the HTT event likely occurred somewhere in the Nearctic, Neotropic or Indo-Australian part of the S. paradoxus species range, and that a lineage related to S. uvarum or S. eubayanus was the likely donor species. The HTT event has led to massive proliferation of Tsu4 in the South American lineage of S. paradoxus, which exhibits partial reproductive isolation with other strains of this species because of multiple reciprocal translocations. Full-length Tsu4 elements are associated with both breakpoints of one of these reciprocal translocations.This work shows that comprehensive analysis of TE sequences in essentially-complete genome assemblies derived from long-read sequencing provides new opportunities to detect HTT events in fungi and other organisms. This work also provides support for the hypothesis that HTT and subsequent TE proliferation can induce genome rearrangements that contribute to post-zygotic isolation in yeast.


September 22, 2019  |  

Comparative genomics of Campylobacter concisus: Analysis of clinical strains reveals genome diversity and pathogenic potential.

In recent years, an increasing number of Campylobacter species have been associated with human gastrointestinal (GI) diseases including gastroenteritis, inflammatory bowel disease, and colorectal cancer. Campylobacter concisus, an oral commensal historically linked to gingivitis and periodontitis, has been increasingly detected in the lower GI tract. In the present study, we generated robust genome sequence data from C. concisus strains and undertook a comprehensive pangenome assessment to identify C. concisus virulence properties and to explain potential adaptations acquired while residing in specific ecological niche(s) of the GI tract. Genomes of 53 new C. concisus strains were sequenced, assembled, and annotated including 36 strains from gastroenteritis patients, 13 strains from Crohn’s disease patients and four strains from colitis patients (three collagenous colitis and one lymphocytic colitis). When compared with previous published sequences, strains clustered into two main groups/genomospecies (GS) with phylogenetic clustering explained neither by disease phenotype nor sample location. Paired oral/faecal isolates, from the same patient, indicated that there are few genetic differences between oral and gut isolates which suggests that gut isolates most likely reflect oral strain relocation. Type IV and VI secretion systems genes, genes known to be important for pathogenicity in the Campylobacter genus, were present in the genomes assemblies, with 82% containing Type VI secretion system genes. Our findings indicate that C. concisus strains are genetically diverse, and the variability in bacterial secretion system content may play an important role in their virulence potential.


September 22, 2019  |  

Long-read whole genome sequencing and comparative analysis of six strains of the human pathogen Orientia tsutsugamushi.

Orientia tsutsugamushi is a clinically important but neglected obligate intracellular bacterial pathogen of the Rickettsiaceae family that causes the potentially life-threatening human disease scrub typhus. In contrast to the genome reduction seen in many obligate intracellular bacteria, early genetic studies of Orientia have revealed one of the most repetitive bacterial genomes sequenced to date. The dramatic expansion of mobile elements has hampered efforts to generate complete genome sequences using short read sequencing methodologies, and consequently there have been few studies of the comparative genomics of this neglected species.We report new high-quality genomes of O. tsutsugamushi, generated using PacBio single molecule long read sequencing, for six strains: Karp, Kato, Gilliam, TA686, UT76 and UT176. In comparative genomics analyses of these strains together with existing reference genomes from Ikeda and Boryong strains, we identify a relatively small core genome of 657 genes, grouped into core gene islands and separated by repeat regions, and use the core genes to infer the first whole-genome phylogeny of Orientia.Complete assemblies of multiple Orientia genomes verify initial suggestions that these are remarkable organisms. They have larger genomes compared with most other Rickettsiaceae, with widespread amplification of repeat elements and massive chromosomal rearrangements between strains. At the gene level, Orientia has a relatively small set of universally conserved genes, similar to other obligate intracellular bacteria, and the relative expansion in genome size can be accounted for by gene duplication and repeat amplification. Our study demonstrates the utility of long read sequencing to investigate complex bacterial genomes and characterise genomic variation.


September 22, 2019  |  

Comparative genomics of Spiraeoideae-infecting Erwinia amylovora strains provides novel insight to genetic diversity and identifies the genetic basis of a low-virulence strain.

Erwinia amylovora is the causal agent of fire blight, one of the most devastating diseases of apple and pear. Erwinia amylovora is thought to have originated in North America and has now spread to at least 50 countries worldwide. An understanding of the diversity of the pathogen population and the transmission to different geographical regions is important for the future mitigation of this disease. In this research, we performed an expanded comparative genomic study of the Spiraeoideae-infecting (SI) E. amylovora population in North America and Europe. We discovered that, although still highly homogeneous, the genetic diversity of 30 E. amylovora genomes examined was about 30 times higher than previously determined. These isolates belong to four distinct clades, three of which display geographical clustering and one of which contains strains from various geographical locations (‘Widely Prevalent’ clade). Furthermore, we revealed that strains from the Widely Prevalent clade displayed a higher level of recombination with strains from a clade strictly from the eastern USA, which suggests that the Widely Prevalent clade probably originated from the eastern USA before it spread to other locations. Finally, we detected variations in virulence in the SI E. amylovora strains on immature pear, and identified the genetic basis of one of the low-virulence strains as being caused by a single nucleotide polymorphism in hfq, a gene encoding an important virulence regulator. Our results provide insights into the population structure, distribution and evolution of SI E. amylovora in North America and Europe.© 2017 BSPP AND JOHN WILEY & SONS LTD.


September 22, 2019  |  

Clonal emergence of invasive multidrug-resistant Staphylococcus epidermidis deconvoluted via a combination of whole-genome sequencing and microbiome analyses.

Pathobionts, bacteria that are typically human commensals but can cause disease, contribute significantly to antimicrobial resistance. Staphylococcus epidermidis is a prototypical pathobiont as it is a ubiquitous human commensal but also a leading cause of healthcare-associated bacteremia. We sought to determine the etiology of a recent increase in invasive S. epidermidis isolates resistant to linezolid.Whole-genome sequencing (WGS) was performed on 176 S. epidermidis bloodstream isolates collected at the MD Anderson Cancer Center in Houston, Texas, between 2013 and 2016. Molecular relationships were assessed via complementary phylogenomic approaches. Abundance of the linezolid resistance determinant cfr was determined in stool samples via reverse-transcription quantitative polymerase chain reaction.Thirty-nine of the 176 strains were linezolid resistant (22%). Thirty-one of the 39 linezolid-resistant S. epidermidis infections were caused by a particular clone resistant to multiple antimicrobials that spread among leukemia patients and carried cfr on a 49-kb plasmid (herein called pMB151a). The 6 kb of pMB151a surrounding the cfr gene was nearly 100% identical to a cfr-containing plasmid isolated from livestock-associated staphylococci in China. Analysis of serial stool samples from leukemia patients revealed progressive staphylococcal domination of the intestinal microflora and an increase in cfr abundance following linezolid use.The combination of linezolid use plus transmission of a multidrug-resistant clone drove expansion of invasive, linezolid-resistant S. epidermidis. Our results lend support to the notion that a combination of antibiotic stewardship plus infection control measures may help to control the spread of a multidrug-resistant pathobiont.


September 22, 2019  |  

Redefinition and unification of the SXT/R391 family of integrative and conjugative elements.

Integrative and conjugative elements (ICEs) of the SXT/R391 family are key drivers of the spread of antibiotic resistance in Vibrio cholerae, the infectious agent of cholera, and other pathogenic bacteria. The SXT/R391 family of ICEs was defined based on the conservation of a core set of 52 genes and site-specific integration into the 5′ end of the chromosomal gene prfC Hence, the integrase gene int has been intensively used as a marker to detect SXT/R391 ICEs in clinical isolates. ICEs sharing most core genes but differing by their integration site and integrase gene have been recently reported and excluded from the SXT/R391 family. Here we explored the prevalence and diversity of atypical ICEs in GenBank databases and their relationship with typical SXT/R391 ICEs. We found atypical ICEs in V. cholerae isolates that predate the emergence and expansion of typical SXT/R391 ICEs in the mid-1980s in seventh-pandemic toxigenic V. cholerae strains O1 and O139. Our analyses revealed that while atypical ICEs are not associated with antibiotic resistance genes, they often carry cation efflux pumps, suggesting heavy metal resistance. Atypical ICEs constitute a polyphyletic group likely because of occasional recombination events with typical ICEs. Furthermore, we show that the alternative integration and excision genes of atypical ICEs remain under the control of SetCD, the main activator of the conjugative functions of SXT/R391 ICEs. Together, these observations indicate that substitution of the integration/excision module and change of specificity of integration do not preclude atypical ICEs from inclusion into the SXT/R391 family.IMPORTANCEVibrio cholerae is the causative agent of cholera, an acute intestinal infection that remains to this day a world public health threat. Integrative and conjugative elements (ICEs) of the SXT/R391 family have played a major role in spreading antimicrobial resistance in seventh-pandemic V. cholerae but also in several species of Enterobacteriaceae Most epidemiological surveys use the integrase gene as a marker to screen for SXT/R391 ICEs in clinical or environmental strains. With the recent reports of closely related elements that carry an alternative integrase gene, it became urgent to investigate whether ICEs that have been left out of the family are a liability for the accuracy of such screenings. In this study, based on comparative genomics, we broaden the SXT/R391 family of ICEs to include atypical ICEs that are often associated with heavy metal resistance. Copyright © 2018 American Society for Microbiology.


September 22, 2019  |  

Complete genome sequencing and comparative genomic analysis of Helicobacter apodemus isolated from the wild Korean striped field mouse (Apodemus agrarius) for potential pathogenicity

The Helicobacter bacterial genus comprises of spiral-shaped gram-negative bacteria with flagella that colonize the gastro-intestinal (GI) tract of humans and various mammals (Solnick and Schauer, 2001). In particular, Helicobacter pylori was classified as a group 1 carcinogen by the International Agency for Research on Cancer (IARC) in 1994, and has been shown to occur with a high prevalence in humans, although this varies between geographical regions, ethnic groups, and various populations (Kusters et al., 2006; Goh et al., 2011). To date, more than 37 Helicobacter species have been identified in addition to H. pylori (Péré-Védrenne et al., 2017). Furthermore, non-H. pylori Helicobacters (NHPH) have been shown to infect both humans and animals, and NHPH infections are associated with intestinal carcinoma, and mucinous adenocarcinoma (Swennes et al., 2016). Despite the demonstrated association between NHPH and disease, most studies to date have investigated H. pylori in humans; thus, it is necessary to characterize NHPH and elucidate its role in the GI tract of wild rodents which are potential Helicobacter carriers (Taylor et al., 2007; Mladenova-Hristova et al., 2017).


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.