Menu
July 7, 2019

Complete genome sequence analysis of Enterobacter sp. SA187, a plant multi-stress tolerance promoting endophytic bacterium

Enterobacter sp. SA187 is an endophytic bacterium that has been isolated from root nodules of the indigenous desert plant Indigofera argentea. SA187 could survive in the rhizosphere as well as in association with different plant species, and was able to provide abiotic stress tolerance to Arabidopsis thaliana. The genome sequence of SA187 was obtained by using Pacific BioScience (PacBio) single-molecule sequencing technology, with average coverage of 275X. The genome of SA187 consists of one single 4,429,597 bp chromosome, with an average 56% GC content and 4,347 predicted protein coding DNA sequences (CDS), 153 ncRNA, 7 rRNA, and 84 tRNA. Functional analysis of the SA187 genome revealed a large number of genes involved in uptake and exchange of nutrients, chemotaxis, mobilization and plant colonization. A high number of genes were also found to be involved in survival, defense against oxidative stress and production of antimicrobial compounds and toxins. Moreover, different metabolic pathways were identified that potentially contribute to plant growth promotion. The information encoded in the genome of SA187 reveals the characteristics of a dualistic lifestyle of a bacterium that can adapt to different environments and promote the growth of plants. This information provides a better understanding of the mechanisms involved in plant-microbe interaction and could be further exploited to develop SA187 as a biological agent to improve agricultural practices in marginal and arid lands.


July 7, 2019

Remarkable diversity of Escherichia coli carrying mcr-1 from hospital sewage with the identification of two new mcr-1 variants.

The plasmid-borne colistin-resistant gene mcr-1 has rapidly become a worldwide public health concern. This study aims to determine the host bacterial strains, plasmids, and genetic contexts of mcr-1 in hospital sewage. A 1-ml hospital sewage sample was cultured. Colistin-resistant bacterial colonies were selected on agar plates and were subjected to whole genome sequencing and subsequent analysis. The transfer of mcr-1 between bacterial strains was tested using conjugation. New variants of mcr-1 were cloned to test the impact of variations on the function of mcr-1. Plasmids carrying mcr-1 were retrieved from GenBank for comparison based on concatenated backbone genes. In the sewage sample, we observed that mcr-1 was located in various genetic contexts on the chromosome, or plasmids of four different replicon types (IncHI2, IncI2, IncP, and IncX4), in Klebsiella pneumoniae, Kluyvera spp. and seven Escherichia coli strains of six different sequence types (ST10, ST34, ST48, ST1196, ST7086, and ST7087). We also identified two new variants of mcr-1, mcr-1.4 and mcr-1.7, both of which encode an amino acid variation from mcr-1. mcr-1-carrying IncX4 plasmids, which have a global distribution across the Enterobacteriaceae, are the result of global dissemination of a single common plasmid, while IncI2 mcr-1 plasmids appear to acquire mcr-1 in multiple events. In conclusion, the unprecedented remarkable diversity of species, strains, plasmids, and genetic contexts carrying mcr-1 present in a single sewage sample from a single healthcare site highlights the continued evolution and dynamic transmission of mcr-1 in healthcare-associated environments.


July 7, 2019

Genome sequence of the small brown planthopper, Laodelphax striatellus.

Laodelphax striatellus Fallén (Hemiptera: Delphacidae) is one of the most destructive rice pests. L. striatellus is different from 2 other rice planthoppers with a released genome sequence, Sogatella furcifera and Nilaparvata lugens, in many biological characteristics, such as host range, dispersal capacity, and vectoring plant viruses. Deciphering the genome of L. striatellus will further the understanding of the genetic basis of the biological differences among the 3 rice planthoppers.A total of 190 Gb of Illumina data and 32.4 Gb of Pacbio data were generated and used to assemble a high-quality L. striatellus genome sequence, which is 541 Mb in length and has a contig N50 of 118 Kb and a scaffold N50 of 1.08 Mb. Annotated repetitive elements account for 25.7% of the genome. A total of 17?736 protein-coding genes were annotated, capturing 97.6% and 98% of the BUSCO eukaryote and arthropoda genes, respectively. Compared with N. lugens and S. furcifera, L. striatellus has the smallest genome and the lowest gene number. Gene family expansion and transcriptomic analyses provided hints to the genomic basis of the differences in important traits such as host range, migratory habit, and plant virus transmission between L. striatellus and the other 2 planthoppers.We report a high-quality genome assembly of L. striatellus, which is an important genomic resource not only for the study of the biology of L. striatellus and its interactions with plant hosts and plant viruses, but also for comparison with other planthoppers.© The Authors 2017. Published by Oxford University Press.


July 7, 2019

The genome sequence of Bipolaris cookei reveals mechanisms of pathogenesis underlying target leaf spot of sorghum.

Bipolaris cookei (=Bipolaris sorghicola) causes target leaf spot, one of the most prevalent foliar diseases of sorghum. Little is known about the molecular basis of pathogenesis in B. cookei, in large part due to a paucity of resources for molecular genetics, such as a reference genome. Here, a draft genome sequence of B. cookei was obtained and analyzed. A hybrid assembly strategy utilizing Illumina and Pacific Biosciences sequencing technologies produced a draft nuclear genome of 36.1?Mb, organized into 321 scaffolds with L50 of 31 and N50 of 378?kb, from which 11,189 genes were predicted. Additionally, a finished mitochondrial genome sequence of 135,790?bp was obtained, which contained 75 predicted genes. Comparative genomics revealed that B. cookei possessed substantially fewer carbohydrate-active enzymes and secreted proteins than closely related Bipolaris species. Novel genes involved in secondary metabolism, including genes implicated in ophiobolin biosynthesis, were identified. Among 37 B. cookei genes induced during sorghum infection, one encodes a putative effector with a limited taxonomic distribution among plant pathogenic fungi. The draft genome sequence of B. cookei provided novel insights into target leaf spot of sorghum and is an important resource for future investigation.


July 7, 2019

Hidden genetic variation shapes the structure of functional elements in Drosophila.

Mutations that add, subtract, rearrange, or otherwise refashion genome structure often affect phenotypes, although the fragmented nature of most contemporary assemblies obscures them. To discover such mutations, we assembled the first new reference-quality genome of Drosophila melanogaster since its initial sequencing. By comparing this new genome to the existing D. melanogaster assembly, we created a structural variant map of unprecedented resolution and identified extensive genetic variation that has remained hidden until now. Many of these variants constitute candidates underlying phenotypic variation, including tandem duplications and a transposable element insertion that amplifies the expression of detoxification-related genes associated with nicotine resistance. The abundance of important genetic variation that still evades discovery highlights how crucial high-quality reference genomes are to deciphering phenotypes.


July 7, 2019

RNA-seq and Tn-seq reveal fitness determinants of vancomycin-resistant Enterococcus faecium during growth in human serum.

The Gram-positive bacterium Enterococcus faecium is a commensal of the human gastrointestinal tract and a frequent cause of bloodstream infections in hospitalized patients. The mechanisms by which E. faecium can survive and grow in blood during an infection have not yet been characterized. Here, we identify genes that contribute to growth of E. faecium in human serum through transcriptome profiling (RNA-seq) and a high-throughput transposon mutant library sequencing approach (Tn-seq).We first sequenced the genome of E. faecium E745, a vancomycin-resistant clinical isolate, using a combination of short- and long read sequencing, revealing a 2,765,010 nt chromosome and 6 plasmids, with sizes ranging between 9.3 kbp and 223.7 kbp. We then compared the transcriptome of E. faecium E745 during exponential growth in rich medium and in human serum by RNA-seq. This analysis revealed that 27.8% of genes on the E. faecium E745 genome were differentially expressed in these two conditions. A gene cluster with a role in purine biosynthesis was among the most upregulated genes in E. faecium E745 upon growth in serum. The E. faecium E745 transposon mutant library was then used to identify genes that were specifically required for growth of E. faecium in serum. Genes involved in de novo nucleotide biosynthesis (including pyrK_2, pyrF, purD, purH) and a gene encoding a phosphotransferase system subunit (manY_2) were thus identified to be contributing to E. faecium growth in human serum. Transposon mutants in pyrK_2, pyrF, purD, purH and manY_2 were isolated from the library and their impaired growth in human serum was confirmed. In addition, the pyrK_2 and manY_2 mutants were tested for their virulence in an intravenous zebrafish infection model and exhibited significantly attenuated virulence compared to E. faecium E745.Genes involved in carbohydrate metabolism and nucleotide biosynthesis of E. faecium are essential for growth in human serum and contribute to the pathogenesis of this organism. These genes may serve as targets for the development of novel anti-infectives for the treatment of E. faecium bloodstream infections.


July 7, 2019

Chromosome level assembly and secondary metabolite potential of the parasitic fungus Cordyceps militaris.

Cordyceps militaris is an insect pathogenic fungus that is prized for its use in traditional medicine. This and other entomopathogenic fungi are understudied sources for the discovery of new bioactive molecules. In this study, PacBio SMRT long read sequencing technology was used to sequence the genome of C. militaris with a focus on the genetic potential for secondary metabolite production in the genome assembly of this fungus.This is first chromosome level assembly of a species in the Cordyceps genera. In this seven chromosome assembly of 33.6 Mba there were 9371 genes identified. Cordyceps militaris was determined to have the MAT 1-1-1 and MAT 1-1-2 mating type genes. Secondary metabolite analysis revealed the potential for at least 36 distinct metabolites from a variety of classes. Three of these gene clusters had homology with clusters producing desmethylbassianin, equisetin and emericellamide that had been studied in other fungi.Our assembly and analysis has revealed that C. militaris has a wealth of gene clusters for secondary metabolite production distributed among seven chromosomes. The identification of these gene clusters will facilitate the future study and identification of the secondary metabolites produced by this entomopathogenic fungus.


July 7, 2019

The complete genome sequence of Ensifer meliloti strain CCMM B554 (FSM-MA), a highly effective nitrogen-fixing microsymbiont of Medicago truncatula Gaertn.

Strain CCMM B554, also known as FSM-MA, is a soil dwelling and nodule forming, nitrogen-fixing bacterium isolated from the nodules of the legume Medicago arborea L. in the Maamora Forest, Morocco. The strain forms effective nitrogen fixing nodules on species of the Medicago, Melilotus and Trigonella genera and is exceptional because it is a highly effective symbiotic partner of the two most widely used accessions, A17 and R108, of the model legume Medicago truncatula Gaertn. Based on 16S rRNA gene sequence, multilocus sequence and average nucleotide identity analyses, FSM-MA is identified as a new Ensifer meliloti strain. The genome is 6,70 Mbp and is comprised of the chromosome (3,64 Mbp) harboring 3574 predicted genes and two megaplasmids, pSymA (1,42 Mbp) and pSymB (1,64 Mbp) with respectively 1481 and 1595 predicted genes. The average GC content of the genome is 61.93%. The FSM-MA genome structure is highly similar and co-linear to other E. meliloti strains in the chromosome and the pSymB megaplasmid while, in contrast, it shows high variability in the pSymA plasmid. The large number of strain-specific sequences in pSymA as well as strain-specific genes on pSymB involved in the biosynthesis of the lipopolysaccharide and capsular polysaccharide surface polysaccharides may encode novel symbiotic functions explaining the high symbiotic performance of FSM-MA.


July 7, 2019

Resequencing of the Leishmania infantum (strain JPCM5) genome and de novo assembly into 36 contigs.

Leishmania parasites are the causative of leishmaniasis, a group of potentially fatal human diseases. Control strategies for leishmaniasis can be enhanced by genome based investigations. The publication in 2005 of the Leishmania major genome sequence, and two years later the genomes for the species Leishmania braziliensis and Leishmania infantum were major milestones. Since then, the L. infantum genome, although highly fragmented and incomplete, has been used widely as the reference genome to address whole transcriptomics and proteomics studies. Here, we report the sequencing of the L. infantum genome by two NGS methodologies and, as a result, the complete genome assembly on 36 contigs (chromosomes). Regarding the present L. infantum genome-draft, 495 new genes have been annotated, a hundred have been corrected and 75 previous annotated genes have been discontinued. These changes are not only the result of an increase in the genome size, but a significant contribution derives from the existence of a large number of incorrectly assembled regions in current chromosomal scaffolds. Furthermore, an improved assembly of tandemly repeated genes has been obtained. All these analyses support that the de novo assembled L. infantum genome represents a robust assembly and should replace the currently available in the databases.


July 7, 2019

Complete genome sequences of two plant-associated Pseudomonas putida isolates with increased heavy-metal tolerance.

We report here the complete genome sequences of two Pseudomonas putida isolates recovered from surfac e-sterilized roots of Sida hermaphrodita The two isolates were characterized by an increased tolerance to zinc, cadmium, and lead. Furthermore, the strains showed typical plant growth-promoting properties, such as the production of indole acetic acid, cellulolytic enzymes, and siderophores. Copyright © 2017 Nesme et al.


July 7, 2019

Draft sequencing of the heterozygous diploid genome of Satsuma (Citrus unshiu Marc.) using a hybrid assembly approach.

Satsuma (Citrus unshiu Marc.) is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma (“Miyagawa Wase”) was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome.


July 7, 2019

Scaffolding of long read assemblies using long range contact information.

Long read technologies have revolutionized de novo genome assembly by generating contigs orders of magnitude longer than that of short read assemblies. Although assembly contiguity has increased, it usually does not reconstruct a full chromosome or an arm of the chromosome, resulting in an unfinished chromosome level assembly. To increase the contiguity of the assembly to the chromosome level, different strategies are used which exploit long range contact information between chromosomes in the genome.We develop a scalable and computationally efficient scaffolding method that can boost the assembly contiguity to a large extent using genome-wide chromatin interaction data such as Hi-C.we demonstrate an algorithm that uses Hi-C data for longer-range scaffolding of de novo long read genome assemblies. We tested our methods on the human and goat genome assemblies. We compare our scaffolds with the scaffolds generated by LACHESIS based on various metrics.Our new algorithm SALSA produces more accurate scaffolds compared to the existing state of the art method LACHESIS.


July 7, 2019

SV2: Accurate structural variation genotyping and de novo mutation detection from whole genomes.

Structural Variation (SV) detection from short-read whole genome sequencing is error prone, presenting significant challenges for population or family-based studies of disease.Here we describe SV2, a machine-learning algorithm for genotyping deletions and duplications from paired-end sequencing data. SV2 can rapidly integrate variant calls from multiple structural variant discovery algorithms into a unified call set with high genotyping accuracy and capability to detect de novo mutations. SV2 is freely available on GitHub (https://github.com/dantaki/SV2).Supplementary data are available at Bioinformatics online.© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com


July 7, 2019

Study of mesophilic Aeromonas salmonicida A527 strain sheds light on the species’ lifestyles and taxonomic dilemma.

The Gram-negative bacterium Aeromonas salmonicida contains five subspecies: salmonicida, smithia, achromogenes, masoucida and pectinolytica. Pectinolytica is a mesophilic subspecies with the ability to thrive at a wide range of temperatures, including 37°C, while the four other subspecies are psychrophilic, restricted to lower temperatures. The psychrophilic subspecies are known to infect a wide range of fishes. However, there is no evidence of pathogenicity for the mesophilic subspecies pectinolytica. Study of the differences between the mesophilic and psychrophilic subspecies is hampered by the lack of completely sequenced and closed genomes from the mesophilic subspecies. A previous study reported that insertion sequences, which can induce genomic rearrangements at temperatures around 25°C, could be one of the determinants explaining the differences in lifestyle (mesophilic or psychrophilic) between the subspecies. In this study, the genome of mesophilic strain A527 of A. salmonicida was sequenced, closed and analyzed to investigate the mesophilic-psychrophilic discrepancy. This reference genome supports the hypothesis that insertion sequences are major determinants of the lifestyle differences between the A. salmonicida subspecies. Moreover, the phylogenetic analysis performed to position strain A527 within the taxonomy raises an issue regarding the intraspecies structure of A. salmonicida.© FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Trajectories and drivers of genome evolution in surface-associated marine Phaeobacter.

The extent of genome divergence and the evolutionary events leading to speciation of marine bacteria have mostly been studied for (locally) abundant, free-living groups. The genus Phaeobacter is found on different marine surfaces, seems to occupy geographically disjunct habitats, and is involved in different biotic interactions, and was therefore targeted in the present study. The analysis of the chromosomes of 32 closely related but geographically spread Phaeobacter strains revealed an exceptionally large, highly syntenic core genome. The flexible gene pool is constantly but slightly expanding across all Phaeobacter lineages. The horizontally transferred genes mostly originated from bacteria of the Roseobacter group and horizontal transfer most likely was mediated by gene transfer agents. No evidence for geographic isolation and habitat specificity of the different phylogenomic Phaeobacter clades was detected based on the sources of isolation. In contrast, the functional gene repertoire and physiological traits of different phylogenomic Phaeobacter clades were sufficiently distinct to suggest an adaptation to an associated lifestyle with algae, to additional nutrient sources, or toxic heavy metals. Our study reveals that the evolutionary trajectories of surface-associated marine bacteria can differ significantly from free-living marine bacteria or marine generalists.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.