Gap filling Archives - Page 14 of 19

July 7, 2019

Genomic epidemiology of NDM-1-encoding plasmids in Latin American clinical isolates reveals insights into the evolution of multidrug resistance

Bacteria that produce the broad-spectrum Carbapenem antibiotic New Delhi Metallo-ß-lactamase (NDM) place a burden on health care systems worldwide, due to the limited treatment options for infections caused by them and the rapid global spread of this antibiotic resistance mechanism. Although it is believed that the associated resistance gene blaNDM-1 originated in Acinetobacter spp., the role of Enterobacteriaceae in its dissemination remains unclear. In this study, we used whole genome sequencing to investigate the dissemination dynamics of blaNDM-1-positive plasmids in a set of 21 clinical NDM-1-positive isolates from Colombia and Mexico (Providencia rettgeri, Klebsiella pneumoniae, and Acinetobacter baumannii) as well as six representative NDM-1-positive Escherichia coli transconjugants. Additionally, the plasmids from three representative P. rettgeri isolates were sequenced by PacBio sequencing and finished. Our results demonstrate the presence of previously reported plasmids from K. pneumoniae and A. baumannii in different genetic backgrounds and geographically distant locations in Colombia. Three new previously unclassified plasmids were also identified in P. rettgeri from Colombia and Mexico, plus an interesting genetic link between NDM-1-positive P. rettgeri from distant geographic locations (Canada, Mexico, Colombia, and Israel) without any reported epidemiological links was discovered. Finally, we detected a relationship between plasmids present in P. rettgeri and plasmids from A. baumannii and K. pneumoniae. Overall, our findings suggest a Russian doll model for the dissemination of blaNDM-1 in Latin America, with P. rettgeri playing a central role in this process, and reveal new insights into the evolution and dissemination of plasmids carrying such antibiotic resistance genes.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

July 7, 2019

Plasmid composition in Aeromonas salmonicida subsp. salmonicida 01-B526 unravels unsuspected type three secretion system loss patterns.

Aeromonas salmonicida subsp. salmonicida is a ubiquitous psychrophilic waterborne bacterium and a fish pathogen. The numerous mobile elements, especially insertion sequences (IS), in its genome promote rearrangements that impact its phenotype. One of the main virulence factors of this bacterium, its type three secretion system (TTSS), is affected by these rearrangements. In Aeromonas salmonicida subsp. salmonicida most of the TTSS genes are encoded in a single locus on a large plasmid called pAsa5, and may be lost when the bacterium is cultivated at a higher temperature (25 °C), producing non-virulent mutants. In a previous study, pAsa5-rearranged strains that lacked the TTSS locus on pAsa5 were produced using parental strains, including 01-B526. Some of the generated deletions were explained by homologous recombination between ISs found on pAsa5, whereas the others remained unresolved. To investigate those rearrangements, short- and long-read high-throughput sequencing technologies were used on the A. salmonicida subsp. salmonicida 01-B526 whole genome.Whole genome sequencing of the 01-B526 strain revealed that its pAsa5 has an additional IS copy, an ISAS5, compared to the reference strain (A449) sequence, which allowed for a previously unknown rearrangement to occur. It also appeared that 01-B526 bears a second large plasmid, named pAsa9, which shares 40 kbp of highly similar sequences with pAsa5. Following these discoveries, previously unexplained deletions were elucidated by genotyping. Furthermore, in one of the derived strains a fusion of pAsa5 and pAsa9, involving the newly discovered ISAS5 copy, was observed.The loss of TTSS and hence virulence is explained by one consistent mechanism: IS-driven homologous recombination. The similarities between pAsa9 and pAsa5 also provide another example of genetic diversity driven by ISs.

July 7, 2019

Hybrid assembly with long and short reads improves discovery of gene family expansions.

Long-read and short-read sequencing technologies offer competing advantages for eukaryotic genome sequencing projects. Combinations of both may be appropriate for surveys of within-species genomic variation.We developed a hybrid assembly pipeline called “Alpaca” that can operate on 20X long-read coverage plus about 50X short-insert and 50X long-insert short-read coverage. To preclude collapse of tandem repeats, Alpaca relies on base-call-corrected long reads for contig formation.Compared to two other assembly protocols, Alpaca demonstrated the most reference agreement and repeat capture on the rice genome. On three accessions of the model legume Medicago truncatula, Alpaca generated the most agreement to a conspecific reference and predicted tandemly repeated genes absent from the other assemblies.Our results suggest Alpaca is a useful tool for investigating structural and copy number variation within de novo assemblies of sampled populations.

July 7, 2019

Repeated divergent selection on pigmentation genes in a rapid finch radiation.

Instances of recent and rapid speciation are suitable for associating phenotypes with their causal genotypes, especially if gene flow homogenizes areas of the genome that are not under divergent selection. We study a rapid radiation of nine sympatric bird species known as capuchino seedeaters, which are differentiated in sexually selected characters of male plumage and song. We sequenced the genomes of a phenotypically diverse set of species to search for differentiated genomic regions. Capuchinos show differences in a small proportion of their genomes, yet selection has acted independently on the same targets in different members of this radiation. Many divergent regions contain genes involved in the melanogenesis pathway, with the strongest signal originating from putative regulatory regions. Selection has acted on these same genomic regions in different lineages, likely shaping the evolution of cis-regulatory elements, which control how more conserved genes are expressed and thereby generate diversity in classically sexually selected traits.

July 7, 2019

The origin, diversification and adaptation of a major mangrove clade (Rhizophoreae) revealed by whole-genome sequencing

Mangroves invade some very marginal habitats for woody plants—at the interface between land and sea. Since mangroves anchor tropical coastal communities globally, their origin, diversification and adaptation are of scientific significance, particularly at a time of global climate change. In this study, a combination of single-molecule long reads and the more conventional short reads are generated from Rhizophora apiculata for the de novo assembly of its genome to a near chromosome level. The longest scaffold, N50 and N90 for the R. apiculata genome, are 13.3 Mb, 5.4 Mb and 1.0 Mb, respectively. Short reads for the genomes and transcriptomes of eight related species are also generated. We find that the ancestor of Rhizophoreae experienced a whole-genome duplication ~70 Myrs ago, which is followed rather quickly by colonization and species diversification. Mangroves exhibit pan-exome modifications of amino acid (AA) usage as well as unusual AA substitutions among closely related species. The usage and substitution of AAs, unique among plants surveyed, is correlated with the rapid evolution of proteins in mangroves. A small subset of these substitutions is associated with mangroves’ highly specialized traits (vivipary and red bark) thought to be adaptive in the intertidal habitats. Despite the many adaptive features, mangroves are among the least genetically diverse plants, likely the result of continual habitat turnovers caused by repeated rises and falls of sea level in the geologically recent past. Mangrove genomes thus inform about their past evolutionary success as well as portend a possibly difficult future.

July 7, 2019

Whole genome sequencing predicts novel human disease models in rhesus macaques.

Rhesus macaques are an important pre-clinical model of human disease. To advance our understanding of genomic variation that may influence disease, we surveyed genome-wide variation in 21 rhesus macaques. We employed best-practice variant calling, validated with Mendelian inheritance. Next, we used alignment data from our cohort to detect genomic regions likely to produce inaccurate genotypes, potentially due to either gene duplication or structural variation between individuals. We generated a final dataset of >16 million high confidence variants, including 13 million in Chinese-origin rhesus macaques, an increasingly important disease model. We detected an average of 131 mutations predicted to severely alter protein coding per animal, and identified 45 such variants that coincide with known pathogenic human variants. These data suggest that expanded screening of existing breeding colonies will identify novel models of human disease, and that increased genomic characterization can help inform research studies in macaques. Copyright © 2017 Elsevier Inc. All rights reserved.

July 7, 2019

Comparison of pseudorabies virus China reference strain with emerging variants reveals independent virus evolution within specific geographic regions.

Pseudorabies virus (PRV) China reference strain Ea is genetically closely related to newly emerged variants; however, there is limited information about PRV Ea. Here, we compared PRV Ea with new variant strains by growth kinetics, genome sequencing, and protein expression analysis. Growth analysis showed that strain Ea forms smaller plaques than strain HNX. The full-length genome sequence of Ea revealed that it is clustered in the same subgroup as HNX. Ea and HNX strains exhibited similar extracellular virion protein polymorphisms, whereas strain Bartha expressed less VP26 and more GAPDH. In infected cells, strain Ea expressed high levels of IE180 protein, and Ea and HNX produced higher levels of UL21 protein than strain Bartha. These findings provide evidence that PRV China reference strain Ea is genetically closely related to the newly emerged variant strains, indicating that strain PRV China may have evolved independently leading to the emergence of a variant strain. Copyright © 2017 Elsevier Inc. All rights reserved.

July 7, 2019

Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production.

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.

July 7, 2019

Characterization of the polymyxin D synthetase biosynthetic cluster and product profile of Paenibacillus polymyxa ATCC 10401.

The increasing prevalence of polymyxin-resistant bacteria has stimulated the search for improved polymyxin lipopeptides. Here we describe the sequence and product profile for polymyxin D nonribosomal peptide synthetase from Paenibacillus polymyxa ATCC 10401. The polymyxin D synthase gene cluster comprised five genes that encoded ABC transporters (pmxC and pmxD) and enzymes responsible for the biosynthesis of polymyxin D (pmxA, pmxB, and pmxE). Unlike polymyxins B and E, polymyxin D contains d-Ser at position 3 as opposed to l-a,?-diaminobutyric acid and has an l-Thr at position 7 rather than l-Leu. Module 3 of pmxE harbored an auxiliary epimerization domain that catalyzes the conversion of l-Ser to the d-form. Structural modeling suggested that the adenylation domains of module 3 in PmxE and modules 6 and 7 in PmxA could bind amino acids with larger side chains than their preferred substrate. Feeding individual amino acids into the culture media not only affected production of polymyxins D1 and D2 but also led to the incorporation of different amino acids at positions 3, 6, and 7 of polymyxin D. Interestingly, the unnatural polymyxin analogues did not show antibiotic activity against a panel of Gram-negative clinical isolates, while the natural polymyxins D1 and D2 exhibited excellent in vitro antibacterial activity and were efficacious against Klebsiella pneumoniae and Acinetobacter baumannii in a mouse blood infection model. The results demonstrate the excellent antibacterial activity of these unusual d-Ser(3) polymxyins and underscore the possibility of incorporating alternate amino acids at positions 3, 6, and 7 of polymyxin D via manipulation of the polymyxin nonribosomal biosynthetic machinery.

July 7, 2019

Euglena gracilis genome and transcriptome: organelles, nuclear genome assembly strategies and initial features.

Euglena gracilis is a major component of the aquatic ecosystem and together with closely related species, is ubiquitous worldwide. Euglenoids are an important group of protists, possessing a secondarily acquired plastid and are relatives to the Kinetoplastidae, which themselves have global impact as disease agents. To understand the biology of E. gracilis, as well as to provide further insight into the evolution and origins of the Kinetoplastidae, we embarked on sequencing the nuclear genome; the plastid and mitochondrial genomes are already in the public domain. Earlier studies suggested an extensive nuclear DNA content, with likely a high degree of repetitive sequence, together with significant extrachromosomal elements. To produce a list of coding sequences we have combined transcriptome data from both published and new sources, as well as embarked on de novo sequencing using a combination of 454, Illumina paired end libraries and long PacBio reads. Preliminary analysis suggests a surprisingly large genome approaching 2 Gbp, with a highly fragmented architecture and extensive repeat composition. Over 80% of the RNAseq reads from E. gracilis maps to the assembled genome sequence, which is comparable with the well assembled genomes of T. brucei and T. cruzi. In order to achieve this level of assembly we employed multiple informatics pipelines, which are discussed here. Finally, as a preliminary view of the genome architecture, we discuss the tubulin and calmodulin genes, which highlight potential novel splicing mechanisms.

July 7, 2019

Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula.

Third generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family expansions, segmental and tandem duplications, and polyploidy. Recently, high-throughput mapping and scaffolding strategies have further improved continuity. Together, these long-range technologies enable quality draft assemblies of complex genomes in a cost-effective and timely manner.Here, we present high quality genome assemblies of the model legume plant, Medicago truncatula (R108) using PacBio, Dovetail Chicago (hereafter, Dovetail) and BioNano technologies. To test these technologies for plant genome assembly, we generated five assemblies using all possible combinations and ordering of these three technologies in the R108 assembly. While the BioNano and Dovetail joins overlapped, they also showed complementary gains in continuity and join numbers. Both technologies spanned repetitive regions that PacBio alone was unable to bridge. Combining technologies, particularly Dovetail followed by BioNano, resulted in notable improvements compared to Dovetail or BioNano alone. A combination of PacBio, Dovetail, and BioNano was used to generate a high quality draft assembly of R108, a M. truncatula accession widely used in studies of functional genomics. As a test for the usefulness of the resulting genome sequence, the new R108 assembly was used to pinpoint breakpoints and characterize flanking sequence of a previously identified translocation between chromosomes 4 and 8, identifying more than 22.7 Mb of novel sequence not present in the earlier A17 reference assembly.Adding Dovetail followed by BioNano data yielded complementary improvements in continuity over the original PacBio assembly. This strategy proved efficient and cost-effective for developing a quality draft assembly compared to traditional reference assemblies.

July 7, 2019

Complete genome sequence of the sand-sediment actinobacterium Nocardioides dokdonensis FR1436(T).

Nocardioides dokdonensis, belonging to the class Actinobacteria, was first isolated from sand sediment of a beach in Dokdo, Korea, in 2005. In this study, we determined the genome sequence of FR1436, the type strain of N. dokdonensis, and analyzed its gene contents. The genome sequence is the second complete one in the genus Nocardioides after that of Nocardioides sp. JS614. It is composed of a 4,376,707-bp chromosome with a G + C content of 72.26%. From the genome sequence, 4,104 CDSs, three rRNA operons, 51 tRNAs, and one tmRNA were predicted, and 71.38% of the genes were assigned putative functions. Through the sequence analysis, dozens of genes involved in steroid metabolism, especially its degradation, were detected. Most of the identified genes were located in large gene clusters, which showed high similarities with the gene clusters in Pimelobacter simplex VKM Ac-2033D. Genomic features of N. dokdonensis associated with steroid catabolism indicate that it could be used for research and application of steroids in science and industry.

July 7, 2019

Complete genome sequence of the nematicidal Bacillus thuringiensis MYBT18246.

Bacillus thuringiensis is a rod-shaped facultative anaerobic spore forming bacterium of the genus Bacillus . The defining feature of the species is the ability to produce parasporal crystal inclusion bodies, consisting of d-endotoxins, encoded by cry-genes. Here we present the complete annotated genome sequence of the nematicidal B. thuringiensis strain MYBT18246. The genome comprises one 5,867,749 bp chromosome and 11 plasmids which vary in size from 6330 bp to 150,790 bp. The chromosome contains 6092 protein-coding and 150 RNA genes, including 36 rRNA genes. The plasmids encode 997 proteins and 4 t-RNA’s. Analysis of the genome revealed a large number of mobile elements involved in genome plasticity including 11 plasmids and 16 chromosomal prophages. Three different nematicidal toxin genes were identified and classified according to the Cry toxin naming committee as cry13Aa2, cry13Ba1, and cry13Ab1. Strikingly, these genes are located on the chromosome in close proximity to three separate prophages. Moreover, four putative toxin genes of different toxin classes were identified on the plasmids p120510 (Vip-like toxin), p120416 (Cry-like toxin) and p109822 (two Bin-like toxins). A comparative genome analysis of B. thuringiensis MYBT18246 with three closely related B. thuringiensis strains enabled determination of the pan-genome of B. thuringiensis MYBT18246, revealing a large number of singletons, mostly represented by phage genes, morons and cryptic genes.

July 7, 2019

High-quality genome sequence of the radioresistant bacterium Deinococcus ficus KS 0460.

The genetic platforms of Deinococcus species remain the only systems in which massive ionizing radiation (IR)-induced genome damage can be investigated in vivo at exposures commensurate with cellular survival. We report the whole genome sequence of the extremely IR-resistant rod-shaped bacterium Deinococcus ficus KS 0460 and its phenotypic characterization. Deinococcus ficus KS 0460 has been studied since 1987, first under the name Deinobacter grandis, then Deinococcus grandis. The D. ficus KS 0460 genome consists of a 4.019 Mbp sequence (69.7% GC content and 3894 predicted genes) divided into six genome partitions, five of which are confirmed to be circular. Circularity was determined manually by mate pair linkage. Approximately 76% of the predicted proteins contained identifiable Pfam domains and 72% were assigned to COGs. Of all D. ficus KS 0460 proteins, 79% and 70% had homologues in Deinococcus radiodurans ATCC BAA-816 and Deinococcus geothermalis DSM 11300, respectively. The most striking differences between D. ficus KS 0460 and D. radiodurans BAA-816 identified by the comparison of the KEGG pathways were as follows: (i) D. ficus lacks nine enzymes of purine degradation present in D. radiodurans, and (ii) D. ficus contains eight enzymes involved in nitrogen metabolism, including nitrate and nitrite reductases, that D. radiodurans lacks. Moreover, genes previously considered to be important to IR resistance are missing in D. ficus KS 0460, namely, for the Mn-transporter nramp, and proteins DdrF, DdrJ and DdrK, all of which are also missing in Deinococcus deserti. Otherwise, D. ficus KS 0460 exemplifies the Deinococcus lineage.

July 7, 2019

The Apostasia genome and the evolution of orchids.

Constituting approximately 10% of flowering plant species, orchids (Orchidaceae) display unique flower morphologies, possess an extraordinary diversity in lifestyle, and have successfully colonized almost every habitat on Earth. Here we report the draft genome sequence of Apostasia shenzhenica, a representative of one of two genera that form a sister lineage to the rest of the Orchidaceae, providing a reference for inferring the genome content and structure of the most recent common ancestor of all extant orchids and improving our understanding of their origins and evolution. In addition, we present transcriptome data for representatives of Vanilloideae, Cypripedioideae and Orchidoideae, and novel third-generation genome data for two species of Epidendroideae, covering all five orchid subfamilies. A. shenzhenica shows clear evidence of a whole-genome duplication, which is shared by all orchids and occurred shortly before their divergence. Comparisons between A. shenzhenica and other orchids and angiosperms also permitted the reconstruction of an ancestral orchid gene toolkit. We identify new gene families, gene family expansions and contractions, and changes within MADS-box gene classes, which control a diverse suite of developmental processes, during orchid evolution. This study sheds new light on the genetic mechanisms underpinning key orchid innovations, including the development of the labellum and gynostemium, pollinia, and seeds without endosperm, as well as the evolution of epiphytism; reveals relationships between the Orchidaceae subfamilies; and helps clarify the evolutionary history of orchids within the angiosperms.

Auto Tag: Gap filling

Genomic epidemiology of NDM-1-encoding plasmids in Latin American clinical isolates reveals insights into the evolution of multidrug resistance

Plasmid composition in Aeromonas salmonicida subsp. salmonicida 01-B526 unravels unsuspected type three secretion system loss patterns.

Hybrid assembly with long and short reads improves discovery of gene family expansions.

Repeated divergent selection on pigmentation genes in a rapid finch radiation.

The origin, diversification and adaptation of a major mangrove clade (Rhizophoreae) revealed by whole-genome sequencing

Whole genome sequencing predicts novel human disease models in rhesus macaques.

Comparison of pseudorabies virus China reference strain with emerging variants reveals independent virus evolution within specific geographic regions.

Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production.

Characterization of the polymyxin D synthetase biosynthetic cluster and product profile of Paenibacillus polymyxa ATCC 10401.

Euglena gracilis genome and transcriptome: organelles, nuclear genome assembly strategies and initial features.

Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula.

Complete genome sequence of the sand-sediment actinobacterium Nocardioides dokdonensis FR1436(T).

Complete genome sequence of the nematicidal Bacillus thuringiensis MYBT18246.

High-quality genome sequence of the radioresistant bacterium Deinococcus ficus KS 0460.

The Apostasia genome and the evolution of orchids.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert