Menu
April 21, 2020

Medusavirus, a Novel Large DNA Virus Discovered from Hot Spring Water.

Recent discoveries of new large DNA viruses reveal high diversity in their morphologies, genetic repertoires, and replication strategies. Here, we report the novel features of medusavirus, a large DNA virus newly isolated from hot spring water in Japan. Medusavirus, with a diameter of 260?nm, shows a T=277 icosahedral capsid with unique spherical-headed spikes on its surface. It has a 381-kb genome encoding 461 putative proteins, 86 of which have their closest homologs in Acanthamoeba, whereas 279 (61%) are orphan genes. The virus lacks the genes encoding DNA topoisomerase II and RNA polymerase, showing that DNA replication takes place in the host nucleus, whereas the progeny virions are assembled in the cytoplasm. Furthermore, the medusavirus genome harbored genes for all five types of histones (H1, H2A, H2B, H3, and H4) and one DNA polymerase, which are phylogenetically placed at the root of the eukaryotic clades. In contrast, the host amoeba encoded many medusavirus homologs, including the major capsid protein. These facts strongly suggested that amoebae are indeed the most promising natural hosts of medusavirus, and that lateral gene transfers have taken place repeatedly and bidirectionally between the virus and its host since the early stage of their coevolution. Medusavirus reflects the traces of direct evolutionary interactions between the virus and eukaryotic hosts, which may be caused by sharing the DNA replication compartment and by evolutionarily long lasting virus-host relationships. Based on its unique morphological characteristics and phylogenomic relationships with other known large DNA viruses, we propose that medusavirus represents a new family, MedusaviridaeIMPORTANCE We have isolated a new nucleocytoplasmic large DNA virus (NCLDV) from hot spring water in Japan, named medusavirus. This new NCLDV is phylogenetically placed at the root of the eukaryotic clades based on the phylogenies of several key genes, including that encoding DNA polymerase, and its genome surprisingly encodes the full set of histone homologs. Furthermore, its laboratory host, Acanthamoeba castellanii, encodes many medusavirus homologs in its genome, including the major capsid protein, suggesting that the amoeba is the genuine natural host from ancient times of this newly described virus and that lateral gene transfers have repeatedly occurred between the virus and amoeba. These results suggest that medusavirus is a unique NCLDV preserving ancient footprints of evolutionary interactions with its hosts, thus providing clues to elucidate the evolution of NCLDVs, eukaryotes, and virus-host interaction. Based on the dissimilarities with other known NCLDVs, we propose that medusavirus represents a new viral family, Medusaviridae.Copyright © 2019 Yoshikawa et al.


April 21, 2020

Nine Novel Phages from a Plateau Lake in Southwest China: Insights into Aeromonas Phage Diversity.

Aeromonas species are common pathogens of fish and some of them can opportunistically cause infectious diseases in humans. The overuse of antibiotics has led to the emergence of bacterial drug-resistance. To date, only 51 complete genome sequences of Aeromonas phages are available in GenBank. Here, we report the isolation of nine Aeromonas phages from a plateau lake in China. The protein cluster, dot plot and ANI analyses were performed on all 60 currently sequenced Aeromonas phage genomes and classified into nine clusters and thirteen singletons. Among the nine isolated phages, the DNA-packaging strategy of cluster 2L372D (including 2L372D, 2L372X, 4L372D, 4L372XY) is unknown, while the other five phages use the headful (P22/Sf6) DNA-packaging strategy. Notably, the isolated phages with larger genomes conservatively encode auxiliary metabolism genes, DNA replication and metabolism genes, while in smaller phage genomes, recombination-related genes were conserved. Finally, we propose a new classification scheme for Aeromonas phages.


April 21, 2020

Chromosomal-level assembly of the blolsod clam, Scapharca (Anadara) broughtonii, using long sequence reads and Hi-C.

The blood clam, Scapharca (Anadara) broughtonii, is an economically and ecologically important marine bivalve of the family Arcidae. Efforts to study their population genetics, breeding, cultivation, and stock enrichment have been somewhat hindered by the lack of a reference genome. Herein, we report the complete genome sequence of S. broughtonii, a first reference genome of the family Arcidae.A total of 75.79 Gb clean data were generated with the Pacific Biosciences and Oxford Nanopore platforms, which represented approximately 86× coverage of the S. broughtonii genome. De novo assembly of these long reads resulted in an 884.5-Mb genome, with a contig N50 of 1.80 Mb and scaffold N50 of 45.00 Mb. Genome Hi-C scaffolding resulted in 19 chromosomes containing 99.35% of bases in the assembled genome. Genome annotation revealed that nearly half of the genome (46.1%) is composed of repeated sequences, while 24,045 protein-coding genes were predicted and 84.7% of them were annotated.We report here a chromosomal-level assembly of the S. broughtonii genome based on long-read sequencing and Hi-C scaffolding. The genomic data can serve as a reference for the family Arcidae and will provide a valuable resource for the scientific community and aquaculture sector. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

De novo genome assembly of the endangered Acer yangbiense, a plant species with extremely small populations endemic to Yunnan Province, China.

Acer yangbiense is a newly described critically endangered endemic maple tree confined to Yangbi County in Yunnan Province in Southwest China. It was included in a programme for rescuing the most threatened species in China, focusing on “plant species with extremely small populations (PSESP)”.We generated 64, 94, and 110 Gb of raw DNA sequences and obtained a chromosome-level genome assembly of A. yangbiense through a combination of Pacific Biosciences Single-molecule Real-time, Illumina HiSeq X, and Hi-C mapping, respectively. The final genome assembly is ~666 Mb, with 13 chromosomes covering ~97% of the genome and scaffold N50 sizes of 45 Mb. Further, BUSCO analysis recovered 95.5% complete BUSCO genes. The total number of repetitive elements account for 68.0% of the A. yangbiense genome. Genome annotation generated 28,320 protein-coding genes, assisted by a combination of prediction and transcriptome sequencing. In addition, a nearly 1:1 orthology ratio of dot plots of longer syntenic blocks revealed a similar evolutionary history between A. yangbiense and grape, indicating that the genome has not undergone a whole-genome duplication event after the core eudicot common hexaploidization.Here, we report a high-quality de novo genome assembly of A. yangbiense, the first genome for the genus Acer and the family Aceraceae. This will provide fundamental conservation genomics resources, as well as representing a new high-quality reference genome for the economically important Acer lineage and the wider order of Sapindales. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Diversification and Evolution of Vancomycin-Resistant Enterococcus faecium during Intestinal Domination.

Vancomycin-resistant Enterococcus faecium (VRE) is a leading cause of hospital-acquired infections. This is particularly true in immunocompromised patients, where the damage to the microbiota caused by antibiotics can lead to VRE domination of the intestine, increasing a patient’s risk for bloodstream infection. In previous studies we observed that the intestinal domination by VRE of patients hospitalized to receive allogeneic bone marrow transplantation can persist for weeks, but little is known about subspecies diversification and evolution during prolonged domination. Here we combined a longitudinal analysis of patient data and in vivo experiments to reveal previously unappreciated subspecies dynamics during VRE domination that appeared to be stable from 16S rRNA microbiota analyses. Whole-genome sequencing of isolates obtained from sequential stool samples provided by VRE-dominated patients revealed an unanticipated level of VRE population complexity that evolved over time. In experiments with ampicillin-treated mice colonized with a single CFU, VRE rapidly diversified and expanded into distinct lineages that competed for dominance. Mathematical modeling shows that in vivo evolution follows mostly a parabolic fitness landscape, where each new mutation provides diminishing returns and, in the setting of continuous ampicillin treatment, reveals a fitness advantage for mutations in penicillin-binding protein 5 (pbp5) that increase resistance to ampicillin. Our results reveal the rapid diversification of host-colonizing VRE populations, with implications for epidemiologic tracking of in-hospital VRE transmission and susceptibility to antibiotic treatment.Copyright © 2019 Dubin et al.


April 21, 2020

Complete Genome Sequence of Halocella sp. Strain SP3-1, an Extremely Halophilic, Glycoside Hydrolase- and Bacteriocin-Producing Bacterium Isolated from a Salt Evaporation Pond.

Halocella sp. strain SP3-1, a cellulose-degrading bacterium, was isolated from a hypersaline evaporation pond in Thailand. Here, we report the first complete genome sequence of strain SP3-1. This species has a genome size of 4,035,760 bases, and the genome contains several genes encoding cellulose, hemicellulose, starch-degrading enzymes, and bacteriocins.


April 21, 2020

Clostridium scindens ATCC 35704: Integration of Nutritional Requirements, the Complete Genome Sequence, and Global Transcriptional Responses to Bile Acids.

In the human gut, Clostridium scindens ATCC 35704 is a predominant bacterium and one of the major bile acid 7a-dehydroxylating anaerobes. While this organism is well-studied relative to bile acid metabolism, little is known about the basic nutrition and physiology of C. scindens ATCC 35704. To determine the amino acid and vitamin requirements of C. scindens, the leave-one-out (one amino acid group or vitamin) technique was used to eliminate the nonessential amino acids and vitamins. With this approach, the amino acid tryptophan and three vitamins (riboflavin, pantothenate, and pyridoxal) were found to be required for the growth of C. scindens In the newly developed defined medium, C. scindens fermented glucose mainly to ethanol, acetate, formate, and H2. The genome of C. scindens ATCC 35704 was completed through PacBio sequencing. Pathway analysis of the genome sequence coupled with transcriptome sequencing (RNA-Seq) under defined culture conditions revealed consistency with the growth requirements and end products of glucose metabolism. Induction with bile acids revealed complex and differential responses to cholic acid and deoxycholic acid, including the expression of potentially novel bile acid-inducible genes involved in cholic acid metabolism. Responses to toxic deoxycholic acid included expression of genes predicted to be involved in DNA repair, oxidative stress, cell wall maintenance/metabolism, chaperone synthesis, and downregulation of one-third of the genome. These analyses provide valuable insight into the overall biology of C. scindens which may be important in treatment of disease associated with increased colonic secondary bile acids.IMPORTANCEC. scindens is one of a few identified gut bacterial species capable of converting host cholic acid into disease-associated secondary bile acids such as deoxycholic acid. The current work represents an important advance in understanding the nutritional requirements and response to bile acids of the medically important human gut bacterium, C. scindens ATCC 35704. A defined medium has been developed which will further the understanding of bile acid metabolism in the context of growth substrates, cofactors, and other metabolites in the vertebrate gut. Analysis of the complete genome supports the nutritional requirements reported here. Genome-wide transcriptomic analysis of gene expression in the presence of cholic acid and deoxycholic acid provides a unique insight into the complex response of C. scindens ATCC 35704 to primary and secondary bile acids. Also revealed are genes with the potential to function in bile acid transport and metabolism.Copyright © 2019 American Society for Microbiology.


April 21, 2020

A chromosome-scale genome assembly of cucumber (Cucumis sativus L.).

Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assemble draft cucumber genomes, but the incompleteness and low quality of these genomes limit their use in comparative genomics and genetic research. A high-quality and complete cucumber genome assembly is therefore essential.We assembled single-molecule real-time (SMRT) long reads to generate an improved cucumber reference genome. This version contains 174 contigs with a total length of 226.2 Mb and an N50 of 8.9 Mb, and provides 29.0 Mb more sequence data than previous versions. Using 10X Genomics and high-throughput chromosome conformation capture (Hi-C) data, 89 contigs (~211.0 Mb) were directly linked into 7 pseudo-chromosome sequences. The newly assembled regions show much higher guanine-cytosine or adenine-thymine content than found previously, which is likely to have been inaccessible to Illumina sequencing. The new assembly contains 1,374 full-length long terminal retrotransposons and 1,078 novel genes including 239 tandemly duplicated genes. For example, we found 4 tandemly duplicated tyrosylprotein sulfotransferases, in contrast to the single copy of the gene found previously and in most other plants.This high-quality genome presents novel features of the cucumber genome and will serve as a valuable resource for genetic research in cucumber and plant comparative genomics. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

The genome assembly and annotation of yellowhorn (Xanthoceras sorbifolium Bunge).

Yellowhorn (Xanthoceras sorbifolium Bunge), a deciduous shrub or small tree native to north China, is of great economic value. Seeds of yellowhorn are rich in oil containing unsaturated long-chain fatty acids that have been used for producing edible oil and nervonic acid capsules. However, the lack of a high-quality genome sequence hampers the understanding of its evolution and gene functions.In this study, a whole genome of yellowhorn was sequenced and assembled by integration of Illumina sequencing, Pacific Biosciences single-molecule real-time sequencing, 10X Genomics linked reads, Bionano optical maps, and Hi-C. The yellowhorn genome assembly was 439.97 Mb, which comprised 15 pseudo-chromosomes covering 95.42% (419.84 Mb) of the assembled genome. The repetitive fractions accounted for 56.39% of the yellowhorn genome. The genome contained 21,059 protein-coding genes. Of them, 18,503 (87.86%) genes were found to be functionally annotated with =1 “annotation” term by searching against other databases. Transcriptomic analysis showed that 341, 135, 125, 113, and 100 genes were specifically expressed in hermaphrodite flower, staminate flower, young fruit, leaf, and shoot, respectively. Phylogenetic analysis suggested that yellowhorn and Dimocarpus longan diverged from their most recent common ancestor ~46 million years ago.The availability and subsequent annotation of the yellowhorn genome, as well as the identification of tissue-specific functional genes, provides a valuable reference for plant comparative genomics, evolutionary studies, and molecular design breeding. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Pseudomolecule-level assembly of the Chinese oil tree yellowhorn (Xanthoceras sorbifolium) genome.

Yellowhorn (Xanthoceras sorbifolium) is a species of the Sapindaceae family native to China and is an oil tree that can withstand cold and drought conditions. A pseudomolecule-level genome assembly for this species will not only contribute to understanding the evolution of its genes and chromosomes but also bring yellowhorn breeding into the genomic era.Here, we generated 15 pseudomolecules of yellowhorn chromosomes, on which 97.04% of scaffolds were anchored, using the combined Illumina HiSeq, Pacific Biosciences Sequel, and Hi-C technologies. The length of the final yellowhorn genome assembly was 504.2 Mb with a contig N50 size of 1.04 Mb and a scaffold N50 size of 32.17 Mb. Genome annotation revealed that 68.67% of the yellowhorn genome was composed of repetitive elements. Gene modelling predicted 24,672 protein-coding genes. By comparing orthologous genes, the divergence time of yellowhorn and its close sister species longan (Dimocarpus longan) was estimated at ~33.07 million years ago. Gene cluster and chromosome synteny analysis demonstrated that the yellowhorn genome shared a conserved genome structure with its ancestor in some chromosomes.This genome assembly represents a high-quality reference genome for yellowhorn. Integrated genome annotations provide a valuable dataset for genetic and molecular research in this species. We did not detect whole-genome duplication in the genome. The yellowhorn genome carries syntenic blocks from ancient chromosomes. These data sources will enable this genome to serve as an initial platform for breeding better yellowhorn cultivars. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Genome Analysis of Hypomyces perniciosus, the Causal Agent of Wet Bubble Disease of Button Mushroom (Agaricus bisporus).

The mycoparasitic fungus Hypomyces perniciosus causes wet bubble disease of mushrooms, particularly Agaricus bisporus. The genome of a highly virulent strain of H. perniciosus HP10 was sequenced and compared to three other fungi from the order Hypocreales that cause disease on A. bisporus. H. perniciosus genome is ~44 Mb, encodes 10,077 genes and enriched with transposable elements up to 25.3%. Phylogenetic analysis revealed that H. perniciosus is closely related to Cladobotryum protrusum and diverged from their common ancestor ~156.7 million years ago. H. perniciosus has few secreted proteins compared to C. protrusum and Trichoderma virens, but significantly expanded protein families of transporters, protein kinases, CAZymes (GH 18), peptidases, cytochrome P450, and SMs that are essential for mycoparasitism and adaptation to harsh environments. This study provides insights into H. perniciosus evolution and pathogenesis and will contribute to the development of effective disease management strategies to control wet bubble disease.


April 21, 2020

Genomic Investigation of the Emergence of Invasive Multidrug-Resistant Salmonella enterica Serovar Dublin in Humans and Animals in Canada.

Salmonella enterica subsp. enterica serovar Dublin is a zoonotic pathogen that often leads to invasive bloodstream infections in humans that are multidrug resistant. Described here are the results of Canadian national surveillance of S Dublin from 2003 to 2015 in humans and bovines, principally collected through the Canadian Integrated Program for Antibiotic Resistance Surveillance (CIPARS). An increase in human infections due to multidrug-resistant (MDR) S Dublin was observed in 2010, many of which were bloodstream infections. Phylogenomic analysis of human and bovine isolates revealed a closely related network that differed by only 0 to 17 single nucleotide variants (SNVs), suggesting some potential transmission between humans and bovines. Phylogenomic comparison of global publicly available sequences of S Dublin showed that Canadian isolates clustered closely with those from the United States. A high correlation between phenotypic and genotypic antimicrobial susceptibility was observed in Canadian isolates. IS26 replication was widespread among U.S. and Canadian isolates and caused the truncation and inactivation of the resistance genes strA and blaTEM-1B A hybrid virulence and MDR plasmid (pN13-01125) isolated from a Canadian S Dublin isolate was searched against NCBI SRA data of bacteria. The pN13-01125 coding sequences were found in 13 Salmonella serovars, but S Dublin appears to be a specific reservoir. In summary, we have observed the rise of invasive MDR S Dublin in humans in Canada and found that they are closely related to bovine isolates and to American isolates in their mobile and chromosomal contents. © Crown copyright 2019.


April 21, 2020

Genomic and Functional Analysis of Emerging Virulent and Multidrug-Resistant Escherichia coli Lineage Sequence Type 648.

The pathogenic extended-spectrum-beta-lactamase (ESBL)-producing Escherichia coli lineage ST648 is increasingly reported from multiple origins. Our study of a large and global ST648 collection from various hosts (87 whole-genome sequences) combining core and accessory genomics with functional analyses and in vivo experiments suggests that ST648 is a nascent and generalist lineage, lacking clear phylogeographic and host association signals. By including large numbers of ST131 (n?=?107) and ST10 (n?=?96) strains for comparative genomics and phenotypic analysis, we demonstrate that the combination of multidrug resistance and high-level virulence are the hallmarks of ST648, similar to international high-risk clonal lineage ST131. Specifically, our in silico, in vitro, and in vivo results demonstrate that ST648 is well equipped with biofilm-associated features, while ST131 shows sophisticated signatures indicative of adaption to urinary tract infection, potentially conveying individual ecological niche adaptation. In addition, we used a recently developed NFDS (negative frequency-dependent selection) population model suggesting that ST648 will increase significantly in frequency as a cause of bacteremia within the next few years. Also, ESBL plasmids impacting biofilm formation aided in shaping and maintaining ST648 strains to successfully emerge worldwide across different ecologies. Our study contributes to understanding what factors drive the evolution and spread of emerging international high-risk clonal lineages.Copyright © 2019 American Society for Microbiology.


April 21, 2020

hicap: In Silico Serotyping of the Haemophilus influenzae Capsule Locus.

Haemophilus influenzae exclusively colonizes the human nasopharynx and can cause a variety of respiratory infections as well as invasive diseases, including meningitis and sepsis. A key virulence determinant of H. influenzae is the polysaccharide capsule, of which six serotypes are known, each encoded by a distinct variation of the capsule biosynthesis locus (cap-a to cap-f). H. influenzae type b (Hib) was historically responsible for the majority of invasive H. influenzae disease, and its prevalence has been markedly reduced in countries that have implemented vaccination programs targeting this serotype. In the postvaccine era, nontypeable H. influenzae emerged as the most dominant group causing disease, but in recent years a resurgence of encapsulated H. influenzae strains has also been observed, most notably serotype a. Given the increasing incidence of encapsulated strains and the high frequency of Hib in countries without vaccination programs, there is growing interest in genomic epidemiology of H. influenzae Here we present hicap, a software tool for rapid in silico serotype prediction from H. influenzae genome sequences. hicap is written using Python3 and is freely available at https://github.com/scwatts/hicap under the GNU General Public License v3 (GPL3). To demonstrate the utility of hicap, we used it to investigate the cap locus diversity and distribution in 691 high-quality H. influenzae genomes from GenBank. These analyses identified cap loci in 95 genomes and confirmed the general association of each serotype with a unique clonal lineage, and they also identified occasional recombination between lineages that gave rise to hybrid cap loci (2% of encapsulated strains).Copyright © 2019 Watts and Holt.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.