Menu
September 22, 2019

A whole genome assembly of the horn fly, Haematobia irritans, and prediction of genes with roles in metabolism and sex determination.

Haematobia irritans, commonly known as the horn fly, is a globally distributed blood-feeding pest of cattle that is responsible for significant economic losses to cattle producers. Chemical insecticides are the primary means for controlling this pest but problems with insecticide resistance have become common in the horn fly. To provide a foundation for identification of genomic loci for insecticide resistance and for discovery of new control technology, we report the sequencing, assembly, and annotation of the horn fly genome. The assembled genome is 1.14 Gb, comprising 76,616 scaffolds with N50 scaffold length of 23 Kb. Using RNA-Seq data, we have predicted 34,413 gene models of which 19,185 have been assigned functional annotations. Comparative genomics analysis with the Dipteran flies Musca domestica L., Drosophila melanogaster, and Lucilia cuprina, show that the horn fly is most closely related to M. domestica, sharing 8,748 orthologous clusters followed by D. melanogaster and L. cuprina, sharing 7,582 and 7,490 orthologous clusters respectively. We also identified a gene locus for the sodium channel protein in which mutations have been previously reported that confers target site resistance to the most common class of pesticides used in fly control. Additionally, we identified 276 genomic loci encoding members of metabolic enzyme gene families such as cytochrome P450s, esterases and glutathione S-transferases, and several genes orthologous to sex determination pathway genes in other Dipteran species. Copyright © 2018 Konganti et al.


September 22, 2019

Coordinated regulation of core and accessory genes in the multipartite genome of Sinorhizobium fredii.

Prokaryotes benefit from having accessory genes, but it is unclear how accessory genes can be linked with the core regulatory network when developing adaptations to new niches. Here we determined hierarchical core/accessory subsets in the multipartite pangenome (composed of genes from the chromosome, chromid and plasmids) of the soybean microsymbiont Sinorhizobium fredii by comparing twelve Sinorhizobium genomes. Transcriptomes of two S. fredii strains at mid-log and stationary growth phases and in symbiotic conditions were obtained. The average level of gene expression, variation of expression between different conditions, and gene connectivity within the co-expression network were positively correlated with the gene conservation level from strain-specific accessory genes to genus core. Condition-dependent transcriptomes exhibited adaptive transcriptional changes in pangenome subsets shared by the two strains, while strain-dependent transcriptomes were enriched with accessory genes on the chromid. Proportionally more chromid genes than plasmid genes were co-expressed with chromosomal genes, while plasmid genes had a higher within-replicon connectivity in expression than chromid ones. However, key nitrogen fixation genes on the symbiosis plasmid were characterized by high connectivity in both within- and between-replicon analyses. Among those genes with host-specific upregulation patterns, chromosomal znu and mdt operons, encoding a conserved high-affinity zinc transporter and an accessory multi-drug efflux system, respectively, were experimentally demonstrated to be involved in host-specific symbiotic adaptation. These findings highlight the importance of integrative regulation of hierarchical core/accessory components in the multipartite genome of bacteria during niche adaptation and in shaping the prokaryotic pangenome in the long run.


September 22, 2019

Whole-genome analysis of three yeast strains used for production of sherry-like wines revealed genetic traits specific to Flor yeasts.

Flor yeast strains represent a specialized group of Saccharomyces cerevisiae yeasts used for biological wine aging. We have sequenced the genomes of three flor strains originated from different geographic regions and used for production of sherry-like wines in Russia. According to the obtained phylogeny of 118 yeast strains, flor strains form very tight cluster adjacent to the main wine clade. SNP analysis versus available genomes of wine and flor strains revealed 2,270 genetic variants in 1,337 loci specific to flor strains. Gene ontology analysis in combination with gene content evaluation revealed a complex landscape of possibly adaptive genetic changes in flor yeast, related to genes associated with cell morphology, mitotic cell cycle, ion homeostasis, DNA repair, carbohydrate metabolism, lipid metabolism, and cell wall biogenesis. Pangenomic analysis discovered the presence of several well-known “non-reference” loci of potential industrial importance. Events of gene loss included deletions of asparaginase genes, maltose utilization locus, and FRE-FIT locus involved in iron transport. The latter in combination with a flor-yeast-specific mutation in the Aft1 transcription factor gene is likely to be responsible for the discovered phenotype of increased iron sensitivity and improved iron uptake of analyzed strains. Expansion of the coding region of the FLO11 flocullin gene and alteration of the balance between members of the FLO gene family are likely to positively affect the well-known propensity of flor strains for velum formation. Our study provides new insights in the nature of genetic variation in flor yeast strains and demonstrates that different adaptive properties of flor yeast strains could have evolved through different mechanisms of genetic variation.


September 22, 2019

Convergent loss of ABC transporter genes from Clostridioides difficile genomes is associated with impaired tyrosine uptake and p-cresol production.

We report the frequent, convergent loss of two genes encoding the substrate-binding protein and the ATP-binding protein of an ATP-binding cassette (ABC) transporter from the genomes of unrelated Clostridioides difficile strains. This specific genomic deletion was strongly associated with the reduced uptake of tyrosine and phenylalanine and production of derived Stickland fermentation products, including p-cresol, suggesting that the affected ABC transporter had been responsible for the import of aromatic amino acids. In contrast, the transporter gene loss did not measurably affect bacterial growth or production of enterotoxins. Phylogenomic analysis of publically available genome sequences indicated that this transporter gene deletion had occurred multiple times in diverse clonal lineages of C. difficile, with a particularly high prevalence in ribotype 027 isolates, where 48 of 195 genomes (25%) were affected. The transporter gene deletion likely was facilitated by the repetitive structure of its genomic location. While at least some of the observed transporter gene deletions are likely to have occurred during the natural life cycle of C. difficile, we also provide evidence for the emergence of this mutation during long-term laboratory cultivation of reference strain R20291.


September 22, 2019

Isolation and characterization of Bacillus sp. GFP-2, a novel Bacillus strain with antimicrobial activities, from Whitespotted bamboo shark intestine.

The abuse of antibiotics and following rapidly increasing of antibiotic-resistant pathogens is the serious threat to our society. Natural products from microorganism are regarded as the important substitution antimicrobial agents of antibiotics. We isolated a new strain, Bacillus sp. GFP-2, from the Chiloscyllium plagiosum (Whitespotted bamboo shark) intestine, which showed great inhibitory effects on the growth of both Gram-positive and Gram-negative bacteria. Additionally, the growth of salmon was effectively promoted when fed with inactivated strain GFP-2 as the inhibition agent of pathogenic bacteria. The genes encoding antimicrobial peptides like LCI, YFGAP and hGAPDH and gene clusters for secondary metabolites and bacteriocins, such as difficidin, bacillibactin, bacilysin, surfactin, butirosin, macrolactin, bacillaene, fengycin, lanthipeptides and LCI, were predicted in the genome of Bacillus sp. GFP-2, which might be expressed and contribute to the antimicrobial activities of this strain. The gene encoding ß-1,3-1,4-glucanase was successfully cloned from the genome and this protein was detected in the culture supernatant of Bacillus sp. GFP-2 by the antibody produced in rabbit immunized with the recombinant ß-1,3-1,4-glucanase, indicating that this strain could express ß-1,3-1,4-glucanase, which might partially contribute to its antimicrobial activities. This study can enhance a better understanding of the mechanism of antimicrobial activities in genus Bacillus and provide a useful material for the biotechnology study in antimicrobial agent development.


September 22, 2019

Genomic and probiotic characterization of SJP-SNU strain of Pichia kudriavzevii.

The yeast strain SJP-SNU was investigated as a probiotic and was characterized with respect to growth temperature, bile salt resistance, hydrogen sulfide reducing activity, intestinal survival ability and chicken embryo pathogenicity. In addition, we determined the complete genomic and mitochondrial sequences of SJP-SNU and conducted comparative genomics analyses. SJP-SNU grew rapidly at 37 °C and formed colonies on MacConkey agar containing bile salt. SJP-SNU reduced hydrogen sulfide produced by Salmonella serotype Enteritidis and, after being fed to 4-week-old chickens, could be isolated from cecal feces. SJP-SNU did not cause mortality in 10-day-old chicken embryos. From 13 initial contigs, 11 were finally assembled and represented 10 chromosomal sequences and 1 mitochondrial DNA sequence. Comparative genomic analyses revealed that SJP-SNU was a strain of Pichia kudriavzevii. Although SJP-SNU possesses pathogenicity-related genes, they showed very low amino acid sequence identities to those of Candida albicans. Furthermore, SJP-SNU possessed useful genes, such as phytases and cellulase. Thus, SJP-SNU is a useful yeast possessing the basic traits of a probiotic, and further studies to demonstrate its efficacy as a probiotic in the future may be warranted.


September 22, 2019

A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea.

Transposable elements (TEs) are mobile DNA sequences known as drivers of genome evolution. Their impacts have been widely studied in animals, plants and insects, but little is known about them in microalgae. In a previous study, we compared the genetic polymorphisms between strains of the haptophyte microalga Tisochrysis lutea and suggested the involvement of active autonomous TEs in their genome evolution.To identify potentially autonomous TEs, we designed a pipeline named PiRATE (Pipeline to Retrieve and Annotate Transposable Elements, download: https://doi.org/10.17882/51795 ), and conducted an accurate TE annotation on a new genome assembly of T. lutea. PiRATE is composed of detection, classification and annotation steps. Its detection step combines multiple, existing analysis packages representing all major approaches for TE detection and its classification step was optimized for microalgal genomes. The efficiency of the detection and classification steps was evaluated with data on the model species Arabidopsis thaliana. PiRATE detected 81% of the TE families of A. thaliana and correctly classified 75% of them. We applied PiRATE to T. lutea genomic data and established that its genome contains 15.89% Class I and 4.95% Class II TEs. In these, 3.79 and 17.05% correspond to potentially autonomous and non-autonomous TEs, respectively. Annotation data was combined with transcriptomic and proteomic data to identify potentially active autonomous TEs. We identified 17 expressed TE families and, among these, a TIR/Mariner and a TIR/hAT family were able to synthesize their transposase. Both these TE families were among the three highest expressed genes in a previous transcriptomic study and are composed of highly similar copies throughout the genome of T. lutea. This sum of evidence reveals that both these TE families could be capable of transposing or triggering the transposition of potential related MITE elements.This manuscript provides an example of a de novo transposable element annotation of a non-model organism characterized by a fragmented genome assembly and belonging to a poorly studied phylum at genomic level. Integration of multi-omics data enabled the discovery of potential mobile TEs and opens the way for new discoveries on the role of these repeated elements in genomic evolution of microalgae.


September 22, 2019

Comparative genomics analysis of plasmid pPV989-94 from a clinical isolate of Pantoea vagans PV989.

Pantoea vagans, a gram-negative bacterium from the genus Pantoea and family Enterobacteriaceae, is present in various natural environments and considered to be plant endophytes. We isolated the Pantoea vagans PV989 strain from the clinic and sequenced its whole genome. Besides a chromosome DNA molecule, it also harboured three large plasmids. A comparative genomics analysis was performed for the smallest plasmid, pPV989-94. It can be divided into four regions, including three conservative regions related to replication (R1), transfer conjugation (R2), and transfer leading (R3), and one variable region (R4). Further analysis showed that pPV989-94 is most similar to plasmids LA637P2 and pEA68 of Erwinia amylovora strains isolated from fruit trees. These three plasmids share three conservative regions (R1, R2, and R3). Interestingly, a fragment (R4′) in R4, mediated by phage integrase and phage integrase family site-specific recombinase and encoding 9 genes related to glycometabolism, resistance, and DNA repair, was unique in pPV989-94. Homologues of R4′ were found in other plasmids or chromosomes, suggesting that horizontal gene transfer (HGT) occurred among different bacteria of various species or genera. The acquired functional genes may play important roles in the adaptation of bacteria to different hosts or environmental conditions.


September 22, 2019

Genus-wide assessment of lignocellulose utilization in the extremely thermophilic Caldicellulosiruptor by genomic, pan-genomic and metagenomic analysis

Metagenomic data from Obsidian Pool (Yellowstone National Park, USA) and 13 genome sequences were used to reassess genus-wide biodiversity for the extremely thermophilic Caldicellulosiruptor The updated core genome contains 1,401 ortholog groups (average genome size for 13 species = 2,516 genes). The pangenome, which remains open with a revised total of 3,493 ortholog groups, encodes a variety of multidomain glycoside hydrolases (GHs). These include three cellulases with GH48 domains that are colocated in the glucan degradation locus (GDL) and are specific determinants for microcrystalline cellulose utilization. Three recently sequenced species, Caldicellulosiruptor sp. strain Rt8.B8 (renamed here Caldicellulosiruptor morganii), Thermoanaerobacter cellulolyticus strain NA10 (renamed here Caldicellulosiruptor naganoensis), and Caldicellulosiruptor sp. strain Wai35.B1 (renamed here Caldicellulosiruptor danielii), degraded Avicel and lignocellulose (switchgrass). C. morganii was more efficient than Caldicellulosiruptor bescii in this regard and differed from the other 12 species examined, both based on genome content and organization and in the specific domain features of conserved GHs. Metagenomic analysis of lignocellulose-enriched samples from Obsidian Pool revealed limited new information on genus biodiversity. Enrichments yielded genomic signatures closely related to that of Caldicellulosiruptor obsidiansis, but there was also evidence for other thermophilic fermentative anaerobes (Caldanaerobacter, Fervidobacterium, Caloramator, and Clostridium). One enrichment, containing 89.8% Caldicellulosiruptor and 9.7% Caloramator, had a capacity for switchgrass solubilization comparable to that of C. bescii These results refine the known biodiversity of Caldicellulosiruptor and indicate that microcrystalline cellulose degradation at temperatures above 70°C, based on current information, is limited to certain members of this genus that produce GH48 domain-containing enzymes.IMPORTANCE The genus Caldicellulosiruptor contains the most thermophilic bacteria capable of lignocellulose deconstruction, which are promising candidates for consolidated bioprocessing for the production of biofuels and bio-based chemicals. The focus here is on the extant capability of this genus for plant biomass degradation and the extent to which this can be inferred from the core and pangenomes, based on analysis of 13 species and metagenomic sequence information from environmental samples. Key to microcrystalline hydrolysis is the content of the glucan degradation locus (GDL), a set of genes encoding glycoside hydrolases (GHs), several of which have GH48 and family 3 carbohydrate binding module domains, that function as primary cellulases. Resolving the relationship between the GDL and lignocellulose degradation will inform efforts to identify more prolific members of the genus and to develop metabolic engineering strategies to improve this characteristic. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Comparative genomic analysis of Geosporobacter ferrireducens and its versatility of anaerobic energy metabolism.

Members of the family Clostridiaceae within phylum Firmicutes are ubiquitous in various iron-reducing environments. However, genomic data on iron-reducing bacteria of the family Clostridiaceae, particularly regarding their environmental distribution, are limited. Here, we report the analysis and comparison of the genomic properties of Geosporobacter ferrireducens IRF9, a strict anaerobe that ferments sugars and degrades toluene under iron-reducing conditions, with those of the closely related species, Geosporobacter subterraneus DSM 17957. Putative alkyl succinate synthase-encoding genes were observed in the genome of strain IRF9 instead of the typical benzyl succinate synthase-encoding genes. Canonical genes associated with iron reduction were not observed in either genome. The genomes of strains IRF9 and DMS 17957 harbored genes for acetogenesis, that encode two types of Rnf complexes mediating the translocation of H+ and Na+ ions, respectively. Strain IRF9 harbored two different types of ATPases (Na+-dependent F-type ATPase and H+-dependent V-type ATPase), which enable full exploitation of ion gradients. The versatile energy conservation potential of strain IRF9 promotes its survival in various environmental conditions.


September 22, 2019

Discovery of the first germline-restricted gene by subtractive transcriptomic analysis in the zebra finch, Taeniopygia guttata.

Developmentally programmed genome rearrangements are rare in vertebrates, but have been reported in scattered lineages including the bandicoot, hagfish, lamprey, and zebra finch (Taeniopygia guttata) [1]. In the finch, a well-studied animal model for neuroendocrinology and vocal learning [2], one such programmed genome rearrangement involves a germline-restricted chromosome, or GRC, which is found in germlines of both sexes but eliminated from mature sperm [3, 4]. Transmitted only through the oocyte, it displays uniparental female-driven inheritance, and early in embryonic development is apparently eliminated from all somatic tissue in both sexes [3, 4]. The GRC comprises the longest finch chromosome at over 120 million base pairs [3], and previously the only known GRC-derived sequence was repetitive and non-coding [5]. Because the zebra finch genome project was sourced from male muscle (somatic) tissue [6], the remaining genomic sequence and protein-coding content of the GRC remain unknown. Here we report the first protein-coding gene from the GRC: a member of the a-soluble N-ethylmaleimide sensitive fusion protein (NSF) attachment protein (a-SNAP) family hitherto missing from zebra finch gene annotations. In addition to the GRC-encoded a-SNAP, we find an additional paralogous a-SNAP residing in the somatic genome (a somatolog)-making the zebra finch the first example in which a-SNAP is not a single-copy gene. We show divergent, sex-biased expression for the paralogs and also that positive selection is detectable across the bird a-SNAP lineage, including the GRC-encoded a-SNAP. This study presents the identification and evolutionary characterization of the first protein-coding GRC gene in any organism. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019

Structure and biosynthesis of mayamycin B, a new polyketide with antibacterial activity from Streptomyces sp. 120454.

Mayamycin B, a new antibacterial type II polyketide, together with its known congener mayamycin A, were isolated from Streptomyces sp. 120454. The structure of new compound was elucidated by extensive spectroscopic analysis and comparison with literature data. Sequencing and bioinformatics analysis revealed the biosynthetic gene cluster for mayamycins A and B.


September 22, 2019

Insect symbionts as valuable grist for the biotechnological mill: an alkaliphilic silkworm gut bacterium for efficient lactic acid production.

Insects constitute the most abundant and diverse animal class and act as hosts to an extraordinary variety of symbiotic microorganisms. These microbes living inside the insects play critical roles in host biology and are also valuable bioresources. Enterococcus mundtii EMB156, isolated from the larval gut (gut pH >10) of the model organism Bombyx mori (Lepidoptera: Bombycidae), efficiently produces lactic acid, an important metabolite for industrial production of bioplastic materials. E. mundtii EMB156 grows well under alkaline conditions and stably converts various carbon sources into lactic acid, offering advantages in downstream fermentative processes. High-yield lactic acid production can be achieved by the strain EMB156 from renewable biomass substrates under alkaline pretreatments. Single-molecule real-time (SMRT) sequencing technology revealed its 3.01 Mbp whole genome sequence. A total of 2956 protein-coding sequences, 65 tRNA genes, and 6 rRNA operons were predicted in the EMB156 chromosome. Remarkable genomic features responsible for lactic acid fermentation included key enzymes involved in the pentose phosphate (PP)/glycolytic pathway, and an alpha amylase and xylose isomerase were characterized in EMB156. This genomic information coincides with the phenotype of E. mundtii EMB156, reflecting its metabolic flexibility in efficient lactate fermentation, and established a foundation for future biotechnological application. Interestingly, enzyme activities of amylase were quite stable in high-pH broths, indicating a possible mechanism for strong EMB156 growth in an alkaline environment, thereby facilitating lactic acid production. Together, these findings implied that valuable lactic acid-producing bacteria can be discovered efficiently by screening under the extremely alkaline conditions, as exemplified by gut microbial symbionts of Lepidoptera insects.


September 22, 2019

Long-read sequencing data analysis for yeasts.

Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ~41 h to generate a complete and well-annotated genome from ~100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.


September 22, 2019

The genome of Artemisia annua provides insight into the evolution of Asteraceae family and artemisinin biosynthesis.

Artemisia annua, commonly known as sweet wormwood or Qinghao, is a shrub native to China and has long been used for medicinal purposes. A. annua is now cultivated globally as the only natural source of a potent anti-malarial compound, artemisinin. Here, we report a high-quality draft assembly of the 1.74-gigabase genome of A. annua, which is highly heterozygous, rich in repetitive sequences, and contains 63 226 protein-coding genes, one of the largest numbers among the sequenced plant species. We found that, as one of a few sequenced genomes in the Asteraceae, the A. annua genome contains a large number of genes specific to this large angiosperm clade. Notably, the expansion and functional diversification of genes encoding enzymes involved in terpene biosynthesis are consistent with the evolution of the artemisinin biosynthetic pathway. We further revealed by transcriptome profiling that A. annua has evolved the sophisticated transcriptional regulatory networks underlying artemisinin biosynthesis. Based on comprehensive genomic and transcriptomic analyses we generated transgenic A. annua lines producing high levels of artemisinin, which are now ready for large-scale production and thereby will help meet the challenge of increasing global demand of artemisinin. Copyright © 2018 The Author. Published by Elsevier Inc. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.