Menu
September 22, 2019

Horizontal transfer of BovB and L1 retrotransposons in eukaryotes.

Transposable elements (TEs) are mobile DNA sequences, colloquially known as jumping genes because of their ability to replicate to new genomic locations. TEs can jump between organisms or species when given a vector of transfer, such as a tick or virus, in a process known as horizontal transfer. Here, we propose that LINE-1 (L1) and Bovine-B (BovB), the two most abundant TE families in mammals, were initially introduced as foreign DNA via ancient horizontal transfer events.Using analyses of 759 plant, fungal and animal genomes, we identify multiple possible L1 horizontal transfer events in eukaryotic species, primarily involving Tx-like L1s in marine eukaryotes. We also extend the BovB paradigm by increasing the number of estimated transfer events compared to previous studies, finding new parasite vectors of transfer such as bed bug, leech and locust, and BovB occurrences in new lineages such as bat and frog. Given that these transposable elements have colonised more than half of the genome sequence in today’s mammals, our results support a role for horizontal transfer in causing long-term genomic change in new host organisms.We describe extensive horizontal transfer of BovB retrotransposons and provide the first evidence that L1 elements can also undergo horizontal transfer. With the advancement of genome sequencing technologies and bioinformatics tools, we anticipate our study to be a valuable resource for inferring horizontal transfer from large-scale genomic data.


September 22, 2019

Comparative genomics of Pseudomonas sp. strain SI-3 associated with macroalga Ulva prolifera, the causative species for green tide in the Yellow Sea.

Algae-bacteria associations occurred widely in marine habitats, however, contributions of bacteria to macroalgal blooming were almost unknown. In this study, a potential endophytic strain SI-3 was isolated from Ulva prolifera, the causative species for the world’s largest green tide in the Yellow Sea, following a strict bleaching treatment to eliminate epiphytes. The genomic sequence of SI-3 was determined in size of 4.8 Mb and SI-3 was found to be mostly closed to Pseudomonas stutzeri. To evaluate the characteristics of SI-3 as a potential endophyte, the genomes of SI-3 and other 20 P. stutzeri strains were compared. We found that SI-3 had more strain-specific genes than most of the 20 P. stutzeri strains. Clusters of Orthologous Groups (COGs) analysis revealed that SI-3 had a higher proportion of genes assigned to transcriptional regulation and signal transduction compared with the 20 P. stutzeri strains, including four rhizosphere bacteria, indicating a complicated interaction network between SI-3 and its host. P. stutzeri is renowned for its metabolic versatility in aromatic compounds degradation. However, significant gene loss was observed in several aromatic compounds degradation pathways in SI-3, which may be an evolutional adaptation that developed upon association with its host. KEGG analysis revealed that dissimilatory nitrate reduction to ammonium (DNRA) and denitrification, two competing dissimilatory nitrate reduction pathways, co-occurred in the genome of SI-3, like most of the other 20 P. stutzeri strains. We speculated that DNRA of SI-3 may contribute a competitive advantage in nitrogen acquisition of U. prolifera by conserving nitrogen in NH4+ form, as in the case of microalgae bloom. Collectively, these data suggest that Pseudomonas sp. strain SI-3 was a suitable candidate for investigation of the algae-bacteria interaction with U. prolifera and the ecological impacts on algal blooming.


September 22, 2019

A chromosome scale assembly of the model desiccation tolerant grass Oropetium thomaeum

Oropetium thomaeum is an emerging model for desiccation tolerance and genome size evolution in grasses. A high-quality draft genome of Oropetium was recently sequenced, but the lack of a chromosome scale assembly has hindered comparative analyses and downstream functional genomics. Here, we reassembled Oropetium, and anchored the genome into ten chromosomes using Hi-C based chromatin interactions. A combination of high-resolution RNAseq data and homology-based gene prediction identified thousands of new, conserved gene models that were absent from the V1 assembly. This includes thousands of new genes with high expression across a desiccation timecourse. The sorghum and Oropetium genomes have a surprising degree of chromosome-level collinearity, and several chromosome pairs have near perfect synteny. Other chromosomes are collinear in the gene rich chromosome arms but have experienced pericentric translocations. Together, these resources will be useful for the grass comparative genomic community and further establish Oropetium as a model resurrection plant.


September 22, 2019

Linking genotype and phenotype in an economically viable propionic acid biosynthesis process

Propionic acid (PA) is used as a food preservative and increasingly, as a precursor for the synthesis of monomers. PA is produced mainly through hydrocarboxylation of ethylene, also known as the `oxo-process’; however, Propionibacterium species are promising biological PA producers natively producing PA as their main fermentation product. However, for fermentation to be competitive, a PA yield of at least 0.6 g/g is required.


September 22, 2019

Optical and physical mapping with local finishing enables megabase-scale resolution of agronomically important regions in the wheat genome.

Numerous scaffold-level sequences for wheat are now being released and, in this context, we report on a strategy for improving the overall assembly to a level comparable to that of the human genome.Using chromosome 7A of wheat as a model, sequence-finished megabase-scale sections of this chromosome were established by combining a new independent assembly using a bacterial artificial chromosome (BAC)-based physical map, BAC pool paired-end sequencing, chromosome-arm-specific mate-pair sequencing and Bionano optical mapping with the International Wheat Genome Sequencing Consortium RefSeq v1.0 sequence and its underlying raw data. The combined assembly results in 18 super-scaffolds across the chromosome. The value of finished genome regions is demonstrated for two approximately 2.5 Mb regions associated with yield and the grain quality phenotype of fructan carbohydrate grain levels. In addition, the 50 Mb centromere region analysis incorporates cytological data highlighting the importance of non-sequence data in the assembly of this complex genome region.Sufficient genome sequence information is shown to now be available for the wheat community to produce sequence-finished releases of each chromosome of the reference genome. The high-level completion identified that an array of seven fructosyl transferase genes underpins grain quality and that yield attributes are affected by five F-box-only-protein-ubiquitin ligase domain and four root-specific lipid transfer domain genes. The completed sequence also includes the centromere.


September 22, 2019

Whole-genome resequencing and pan-transcriptome reconstruction highlight the impact of genomic structural Variation on secondary metabolite gene clusters in the grapevine Esca pathogen Phaeoacremonium minimum.

The Ascomycete fungus Phaeoacremonium minimum is one of the primary causal agents of Esca, a widespread and damaging grapevine trunk disease. Variation in virulence among Pm. minimum isolates has been reported, but the underlying genetic basis of the phenotypic variability remains unknown. The goal of this study was to characterize intraspecific genetic diversity and explore its potential impact on virulence functions associated with secondary metabolism, cellular transport, and cell wall decomposition. We generated a chromosome-scale genome assembly, using single molecule real-time sequencing, and resequenced the genomes and transcriptomes of multiple isolates to identify sequence and structural polymorphisms. Numerous insertion and deletion events were found for a total of about 1 Mbp in each isolate. Structural variation in this extremely gene dense genome frequently caused presence/absence polymorphisms of multiple adjacent genes, mostly belonging to biosynthetic clusters associated with secondary metabolism. Because of the observed intraspecific diversity in gene content due to structural variation we concluded that a transcriptome reference developed from a single isolate is insufficient to represent the virulence factor repertoire of the species. We therefore compiled a pan-transcriptome reference of Pm. minimum comprising a non-redundant set of 15,245 protein-coding sequences. Using naturally infected field samples expressing Esca symptoms, we demonstrated that mapping of meta-transcriptomics data on a multi-species reference that included the Pm. minimum pan-transcriptome allows the profiling of an expanded set of virulence factors, including variable genes associated with secondary metabolism and cellular transport.


September 22, 2019

Orphan legumes growing in dry environments: Marama bean as a case study.

Plants have developed morphological, physiological, biochemical, cellular, and molecular mechanisms to survive in drought-stricken environments with little or no water caused by below-average precipitation. In this mini-review, we highlight the characteristics that allows marama bean [Tylosema esculentum (Burchell) Schreiber], an example of an orphan legume native to arid regions of southwestern Southern Africa, to flourish under an inhospitable climate and dry soil conditions where no other agricultural crop competes in this agro-ecological zone. Orphan legumes are often better suited to withstand such harsh growth environments due to development of survival strategies using a combination of different traits and responses. Recent findings on questions on marama bean speciation, hybridization, population dynamics, and the evolutionary history of the bean and mechanisms by which the bean is able to extract and conserve water and nutrients from its environment as well as aspects of morphological and physiological adaptation will be reviewed. The importance of the soil microbiome and the genetic diversity in this species, and their interplay, as a reservoir for improvement will also be considered. In particular, the application of the newly established marama bean genome sequence will facilitate both the identification of important genes involved in the interaction with the soil microbiome and the identification of the diversity within the wild germplasm for genes involved drought tolerance. Since predicted future changes in climatic conditions, with less water availability for plant growth, will severely affect agricultural productivity, an understanding of the mechanisms of unique adaptations in marama bean to such conditions may also provide insights as to how to improve the performance of the major crops.


September 22, 2019

Discovery of new genes involved in curli production by a uropathogenic Escherichia coli strain from the highly virulent O45:K1:H7 lineage.

Curli are bacterial surface-associated amyloid fibers that bind to the dye Congo red (CR) and facilitate uropathogenic Escherichia coli (UPEC) biofilm formation and protection against host innate defenses. Here we sequenced the genome of the curli-producing UPEC pyelonephritis strain MS7163 and showed it belongs to the highly virulent O45:K1:H7 neonatal meningitis-associated clone. MS7163 produced curli at human physiological temperature, and this correlated with biofilm growth, resistance of sessile cells to the human cationic peptide cathelicidin, and enhanced colonization of the mouse bladder. We devised a forward genetic screen using CR staining as a proxy for curli production and identified 41 genes that were required for optimal CR binding, of which 19 genes were essential for curli synthesis. Ten of these genes were novel or poorly characterized with respect to curli synthesis and included genes involved in purine de novo biosynthesis, a regulator that controls the Rcs phosphorelay system, and a novel repressor of curli production (referred to as rcpA). The involvement of these genes in curli production was confirmed by the construction of defined mutants and their complementation. The mutants did not express the curli major subunit CsgA and failed to produce curli based on CR binding. Mutation of purF (the first gene in the purine biosynthesis pathway) and rcpA also led to attenuated colonization of the mouse bladder. Overall, this work has provided new insight into the regulation of curli and the role of these amyloid fibers in UPEC biofilm formation and pathogenesis.IMPORTANCE Uropathogenic Escherichia coli (UPEC) strains are the most common cause of urinary tract infection, a disease increasingly associated with escalating antibiotic resistance. UPEC strains possess multiple surface-associated factors that enable their colonization of the urinary tract, including fimbriae, curli, and autotransporters. Curli are extracellular amyloid fibers that enhance UPEC virulence and promote biofilm formation. Here we examined the function and regulation of curli in a UPEC pyelonephritis strain belonging to the highly virulent O45:K1:H7 neonatal meningitis-associated clone. Curli expression at human physiological temperature led to increased biofilm formation, resistance of sessile cells to the human cationic peptide LL-37, and enhanced bladder colonization. Using a comprehensive genetic screen, we identified multiple genes involved in curli production, including several that were novel or poorly characterized with respect to curli synthesis. In total, this study demonstrates an important role for curli as a UPEC virulence factor that promotes biofilm formation, resistance, and pathogenesis. Copyright © 2018 Nhu et al.


September 22, 2019

The hpRNA/RNAi pathway is essential to resolve intragenomic conflict in the Drosophila male germline.

Intragenomic conflicts are fueled by rapidly evolving selfish genetic elements, which induce selective pressures to innovate opposing repressive mechanisms. This is patently manifest in sex-ratio (SR) meiotic drive systems, in which distorter and suppressor factors bias and restore equal transmission of X and Y sperm. Here, we reveal that multiple SR suppressors in Drosophila simulans (Nmy and Tmy) encode related hairpin RNAs (hpRNAs), which generate endo-siRNAs that repress the paralogous distorters Dox and MDox. All components in this drive network are recently evolved and largely testis restricted. To connect SR hpRNA function to the RNAi pathway, we generated D. simulans null mutants of Dcr-2 and AGO2. Strikingly, these core RNAi knockouts massively derepress Dox and MDox and are in fact completely male sterile and exhibit highly defective spermatogenesis. Altogether, our data reveal how the adaptive capacity of hpRNAs is critically deployed to restrict selfish gonadal genetic systems that can exterminate a species. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

Genomic analysis of multidrug-resistant Escherichia coli ST58 causing urosepsis.

Sequence type 58 (ST58) phylogroup B1 Escherichia coli have been isolated from a wide variety of mammalian and avian hosts but are not noted for their ability to cause serious disease in humans or animals. Here we determined the genome sequences of two multidrug-resistant E. coli ST58 strains from urine and blood of one patient using a combination of Illumina and Single Molecule, Real-Time (SMRT) sequencing. Both ST58 strains were clonal and were characterised as serotype O8:H25, phylogroup B1 and carried a complex resistance locus/loci (CRL) that featured an atypical class 1 integron with a dfrA5 (trimethoprim resistance) gene cassette followed by only 24 bp of the 3′-CS. CRL that carry this particular integron have been described previously in E. coli from cattle, pigs and humans in Australia. The integron abuts a copy of Tn6029, an IS26-flanked composite transposon encoding blaTEM, sul2 and strAB genes that confer resistance to ampicillin, sulfathiazole and streptomycin, respectively. The CRL resides within a novel Tn2610-like hybrid Tn1721/Tn21 transposon on an IncF, ColV plasmid (pSDJ2009-52F) of 138 553 bp that encodes virulence associated genes implicated in life-threatening extraintestinal pathogenic E. coli (ExPEC) infections. Notably, pSDJ2009-52F shares high sequence identity with pSF-088-1, a plasmid reported in an E. coli ST95 strain from a patient with blood sepsis from a hospital in San Francisco. These data suggest that extraintestinal infections caused by E. coli carrying ColV-like plasmids, irrespective of their phylogroup or ST, may pose a potential threat to human health, particularly to the elderly and immunocompromised. Copyright © 2018. Published by Elsevier B.V.


September 22, 2019

Draft genome assembly of the invasive cane toad, Rhinella marina.

The cane toad (Rhinella marina formerly Bufo marinus) is a species native to Central and South America that has spread across many regions of the globe. Cane toads are known for their rapid adaptation and deleterious impacts on native fauna in invaded regions. However, despite an iconic status, there are major gaps in our understanding of cane toad genetics. The availability of a genome would help to close these gaps and accelerate cane toad research.We report a draft genome assembly for R. marina, the first of its kind for the Bufonidae family. We used a combination of long-read Pacific Biosciences RS II and short-read Illumina HiSeq X sequencing to generate 359.5 Gb of raw sequence data. The final hybrid assembly of 31,392 scaffolds was 2.55 Gb in length with a scaffold N50 of 168 kb. BUSCO analysis revealed that the assembly included full length or partial fragments of 90.6% of tetrapod universal single-copy orthologs (n = 3950), illustrating that the gene-containing regions have been well assembled. Annotation predicted 25,846 protein coding genes with similarity to known proteins in Swiss-Prot. Repeat sequences were estimated to account for 63.9% of the assembly.The R. marina draft genome assembly will be an invaluable resource that can be used to further probe the biology of this invasive species. Future analysis of the genome will provide insights into cane toad evolution and enrich our understanding of their interplay with the ecosystem at large.


September 22, 2019

Evolutionary history of human Plasmodium vivax revealed by genome-wide analyses of related ape parasites.

Wild-living African apes are endemically infected with parasites that are closely related to human Plasmodium vivax, a leading cause of malaria outside Africa. This finding suggests that the origin of P. vivax was in Africa, even though the parasite is now rare in humans there. To elucidate the emergence of human P. vivax and its relationship to the ape parasites, we analyzed genome sequence data of P. vivax strains infecting six chimpanzees and one gorilla from Cameroon, Gabon, and Côte d’Ivoire. We found that ape and human parasites share nearly identical core genomes, differing by only 2% of coding sequences. However, compared with the ape parasites, human strains of P. vivax exhibit about 10-fold less diversity and have a relative excess of nonsynonymous nucleotide polymorphisms, with site-frequency spectra suggesting they are subject to greatly relaxed purifying selection. These data suggest that human P. vivax has undergone an extreme bottleneck, followed by rapid population expansion. Investigating potential host-specificity determinants, we found that ape P. vivax parasites encode intact orthologs of three reticulocyte-binding protein genes (rbp2d, rbp2e, and rbp3), which are pseudogenes in all human P. vivax strains. However, binding studies of recombinant RBP2e and RBP3 proteins to human, chimpanzee, and gorilla erythrocytes revealed no evidence of host-specific barriers to red blood cell invasion. These data suggest that, from an ancient stock of P. vivax parasites capable of infecting both humans and apes, a severely bottlenecked lineage emerged out of Africa and underwent rapid population growth as it spread globally. Copyright © 2018 the Author(s). Published by PNAS.


September 22, 2019

Genomic insights into host adaptation between the wheat stripe rust pathogen (Puccinia striiformis f. sp. tritici) and the barley stripe rust pathogen (Puccinia striiformis f. sp. hordei).

Plant fungal pathogens can rapidly evolve and adapt to new environmental conditions in response to sudden changes of host populations in agro-ecosystems. However, the genomic basis of their host adaptation, especially at the forma specialis level, remains unclear.We sequenced two isolates each representing Puccinia striiformis f. sp. tritici (Pst) and P. striiformis f. sp. hordei (Psh), different formae speciales of the stripe rust fungus P. striiformis highly adapted to wheat and barley, respectively. The divergence of Pst and Psh, estimated to start 8.12 million years ago, has been driven by high nucleotide mutation rates. The high genomic variation within dikaryotic urediniospores of P. striiformis has provided raw genetic materials for genome evolution. No specific gene families have enriched in either isolate, but extensive gene loss events have occurred in both Pst and Psh after the divergence from their most recent common ancestor. A large number of isolate-specific genes were identified, with unique genomic features compared to the conserved genes, including 1) significantly shorter in length; 2) significantly less expressed; 3) significantly closer to transposable elements; and 4) redundant in pathways. The presence of specific genes in one isolate (or forma specialis) was resulted from the loss of the homologues in the other isolate (or forma specialis) by the replacements of transposable elements or losses of genomic fragments. In addition, different patterns and numbers of telomeric repeats were observed between the isolates.Host adaptation of P. striiformis at the forma specialis level is a complex pathogenic trait, involving not only virulence-related genes but also other genes. Gene loss, which might be adaptive and driven by transposable element activities, provides genomic basis for host adaptation of different formae speciales of P. striiformis.


September 22, 2019

Comparative genome analysis of jujube witches’-broom Phytoplasma, an obligate pathogen that causes jujube witches’-broom disease.

JWB phytoplasma is a kind of insect-transmitted and uncultivable bacterial plant pathogen causeing a destructive Jujube disease. To date, no genome information about JWB phytoplasma has been published, which hindered its characterization at genomic level. To understand its pathogenicity and ecology, the genome of a JWB phytoplasma isolate jwb-nky was sequenced and compared with other phytoplasmas enabled us to explore the mechanisms of genomic rearrangement.The complete genome sequence of JWB phytoplasma (jwb-nky) was determined, which consisting of one circular chromosome of 750,803 bp with a GC content of 23.3%. 694 protein-encoding genes, 2 operons for rRNA genes and 31 tRNA genes as well as 4 potential mobile units (PMUs) containing clusters of DNA repeats were identified. Based on PHIbaes analysis, a large number of genes were genome-specific and approximately 13% of JWB phytoplasma genes were predicted to be associated with virulence. Although transporters for maltose, dipeptides/oligopeptides, spermidine/putrescine, cobalt, Mn/Zn and methionine were identified, KEGG pathway analysis revealed the reduced metabolic capabilities of JWB phytoplasma. Comparative genome analyses between JWB phytoplasma and other phytoplasmas shows the occurrence of large-scale gene rearrangements. The low synteny with other phytoplasmas indicated that the expansion of multiple gene families/duplication probably occurred separately after differentiation.In this study, the complete genome sequence of a JWB phytoplasma isolate jwb-nky that causing JWB disease was reported for the first time and a number of species-specific genes were identified in the genome. The study enhanced our understandings about genomic basis and the pathogenicity mechanism of this pathogen, which will aid in the development of improved strategies for efficient management of JWB diseases.


September 22, 2019

B chromosomes of the Asian seabass (Lates calcarifer) contribute to genome variations at the level of individuals and populations.

The Asian seabass (Lates calcarifer) is a bony fish from the Latidae family, which is widely distributed in the tropical Indo-West Pacific region. The karyotype of the Asian seabass contains 24 pairs of A chromosomes and a variable number of AT- and GC-rich B chromosomes (Bchrs or Bs). Dot-like shaped and nucleolus-associated AT-rich Bs were microdissected and sequenced earlier. Here we analyzed DNA fragments from Bs to determine their repeat and gene contents using the Asian seabass genome as a reference. Fragments of 75 genes, including an 18S rRNA gene, were found in the Bs; repeats represented 2% of the Bchr assembly. The 18S rDNA of the standard genome and Bs were similar and enriched with fragments of transposable elements. A higher nuclei DNA content in the male gonad and somatic tissue, compared to the female gonad, was demonstrated by flow cytometry. This variation in DNA content could be associated with the intra-individual variation in the number of Bs. A comparison between the copy number variation among the B-related fragments from whole genome resequencing data of Asian seabass individuals identified similar profiles between those from the South-East Asian/Philippines and Indian region but not the Australian ones. Our results suggest that Bs might cause variations in the genome among the individuals and populations of Asian seabass. A personalized copy number approach for segmental duplication detection offers a suitable tool for population-level analysis across specimens with low coverage genome sequencing.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.