Menu
April 21, 2020

TaF: a web platform for taxonomic profile-based fungal gene prediction.

The accurate prediction and annotation of gene structures from the genome sequence of an organism enable genome-wide functional analyses to obtain insight into the biological properties of an organism.We recently developed a highly accurate filamentous fungal gene prediction pipeline and web platform called TaF. TaF is a homology-based gene predictor employing large-scale taxonomic profiling to search for close relatives in genome queries.TaF pipeline consists of four processing steps; (1) taxonomic profiling to search for close relatives to query, (2) generation of hints for determining exon-intron boundaries from orthologous protein sequence data of the profiled species, (3) gene prediction by combination of ab inito and evidence-based prediction methods, and (4) homology search for gene models.TaF generates extrinsic evidence that suggests possible exon-intron boundaries based on orthologous protein sequence data, thus reducing false-positive predictions of gene structure based on distantly related orthologs data. In particular, the gene prediction method using taxonomic profiling shows very high accuracy, including high sensitivity and specificity for gene models, suggesting a new approach for homology-based gene prediction from newly sequenced or uncharacterized fungal genomes, with the potential to improve the quality of gene prediction.TaF will be a useful tool for fungal genome-wide analyses, including the identification of targeted genes associated with a trait, transcriptome profiling, comparative genomics, and evolutionary analysis.


April 21, 2020

Genome Sequence of Jaltomata Addresses Rapid Reproductive Trait Evolution and Enhances Comparative Genomics in the Hyper-Diverse Solanaceae.

Within the economically important plant family Solanaceae, Jaltomata is a rapidly evolving genus that has extensive diversity in flower size and shape, as well as fruit and nectar color, among its ~80 species. Here, we report the whole-genome sequencing, assembly, and annotation, of one representative species (Jaltomata sinuosa) from this genus. Combining PacBio long reads (25×) and Illumina short reads (148×) achieved an assembly of ~1.45?Gb, spanning ~96% of the estimated genome. Ninety-six percent of curated single-copy orthologs in plants were detected in the assembly, supporting a high level of completeness of the genome. Similar to other Solanaceous species, repetitive elements made up a large fraction (~80%) of the genome, with the most recently active element, Gypsy, expanding across the genome in the last 1-2 Myr. Computational gene prediction, in conjunction with a merged transcriptome data set from 11 tissues, identified 34,725 protein-coding genes. Comparative phylogenetic analyses with six other sequenced Solanaceae species determined that Jaltomata is most likely sister to Solanum, although a large fraction of gene trees supported a conflicting bipartition consistent with substantial introgression between Jaltomata and Capsicum after these species split. We also identified gene family dynamics specific to Jaltomata, including expansion of gene families potentially involved in novel reproductive trait development, and loss of gene families that accompanied the loss of self-incompatibility. This high-quality genome will facilitate studies of phenotypic diversification in this rapidly radiating group and provide a new point of comparison for broader analyses of genomic evolution across the Solanaceae.


April 21, 2020

Heterochromatin-enriched assemblies reveal the sequence and organization of the Drosophila melanogaster Y chromosome.

Heterochromatic regions of the genome are repeat-rich and poor in protein coding genes, and are therefore underrepresented in even the best genome assemblies. One of the most difficult regions of the genome to assemble are sex-limited chromosomes. The Drosophila melanogaster Y chromosome is entirely heterochromatic, yet has wide-ranging effects on male fertility, fitness, and genome-wide gene expression. The genetic basis of this phenotypic variation is difficult to study, in part because we do not know the detailed organization of the Y chromosome. To study Y chromosome organization in D. melanogaster, we develop an assembly strategy involving the in silico enrichment of heterochromatic long single-molecule reads and use these reads to create targeted de novo assemblies of heterochromatic sequences. We assigned contigs to the Y chromosome using Illumina reads to identify male-specific sequences. Our pipeline extends the D. melanogaster reference genome by 11.9 Mb, closes 43.8% of the gaps, and improves overall contiguity. The addition of 10.6 MB of Y-linked sequence permitted us to study the organization of repeats and genes along the Y chromosome. We detected a high rate of duplication to the pericentric regions of the Y chromosome from other regions in the genome. Most of these duplicated genes exist in multiple copies. We detail the evolutionary history of one sex-linked gene family, crystal-Stellate While the Y chromosome does not undergo crossing over, we observed high gene conversion rates within and between members of the crystal-Stellate gene family, Su(Ste), and PCKR, compared to genome-wide estimates. Our results suggest that gene conversion and gene duplication play an important role in the evolution of Y-linked genes. Copyright © 2019 Chang and Larracuente.


April 21, 2020

Effective approaches to study the plant-root knot nematode interaction.

Plant-parasitic nematodes cause major agricultural losses worldwide. Examining the molecular mechanisms underlying plant-nematode interactions and how plants respond to different invading pathogens is attracting major attention to reduce the expanding gap between agricultural production and the needs of the growing world population. This review summarizes the most recent developments in plant-nematode interactions and the diverse approaches used to improve plant resistance against root knot nematode (RKN). We will emphasize the recent rapid advances in genome sequencing technologies, small interfering RNA techniques (RNAi) and targeted genome editing which are contributing to the significant progress in understanding the plant-nematode interaction mechanisms. Also, molecular approaches to improve plant resistance against nematodes are considered.Copyright © 2019 Elsevier Masson SAS. All rights reserved.


April 21, 2020

Short communication: Identification of the pseudoautosomal region in the Hereford bovine reference genome assembly ARS-UCD1.2.

In cattle, the X chromosome accounts for approximately 3 and 6% of the genome in bulls and cows, respectively. In spite of the large size of this chromosome, very few studies report analysis of the X chromosome in genome-wide association studies and genomic selection. This lack of genetic interrogation is likely due to the complexities of undertaking these studies given the hemizygous state of some, but not all, of the X chromosome in males. The first step in facilitating analysis of this gene-rich chromosome is to accurately identify coordinates for the pseudoautosomal boundary (PAB) to split the chromosome into a region that may be treated as autosomal sequence (pseudoautosomal region) and a region that requires more complex statistical models. With the recent release of ARS-UCD1.2, a more complete and accurate assembly of the cattle genome than was previously available, it is timely to fine map the PAB for the first time. Here we report the use of SNP chip genotypes, short-read sequences, and long-read sequences to fine map the PAB (X chromosome:133,300,518) and simultaneously determine the neighboring regions of reduced homology and true pseudoautosomal region. These results greatly facilitate the inclusion of the X chromosome in genome-wide association studies, genomic selection, and other genetic analysis undertaken on this reference genome.The Authors. Published by FASS Inc. and Elsevier Inc. on behalf of the American Dairy Science Association®. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).


April 21, 2020

Analyses of four new Caulobacter Phicbkviruses indicate independent lineages.

Bacteriophages with genomes larger than 200 kbp are considered giant phages, and the giant Phicbkviruses are the most frequently isolated Caulobacter crescentus phages. In this study, we compare six bacteriophage genomes that differ from the genomes of the majority of Phicbkviruses. Four of these genomes are much larger than those of the rest of the Phicbkviruses, with genome sizes that are more than 250 kbp. A comparison of 16 Phicbkvirus genomes identified a ‘core genome’ of 69 genes that is present in all of these Phicbkvirus genomes, as well as shared accessory genes and genes that are unique for each phage. Most of the core genes are clustered into the regions coding for structural proteins or those involved in DNA replication. A phylogenetic analysis indicated that these 16 CaulobacterPhicbkvirus genomes are related, but they represent four distinct branches of the Phicbkvirus genomic tree with distantly related branches sharing little nucleotide homology. In contrast, pairwise comparisons within each branch of the phylogenetic tree showed that more than 80?% of the entire genome is shared among phages within a group. This conservation of the genomes within each branch indicates that horizontal gene transfer events between the groups are rare. Therefore, the Phicbkvirus genus consists of at least four different phylogenetic branches that are evolving independently from one another. One of these branches contains a 27-gene inversion relative to the other three branches. Also, an analysis of the tRNA genes showed that they are relatively mobile within the Phicbkvirus genus.


April 21, 2020

Combined Genome and Transcriptome (G&T) Sequencing of Single Cells.

The simultaneous examination of a single cell’s genome and transcriptome presents scientists with a powerful tool to study genetic variability and its effect on gene expression. In this chapter, we describe the library generation method for combined genome and transcriptome sequencing (G&T-seq) originally described by Macaulay et al. (Nat Protoc 11(11):2081-2103, 2016; Nat Methods 12(6):519-522, 2015). This includes some alterations we made to improve robustness of this process for both the novice user and laboratories that want to deploy this method at scale. Using this method, genomic DNA and full-length mRNA from single cells are separated, amplified, and converted into Illumina sequencer-compatible sequencing libraries.


April 21, 2020

Two large reciprocal translocations characterized in the disease resistance-rich burmannica genetic group of Musa acuminata.

Banana cultivars are derived from hybridizations involving Musa acuminata subspecies. The latter diverged following geographical isolation in distinct South-east Asian continental regions and islands. Observation of chromosome pairing irregularities in meiosis of hybrids between these subspecies suggested the presence of large chromosomal structural variations. The aim of this study was to characterize such rearrangements.Marker (single nucleotide polymorphism) segregation in a self-progeny of the ‘Calcutta 4’ accession and mate-pair sequencing were used to search for chromosomal rearrangements in comparison with the M. acuminata ssp. malaccensis genome reference sequence. Signature segment junctions of the revealed chromosome structures were identified and searched in whole-genome sequencing data from 123 wild and cultivated Musa accessions.Two large reciprocal translocations were characterized in the seedy banana M. acuminata ssp. burmannicoides ‘Calcutta 4’ accession. One consisted of an exchange of a 240 kb distal region of chromosome 2 with a 7.2 Mb distal region of chromosome 8. The other involved an exchange of a 20.8 Mb distal region of chromosome 1 with a 11.6 Mb distal region of chromosome 9. Both translocations were found only in wild accessions belonging to the burmannicoides/burmannica/siamea subspecies. Only two of the 87 cultivars analysed displayed the 2/8 translocation, while none displayed the 1/9 translocation.Two large reciprocal translocations were identified that probably originated in the burmannica genetic group. Accurate characterization of these translocations should enhance the use of this disease resistance-rich burmannica group in breeding programmes. © The Author(s) 2019. Published by Oxford University Press on behalf of the Annals of Botany Company.


April 21, 2020

The conservation of polyol transporter proteins and their involvement in lichenized Ascomycota.

In lichen symbiosis, polyol transfer from green algae is important for acquiring the fungal carbon source. However, the existence of polyol transporter genes and their correlation with lichenization remain unclear. Here, we report candidate polyol transporter genes selected from the genome of the lichen-forming fungus (LFF) Ramalina conduplicans. A phylogenetic analysis using characterized polyol and monosaccharide transporter proteins and hypothetical polyol transporter proteins of R. conduplicans and various ascomycetous fungi suggested that the characterized yeast’ polyol transporters form multiple clades with the polyol transporter-like proteins selected from the diverse ascomycetous taxa. Thus, polyol transporter genes are widely conserved among Ascomycota, regardless of lichen-forming status. In addition, the phylogenetic clusters suggested that LFFs belonging to Lecanoromycetes have duplicated proteins in each cluster. Consequently, the number of sequences similar to characterized yeast’ polyol transporters were evaluated using the genomes of 472 species or strains of Ascomycota. Among these, LFFs belonging to Lecanoromycetes had greater numbers of deduced polyol transporter proteins. Thus, various polyol transporters are conserved in Ascomycota and polyol transporter genes appear to have expanded during the evolution of Lecanoromycetes. Copyright © 2019 British Mycological Society. Published by Elsevier Ltd. All rights reserved.


April 21, 2020

A Rigorous Interlaboratory Examination of the Need to Confirm Next-Generation Sequencing-Detected Variants with an Orthogonal Method in Clinical Genetic Testing.

Orthogonal confirmation of next-generation sequencing (NGS)-detected germline variants is standard practice, although published studies have suggested that confirmation of the highest-quality calls may not always be necessary. The key question is how laboratories can establish criteria that consistently identify those NGS calls that require confirmation. Most prior studies addressing this question have had limitations: they have been generally of small scale, omitted statistical justification, and explored limited aspects of underlying data. The rigorous definition of criteria that separate high-accuracy NGS calls from those that may or may not be true remains a crucial issue. We analyzed five reference samples and over 80,000 patient specimens from two laboratories. Quality metrics were examined for approximately 200,000 NGS calls with orthogonal data, including 1662 false positives. A classification algorithm used these data to identify a battery of criteria that flag 100% of false positives as requiring confirmation (CI lower bound, 98.5% to 99.8%, depending on variant type) while minimizing the number of flagged true positives. These criteria identify false positives that the previously published criteria miss. Sampling analysis showed that smaller data sets resulted in less effective criteria. Our methodology for determining test- and laboratory-specific criteria can be generalized into a practical approach that can be used by laboratories to reduce the cost and time burdens of confirmation without affecting clinical accuracy. Copyright © 2019 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.


April 21, 2020

Snf2 controls pulcherriminic acid biosynthesis and antifungal activity of the biocontrol yeast Metschnikowia pulcherrima.

Metschnikowia pulcherrima synthesises the pigment pulcherrimin, from cyclodileucine (cyclo(Leu-Leu)) as a precursor, and exhibits strong antifungal activity against notorious plant pathogenic fungi. This yeast therefore has great potential for biocontrol applications against fungal diseases; particularly in the phyllosphere where this species is frequently found. To elucidate the molecular basis of the antifungal activity of M. pulcherrima, we compared a wild-type strain with a spontaneously occurring, pigmentless, weakly antagonistic mutant derivative. Whole genome sequencing of the wild-type and mutant strains identified a point mutation that creates a premature stop codon in the transcriptional regulator gene SNF2 in the mutant. Complementation of the mutant strain with the wild-type SNF2 gene restored pigmentation and recovered the strong antifungal activity. Mass spectrometry (UPLC HR HESI-MS) proved the presence of the pulcherrimin precursors cyclo(Leu-Leu) and pulcherriminic acid and identified new precursor and degradation products of pulcherriminic acid and/or pulcherrimin. All of these compounds were identified in the wild-type and complemented strain, but were undetectable in the pigmentless snf2 mutant strain. These results thus identify Snf2 as a regulator of antifungal activity and pulcherriminic acid biosynthesis in M. pulcherrima and provide a starting point for deciphering the molecular functions underlying the antagonistic activity of this yeast. © 2019 The Authors. Molecular Microbiology Published by John Wiley & Sons Ltd.


April 21, 2020

Genomic investigation of Staphylococcus aureus recovered from Gambian women and newborns following an oral dose of intra-partum azithromycin.

Oral azithromycin given during labour reduces carriage of bacteria responsible for neonatal sepsis, including Staphylococcus aureus. However, there is concern that this may promote drug resistance.Here, we combine genomic and epidemiological data on S. aureus isolated from mothers and babies in a randomized intra-partum azithromycin trial (PregnAnZI) to describe bacterial population dynamics and resistance mechanisms.Participants from both arms of the trial, who carried S. aureus in day 3 and day 28 samples post-intervention, were included. Sixty-six S. aureus isolates (from 7 mothers and 10 babies) underwent comparative genome analyses and the data were then combined with epidemiological data. Trial registration (main trial): ClinicalTrials.gov Identifier NCT01800942.Seven S. aureus STs were identified, with ST5 dominant (n?=?40, 61.0%), followed by ST15 (n?=?11, 17.0%). ST5 predominated in the placebo arm (73.0% versus 49.0%, P?=?0.039) and ST15 in the azithromycin arm (27.0% versus 6.0%, P?=?0.022). In azithromycin-resistant isolates, msr(A) was the main macrolide resistance gene (n?=?36, 80%). Ten study participants, from both trial arms, acquired azithromycin-resistant S. aureus after initially harbouring a susceptible isolate. In nine (90%) of these cases, the acquired clone was an msr(A)-containing ST5 S. aureus. Long-read sequencing demonstrated that in ST5, msr(A) was found on an MDR plasmid.Our data reveal in this Gambian population the presence of a dominant clone of S. aureus harbouring plasmid-encoded azithromycin resistance, which was acquired by participants in both arms of the study. Understanding these resistance dynamics is crucial to defining the public health drug resistance impacts of azithromycin prophylaxis given during labour in Africa. © The Author(s) 2019. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy.


April 21, 2020

Identification of potential tobramycin-resistant mutagenesis of Escherichia coli strains after spaceflight.

This study aimed to explore potential tobramycin-resistant mutagenesis of Escherichia coli strains after spaceflight.A spaceflight-induced mutagenesis of multidrug resistant E. coli strain (T1_13) on the outer space for 64 days (ST5), and a ground laboratory with the same conditions (GT5) were conducted. Both whole-genome sequencing and RNA-sequencing were performed.A total of 75 single nucleotide polymorphisms and 20 InDels were found to be associated with the resistance mechanism. Compared with T1_13, 1242 genes were differentially expressed in more than 20 of 38 tobramycin-resistant E. coli isolates while not in GT5. Function annotation of these single nucleotide polymorphisms/InDels related genes and differentially expressed genes was performed.This study provided clues for potential tobramycin-resistant spaceflight-induced mutagenesis of E. coli.


April 21, 2020

One Aeromonas salmonicida subsp. salmonicida isolate with a pAsa5 variant bearing antibiotic resistance and a pRAS3 variant making a link with a swine pathogen.

The Gram-negative bacterium Aeromonas salmonicida subsp. salmonicida is an aquatic pathogen which causes furunculosis to salmonids, especially in fish farms. The emergence of strains of this bacterium exhibiting antibiotic resistance is increasing, limiting the effectiveness of antibiotherapy as a treatment against this worldwide disease. In the present study, we discovered an isolate of A. salmonicida subsp. salmonicida that harbors two novel plasmids variants carrying antibiotic resistance genes. The use of long-read sequencing (PacBio) allowed us to fully characterize those variants, named pAsa5-3432 and pRAS3-3432, which both differ from their classic counterpart through their content in mobile genetic elements. The plasmid pAsa5-3432 carries a new multidrug region composed of multiple mobile genetic elements, including a Class 1 integron similar to an integrated element of Salmonella enterica. With this new region, probably acquired through plasmid recombination, pAsa5-3432 is the first reported plasmid of this bacterium that bears both an essential virulence factor (the type three secretion system) and multiple antibiotic resistance genes. As for pRAS3-3432, compared to the classic pRAS3, it carries a new mobile element that has only been identified in Chlamydia suis. Hence, with the identification of those two novel plasmids harboring mobile genetic elements that are normally encountered in other bacterial species, the present study puts emphasis on the important impact of mobile genetic elements in the genomic plasticity of A. salmonicida subsp. salmonicida and suggests that this aquatic bacterium could be an important reservoir of antibiotic resistance genes that can be exchanged with other bacteria, including human and animal pathogens. Copyright © 2019 Elsevier B.V. All rights reserved.


April 21, 2020

The interplay between microRNA and alternative splicing of linear and circular RNAs in eleven plant species.

MicroRNA (miRNA) and alternative splicing (AS)-mediated post-transcriptional regulation has been extensively studied in most eukaryotes. However, the interplay between AS and miRNAs has not been explored in plants. To our knowledge, the overall profile of miRNA target sites in circular RNAs (circRNA) generated by alternative back splicing has never been reported previously. To address the challenge, we identified miRNA target sites located in alternatively spliced regions of the linear and circular splice isoforms using the up-to-date single-molecule real-time (SMRT) isoform sequencing (Iso-Seq) and Illumina sequencing data in eleven plant species.In total, we identified 399?401 and 114?574 AS events from linear and circular RNAs, respectively. Among them, there were 64?781 and 41?146 miRNA target sites located in linear and circular AS region, respectively. In addition, we found 38?913 circRNAs to be overlapping with 45?648 AS events of its own parent isoforms, suggesting circRNA regulation of AS of linear RNAs by forming R-loop with the genomic locus. Here, we present a comprehensive database of miRNA targets in alternatively spliced linear and circRNAs (ASmiR) and a web server for deposition and identification of miRNA target sites located in the alternatively spliced region of linear and circular RNAs. This database is accompanied by an easy-to-use web query interface for meaningful downstream analysis. Plant research community can submit user-defined datasets to the web service to search AS regions harboring small RNA target sites. In conclusion, this study provides an unprecedented resource to understand regulatory relationships between miRNAs and AS in both gymnosperms and angiosperms.The readily accessible database and web-based tools are available at http://forestry.fafu.edu.cn/bioinfor/db/ASmiR.Supplementary data are available at Bioinformatics online. © The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.