Menu
July 7, 2019  |  

Development of molecular markers linked to powdery mildew resistance GenePm4bby combining SNP discovery from transcriptome sequencing data with bulked segregant analysis (BSR-Seq) in wheat.

Powdery mildew resistance genePm4b, originating fromTriticum persicum, is effective against the prevalentBlumeria graminisf. sp.tritici(Bgt) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification ofPm4bduring the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F2:3mapping population (237 families) derived from a pair of isogenic lines VPM1/7*Bainong 3217 F4(carryingPm4b) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F2:3families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. FourPm4b-linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking thePm4bgene. Three SSR markers,Xics13,Xics43, andXics76, were incorporated in the new genetic linkage map, which locatedPm4bin a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship withBrachypodium distachyonchromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with disease resistance were predicted in this collinear genomic region, which included C2 domain protein, peroxidase activity protein, protein kinases of PKc_like super family, Mlo family protein, and catalytic domain of the serine/threonine kinases (STKc_IRAK like super family). The markers developed in the present study facilitate identification ofPm4bduring its MAS practice.


July 7, 2019  |  

Identification and expression analysis of wheat TaGF14 genes.

The 14-3-3 gene family members play key roles in various cellular processes. However, little is known about the numbers and roles of 14-3-3 genes in wheat. The aims of this study were to identify TaGF14 numbers in wheat by searching its whole genome through blast, to study the phylogenetic relationships with other plant species and to discuss the functions of TaGF14s. The results showed that common wheat harbored 20 TaGF14 genes, located on wheat chromosome groups 2, 3, 4, and 7. Out of them, eighteen TaGF14s are non-e proteins, and two wheat TaGF14 genes, TaGF14i and TaGF14f, are e proteins. Phylogenetic analysis indicated that these genes were divided into six clusters: cluster 1 (TaGF14d, TaGF14g, TaGF14j, TaGF14h, TaGF14c, and TaGF14n); cluster 2 (TaGF14k); cluster 3 (TaGF14b, TaGF14l, TaGF14m, and TaGF14s); cluster 4 (TaGF14a, TaGF14e, and TaGF14r); cluster 5 (TaGF14i and TaGF14f); and cluster 6 (TaGF14o, TaGF14p, TaGF14q, and TaGF14t). Tissue-specific gene expressions suggested that all TaGF14s were likely constitutively expressed, except two genes, i.e., TaGF14p and TaGF14f. And the highest amount of TaGF14 transcripts were observed in developing grains at 20 days post anthesis (DPA), especially for TaGF14j and TaGF14l. After drought stress, five genes, i.e., TaGF14c, TaGF14d, TaGF14g, TaGF14h, and TaGF14j, were up-regulated expression under drought stress for both 1 and 6 h, suggesting these genes played vital role in combating against drought stress. However, all the TaGF14s were down-regulated expression under heat stress for both 1 and 6 h, indicating TaGF14s may be negatively associated with heat stress by reducing the expression to combat heat stress or through other pathways. These results suggested that cluster 1, e.g., TaGF14j, may participate in the whole wheat developing stages, e.g., grain-filling (starch biosynthesis) and may also participate in combating against drought stress. Subsequently, a homolog of TaGF14j, TaGF14-JM22, were cloned by RACE and used to validate its function. Immunoblotting results showed that TaGF14-JM22 protein, closely related to TaGF14d, TaGF14g, and TaGF14j, can interact with AGP-L, SSI, SSII, SBEIIa, and SBEIIb in developing grains, suggesting that TaGF14s located on group 4 may be involved in starch biosynthesis. Therefore, it is possible to develop starch-rich wheat cultivars by modifying TaGF14s.


July 7, 2019  |  

Hercules: a profile HMM-based hybrid error correction algorithm for long reads.

Choosing whether to use second or third generation sequencing platforms can lead to trade-offs between accuracy and read length. Several types of studies require long and accurate reads. In such cases researchers often combine both technologies and the erroneous long reads are corrected using the short reads. Current approaches rely on various graph or alignment based techniques and do not take the error profile of the underlying technology into account. Efficient machine learning algorithms that address these shortcomings have the potential to achieve more accurate integration of these two technologies. We propose Hercules, the first machine learning-based long read error correction algorithm. Hercules models every long read as a profile Hidden Markov Model with respect to the underlying platform’s error profile. The algorithm learns a posterior transition/emission probability distribution for each long read to correct errors in these reads. We show on two DNA-seq BAC clones (CH17-157L1 and CH17-227A2) that Hercules-corrected reads have the highest mapping rate among all competing algorithms and have the highest accuracy when the breadth of coverage is high. On a large human CHM1 cell line WGS data set, Hercules is one of the few scalable algorithms; and among those, it achieves the highest accuracy.


July 7, 2019  |  

Satellite DNA evolution: old ideas, new approaches.

A substantial portion of the genomes of most multicellular eukaryotes consists of large arrays of tandemly repeated sequence, collectively called satellite DNA. The processes generating and maintaining different satellite DNA abundances across lineages are important to understand as satellites have been linked to chromosome mis-segregation, disease phenotypes, and reproductive isolation between species. While much theory has been developed to describe satellite evolution, empirical tests of these models have fallen short because of the challenges in assessing satellite repeat regions of the genome. Advances in computational tools and sequencing technologies now enable identification and quantification of satellite sequences genome-wide. Here, we describe some of these tools and how their applications are furthering our knowledge of satellite evolution and function. Copyright © 2018 Elsevier Ltd. All rights reserved.


July 7, 2019  |  

To B or not to B: a tale of unorthodox chromosomes.

Highlights • B chromosomes are dispensable parts of the karyotype of many eukaryotes. • Deemed genome parasites in plants and animals, provide advantage to pathogenic fungi. • Often enriched in repeats and in fast evolving pathogenicity-related genes. • B chromosomes are not a uniform class, share certain features with core chromosomes.


July 7, 2019  |  

The case for not masking away repetitive DNA

In the course of analyzing whole-genome data, it is common practice to mask or filter out repetitive regions of a genome, such as transposable elements and endogenous retroviruses, in order to focus only on genes and thus simplify the results. This Commentary is a plea from one member of the Mobile DNA community to all gene-centric researchers: please do not ignore the repetitive fraction of the genome. Please stop narrowing your findings by only analyzing a minority of the genome, and instead broaden your analyses to include the rich biology of repetitive and mobile DNA. In this article, I present four arguments supporting a case for retaining repetitive DNA in your genome-wide analysis.


July 7, 2019  |  

Genome sequence resources for the wheat stripe rust pathogen (Puccinia striiformis f. sp. tritici) and the barley stripe rust pathogen (Puccinia striiformis f. sp. hordei)

Puccinia striiformis f. sp. tritici causes devastating stripe (yellow) rust on wheat and P. striiformis f. sp. hordei causes stripe rust on barley. Several P. striiformis f. sp. tritici genomes are available, but no P. striiformis f. sp. hordei genome is available. More genomes of P. striiformis f. sp. tritici and P. striiformis f. sp. hordei are needed to understand the genome evolution and molecular mechanisms of their pathogenicity. We sequenced P. striiformis f. sp. tritici isolate 93-210 and P. striiformis f. sp. hordei isolate 93TX-2, using PacBio and Illumina technologies and RNA sequencing. Their genomic sequences were assembled to contigs with high continuity and showed significant structural differences. The circular mitochondria genomes of both were complete. These genomes provide high-quality resources for deciphering the genomic basis of rapid evolution and host adaptation, identifying genes for avirulence and other important traits, and studying host-pathogen interactions.


July 7, 2019  |  

ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers.

The long-range sequencing information captured by linked reads, such as those available from 10× Genomics (10xG), helps resolve genome sequence repeats, and yields accurate and contiguous draft genome assemblies. We introduce ARKS, an alignment-free linked read genome scaffolding methodology that uses linked reads to organize genome assemblies further into contiguous drafts. Our approach departs from other read alignment-dependent linked read scaffolders, including our own (ARCS), and uses a kmer-based mapping approach. The kmer mapping strategy has several advantages over read alignment methods, including better usability and faster processing, as it precludes the need for input sequence formatting and draft sequence assembly indexing. The reliance on kmers instead of read alignments for pairing sequences relaxes the workflow requirements, and drastically reduces the run time.Here, we show how linked reads, when used in conjunction with Hi-C data for scaffolding, improve a draft human genome assembly of PacBio long-read data five-fold (baseline vs. ARKS NG50?=?4.6 vs. 23.1 Mbp, respectively). We also demonstrate how the method provides further improvements of a megabase-scale Supernova human genome assembly (NG50?=?14.74 Mbp vs. 25.94 Mbp before and after ARKS), which itself exclusively uses linked read data for assembly, with an execution speed six to nine times faster than competitive linked read scaffolders (~?10.5 h compared to 75.7 h, on average). Following ARKS scaffolding of a human genome 10xG Supernova assembly (of cell line NA12878), fewer than 9 scaffolds cover each chromosome, except the largest (chromosome 1, n?=?13).ARKS uses a kmer mapping strategy instead of linked read alignments to record and associate the barcode information needed to order and orient draft assembly sequences. The simplified workflow, when compared to that of our initial implementation, ARCS, markedly improves run time performances on experimental human genome datasets. Furthermore, the novel distance estimator in ARKS utilizes barcoding information from linked reads to estimate gap sizes. It accomplishes this by modeling the relationship between known distances of a region within contigs and calculating associated Jaccard indices. ARKS has the potential to provide correct, chromosome-scale genome assemblies, promptly. We expect ARKS to have broad utility in helping refine draft genomes.


July 7, 2019  |  

The challenge of analyzing the sugarcane genome.

Reference genome sequences have become key platforms for genetics and breeding of the major crop species. Sugarcane is probably the largest crop produced in the world (in weight of crop harvested) but lacks a reference genome sequence. Sugarcane has one of the most complex genomes in crop plants due to the extreme level of polyploidy. The genome of modern sugarcane hybrids includes sub-genomes from two progenitors Saccharum officinarum and S. spontaneum with some chromosomes resulting from recombination between these sub-genomes. Advancing DNA sequencing technologies and strategies for genome assembly are making the sugarcane genome more tractable. Advances in long read sequencing have allowed the generation of a more complete set of sugarcane gene transcripts. This is supporting transcript profiling in genetic research. The progenitor genomes are being sequenced. A monoploid coverage of the hybrid genome has been obtained by sequencing BAC clones that cover the gene space of the closely related sorghum genome. The complete polyploid genome is now being sequenced and assembled. The emerging genome will allow comparison of related genomes and increase understanding of the functioning of this polyploidy system. Sugarcane breeding for traditional sugar and new energy and biomaterial uses will be enhanced by the availability of these genomic resources.


July 7, 2019  |  

Modular traits of the Rhizobiales root microbiota and their evolutionary relationship with symbiotic Rhizobia.

Rhizobia are a paraphyletic group of soil-borne bacteria that induce nodule organogenesis in legume roots and fix atmospheric nitrogen for plant growth. In non-leguminous plants, species from the Rhizobiales order define a core lineage of the plant microbiota, suggesting additional functional interactions with plant hosts. In this work, genome analyses of 1,314 Rhizobiales isolates along with amplicon studies of the root microbiota reveal the evolutionary history of nitrogen-fixing symbiosis in this bacterial order. Key symbiosis genes were acquired multiple times, and the most recent common ancestor could colonize roots of a broad host range. In addition, root growth promotion is a characteristic trait of Rhizobiales in Arabidopsis thaliana, whereas interference with plant immunity constitutes a separate, strain-specific phenotype of root commensal Alphaproteobacteria. Additional studies with a tripartite gnotobiotic plant system reveal that these traits operate in a modular fashion and thus might be relevant to microbial homeostasis in healthy roots. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.


July 7, 2019  |  

Tracing the de novo origin of protein-coding genes in yeast.

De novo genes are very important for evolutionary innovation. However, how these genes originate and spread remains largely unknown. To better understand this, we rigorously searched for de novo genes in Saccharomyces cerevisiae S288C and examined their spread and fixation in the population. Here, we identified 84 de novo genes in S. cerevisiae S288C since the divergence with their sister groups. Transcriptome and ribosome profiling data revealed at least 8 (10%) and 28 (33%) de novo genes being expressed and translated only under specific conditions, respectively. DNA microarray data, based on 2-fold change, showed that 87% of the de novo genes are regulated during various biological processes, such as nutrient utilization and sporulation. Our comparative and evolutionary analyses further revealed that some factors, including single nucleotide polymorphism (SNP)/indel mutation, high GC content, and DNA shuffling, contribute to the birth of de novo genes, while domestication and natural selection drive the spread and fixation of these genes. Finally, we also provide evidence suggesting the possible parallel origin of a de novo gene between S. cerevisiae and Saccharomyces paradoxus Together, our study provides several new insights into the origin and spread of de novo genes.IMPORTANCE Emergence of de novo genes has occurred in many lineages during evolution, but the birth, spread, and function of these genes remain unresolved. Here we have searched for de novo genes from Saccharomyces cerevisiae S288C using rigorous methods, which reduced the effects of bad annotation and genomic gaps on the identification of de novo genes. Through this analysis, we have found 84 new genes originating de novo from previously noncoding regions, 87% of which are very likely involved in various biological processes. We noticed that 10% and 33% of de novo genes were only expressed and translated under specific conditions, therefore, verification of de novo genes through transcriptome and ribosome profiling, especially from limited expression data, may underestimate the number of bona fide new genes. We further show that SNP/indel mutation, high GC content, and DNA shuffling could be involved in the birth of de novo genes, while domestication and natural selection drive the spread and fixation of these genes. Finally, we provide evidence suggesting the possible parallel origin of a new gene. Copyright © 2018 Wu and Knudson.


July 7, 2019  |  

Low-level antimicrobials in the medicinal leech select for resistant pathogens that spread to patients.

Fluoroquinolones (FQs) and ciprofloxacin (Cp) are important antimicrobials that pollute the environment in trace amounts. Although Cp has been recommended as prophylaxis for patients undergoing leech therapy to prevent infections by the leech gut symbiont Aeromonas, a puzzling rise in Cp-resistant (Cpr) Aeromonas infections has been reported. We report on the effects of subtherapeutic FQ concentrations on bacteria in an environmental reservoir, the medicinal leech, and describe the presence of multiple antibiotic resistance mutations and a gain-of-function resistance gene. We link the rise of CprAeromonas isolates to exposure of the leech microbiota to very low levels of Cp (0.01 to 0.04 µg/ml), <1/100 of the clinical resistance breakpoint for Aeromonas Using competition experiments and comparative genomics of 37 strains, we determined the mechanisms of resistance in clinical and leech-derived Aeromonas isolates, traced their origin, and determined that the presence of merely 0.01 µg/ml Cp provides a strong competitive advantage for Cpr strains. Deep-sequencing the Cpr-conferring region of gyrA enabled tracing of the mutation-harboring Aeromonas population in archived gut samples, and an increase in the frequency of the Cpr-conferring mutation in 2011 coincides with the initial reports of CprAeromonas infections in patients receiving leech therapy.IMPORTANCE The role of subtherapeutic antimicrobial contamination in selecting for resistant strains has received increasing attention and is an important clinical matter. This study describes the relationship of resistant bacteria from the medicinal leech, Hirudo verbana, with patient infections following leech therapy. While our results highlight the need for alternative antibiotic therapies, the rise of Cpr bacteria demonstrates the importance of restricting the exposure of animals to antibiotics approved for veterinary use. The shift to a more resistant community and the dispersion of Cpr-conferring mechanisms via mobile elements occurred in a natural setting due to the presence of very low levels of fluoroquinolones, revealing the challenges of controlling the spread of antibiotic-resistant bacteria and highlighting the importance of a holistic approach in the management of antibiotic use. Copyright © 2018 Beka et al.


July 7, 2019  |  

Identification of different putative outer membrane electron conduits necessary for Fe(III) citrate, Fe(III) oxide, Mn(IV) oxide, or electrode reduction by Geobacter sulfurreducens.

At least five gene clusters in the Geobacter sulfurreducens genome encode putative “electron conduits” implicated in electron transfer across the outer membrane, each containing a periplasmic multiheme c-type cytochrome, integral outer membrane anchor, and outer membrane redox lipoprotein(s). Markerless single-gene-cluster deletions and all possible multiple-deletion combinations were constructed and grown with soluble Fe(III) citrate, Fe(III) and Mn(IV) oxides, and graphite electrodes poised at +0.24 V and -0.1 V versus the standard hydrogen electrode (SHE). Different gene clusters were necessary for reduction of each electron acceptor. During metal oxide reduction, deletion of the previously described omcBC cluster caused defects, but deletion of additional components in an ?omcBC background, such as extEFG, were needed to produce defects greater than 50% compared to findings with the wild type. Deletion of all five gene clusters abolished all metal reduction. During electrode reduction, only the ?extABCD mutant had a severe growth defect at both redox potentials, while this mutation did not affect Fe(III) oxide, Mn(IV) oxide, or Fe(III) citrate reduction. Some mutants containing only one cluster were able to reduce particular terminal electron acceptors better than the wild type, suggesting routes for improvement by targeting specific electron transfer pathways. Transcriptomic comparisons between fumarate and electrode-based growth conditions showed all of these ext clusters to be constitutive, and transcriptional analysis of the triple-deletion strain containing only extABCD detected no significant changes in expression of genes encoding known redox proteins or pilus components. These genetic experiments reveal new outer membrane conduit complexes necessary for growth of G. sulfurreducens, depending on the available extracellular electron acceptor.IMPORTANCE Gram-negative metal-reducing bacteria utilize electron conduits, chains of redox proteins spanning the outer membrane, to transfer electrons to the extracellular surface. Only one pathway for electron transfer across the outer membrane of Geobacter sulfurreducens has been linked to Fe(III) reduction. However, G. sulfurreducens is able to respire a wide array of extracellular substrates. Here we present the first combinatorial genetic analysis of five different electron conduits via creation of new markerless deletion strains and complementation vectors. Multiple conduit gene clusters appear to have overlapping roles, including two that have never been linked to metal reduction. Another recently described cluster (ExtABCD) was the only electron conduit essential during electrode reduction, a substrate of special importance to biotechnological applications of this organism. Copyright © 2018 Jiménez Otero et al.


July 7, 2019  |  

Meeting report: mobile genetic elements and genome plasticity 2018

The Mobile Genetic Elements and Genome Plasticity conference was hosted by Keystone Symposia in Santa Fe, NM USA, February 11–15, 2018. The organizers were Marlene Belfort, Evan Eichler, Henry Levin and Lynn Maquat. The goal of this conference was to bring together scientists from around the world to discuss the function of transposable elements and their impact on host species. Central themes of the meeting included recent innovations in genome analysis and the role of mobile DNA in disease and evolution. The conference included 200 scientists who participated in poster presentations, short talks selected from abstracts, and invited talks. A total of 58 talks were organized into eight sessions and two workshops. The topics varied from mechanisms of mobilization, to the structure of genomes and their defense strategies to protect against transposable elements.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.