Menu
September 22, 2019  |  

Dynamic evolution of a-gliadin prolamin gene family in homeologous genomes of hexaploid wheat.

Wheat Gli-2 loci encode complex groups of a-gliadin prolamins that are important for breadmaking, but also major triggers of celiac disease (CD). Elucidation of a-gliadin evolution provides knowledge to produce wheat with better end-use properties and reduced immunogenic potential. The Gli-2 loci contain a large number of tandemly duplicated genes and highly repetitive DNA, making sequence assembly of their genomic regions challenging. Here, we constructed high-quality sequences spanning the three wheat homeologous a-gliadin loci by aligning PacBio-based sequence contigs with BioNano genome maps. A total of 47 a-gliadin genes were identified with only 26 encoding intact full-length protein products. Analyses of a-gliadin loci and phylogenetic tree reconstruction indicate significant duplications of a-gliadin genes in the last ~2.5 million years after the divergence of the A, B and D genomes, supporting its rapid lineage-independent expansion in different Triticeae genomes. We showed that dramatic divergence in expression of a-gliadin genes could not be attributed to sequence variations in the promoter regions. The study also provided insights into the evolution of CD epitopes and identified a single indel event in the hexaploid wheat D genome that likely resulted in the generation of the highly toxic 33-mer CD epitope.


September 22, 2019  |  

The N6-adenine methylation in yeast genome profiled by single-molecule technology.

The most common and abundant DNA modification is 5-meth- ylcytosine (5mC), which has been well-established as an epigenetic mark regulating gene expression in eukaryotes (Jones, 2012). Another DNA modification N6-methyldeoxyadenosine (6mA), pre- viously reported as a widespread DNA methylation in prokaryotes, plays an important role in gene expression, DNA replication, DNA repair, cell cycle progression and host-pathogen interaction (Messer and Noyer-Weidner, 1988; Lu et al., 1994; Collier et al., 2007). The knowledge of 6mA in eukaryotes has been very limited until the recent development of high-throughput sequencing and high-sensitive mass spectrometry technologies, which have greatly contributed to the investigation of 6mA in fungi, animals and plants (Fu et al., 2015; Greer et al., 2015; Zhang et al., 2015; Koziol et al., 2016; Liu et al., 2016; Wu et al., 2016; Liang et al., 2017; Mondo et al., 2017). Recent studies revealed that 6mA abundance is vari- able, and it is relative higher in Chlamydomonas and early- diverging fungi species than other eukaryotes. The distribution pat- terns of 6mA and their functions are not quite conserved among or- ganisms. 6mA was found enriched near the transcription start sites (TSS) in Chlamydomonas (Fu et al., 2015) and at the repeats in Drosophila, Mus musculus and Danio rerio (Zhang et al., 2015; Liu et al., 2016; Wu et al., 2016), and commonly depleted from gene exons in Xenopus laevis and M. musculus (Koziol et al., 2016). In several species, 6mA was associated with transcriptionally active genes (Fu et al., 2015; Mondo et al., 2017), and it was also found correlated with gene silencing in mammalian embryonic stem cells (Wu et al., 2016).


September 22, 2019  |  

The genome of the marine medaka Oryzias melastigma.

Marine medaka (Oryzias melastigma) is considered to be a useful fish model for marine and estuarine ecotoxicology studies and has good potential for field-based population genomics because of its geographical distribution in Asian estuarine and coastal areas. In this study, we present the first whole-genome draft of O. melastigma. The genome assembly consists of 8,602 scaffolds (N50 = 23.737 Mb) and a total genome length of 779.4 Mb. A total of 23,528 genes were predicted, and 12,670 gene families shared with three teleost species (Japanese medaka, mangrove killifish and zebrafish) were identified. Genome analyses revealed that the O. melastigma genome is highly heterozygous and contains a large number of repeat sequences. This assembly represents a useful genomic resource for fish scientists.© 2018 John Wiley & Sons Ltd.


September 22, 2019  |  

New genomic data and analyses challenge the traditional vision of animal epithelium evolution.

The emergence of epithelia was the foundation of metazoan expansion. Epithelial tissues are a hallmark of metazoans deeply rooted in the evolution of their complex developmental morphogenesis processes. However, studies on the epithelial features of non-bilaterians are still sparse and it remains unclear whether the last common metazoan ancestor possessed a fully functional epithelial toolkit or if it was acquired later during metazoan evolution.To investigate the early evolution of animal epithelia, we sequenced the genome and transcriptomes of two new sponge species to characterize epithelial markers such as the E-cadherin complex and the polarity complexes for all classes (Calcarea, Demospongiae, Hexactinellida, Homoscleromorpha) of sponges (phylum Porifera) and compare them with their homologues in Placozoa and in Ctenophora. We found that Placozoa and most sponges possess orthologues of all essential genes encoding proteins characteristic of bilaterian epithelial cells, as well as their conserved interaction domains. In stark contrast, we found that ctenophores lack several major polarity complex components such as the Crumbs complex and Scribble. Furthermore, the E-cadherin ctenophore orthologue exhibits a divergent cytoplasmic domain making it unlikely to interact with its canonical cytoplasmic partners.These unexpected findings challenge the current evolutionary paradigm on the emergence of epithelia. Altogether, our results raise doubt on the homology of protein complexes and structures involved in cell polarity and adhesive-type junctions between Ctenophora and Bilateria epithelia.


September 22, 2019  |  

Gene duplication and evolution dynamics in the homeologous regions harboring multiple prolamin and resistance gene families in hexaploid wheat.

Improving end-use quality and disease resistance are important goals in wheat breeding. The genetic loci controlling these traits are highly complex, consisting of large families of prolamin and resistance genes with members present in all three homeologous A, B, and D genomes in hexaploid bread wheat. Here, orthologous regions harboring both prolamin and resistance gene loci were reconstructed and compared to understand gene duplication and evolution in different wheat genomes. Comparison of the two orthologous D regions from the hexaploid wheat Chinese Spring and the diploid progenitor Aegilops tauschii revealed their considerable difference due to the presence of five large structural variations with sizes ranging from 100 kb to 2 Mb. As a result, 44% of the Ae. tauschii and 71% of the Chinese Spring sequences in the analyzed regions, including 79 genes, are not shared. Gene rearrangement events, including differential gene duplication and deletion in the A, B, and D regions, have resulted in considerable erosion of gene collinearity in the analyzed regions, suggesting rapid evolution of prolamin and resistance gene families after the separation of the three wheat genomes. We hypothesize that this fast evolution is attributed to the co-evolution of the two gene families dispersed within a high recombination region. The identification of a full set of prolamin genes facilitated transcriptome profiling and revealed that the A genome contributes the least to prolamin expression because of its smaller number of expressed intact genes and their low expression levels, while the B and D genomes contribute similarly.


September 22, 2019  |  

Genome of an allotetraploid wild peanut Arachis monticola: a de novo assembly.

Arachis monticola (2n = 4x = 40) is the only allotetraploid wild peanut within the Arachis genus and section, with an AABB-type genome of ~2.7 Gb in size. The AA-type subgenome is derived from diploid wild peanut Arachis duranensis, and the BB-type subgenome is derived from diploid wild peanut Arachis ipaensis. A. monticola is regarded either as the direct progenitor of the cultivated peanut or as an introgressive derivative between the cultivated peanut and wild species. The large polyploidy genome structure and enormous nearly identical regions of the genome make the assembly of chromosomal pseudomolecules very challenging. Here we report the first reference quality assembly of the A. monticola genome, using a series of advanced technologies. The final whole genome of A. monticola is ~2.62 Gb and has a contig N50 and scaffold N50 of 106.66 Kb and 124.92 Mb, respectively. The vast majority (91.83%) of the assembled sequence was anchored onto the 20 pseudo-chromosomes, and 96.07% of assemblies were accurately separated into AA- and BB- subgenomes. We demonstrated efficiency of the current state of the strategy for de novo assembly of the highly complex allotetraploid species, wild peanut (A. monticola), based on whole-genome shotgun sequencing, single molecule real-time sequencing, high-throughput chromosome conformation capture technology, and BioNano optical genome maps. These combined technologies produced reference-quality genome of the allotetraploid wild peanut, which is valuable for understanding the peanut domestication and evolution within the Arachis genus and among legume crops.


September 22, 2019  |  

Assembly of chromosome-scale contigs by efficiently resolving repetitive sequences with long reads

Due to the large number of repetitive sequences in complex eukaryotic genomes, fragmented and incompletely assembled genomes lose value as reference sequences, often due to short contigs that cannot be anchored or mispositioned onto chromosomes. Here we report a novel method Highly Efficient Repeat Assembly (HERA), which includes a new concept called a connection graph as well as algorithms for constructing the graph. HERA resolves repeats at high efficiency with single-molecule sequencing data, and enables the assembly of chromosome-scale contigs by further integrating genome maps and Hi-C data. We tested HERA with the genomes of rice R498, maize B73, human HX1 and Tartary buckwheat Pinku1. HERA can correctly assemble most of the tandemly repetitive sequences in rice using single-molecule sequencing data only. Using the same maize and human sequencing data published by Jiao et al. (2017) and Shi et al. (2016), respectively, we dramatically improved on the sequence contiguity compared with the published assemblies, increasing the contig N50 from 1.3 Mb to 61.2 Mb in maize B73 assembly and from 8.3 Mb to 54.4 Mb in human HX1 assembly with HERA. We provided a high-quality maize reference genome with 96.9% of the gaps filled (only 76 gaps left) and several incorrectly positioned sequences fixed compared with the B73 RefGen_v4 assembly. Comparisons between the HERA assembly of HX1 and the human GRCh38 reference genome showed that many gaps in GRCh38 could be filled, and that GRCh38 contained some potential errors that could be fixed. We assembled the Pinku1 genome into 12 scaffolds with a contig N50 size of 27.85 Mb. HERA serves as a new genome assembly/phasing method to generate high quality sequences for complex genomes and as a curation tool to improve the contiguity and completeness of existing reference genomes, including the correction of assembly errors in repetitive regions.


September 22, 2019  |  

De novo genome assembly of Oryza granulata reveals rapid genome expansion and adaptive evolution

The wild relatives of rice have adapted to different ecological environments and constitute a useful reservoir of agronomic traits for genetic improvement. Here we present the ~777?Mb de novo assembled genome sequence of Oryza granulata. Recent bursts of long-terminal repeat retrotransposons, especially RIRE2, led to a rapid twofold increase in genome size after O. granulata speciation. Universal centromeric tandem repeats are absent within its centromeres, while gypsy-type LTRs constitute the main centromere-specific repetitive elements. A total of 40,116 protein-coding genes were predicted in O. granulata, which is close to that of Oryza sativa. Both the copy number and function of genes involved in photosynthesis and energy production have undergone positive selection during the evolution of O. granulata, which might have facilitated its adaptation to the low light habitats. Together, our findings reveal the rapid genome expansion, distinctive centromere organization, and adaptive evolution of O. granulata.


September 22, 2019  |  

The Chara genome: Secondary complexity and implications for plant terrestrialization.

Land plants evolved from charophytic algae, among which Charophyceae possess the most complex body plans. We present the genome of Chara braunii; comparison of the genome to those of land plants identified evolutionary novelties for plant terrestrialization and land plant heritage genes. C. braunii employs unique xylan synthases for cell wall biosynthesis, a phragmoplast (cell separation) mechanism similar to that of land plants, and many phytohormones. C. braunii plastids are controlled via land-plant-like retrograde signaling, and transcriptional regulation is more elaborate than in other algae. The morphological complexity of this organism may result from expanded gene families, with three cases of particular note: genes effecting tolerance to reactive oxygen species (ROS), LysM receptor-like kinases, and transcription factors (TFs). Transcriptomic analysis of sexual reproductive structures reveals intricate control by TFs, activity of the ROS gene network, and the ancestral use of plant-like storage and stress protection proteins in the zygote. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019  |  

Analysis of the Gli-D2 locus identifies a genetic target for simultaneously improving the breadmaking and health-related traits of common wheat.

Gliadins are a major component of wheat seed proteins. However, the complex homoeologous Gli-2 loci (Gli-A2, -B2 and -D2) that encode the a-gliadins in commercial wheat are still poorly understood. Here we analyzed the Gli-D2 locus of Xiaoyan 81 (Xy81), a winter wheat cultivar. A total of 421.091 kb of the Gli-D2 sequence was assembled from sequencing multiple bacterial artificial clones, and 10 a-gliadin genes were annotated. Comparative genomic analysis showed that Xy81 carried only eight of the a-gliadin genes of the D genome donor Aegilops tauschii, with two of them each experiencing a tandem duplication. A mutant line lacking Gli-D2 (DLGliD2) consistently exhibited better breadmaking quality and dough functionalities than its progenitor Xy81, but without penalties in other agronomic traits. It also had an elevated lysine content in the grains. Transcriptome analysis verified the lack of Gli-D2 a-gliadin gene expression in DLGliD2. Furthermore, the transcript and protein levels of protein disulfide isomerase were both upregulated in DLGliD2 grains. Consistent with this finding, DLGliD2 had increased disulfide content in the flour. Our work sheds light on the structure and function of Gli-D2 in commercial wheat, and suggests that the removal of Gli-D2 and the gliadins specified by it is likely to be useful for simultaneously enhancing the end-use and health-related traits of common wheat. Because gliadins and homologous proteins are widely present in grass species, the strategy and information reported here may be broadly useful for improving the quality traits of diverse cereal crops.© 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.


September 22, 2019  |  

Discovery of new genes involved in curli production by a uropathogenic Escherichia coli strain from the highly virulent O45:K1:H7 lineage.

Curli are bacterial surface-associated amyloid fibers that bind to the dye Congo red (CR) and facilitate uropathogenic Escherichia coli (UPEC) biofilm formation and protection against host innate defenses. Here we sequenced the genome of the curli-producing UPEC pyelonephritis strain MS7163 and showed it belongs to the highly virulent O45:K1:H7 neonatal meningitis-associated clone. MS7163 produced curli at human physiological temperature, and this correlated with biofilm growth, resistance of sessile cells to the human cationic peptide cathelicidin, and enhanced colonization of the mouse bladder. We devised a forward genetic screen using CR staining as a proxy for curli production and identified 41 genes that were required for optimal CR binding, of which 19 genes were essential for curli synthesis. Ten of these genes were novel or poorly characterized with respect to curli synthesis and included genes involved in purine de novo biosynthesis, a regulator that controls the Rcs phosphorelay system, and a novel repressor of curli production (referred to as rcpA). The involvement of these genes in curli production was confirmed by the construction of defined mutants and their complementation. The mutants did not express the curli major subunit CsgA and failed to produce curli based on CR binding. Mutation of purF (the first gene in the purine biosynthesis pathway) and rcpA also led to attenuated colonization of the mouse bladder. Overall, this work has provided new insight into the regulation of curli and the role of these amyloid fibers in UPEC biofilm formation and pathogenesis.IMPORTANCE Uropathogenic Escherichia coli (UPEC) strains are the most common cause of urinary tract infection, a disease increasingly associated with escalating antibiotic resistance. UPEC strains possess multiple surface-associated factors that enable their colonization of the urinary tract, including fimbriae, curli, and autotransporters. Curli are extracellular amyloid fibers that enhance UPEC virulence and promote biofilm formation. Here we examined the function and regulation of curli in a UPEC pyelonephritis strain belonging to the highly virulent O45:K1:H7 neonatal meningitis-associated clone. Curli expression at human physiological temperature led to increased biofilm formation, resistance of sessile cells to the human cationic peptide LL-37, and enhanced bladder colonization. Using a comprehensive genetic screen, we identified multiple genes involved in curli production, including several that were novel or poorly characterized with respect to curli synthesis. In total, this study demonstrates an important role for curli as a UPEC virulence factor that promotes biofilm formation, resistance, and pathogenesis. Copyright © 2018 Nhu et al.


September 22, 2019  |  

Population genomics of Culiseta melanura, the principal vector of Eastern equine encephalitis virus in the United States.

Eastern Equine Encephalitis (EEE) (Togaviridae, Alphavirus) is a highly pathogenic mosquito-borne arbovirus that circulates in an enzootic cycle involving Culiseta melanura mosquitoes and wild Passeriformes birds in freshwater swamp habitats. Recently, the northeastern United States has experienced an intensification of virus activity with increased human involvement and northward expansion into new regions. In addition to its principal role in enzootic transmission of EEE virus among avian hosts, recent studies on the blood-feeding behavior of Cs. melanura throughout its geographic range suggest that this mosquito may also be involved in epizootic / epidemic transmission to equines and humans in certain locales. Variations in blood feeding behavior may be a function of host availability, environmental factors, and/or underlying genetic differences among regional populations. Despite the importance of Cs. melanura in transmission and maintenance of EEE virus, the genetics of this species remains largely unexplored.To investigate the occurrence of genetic variation in Cs. melanura, the genome of this mosquito vector was sequenced resulting in a draft genome assembly of 1.28 gigabases with a contig N50 of 93.36 kilobases. Populations of Cs. melanura from 10 EEE virus foci in the eastern North America were genotyped with double-digest RAD-seq. Following alignment of reads to the reference genome, variant calling, and filtering, 40,384 SNPs were retained for downstream analyses. Subsequent analyses revealed genetic differentiation between northern and southern populations of this mosquito species. Moreover, limited fine-scale population structure was detected throughout northeastern North America, suggesting local differentiation of populations but also a history of ancestral polymorphism or contemporary gene flow. Additionally, a genetically distinct cluster was identified predominantly at two northern sites.This study elucidates the first evidence of fine-scale population structure in Cs. melanura throughout its eastern range and detects evidence of gene flow between populations in northeastern North America. This investigation provides the groundwork for examining the consequences of genetic variations in the populations of this mosquito species that could influence vector-host interactions and the risk of human and equine infection with EEE virus.


September 22, 2019  |  

The hpRNA/RNAi pathway is essential to resolve intragenomic conflict in the Drosophila male germline.

Intragenomic conflicts are fueled by rapidly evolving selfish genetic elements, which induce selective pressures to innovate opposing repressive mechanisms. This is patently manifest in sex-ratio (SR) meiotic drive systems, in which distorter and suppressor factors bias and restore equal transmission of X and Y sperm. Here, we reveal that multiple SR suppressors in Drosophila simulans (Nmy and Tmy) encode related hairpin RNAs (hpRNAs), which generate endo-siRNAs that repress the paralogous distorters Dox and MDox. All components in this drive network are recently evolved and largely testis restricted. To connect SR hpRNA function to the RNAi pathway, we generated D. simulans null mutants of Dcr-2 and AGO2. Strikingly, these core RNAi knockouts massively derepress Dox and MDox and are in fact completely male sterile and exhibit highly defective spermatogenesis. Altogether, our data reveal how the adaptive capacity of hpRNAs is critically deployed to restrict selfish gonadal genetic systems that can exterminate a species. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019  |  

PacBio-based mitochondrial genome assembly of Leucaena trichandra (Leguminosae) and an intrageneric assessment of mitochondrial RNA editing.

Reconstructions of vascular plant mitochondrial genomes (mt-genomes) are notoriously complicated by rampant recombination that has resulted in comparatively few plant mt-genomes being available. The dearth of plant mitochondrial resources has limited our understanding of mt-genome structural diversity, complex patterns of RNA editing, and the origins of novel mt-genome elements. Here, we use an efficient long read (PacBio) iterative assembly pipeline to generate mt-genome assemblies for Leucaena trichandra (Leguminosae: Caesalpinioideae: mimosoid clade), providing the first assessment of non-papilionoid legume mt-genome content and structure to date. The efficiency of the assembly approach facilitated the exploration of alternative structures that are common place among plant mitochondrial genomes. A compact version (729 kbp) of the recovered assemblies was used to investigate sources of mt-genome size variation among legumes and mt-genome sequence similarity to the legume associated root holoparasite Lophophytum. The genome and an associated suite of transcriptome data from select species of Leucaena permitted an in-depth exploration of RNA editing in a diverse clade of closely related species that includes hybrid lineages. RNA editing in the allotetraploid, Leucaena leucocephala, is consistent with co-option of nearly equal maternal and paternal C-to-U edit components, generating novel combinations of RNA edited sites. A preliminary investigation of L. leucocephala C-to-U edit frequencies identified the potential for a hybrid to generate unique pools of alleles from parental variation through edit frequencies shared with one parental lineage, those intermediate between parents, and transgressive patterns.


September 22, 2019  |  

Repeated inversions within a pannier intron drive diversification of intraspecific colour patterns of ladybird beetles.

How genetic information is modified to generate phenotypic variation within a species is one of the central questions in evolutionary biology. Here we focus on the striking intraspecific diversity of >200 aposematic elytral (forewing) colour patterns of the multicoloured Asian ladybird beetle, Harmonia axyridis, which is regulated by a tightly linked genetic locus h. Our loss-of-function analyses, genetic association studies, de novo genome assemblies, and gene expression data reveal that the GATA transcription factor gene pannier is the major regulatory gene located at the h locus, and suggest that repeated inversions and cis-regulatory modifications at pannier led to the expansion of colour pattern variation in H. axyridis. Moreover, we show that the colour-patterning function of pannier is conserved in the seven-spotted ladybird beetle, Coccinella septempunctata, suggesting that H. axyridis’ extraordinary intraspecific variation may have arisen from ancient modifications in conserved elytral colour-patterning mechanisms in ladybird beetles.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.