Menu
September 22, 2019  |  

A draft genome assembly of the Chinese sillago (Sillago sinica), the first reference genome for Sillaginidae fishes.

Sillaginidae, also known as smelt-whitings, is a family of benthic coastal marine fishes in the Indo-West Pacific that have high ecological and economic importance. Many Sillaginidae species, including the Chinese sillago (Sillago sinica), have been recently described in China, providing valuable material to analyze genetic diversification of the family Sillaginidae. Here, we constructed a reference genome for the Chinese sillago, with the aim to set up a platform for comparative analysis of all species in this family.Using the single-molecule real-time DNA sequencing platform Pacific Biosciences (PacBio) Sequel, we generated ~27.3 Gb genomic DNA sequences for the Chinese sillago. We reconstructed a genome assembly of 534 Mb using a strategy that takes advantage of complementary strengths of two genome assembly programs, Canu and FALCON. The genome size was consistent with the estimated genome size based on k-mer analysis. The assembled genome consisted of 802 contigs with a contig N50 length of 2.6 Mb. We annotated 22,122 protein-coding genes in the Chinese sillago genomes using a de novo method as well as RNA sequencing data and homologies to other teleosts. According to the phylogenetic analysis using protein-coding genes, the Chinese sillago is closely related to Larimichthys crocea and Dicentrarchus labrax and diverged from their ancestor around 69.5-82.6 million years ago.Using long reads generated with PacBio sequencing technology, we have built a draft genome assembly for the Chinese sillago, which is the first reference genome for Sillaginidae species. This genome assembly sets a stage for comparative analysis of the diversification and adaptation of fishes in Sillaginidae.


September 22, 2019  |  

B chromosomes of the Asian seabass (Lates calcarifer) contribute to genome variations at the level of individuals and populations.

The Asian seabass (Lates calcarifer) is a bony fish from the Latidae family, which is widely distributed in the tropical Indo-West Pacific region. The karyotype of the Asian seabass contains 24 pairs of A chromosomes and a variable number of AT- and GC-rich B chromosomes (Bchrs or Bs). Dot-like shaped and nucleolus-associated AT-rich Bs were microdissected and sequenced earlier. Here we analyzed DNA fragments from Bs to determine their repeat and gene contents using the Asian seabass genome as a reference. Fragments of 75 genes, including an 18S rRNA gene, were found in the Bs; repeats represented 2% of the Bchr assembly. The 18S rDNA of the standard genome and Bs were similar and enriched with fragments of transposable elements. A higher nuclei DNA content in the male gonad and somatic tissue, compared to the female gonad, was demonstrated by flow cytometry. This variation in DNA content could be associated with the intra-individual variation in the number of Bs. A comparison between the copy number variation among the B-related fragments from whole genome resequencing data of Asian seabass individuals identified similar profiles between those from the South-East Asian/Philippines and Indian region but not the Australian ones. Our results suggest that Bs might cause variations in the genome among the individuals and populations of Asian seabass. A personalized copy number approach for segmental duplication detection offers a suitable tool for population-level analysis across specimens with low coverage genome sequencing.


September 22, 2019  |  

A PECTIN METHYLESTERASE gene at the maize Ga1 locus confers male function in unilateral cross-incompatibility.

Unilateral cross-incompatibility (UCI) is a unidirectional inter/intra-population reproductive barrier when both parents are self-compatible. Maize Gametophyte factor1 (Ga1) is an intraspecific UCI system and has been utilized in breeding. However, the mechanism underlying maize UCI specificity has remained mysterious for decades. Here, we report the cloning of ZmGa1P, a pollen-expressed PECTIN METHYLESTERASE (PME) gene at the Ga1 locus that can confer the male function in the maize UCI system. Homozygous transgenic plants expressing ZmGa1P in a ga1 background can fertilize Ga1-S plants and can be fertilized by pollen of ga1 plants. ZmGa1P protein is predominantly localized to the apex of growing pollen tubes and may interact with another pollen-specific PME protein, ZmPME10-1, to maintain the state of pectin methylesterification required for pollen tube growth in Ga1-S silks. Our study discloses a PME-mediated UCI mechanism and provides a tool to manipulate hybrid breeding.


September 22, 2019  |  

The Arctic charr (Salvelinus alpinus) genome and transcriptome assembly.

Arctic charr have a circumpolar distribution, persevere under extreme environmental conditions, and reach ages unknown to most other salmonids. The Salvelinus genus is primarily composed of species with genomes that are structured more like the ancestral salmonid genome than most Oncorhynchus and Salmo species of sister genera. It is thought that this aspect of the genome may be important for local adaptation (due to increased recombination) and anadromy (the migration of fish from saltwater to freshwater). In this study, we describe the generation of a new genetic map, the sequencing and assembly of the Arctic charr genome (GenBank accession: GCF_002910315.2) using the newly created genetic map and a previous genetic map, and present several analyses of the Arctic charr genes and genome assembly. The newly generated genetic map consists of 8,574 unique genetic markers and is similar to previous genetic maps with the exception of three major structural differences. The N50, identified BUSCOs, repetitive DNA content, and total size of the Arctic charr assembled genome are all comparable to other assembled salmonid genomes. An analysis to identify orthologous genes revealed that a large number of orthologs could be identified between salmonids and many appear to have highly conserved gene expression profiles between species. Comparing orthologous gene expression profiles may give us a better insight into which genes are more likely to influence species specific phenotypes.


September 22, 2019  |  

De novo assembly, delivery and expression of a 101 kb human gene in mouse cells

Design and large-scale synthesis of DNA has been applied to the functional study of viral and microbial genomes. New and expanded technology development is required to unlock the transformative potential of such bottom-up approaches to the study of larger, mammalian genomes. Two major challenges include assembling and delivering long DNA sequences. Here we describe a pipeline for de novo DNA assembly and delivery that enables functional evaluation of mammalian genes on the length scale of 100 kb. The DNA assembly step is supported by an integrated robotic workcell. We assemble the 101 kb human HPRT1 gene in yeast, deliver it to mouse cells, and show expression of the human protein from its full-length gene. This pipeline provides a framework for producing systematic, designer variants of any mammalian gene locus for functional evaluation in cells.


September 22, 2019  |  

Extraordinary genome instability and widespread chromosome rearrangements during vegetative growth

The haploid genome of the pathogenic fungus Zymoseptoria tritici is contained on “core” and “accessory” chromosomes. While 13 core chromosomes are found in all strains, as many as eight accessory chromosomes show presence/absence variation and rearrangements among field isolates. The factors influencing these presence/absence polymorphisms are so far unknown. We investigated chromosome stability using experimental evolution, karyotyping, and genome sequencing. We report extremely high and variable rates of accessory chromosome loss during mitotic propagation in vitro and in planta Spontaneous chromosome loss was observed in 2 to >50% of cells during 4 weeks of incubation. Similar rates of chromosome loss in the closely related Zymoseptoria ardabiliae suggest that this extreme chromosome dynamic is a conserved phenomenon in the genus. Elevating the incubation temperature greatly increases instability of accessory and even core chromosomes, causing severe rearrangements involving telomere fusion and chromosome breakage. Chromosome losses do not affect the fitness of Zymoseptoria tritici in vitro, but some lead to increased virulence, suggesting an adaptive role of this extraordinary chromosome instability. Copyright © 2018 by the Genetics Society of America.


September 22, 2019  |  

The genome of tapeworm Taenia multiceps sheds light on understanding parasitic mechanism and control of coenurosis disease.

Coenurosis, caused by the larval coenurus of the tapeworm Taenia multiceps, is a fatal central nervous system disease in both sheep and humans. Though treatment and prevention options are available, the control of coenurosis still faces presents great challenges. Here, we present a high-quality genome sequence of T. multiceps in which 240 Mb (96%) of the genome has been successfully assembled using Pacbio single-molecule real-time (SMRT) and Hi-C data with a N50 length of 44.8 Mb. In total, 49.5 Mb (20.6%) repeat sequences and 13, 013 gene models were identified. We found that Taenia spp. have an expansion of transposable elements and recent small-scale gene duplications following the divergence of Taenia from Echinococcus, but not in Echinococcus genomes, and the genes underlying environmental adaptability and dosage effect tend to be over-retained in the T. multiceps genome. Moreover, we identified several genes encoding proteins involved in proglottid formation and interactions with the host central nervous system, which may contribute to the adaption of T. multiceps to its parasitic life style. Our study not only provides insights into the biology and evolution of T. multiceps, but also identifies a set of species-specific gene targets for developing novel treatment and control tools for coenurosis.


September 22, 2019  |  

Convergent evolution of complex genomic rearrangements in two fungal meiotic drive elements.

Meiotic drive is widespread in nature. The conflict it generates is expected to be an important motor for evolutionary change and innovation. In this study, we investigated the genomic consequences of two large multi-gene meiotic drive elements, Sk-2 and Sk-3, found in the filamentous ascomycete Neurospora intermedia. Using long-read sequencing, we generated the first complete and well-annotated genome assemblies of large, highly diverged, non-recombining regions associated with meiotic drive elements. Phylogenetic analysis shows that, even though Sk-2 and Sk-3 are located in the same chromosomal region, they do not form sister clades, suggesting independent origins or at least a long evolutionary separation. We conclude that they have in a convergent manner accumulated similar patterns of tandem inversions and dense repeat clusters, presumably in response to similar needs to create linkage between genes causing drive and resistance.


September 22, 2019  |  

Assembling the genome of the African wild rice Oryza longistaminata by exploiting synteny in closely related Oryza species.

The African wild rice species Oryza longistaminata has several beneficial traits compared to cultivated rice species, such as resistance to biotic stresses, clonal propagation via rhizomes, and increased biomass production. To facilitate breeding efforts and functional genomics studies, we de-novo assembled a high-quality, haploid-phased genome. Here, we present our assembly, with a total length of 351?Mb, of which 92.2% was anchored onto 12 chromosomes. We detected 34,389 genes and 38.1% of the genome consisted of repetitive content. We validated our assembly by a comparative linkage analysis and by examining well-characterized gene families. This genome assembly will be a useful resource to exploit beneficial alleles found in O. longistaminata. Our results also show that it is possible to generate a high-quality, functionally complete rice genome assembly from moderate SMRT read coverage by exploiting synteny in a closely related Oryza species.


September 22, 2019  |  

Recovery of novel association loci in Arabidopsis thaliana and Drosophila melanogaster through leveraging INDELs association and integrated burden test.

Short insertions, deletions (INDELs) and larger structural variants have been increasingly employed in genetic association studies, but few improvements over SNP-based association have been reported. In order to understand why this might be the case, we analysed two publicly available datasets and observed that 63% of INDELs called in A. thaliana and 64% in D. melanogaster populations are misrepresented as multiple alleles with different functional annotations, i.e. where the same underlying variant is represented by inconsistent alignments leading to different variant calls. To address this issue, we have developed the software Irisas to reclassify and re-annotate these variants, which we then used for single-locus tests of association. We also integrated them to predict the functional impact of SNPs, INDELs, and structural variants for burden testing. Using both approaches, we re-analysed the genetic architecture of complex traits in A. thaliana and D. melanogaster. Heritability analysis using SNPs alone explained on average 27% and 19% of phenotypic variance for A. thaliana and D. melanogaster respectively. Our method explained an additional 11% and 3%, respectively. We also identified novel trait loci that previous SNP-based association studies failed to map, and which contain established candidate genes. Our study shows the value of the association test with INDELs and integrating multiple types of variants in association studies in plants and animals.


September 22, 2019  |  

Repeat elements organise 3D genome structure and mediate transcription in the filamentous fungus Epichloë festucae.

Structural features of genomes, including the three-dimensional arrangement of DNA in the nucleus, are increasingly seen as key contributors to the regulation of gene expression. However, studies on how genome structure and nuclear organisation influence transcription have so far been limited to a handful of model species. This narrow focus limits our ability to draw general conclusions about the ways in which three-dimensional structures are encoded, and to integrate information from three-dimensional data to address a broader gamut of biological questions. Here, we generate a complete and gapless genome sequence for the filamentous fungus, Epichloë festucae. We use Hi-C data to examine the three-dimensional organisation of the genome, and RNA-seq data to investigate how Epichloë genome structure contributes to the suite of transcriptional changes needed to maintain symbiotic relationships with the grass host. Our results reveal a genome in which very repeat-rich blocks of DNA with discrete boundaries are interspersed by gene-rich sequences that are almost repeat-free. In contrast to other species reported to date, the three-dimensional structure of the genome is anchored by these repeat blocks, which act to isolate transcription in neighbouring gene-rich regions. Genes that are differentially expressed in planta are enriched near the boundaries of these repeat-rich blocks, suggesting that their three-dimensional orientation partly encodes and regulates the symbiotic relationship formed by this organism.


September 22, 2019  |  

A complete Cannabis chromosome assembly and adaptive admixture for elevated cannabidiol (CBD) content

Cannabis has been cultivated for millennia with distinct cultivars providing either fiber and grain or tetrahydrocannabinol. Recent demand for cannabidiol rather than tetrahydrocannabinol has favored the breeding of admixed cultivars with extremely high cannabidiol content. Despite several draft Cannabis genomes, the genomic structure of cannabinoid synthase loci has remained elusive. A genetic map derived from a tetrahydrocannabinol/cannabidiol segregating population and a complete chromosome assembly from a high-cannabidiol cultivar together resolve the linkage of cannabidiolic and tetrahydrocannabinolic acid synthase gene clusters which are associated with transposable elements. High-cannabidiol cultivars appear to have been generated by integrating hemp-type cannabidiolic acid synthase gene clusters into a background of marijuana-type cannabis. Quantitative trait locus mapping suggests that overall drug potency, however, is associated with other genomic regions needing additional study.


September 22, 2019  |  

Physiological genomics of dietary adaptation in a marine herbivorous fish

Adopting a new diet is a significant evolutionary change and can profoundly affect an animaltextquoterights physiology, biochemistry, ecology, and its genome. To study this evolutionary transition, we investigated the physiology and genomics of digestion of a derived herbivorous fish, the monkeyface prickleback (Cebidichthys violaceus). We sequenced and assembled its genome and digestive transcriptome and revealed the molecular changes related to important dietary enzymes, finding abundant evidence for adaptation at the molecular level. In this species, two gene families experienced expansion in copy number and adaptive amino acid substitutions. These families, amylase, and bile salt activated lipase, are involved digestion of carbohydrates and lipids, respectively. Both show elevated levels of gene expression and increased enzyme activity. Because carbohydrates are abundant in the pricklebacktextquoterights diet and lipids are rare, these findings suggest that such dietary specialization involves both exploiting abundant resources and scavenging rare ones, especially essential nutrients, like essential fatty acids.


September 22, 2019  |  

How complete are “complete” genome assemblies?-An avian perspective.

The genomics revolution has led to the sequencing of a large variety of nonmodel organisms often referred to as “whole” or “complete” genome assemblies. But how complete are these, really? Here, we use birds as an example for nonmodel vertebrates and find that, although suitable in principle for genomic studies, the current standard of short-read assemblies misses a significant proportion of the expected genome size (7% to 42%; mean 20 ± 9%). In particular, regions with strongly deviating nucleotide composition (e.g., guanine-cytosine-[GC]-rich) and regions highly enriched in repetitive DNA (e.g., transposable elements and satellite DNA) are usually underrepresented in assemblies. However, long-read sequencing technologies successfully characterize many of these underrepresented GC-rich or repeat-rich regions in several bird genomes. For instance, only ~2% of the expected total base pairs are missing in the last chicken reference (galGal5). These assemblies still contain thousands of gaps (i.e., fragmented sequences) because some chromosomal structures (e.g., centromeres) likely contain arrays of repetitive DNA that are too long to bridge with currently available technologies. We discuss how to minimize the number of assembly gaps by combining the latest available technologies with complementary strengths. At last, we emphasize the importance of knowing the location, size and potential content of assembly gaps when making population genetic inferences about adjacent genomic regions.© 2018 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.


September 22, 2019  |  

Genomic characterization reveals significant divergence within Chlorella sorokiniana (Chlorellales, Trebouxiophyceae)

Selection of highly productive algal strains is crucial for establishing economically viable biomass and biopro- duct cultivation systems. Characterization of algal genomes, including understanding strain-specific differences in genome content and architecture is a critical step in this process. Using genomic analyses, we demonstrate significant differences between three strains of Chlorella sorokiniana (strain 1228, UTEX 1230, and DOE1412). We found that unique, strain-specific genes comprise a substantial proportion of each genome, and genomic regions with> 80% local nucleotide identity constitute <15% of each genome among the strains, indicating substantial strain specific evolution. Furthermore, cataloging of meiosis and other sex-related genes in C. sor- okiniana strains suggests strategic breeding could be utilized to improve biomass and bioproduct yields if a sexual cycle can be characterized. Finally, preliminary investigation of epigenetic machinery suggests the pre- sence of potentially unique transcriptional regulation in each strain. Our data demonstrate that these three C. sorokiniana strains represent significantly different genomic content. Based on these findings, we propose in- dividualized assessment of each strain for potential performance in cultivation systems.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.