Menu
September 22, 2019

Draft genome sequence of Camellia sinensis var. sinensis provides insights into the evolution of the tea genome and tea quality.

Tea, one of the world’s most important beverage crops, provides numerous secondary metabolites that account for its rich taste and health benefits. Here we present a high-quality sequence of the genome of tea, Camellia sinensis var. sinensis (CSS), using both Illumina and PacBio sequencing technologies. At least 64% of the 3.1-Gb genome assembly consists of repetitive sequences, and the rest yields 33,932 high-confidence predictions of encoded proteins. Divergence between two major lineages, CSS and Camellia sinensis var. assamica (CSA), is calculated to ~0.38 to 1.54 million years ago (Mya). Analysis of genic collinearity reveals that the tea genome is the product of two rounds of whole-genome duplications (WGDs) that occurred ~30 to 40 and ~90 to 100 Mya. We provide evidence that these WGD events, and subsequent paralogous duplications, had major impacts on the copy numbers of secondary metabolite genes, particularly genes critical to producing three key quality compounds: catechins, theanine, and caffeine. Analyses of transcriptome and phytochemistry data show that amplification and transcriptional divergence of genes encoding a large acyltransferase family and leucoanthocyanidin reductases are associated with the characteristic young leaf accumulation of monomeric galloylated catechins in tea, while functional divergence of a single member of the glutamine synthetase gene family yielded theanine synthetase. This genome sequence will facilitate understanding of tea genome evolution and tea metabolite pathways, and will promote germplasm utilization for breeding improved tea varieties. Copyright © 2018 the Author(s). Published by PNAS.


September 22, 2019

The Egyptian rousette genome reveals unexpected features of bat antiviral immunity.

Bats harbor many viruses asymptomatically, including several notorious for causing extreme virulence in humans. To identify differences between antiviral mechanisms in humans and bats, we sequenced, assembled, and analyzed the genome of Rousettus aegyptiacus, a natural reservoir of Marburg virus and the only known reservoir for any filovirus. We found an expanded and diversified KLRC/KLRD family of natural killer cell receptors, MHC class I genes, and type I interferons, which dramatically differ from their functional counterparts in other mammals. Such concerted evolution of key components of bat immunity is strongly suggestive of novel modes of antiviral defense. An evaluation of the theoretical function of these genes suggests that an inhibitory immune state may exist in bats. Based on our findings, we hypothesize that tolerance of viral infection, rather than enhanced potency of antiviral defenses, may be a key mechanism by which bats asymptomatically host viruses that are pathogenic in humans. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

Insights into platypus population structure and history from whole-genome sequencing.

The platypus is an egg-laying mammal which, alongside the echidna, occupies a unique place in the mammalian phylogenetic tree. Despite widespread interest in its unusual biology, little is known about its population structure or recent evolutionary history. To provide new insights into the dispersal and demographic history of this iconic species, we sequenced the genomes of 57 platypuses from across the whole species range in eastern mainland Australia and Tasmania. Using a highly improved reference genome, we called over 6.7?M SNPs, providing an informative genetic data set for population analyses. Our results show very strong population structure in the platypus, with our sampling locations corresponding to discrete groupings between which there is no evidence for recent gene flow. Genome-wide data allowed us to establish that 28 of the 57 sampled individuals had at least a third-degree relative among other samples from the same river, often taken at different times. Taking advantage of a sampled family quartet, we estimated the de novo mutation rate in the platypus at 7.0?×?10-9/bp/generation (95% CI 4.1?×?10-9-1.2?×?10-8/bp/generation). We estimated effective population sizes of ancestral populations and haplotype sharing between current groupings, and found evidence for bottlenecks and long-term population decline in multiple regions, and early divergence between populations in different regions. This study demonstrates the power of whole-genome sequencing for studying natural populations of an evolutionarily important species.


September 22, 2019

Early life stages of Northern shrimp (Pandalus borealis) are sensitive to fish feed containing the anti-parasitic drug diflubenzuron.

Increasing use of fish feed containing the chitin synthesis inhibiting anti-parasitic drug diflubenzuron (DFB) in salmon aquaculture has raised concerns over its impact on coastal ecosystems. Larvae of Northern shrimp (Pandalus borealis) were exposed to DFB medicated feed under Control conditions (7.0?°C, pH 8.0) and under Ocean Acidification and Warming conditions (OAW, 9.5?°C and pH 7.6). Two weeks’ exposure to DFB medicated feed caused significantly increased mortality. The effect of OAW and DFB on mortality of shrimp larvae was additive; 10% mortality in Control, 35% in OAW, 66% in DFB and 92% in OAW?+?DFB. In OAW?+?DFB feeding and swimming activity were reduced for stage II larvae and none of the surviving larvae developed to stage IV. Two genes involved in feeding (GAPDH and PRLP) and one gene involved in moulting (DD9B) were significantly downregulated in larvae exposed to OAW?+?DFB relative to the Control. Due to a shorter intermoult period under OAW conditions, the OAW?+?DFB larvae were exposed throughout two instead of one critical pre-moult period. This may explain the more serious sub-lethal effects for OAW?+?DFB than DFB larvae. A single day exposure at 4?days after hatching did not affect DFB larvae, but high mortality was observed for OAW?+?DFB larvae, possibly because they were exposed closer to moulting. High mortality of shrimp larvae exposed to DFB medicated feed, indicates that the use of DFB in salmon aquaculture is a threat to crustacean zooplankton. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.


September 22, 2019

De novo genome assembly of the red silk cotton tree (Bombax ceiba).

Bombax ceiba L. (the red silk cotton tree) is a large deciduous tree that is distributed in tropical and sub-tropical Asia as well as northern Australia. It has great economic and ecological importance, with several applications in industry and traditional medicine in many Asian countries. To facilitate further utilization of this plant resource, we present here the draft genome sequence for B. ceiba.We assembled a relatively intact genome of B. ceiba by using PacBio single-molecule sequencing and BioNano optical mapping technologies. The final draft genome is approximately 895 Mb long, with contig and scaffold N50 sizes of 1.0 Mb and 2.06 Mb, respectively.The high-quality draft genome assembly of B. ceiba will be a valuable resource enabling further genetic improvement and more effective use of this tree species.


September 22, 2019

Double insertion of transposable elements provides a substrate for the evolution of satellite DNA.

Eukaryotic genomes are replete with repeated sequences in the form of transposable elements (TEs) dispersed across the genome or as satellite arrays, large stretches of tandemly repeated sequences. Many satellites clearly originated as TEs, but it is unclear how mobile genetic parasites can transform into megabase-sized tandem arrays. Comprehensive population genomic sampling is needed to determine the frequency and generative mechanisms of tandem TEs, at all stages from their initial formation to their subsequent expansion and maintenance as satellites. The best available population resources, short-read DNA sequences, are often considered to be of limited utility for analyzing repetitive DNA due to the challenge of mapping individual repeats to unique genomic locations. Here we develop a new pipeline called ConTExt that demonstrates that paired-end Illumina data can be successfully leveraged to identify a wide range of structural variation within repetitive sequence, including tandem elements. By analyzing 85 genomes from five populations of Drosophila melanogaster, we discover that TEs commonly form tandem dimers. Our results further suggest that insertion site preference is the major mechanism by which dimers arise and that, consequently, dimers form rapidly during periods of active transposition. This abundance of TE dimers has the potential to provide source material for future expansion into satellite arrays, and we discover one such copy number expansion of the DNA transposon hobo to approximately 16 tandem copies in a single line. The very process that defines TEs-transposition-thus regularly generates sequences from which new satellites can arise.© 2018 McGurk and Barbash; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

NextSV: a meta-caller for structural variants from low-coverage long-read sequencing data.

Structural variants (SVs) in human genomes are implicated in a variety of human diseases. Long-read sequencing delivers much longer read lengths than short-read sequencing and may greatly improve SV detection. However, due to the relatively high cost of long-read sequencing, it is unclear what coverage is needed and how to optimally use the aligners and SV callers.In this study, we developed NextSV, a meta-caller to perform SV calling from low coverage long-read sequencing data. NextSV integrates three aligners and three SV callers and generates two integrated call sets (sensitive/stringent) for different analysis purposes. We evaluated SV calling performance of NextSV under different PacBio coverages on two personal genomes, NA12878 and HX1. Our results showed that, compared with running any single SV caller, NextSV stringent call set had higher precision and balanced accuracy (F1 score) while NextSV sensitive call set had a higher recall. At 10X coverage, the recall of NextSV sensitive call set was 93.5 to 94.1% for deletions and 87.9 to 93.2% for insertions, indicating that ~10X coverage might be an optimal coverage to use in practice, considering the balance between the sequencing costs and the recall rates. We further evaluated the Mendelian errors on an Ashkenazi Jewish trio dataset.Our results provide useful guidelines for SV detection from low coverage whole-genome PacBio data and we expect that NextSV will facilitate the analysis of SVs on long-read sequencing data.


September 22, 2019

Phenotypic diversification by enhanced genome restructuring after induction of multiple DNA double-strand breaks.

DNA double-strand break (DSB)-mediated genome rearrangements are assumed to provide diverse raw genetic materials enabling accelerated adaptive evolution; however, it remains unclear about the consequences of massive simultaneous DSB formation in cells and their resulting phenotypic impact. Here, we establish an artificial genome-restructuring technology by conditionally introducing multiple genomic DSBs in vivo using a temperature-dependent endonuclease TaqI. Application in yeast and Arabidopsis thaliana generates strains with phenotypes, including improved ethanol production from xylose at higher temperature and increased plant biomass, that are stably inherited to offspring after multiple passages. High-throughput genome resequencing revealed that these strains harbor diverse rearrangements, including copy number variations, translocations in retrotransposons, and direct end-joinings at TaqI-cleavage sites. Furthermore, large-scale rearrangements occur frequently in diploid yeasts (28.1%) and tetraploid plants (46.3%), whereas haploid yeasts and diploid plants undergo minimal rearrangement. This genome-restructuring system (TAQing system) will enable rapid genome breeding and aid genome-evolution studies.


September 22, 2019

Novel haloarchaeon Natrinema thermophila having the highest growth temperature among haloarchaea with a large genome size.

Environmental temperature is one of the most important factors for the growth and survival of microorganisms. Here we describe a novel extremely halophilic archaeon (haloarchaea) designated as strain CBA1119T isolated from solar salt. Strain CBA1119T had the highest maximum and optimal growth temperatures (66?°C and 55?°C, respectively) and one of the largest genome sizes among haloarchaea (5.1?Mb). It also had the largest number of strain-specific pan-genome orthologous groups and unique pathways among members of the genus Natrinema in the class Halobacteria. A dendrogram based on the presence/absence of genes and a phylogenetic tree constructed based on OrthoANI values highlighted the particularities of strain CBA1119T as compared to other Natrinema species and other haloarchaea members. The large genome of strain CBA1119T may provide information on genes that confer tolerance to extreme environmental conditions, which may lead to the discovery of other thermophilic strains with potential applications in industrial biotechnology.


September 22, 2019

Genomic analyses of unique carbohydrate and phytohormone metabolism in the macroalga Gracilariopsis lemaneiformis (Rhodophyta).

Red algae are economically valuable for food and in industry. However, their genomic information is limited, and the genomic data of only a few species of red algae have been sequenced and deposited recently. In this study, we annotated a draft genome of the macroalga Gracilariopsis lemaneiformis (Gracilariales, Rhodophyta).The entire 88.98 Mb genome of Gp. lemaneiformis 981 was generated from 13,825 scaffolds (=500 bp) with an N50 length of 30,590 bp, accounting for approximately 91% of this algal genome. A total of 38.73 Mb of scaffold sequences were repetitive, and 9281 protein-coding genes were predicted. A phylogenomic analysis of 20 genomes revealed the relationship among the Chromalveolata, Rhodophyta, Chlorophyta and higher plants. Homology analysis indicated phylogenetic proximity between Gp. lemaneiformis and Chondrus crispus. The number of enzymes related to the metabolism of carbohydrates, including agar, glycoside hydrolases, glycosyltransferases, was abundant. In addition, signaling pathways associated with phytohormones such as auxin, salicylic acid and jasmonates are reported for the first time for this alga.We sequenced and analyzed a draft genome of the red alga Gp. lemaneiformis, and revealed its carbohydrate metabolism and phytohormone signaling characteristics. This work will be helpful in research on the functional and comparative genomics of the order Gracilariales and will enrich the genomic information on marine algae.


September 22, 2019

A whole genome assembly of the horn fly, Haematobia irritans, and prediction of genes with roles in metabolism and sex determination.

Haematobia irritans, commonly known as the horn fly, is a globally distributed blood-feeding pest of cattle that is responsible for significant economic losses to cattle producers. Chemical insecticides are the primary means for controlling this pest but problems with insecticide resistance have become common in the horn fly. To provide a foundation for identification of genomic loci for insecticide resistance and for discovery of new control technology, we report the sequencing, assembly, and annotation of the horn fly genome. The assembled genome is 1.14 Gb, comprising 76,616 scaffolds with N50 scaffold length of 23 Kb. Using RNA-Seq data, we have predicted 34,413 gene models of which 19,185 have been assigned functional annotations. Comparative genomics analysis with the Dipteran flies Musca domestica L., Drosophila melanogaster, and Lucilia cuprina, show that the horn fly is most closely related to M. domestica, sharing 8,748 orthologous clusters followed by D. melanogaster and L. cuprina, sharing 7,582 and 7,490 orthologous clusters respectively. We also identified a gene locus for the sodium channel protein in which mutations have been previously reported that confers target site resistance to the most common class of pesticides used in fly control. Additionally, we identified 276 genomic loci encoding members of metabolic enzyme gene families such as cytochrome P450s, esterases and glutathione S-transferases, and several genes orthologous to sex determination pathway genes in other Dipteran species. Copyright © 2018 Konganti et al.


September 22, 2019

Programmed DNA elimination: Keeping germline genes in their place.

Each of our cells contains a full set of instructions needed to make an entire human: the genome. But a few special species buck this trend. A new study now identifies the first germline-specific gene in zebra finch, one of a small number of vertebrates that are known to undergo developmentally programmed DNA elimination. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019

Genome-wide analysis of the NAC transcription factor family and their expression during the development and ripening of the Fragaria × ananassa fruits.

NAC proteins are a family of transcription factors which have a variety of important regulatory roles in plants. They present a very well conserved group of NAC subdomains in the N-terminal region and a highly variable domain at the C-terminus. Currently, knowledge concerning NAC family in the strawberry plant remains very limited. In this work, we analyzed the NAC family of Fragaria vesca, and a total of 112 NAC proteins were identified after we curated the annotations from the version 4.0.a1 genome. They were placed into the ligation groups (pseudo-chromosomes) and described its physicochemical and genetic features. A microarray transcriptomic analysis showed six of them expressed during the development and ripening of the Fragaria x ananassa fruit. Their expression patterns were studied in fruit (receptacle and achenes) in different stages of development and in vegetative tissues. Also, the expression level under different hormonal treatments (auxins, ABA) and drought stress was investigated. In addition, they were clustered with other NAC transcription factor with known function related to growth and development, senescence, fruit ripening, stress response, and secondary cell wall and vascular development. Our results indicate that these six strawberry NAC proteins could play different important regulatory roles in the process of development and ripening of the fruit, providing the basis for further functional studies and the selection for NAC candidates suitable for biotechnological applications.


September 22, 2019

Inpactor, integrated and parallel analyzer and classifier of LTR retrotransposons and its application for pineapple LTR retrotransposons diversity and dynamics.

One particular class of Transposable Elements (TEs), called Long Terminal Repeats (LTRs), retrotransposons, comprises the most abundant mobile elements in plant genomes. Their copy number can vary from several hundreds to up to a few million copies per genome, deeply affecting genome organization and function. The detailed classification of LTR retrotransposons is an essential step to precisely understand their effect at the genome level, but remains challenging in large-sized genomes, requiring the use of optimized bioinformatics tools that can take advantage of supercomputers. Here, we propose a new tool: Inpactor, a parallel and scalable pipeline designed to classify LTR retrotransposons, to identify autonomous and non-autonomous elements, to perform RT-based phylogenetic trees and to analyze their insertion times using High Performance Computing (HPC) techniques. Inpactor was tested on the classification and annotation of LTR retrotransposons in pineapple, a recently-sequenced genome. The pineapple genome assembly comprises 44% of transposable elements, of which 23% were classified as LTR retrotransposons. Exceptionally, 16.4% of the pineapple genome assembly corresponded to only one lineage of the Gypsy superfamily: Del, suggesting that this particular lineage has undergone a significant increase in its copy numbers. As demonstrated for the pineapple genome, Inpactor provides comprehensive data of LTR retrotransposons’ classification and dynamics, allowing a fine understanding of their contribution to genome structure and evolution. Inpactor is available at https://github.com/simonorozcoarias/Inpactor.


September 22, 2019

A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea.

Transposable elements (TEs) are mobile DNA sequences known as drivers of genome evolution. Their impacts have been widely studied in animals, plants and insects, but little is known about them in microalgae. In a previous study, we compared the genetic polymorphisms between strains of the haptophyte microalga Tisochrysis lutea and suggested the involvement of active autonomous TEs in their genome evolution.To identify potentially autonomous TEs, we designed a pipeline named PiRATE (Pipeline to Retrieve and Annotate Transposable Elements, download: https://doi.org/10.17882/51795 ), and conducted an accurate TE annotation on a new genome assembly of T. lutea. PiRATE is composed of detection, classification and annotation steps. Its detection step combines multiple, existing analysis packages representing all major approaches for TE detection and its classification step was optimized for microalgal genomes. The efficiency of the detection and classification steps was evaluated with data on the model species Arabidopsis thaliana. PiRATE detected 81% of the TE families of A. thaliana and correctly classified 75% of them. We applied PiRATE to T. lutea genomic data and established that its genome contains 15.89% Class I and 4.95% Class II TEs. In these, 3.79 and 17.05% correspond to potentially autonomous and non-autonomous TEs, respectively. Annotation data was combined with transcriptomic and proteomic data to identify potentially active autonomous TEs. We identified 17 expressed TE families and, among these, a TIR/Mariner and a TIR/hAT family were able to synthesize their transposase. Both these TE families were among the three highest expressed genes in a previous transcriptomic study and are composed of highly similar copies throughout the genome of T. lutea. This sum of evidence reveals that both these TE families could be capable of transposing or triggering the transposition of potential related MITE elements.This manuscript provides an example of a de novo transposable element annotation of a non-model organism characterized by a fragmented genome assembly and belonging to a poorly studied phylum at genomic level. Integration of multi-omics data enabled the discovery of potential mobile TEs and opens the way for new discoveries on the role of these repeated elements in genomic evolution of microalgae.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.