Menu
July 7, 2019

Genomic insights into date palm origins.

With the development of next-generation sequencing technology, the amount of date palm (Phoenix dactylifera L.) genomic data has grown rapidly and yielded new insights into this species and its origins. Here, we review advances in understanding of the evolutionary history of the date palm, with a particular emphasis on what has been learned from the analysis of genomic data. We first record current genomic resources available for date palm including genome assemblies and resequencing data. We discuss new insights into its domestication and diversification history based on these improved genomic resources. We further report recent discoveries such as the existence of wild ancestral populations in remote locations of Oman and high differentiation between African and Middle Eastern populations. While genomic data are consistent with the view that domestication took place in the Gulf region, they suggest that the process was more complex involving multiple gene pools and possibly a secondary domestication. Many questions remain unanswered, especially regarding the genetic architecture of domestication and diversification. We provide a road map to future studies that will further clarify the domestication history of this iconic crop.


July 7, 2019

Picky comprehensively detects high-resolution structural variants in nanopore long reads.

Acquired genomic structural variants (SVs) are major hallmarks of cancer genomes, but they are challenging to reconstruct from short-read sequencing data. Here we exploited the long reads of the nanopore platform using our customized pipeline, Picky ( https://github.com/TheJacksonLaboratory/Picky ), to reveal SVs of diverse architecture in a breast cancer model. We identified the full spectrum of SVs with superior specificity and sensitivity relative to short-read analyses, and uncovered repetitive DNA as the major source of variation. Examination of genome-wide breakpoints at nucleotide resolution uncovered micro-insertions as the common structural features associated with SVs. Breakpoint density across the genome is associated with the propensity for interchromosomal connectivity and was found to be enriched in promoters and transcribed regions of the genome. Furthermore, we observed an over-representation of reciprocal translocations from chromosomal double-crossovers through phased SVs. We demonstrate that Picky analysis is an effective tool for comprehensive detection of SVs in cancer genomes from long-read data.


July 7, 2019

An investigation of Y chromosome incorporations in 400 species of Drosophila and related genera.

Y chromosomes are widely believed to evolve from a normal autosome through a process of massive gene loss (with preservation of some male genes), shaped by sex-antagonistic selection and complemented by occasional gains of male-related genes. The net result of these processes is a male-specialized chromosome. This might be expected to be an irreversible process, but it was found in 2005 that the Drosophila pseudoobscura Y chromosome was incorporated into an autosome. Y chromosome incorporations have important consequences: a formerly male-restricted chromosome reverts to autosomal inheritance, and the species may shift from an XY/XX to X0/XX sex-chromosome system. In order to assess the frequency and causes of this phenomenon we searched for Y chromosome incorporations in 400 species from Drosophila and related genera. We found one additional large scale event of Y chromosome incorporation, affecting the whole montium subgroup (40 species in our sample); overall 13% of the sampled species (52/400) have Y incorporations. While previous data indicated that after the Y incorporation the ancestral Y disappeared as a free chromosome, the much larger data set analyzed here indicates that a copy of the Y survived as a free chromosome both in montium and pseudoobscura species, and that the current Y of the pseudoobscura lineage results from a fusion between this free Y and the neoY. The 400 species sample also showed that the previously suggested causal connection between X-autosome fusions and Y incorporations is, at best, weak: the new case of Y incorporation (montium) does not have X-autosome fusion, whereas nine independent cases of X-autosome fusions were not followed by Y incorporations. Y incorporation is an underappreciated mechanism affecting Y chromosome evolution; our results show that at least in Drosophila it plays a relevant role and highlight the need of similar studies in other groups.


July 7, 2019

Genome-wide characterization and phylogenetic analysis of GSK gene family in three species of cotton: evidence for a role of some GSKs in fiber development and responses to stress

Background: The glycogen synthase kinase 3/shaggy kinase (GSK3) is a serine/threonine kinase with important roles in animals. Although GSK3 genes have been studied for more than 30years, plant GSK genes have been studied only since the last decade. Previous research has confirmed that plant GSK genes are involved in diverse processes, including floral development, brassinosteroid signaling, and responses to abiotic stresses. Result: In this study, 20, 15 (including 5 different transcripts) and 10 GSK genes were identified in G. hirsutum, G. raimondii and G. arboreum, respectively. A total of 65 genes from Arabidopsis, rice, and cotton were classified into 4 clades. High similarities were found in GSK3 protein sequences, conserved motifs, and gene structures, as well as good concordance in gene pairwise comparisons (G. hirsutum vs. G. arboreum, G. hirsutum vs. G. raimondii, and G. arboreum vs. G. raimondii) were observed. Whole genome duplication (WGD) within At and Dt sub-genomes has been central to the expansion of the GSK gene family. Furthermore, GhSK genes showed diverse expression patterns in various tissues. Additionally, the expression profiles of GhSKs under different stress treatments demonstrated that many are stress-responsive genes. However, none were induced by brassinolide treatment. Finally, nine co-expression sub- networks were observed for GhSKs and the functional annotations of these genes suggested that some GhSKs might be involved in cotton fiber development. Conclusion: In this present work, we identified 45 GSK genes from three cotton species, which were divided into four clades. The gene features, muti-alignment, conversed motifs, and syntenic blocks indicate that they have been highly conserved during evolution. Whole genome duplication was determined to be the dominant factor for GSK gene family expansion. The analysis of co-expressed sub-networks and tissue-specific expression profiles suggested functions of GhSKs during fiber development. Moreover, their different responses to various abiotic stresses indicated great functional diversity amongst the GhSKs. Briefly, data presented herein may serve as the basis for future functional studies of GhSKs.


July 7, 2019

Overview of the germline and expressed repertoires of the TRB genes in Sus scrofa.

The a/ß T cell receptor (TR) is a complex heterodimer that recognizes antigenic peptides and binds to major histocompatibility complex (MH) molecules. Both a and ß chains are encoded by different genes localized on two distinct chromosomal loci: TRA and TRB. The present study employed the recent release of the swine genome assembly to define the genomic organization of the TRB locus. According to the sequencing data, the pig TRB locus spans approximately 400 kb of genomic DNA and consists of 38 TRBV genes belonging to 24 subgroups located upstream of three in tandem TRBD-J-C clusters, which are followed by a TRBV gene in an inverted transcriptional orientation. Comparative analysis confirms that the general organization of the TRB locus is similar among mammalian species, but the number of germline TRBV genes varies greatly even between species belonging to the same order, determining the diversity and specificity of the immune response. However, sequence analysis of the TRB locus also suggests the presence of blocks of conserved homology in the genomic region across mammals. Furthermore, by analysing a public cDNA collection, we identified the usage pattern of the TRBV, TRBD, and TRBJ genes in the adult pig TRB repertoire, and we noted that the expressed TRBV repertoire seems to be broader and more diverse than the germline repertoire, in line with the presence of a high level of TRBV gene polymorphisms. Because the nucleotide differences seems to be principally concentrated in the CDR2 region, it is reasonable to presume that most T cell ß-chain diversity can be related to polymorphisms in pig MH molecules. Domestic pigs represent a valuable animal model as they are even more anatomically, genetically and physiologically similar to humans than are mice. Therefore, present knowledge on the genomic organization of the pig TRB locus allows the collection of increased information on the basic aspects of the porcine immune system and contributes to filling the gaps left by rodent models.


July 7, 2019

Signatures of selection and environmental adaptation across the goat genome post-domestication.

Since goat was domesticated 10,000 years ago, many factors have contributed to the differentiation of goat breeds and these are classified mainly into two types: (i) adaptation to different breeding systems and/or purposes and (ii) adaptation to different environments. As a result, approximately 600 goat breeds have developed worldwide; they differ considerably from one another in terms of phenotypic characteristics and are adapted to a wide range of climatic conditions. In this work, we analyzed the AdaptMap goat dataset, which is composed of data from more than 3000 animals collected worldwide and genotyped with the CaprineSNP50 BeadChip. These animals were partitioned into groups based on geographical area, production uses, available records on solid coat color and environmental variables including the sampling geographical coordinates, to investigate the role of natural and/or artificial selection in shaping the genome of goat breeds.Several signatures of selection on different chromosomal regions were detected across the different breeds, sub-geographical clusters, phenotypic and climatic groups. These regions contain genes that are involved in important biological processes, such as milk-, meat- or fiber-related production, coat color, glucose pathway, oxidative stress response, size, and circadian clock differences. Our results confirm previous findings in other species on adaptation to extreme environments and human purposes and provide new genes that could explain some of the differences between goat breeds according to their geographical distribution and adaptation to different environments.These analyses of signatures of selection provide a comprehensive first picture of the global domestication process and adaptation of goat breeds and highlight possible genes that may have contributed to the differentiation of this species worldwide.


July 7, 2019

Synaptogyrin-2 influences replication of Porcine circovirus 2.

Porcine circovirus 2 (PCV2) is a circular single-stranded DNA virus responsible for a group of diseases collectively known as PCV2 Associated Diseases (PCVAD). Variation in the incidence and severity of PCVAD exists between pigs suggesting a host genetic component involved in pathogenesis. A large-scale genome-wide association study of experimentally infected pigs (n = 974), provided evidence of a host genetic role in PCV2 viremia, immune response and growth during challenge. Host genotype explained 64% of the phenotypic variation for overall viral load, with two major Quantitative Trait Loci (QTL) identified on chromosome 7 (SSC7) near the swine leukocyte antigen complex class II locus and on the proximal end of chromosome 12 (SSC12). The SNP having the strongest association, ALGA0110477 (SSC12), explained 9.3% of the genetic and 6.2% of the phenotypic variance for viral load. Dissection of the SSC12 QTL based on gene annotation, genomic and RNA-sequencing, suggested that a missense mutation in the SYNGR2 (SYNGR2 p.Arg63Cys) gene is potentially responsible for the variation in viremia. This polymorphism, located within a protein domain conserved across mammals, results in an amino acid variant SYNGR2 p.63Cys only observed in swine. PCV2 titer in PK15 cells decreased when the expression of SYNGR2 was silenced by specific-siRNA, indicating a role of SYNGR2 in viral replication. Additionally, a PK15 edited clone generated by CRISPR-Cas9, carrying a partial deletion of the second exon that harbors a key domain and the SYNGR2 p.Arg63Cys, was associated with a lower viral titer compared to wildtype PK15 cells (>24 hpi) and supernatant (>48hpi)(P < 0.05). Identification of a non-conservative substitution in this key domain of SYNGR2 suggests that the SYNGR2 p.Arg63Cys variant may underlie the observed genetic effect on viral load.


July 7, 2019

Pilot satellitome analysis of the model plant, Physcomitrellapatens, revealed a transcribed and high-copy IGS related tandem repeat.

Satellite DNA (satDNA) constitutes a substantial part of eukaryotic genomes. In the last decade, it has been shown that satDNA is not an inert part of the genome and its function extends beyond the nuclear membrane. However, the number of model plant species suitable for studying the novel horizons of satDNA functionality is low. Here, we explored the satellitome of the model “basal” plant, Physcomitrellapatens (Hedwig, 1801) Bruch & Schimper, 1849 (moss), which has a number of advantages for deep functional and evolutionary research. Using a newly developed pyTanFinder pipeline (https://github.com/Kirovez/pyTanFinder) coupled with fluorescence in situ hybridization (FISH), we identified five high copy number tandem repeats (TRs) occupying a long DNA array in the moss genome. The nuclear organization study revealed that two TRs had distinct locations in the moss genome, concentrating in the heterochromatin and knob-rDNA like chromatin bodies. Further genomic, epigenetic and transcriptomic analysis showed that one TR, named PpNATR76, was located in the intergenic spacer (IGS) region and transcribed into long non-coding RNAs (lncRNAs). Several specific features of PpNATR76 lncRNAs make them very similar with the recently discovered human lncRNAs, raising a number of questions for future studies. This work provides new resources for functional studies of satellitome in plants using the model organism P.patens, and describes a list of tandem repeats for further analysis.


July 7, 2019

iMGEins: detecting novel mobile genetic elements inserted in individual genomes.

Recent advances in sequencing technology have allowed us to investigate personal genomes to find structural variations, which have been studied extensively to identify their association with the physiology of diseases such as cancer. In particular, mobile genetic elements (MGEs) are one of the major constituents of the human genomes, and cause genome instability by insertion, mutation, and rearrangement.We have developed a new program, iMGEins, to identify such novel MGEs by using sequencing reads of individual genomes, and to explore the breakpoints with the supporting reads and MGEs detected. iMGEins is the first MGE detection program that integrates three algorithmic components: discordant read-pair mapping, split-read mapping, and insertion sequence assembly. Our evaluation results showed its outstanding performance in detecting novel MGEs from simulated genomes, as well as real personal genomes. In detail, the average recall and precision rates of iMGEins are 96.67 and 100%, respectively, which are the highest among the programs compared. In the testing with real human genomes of the NA12878 sample, iMGEins shows the highest accuracy in detecting MGEs within 20?bp proximity of the breakpoints annotated.In order to study the dynamics of MGEs in individual genomes, iMGEins was developed to accurately detect breakpoints and report inserted MGEs. Compared with other programs, iMGEins has valuable features of identifying novel MGEs and assembling the MGEs inserted.


July 7, 2019

Bridging gaps in transposable element research with single-molecule and single-cell technologies

More than half of the genomic landscape in humans and many other organisms is composed of repetitive DNA, which mostly derives from transposable elements (TEs) and viruses. Recent technological advances permit improved assessment of the repetitive content across genomes and newly developed molecular assays have revealed important roles of TEs and viruses in host genome evolution and organization. To update on our current understanding of TE biology and to promote new interdisciplinary strategies for the TE research community, leading experts gathered for the 2nd Uppsala Transposon Symposium on October 4–5, 2018 in Uppsala, Sweden. Using cutting-edge single-molecule and single-cell approaches, research on TEs and other repeats has entered a new era in biological and biomedical research.


July 7, 2019

Alignment-free genome comparison enables accurate geographic sourcing of white oak DNA.

The application of genomic data and bioinformatics for the identification of restricted or illegally-sourced natural products is urgently needed. The taxonomic identity and geographic provenance of raw and processed materials have implications in sustainable-use commercial practices, and relevance to the enforcement of laws that regulate or restrict illegally harvested materials, such as timber. Improvements in genomics make it possible to capture and sequence partial-to-complete genomes from challenging tissues, such as wood and wood products.In this paper, we report the success of an alignment-free genome comparison method, [Formula: see text] that differentiates different geographic sources of white oak (Quercus) species with a high level of accuracy with very small amount of genomic data. The method is robust to sequencing errors, different sequencing laboratories and sequencing platforms.This method offers an approach based on genome-scale data, rather than panels of pre-selected markers for specific taxa. The method provides a generalizable platform for the identification and sourcing of materials using a unified next generation sequencing and analysis framework.


July 7, 2019

Hardwood tree genomics: Unlocking woody plant biology.

Woody perennial angiosperms (i.e., hardwood trees) are polyphyletic in origin and occur in most angiosperm orders. Despite their independent origins, hardwoods have shared physiological, anatomical, and life history traits distinct from their herbaceous relatives. New high-throughput DNA sequencing platforms have provided access to numerous woody plant genomes beyond the early reference genomes of Populus and Eucalyptus, references that now include willow and oak, with pecan and chestnut soon to follow. Genomic studies within these diverse and undomesticated species have successfully linked genes to ecological, physiological, and developmental traits directly. Moreover, comparative genomic approaches are providing insights into speciation events while large-scale DNA resequencing of native collections is identifying population-level genetic diversity responsible for variation in key woody plant biology across and within species. Current research is focused on developing genomic prediction models for breeding, defining speciation and local adaptation, detecting and characterizing somatic mutations, revealing the mechanisms of gender determination and flowering, and application of systems biology approaches to model complex regulatory networks underlying quantitative traits. Emerging technologies such as single-molecule, long-read sequencing is being employed as additional woody plant species, and genotypes within species, are sequenced, thus enabling a comparative (“evo-devo”) approach to understanding the unique biology of large woody plants. Resource availability, current genomic and genetic applications, new discoveries and predicted future developments are illustrated and discussed for poplar, eucalyptus, willow, oak, chestnut, and pecan.


July 7, 2019

Whole-Genome and Expression Analyses of Bamboo Aquaporin Genes Reveal Their Functions Involved in Maintaining Diurnal Water Balance in Bamboo Shoots.

Water supply is essential for maintaining normal physiological function during the rapid growth of bamboo. Aquaporins (AQPs) play crucial roles in water transport for plant growth and development. Although 26 PeAQPs in bamboo have been reported, the aquaporin-led mechanism of maintaining diurnal water balance in bamboo shoots remains unclear. In this study, a total of 63 PeAQPs were identified, based on the updated genome of moso bamboo (Phyllostachys edulis), including 22 PePIPs, 20 PeTIPs, 17 PeNIPs, and 4 PeSIPs. All of the PeAQPs were differently expressed in 26 different tissues of moso bamboo, based on RNA sequencing (RNA-seq) data. The root pressure in shoots showed circadian rhythm changes, with positive values at night and negative values in the daytime. The quantitative real-time PCR (qRT-PCR) result showed that 25 PeAQPs were detected in the base part of the shoots, and most of them demonstrated diurnal rhythm changes. The expression levels of some PeAQPs were significantly correlated with the root pressure. Of the 86 sugar transport genes, 33 had positive co-expression relationships with 27 PeAQPs. Two root pressure-correlated PeAQPs, PeTIP4;1 and PeTIP4;2, were confirmed to be highly expressed in the parenchyma and epidermal cells of bamboo culm, and in the epidermis, pith, and primary xylem of bamboo roots by in situ hybridization. The authors’ findings provide new insights and a possible aquaporin-led mechanism for bamboo fast growth.


July 7, 2019

The Draft Genome of the MD-2 Pineapple

The main challenge in assembling plant genome is its ploidy level, repeats content, and polymorphism. The second-generation sequencing delivered the throughput and the accuracy that is crucial to whole-genome sequencing but insufficient and remained challenging for some plant species. It is known that genomes produced by next-gen- eration sequencing produced small contigs that would inflate the number of annotated genes (Varshney et al. 2011) and missed on the transposable elements that are abun- dant in plant genome due to their repetitive nature (Michael and Jackson 2013).


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.