Menu
September 22, 2019

The chromosome-level genome assemblies of two rattans (Calamus simplicifolius and Daemonorops jenkinsiana).

Calamus simplicifolius and Daemonorops jenkinsiana are two representative rattans, the most significant material sources for the rattan industry. However, the lack of reference genome sequences is a major obstacle for basic and applied biology on rattan.We produced two chromosome-level genome assemblies of C. simplicifolius and D. jenkinsiana using Illumina, Pacific Biosciences, and Hi-C sequencing data. A total of ~730 Gb and ~682 Gb of raw data covered the predicted genome lengths (~1.98 Gb of C. simplicifolius and ~1.61 Gb of D. jenkinsiana) to ~372 × and ~426 × read depths, respectively. The two de novo genome assemblies, ~1.94 Gb and ~1.58 Gb, were generated with scaffold N50s of ~160 Mb and ~119 Mb in C. simplicifolius and D. jenkinsiana, respectively. The C. simplicifolius and D. jenkinsiana genomes were predicted to harbor ?51,235 and ?53,342 intact protein-coding gene models, respectively. Benchmarking Universal Single-Copy Orthologs evaluation demonstrated that genome completeness reached 96.4% and 91.3% in the C. simplicifolius and D. jenkinsiana genomes, respectively. Genome evolution showed that four Arecaceae plants clustered together, and the divergence time between the two rattans was ~19.3 million years ago. Additionally, we identified 193 and 172 genes involved in the lignin biosynthesis pathway in the C. simplicifolius and D. jenkinsiana genomes, respectively.We present the first de novo assemblies of two rattan genomes (C. simplicifolius and D. jenkinsiana). These data will not only provide a fundamental resource for functional genomics, particularly in promoting germplasm utilization for breeding, but also serve as reference genomes for comparative studies between and among different species.


September 22, 2019

Evolutionary history of human Plasmodium vivax revealed by genome-wide analyses of related ape parasites.

Wild-living African apes are endemically infected with parasites that are closely related to human Plasmodium vivax, a leading cause of malaria outside Africa. This finding suggests that the origin of P. vivax was in Africa, even though the parasite is now rare in humans there. To elucidate the emergence of human P. vivax and its relationship to the ape parasites, we analyzed genome sequence data of P. vivax strains infecting six chimpanzees and one gorilla from Cameroon, Gabon, and Côte d’Ivoire. We found that ape and human parasites share nearly identical core genomes, differing by only 2% of coding sequences. However, compared with the ape parasites, human strains of P. vivax exhibit about 10-fold less diversity and have a relative excess of nonsynonymous nucleotide polymorphisms, with site-frequency spectra suggesting they are subject to greatly relaxed purifying selection. These data suggest that human P. vivax has undergone an extreme bottleneck, followed by rapid population expansion. Investigating potential host-specificity determinants, we found that ape P. vivax parasites encode intact orthologs of three reticulocyte-binding protein genes (rbp2d, rbp2e, and rbp3), which are pseudogenes in all human P. vivax strains. However, binding studies of recombinant RBP2e and RBP3 proteins to human, chimpanzee, and gorilla erythrocytes revealed no evidence of host-specific barriers to red blood cell invasion. These data suggest that, from an ancient stock of P. vivax parasites capable of infecting both humans and apes, a severely bottlenecked lineage emerged out of Africa and underwent rapid population growth as it spread globally. Copyright © 2018 the Author(s). Published by PNAS.


September 22, 2019

Genomic insights into host adaptation between the wheat stripe rust pathogen (Puccinia striiformis f. sp. tritici) and the barley stripe rust pathogen (Puccinia striiformis f. sp. hordei).

Plant fungal pathogens can rapidly evolve and adapt to new environmental conditions in response to sudden changes of host populations in agro-ecosystems. However, the genomic basis of their host adaptation, especially at the forma specialis level, remains unclear.We sequenced two isolates each representing Puccinia striiformis f. sp. tritici (Pst) and P. striiformis f. sp. hordei (Psh), different formae speciales of the stripe rust fungus P. striiformis highly adapted to wheat and barley, respectively. The divergence of Pst and Psh, estimated to start 8.12 million years ago, has been driven by high nucleotide mutation rates. The high genomic variation within dikaryotic urediniospores of P. striiformis has provided raw genetic materials for genome evolution. No specific gene families have enriched in either isolate, but extensive gene loss events have occurred in both Pst and Psh after the divergence from their most recent common ancestor. A large number of isolate-specific genes were identified, with unique genomic features compared to the conserved genes, including 1) significantly shorter in length; 2) significantly less expressed; 3) significantly closer to transposable elements; and 4) redundant in pathways. The presence of specific genes in one isolate (or forma specialis) was resulted from the loss of the homologues in the other isolate (or forma specialis) by the replacements of transposable elements or losses of genomic fragments. In addition, different patterns and numbers of telomeric repeats were observed between the isolates.Host adaptation of P. striiformis at the forma specialis level is a complex pathogenic trait, involving not only virulence-related genes but also other genes. Gene loss, which might be adaptive and driven by transposable element activities, provides genomic basis for host adaptation of different formae speciales of P. striiformis.


September 22, 2019

Draft genome sequence of wild Prunus yedoensis reveals massive inter-specific hybridization between sympatric flowering cherries.

Hybridization is an important evolutionary process that results in increased plant diversity. Flowering Prunus includes popular cherry species that are appreciated worldwide for their flowers. The ornamental characteristics were acquired both naturally and through artificially hybridizing species with heterozygous genomes. Therefore, the genome of hybrid flowering Prunus presents important challenges both in plant genomics and evolutionary biology.We use long reads to sequence and analyze the highly heterozygous genome of wild Prunus yedoensis. The genome assembly covers >?93% of the gene space; annotation identified 41,294 protein-coding genes. Comparative analysis of the genome with 16 accessions of six related taxa shows that 41% of the genes were assigned into the maternal or paternal state. This indicates that wild P. yedoensis is an F1 hybrid originating from a cross between maternal P. pendula f. ascendens and paternal P. jamasakura, and it can be clearly distinguished from its confusing taxon, Yoshino cherry. A focused analysis of the S-locus haplotypes of closely related taxa distributed in a sympatric natural habitat suggests that reduced restriction of inter-specific hybridization due to strong gametophytic self-incompatibility is likely to promote complex hybridization of wild Prunus species and the development of a hybrid swarm.We report the draft genome assembly of a natural hybrid Prunus species using long-read sequencing and sequence phasing. Based on a comprehensive comparative genome analysis with related taxa, it appears that cross-species hybridization in sympatric habitats is an ongoing process that facilitates the diversification of flowering Prunus.


September 22, 2019

Genomic approaches for studying crop evolution.

Understanding how crop plants evolved from their wild relatives and spread around the world can inform about the origins of agriculture. Here, we review how the rapid development of genomic resources and tools has made it possible to conduct genetic mapping and population genetic studies to unravel the molecular underpinnings of domestication and crop evolution in diverse crop species. We propose three future avenues for the study of crop evolution: establishment of high-quality reference genomes for crops and their wild relatives; genomic characterization of germplasm collections; and the adoption of novel methodologies such as archaeogenetics, epigenomics, and genome editing.


September 22, 2019

Genus-wide sequencing supports a two-locus model for sex-determination in Phoenix.

The date palm tree is a commercially important member of the genus Phoenix whose 14 species are dioecious with separate male and female individuals. To identify sex determining genes we sequenced the genomes of 15 female and 13 male Phoenix trees representing all 14 species. We identified male-specific sequences and extended them using phased single-molecule sequencing or BAC clones. We observed that only four genes contained sequences conserved in all analyzed Phoenix males. Most of these sequences showed similarity to a single genomic locus in the closely related monoecious oil palm. CYP703 and GPAT3, two single copy genes present in males and critical for male flower development in other monocots, were absent in females. A LOG-like gene appears translocated into the Y-linked region and is suggested to play a role in suppressing female flowers. Our data are consistent with a two-mutation model for the evolution of dioecy in Phoenix.


September 22, 2019

Repeated inversions within a pannier intron drive diversification of intraspecific colour patterns of ladybird beetles.

How genetic information is modified to generate phenotypic variation within a species is one of the central questions in evolutionary biology. Here we focus on the striking intraspecific diversity of >200 aposematic elytral (forewing) colour patterns of the multicoloured Asian ladybird beetle, Harmonia axyridis, which is regulated by a tightly linked genetic locus h. Our loss-of-function analyses, genetic association studies, de novo genome assemblies, and gene expression data reveal that the GATA transcription factor gene pannier is the major regulatory gene located at the h locus, and suggest that repeated inversions and cis-regulatory modifications at pannier led to the expansion of colour pattern variation in H. axyridis. Moreover, we show that the colour-patterning function of pannier is conserved in the seven-spotted ladybird beetle, Coccinella septempunctata, suggesting that H. axyridis’ extraordinary intraspecific variation may have arisen from ancient modifications in conserved elytral colour-patterning mechanisms in ladybird beetles.


September 22, 2019

Asymmetric processing of DNA ends at a double-strand break leads to unconstrained dynamics and ectopic translocation.

Multiple pathways regulate the repair of double-strand breaks (DSBs) to suppress potentially dangerous ectopic recombination. Both sequence and chromatin context are thought to influence pathway choice between non-homologous end-joining (NHEJ) and homology-driven recombination. To test the effect of repetitive sequences on break processing, we have inserted TG-rich repeats on one side of an inducible DSB at the budding yeast MAT locus on chromosome III. Five clustered Rap1 sites within a break-proximal TG repeat are sufficient to block Mre11-Rad50-Xrs2 recruitment, impair resection, and favor elongation by telomerase. The two sides of the break lose end-to-end tethering and show enhanced, uncoordinated movement. Only the TG-free side is resected and shifts to the nuclear periphery. In contrast to persistent DSBs without TG repeats that are repaired by imprecise NHEJ, nearly all survivors of repeat-proximal DSBs repair the break by a homology-driven, non-reciprocal translocation from ChrIII-R to ChrVII-L. This suppression of imprecise NHEJ at TG-repeat-flanked DSBs requires the Uls1 translocase activity. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019

B chromosomes of the Asian seabass (Lates calcarifer) contribute to genome variations at the level of individuals and populations.

The Asian seabass (Lates calcarifer) is a bony fish from the Latidae family, which is widely distributed in the tropical Indo-West Pacific region. The karyotype of the Asian seabass contains 24 pairs of A chromosomes and a variable number of AT- and GC-rich B chromosomes (Bchrs or Bs). Dot-like shaped and nucleolus-associated AT-rich Bs were microdissected and sequenced earlier. Here we analyzed DNA fragments from Bs to determine their repeat and gene contents using the Asian seabass genome as a reference. Fragments of 75 genes, including an 18S rRNA gene, were found in the Bs; repeats represented 2% of the Bchr assembly. The 18S rDNA of the standard genome and Bs were similar and enriched with fragments of transposable elements. A higher nuclei DNA content in the male gonad and somatic tissue, compared to the female gonad, was demonstrated by flow cytometry. This variation in DNA content could be associated with the intra-individual variation in the number of Bs. A comparison between the copy number variation among the B-related fragments from whole genome resequencing data of Asian seabass individuals identified similar profiles between those from the South-East Asian/Philippines and Indian region but not the Australian ones. Our results suggest that Bs might cause variations in the genome among the individuals and populations of Asian seabass. A personalized copy number approach for segmental duplication detection offers a suitable tool for population-level analysis across specimens with low coverage genome sequencing.


September 22, 2019

Antagonistic pleiotropy in the bifunctional surface protein FadL (OmpP1) during adaptation of Haemophilus influenzae to chronic lung infection associated with chronic obstructive pulmonary disease.

Tracking bacterial evolution during chronic infection provides insights into how host selection pressures shape bacterial genomes. The human-restricted opportunistic pathogen nontypeable Haemophilus influenzae (NTHi) infects the lower airways of patients suffering chronic obstructive pulmonary disease (COPD) and contributes to disease progression. To identify bacterial genetic variation associated with bacterial adaptation to the COPD lung, we sequenced the genomes of 92 isolates collected from the sputum of 13 COPD patients over 1 to 9?years. Individuals were colonized by distinct clonal types (CTs) over time, but the same CT was often reisolated at a later time or found in different patients. Although genomes from the same CT were nearly identical, intra-CT variation due to mutation and recombination occurred. Recurrent mutations in several genes were likely involved in COPD lung adaptation. Notably, nearly a third of CTs were polymorphic for null alleles of ompP1 (also called fadL), which encodes a bifunctional membrane protein that both binds the human carcinoembryonic antigen-related cell adhesion molecule 1 (hCEACAM1) receptor and imports long-chain fatty acids (LCFAs). Our computational studies provide plausible three-dimensional models for FadL’s interaction with hCEACAM1 and LCFA binding. We show that recurrent fadL mutations are likely a case of antagonistic pleiotropy, since loss of FadL reduces NTHi’s ability to infect epithelia but also increases its resistance to bactericidal LCFAs enriched within the COPD lung. Supporting this interpretation, truncated fadL alleles are common in publicly available NTHi genomes isolated from the lower airway tract but rare in others. These results shed light on molecular mechanisms of bacterial pathoadaptation and guide future research toward developing novel COPD therapeutics.IMPORTANCE Nontypeable Haemophilus influenzae is an important pathogen in patients with chronic obstructive pulmonary disease (COPD). To elucidate the bacterial pathways undergoing in vivo evolutionary adaptation, we compared bacterial genomes collected over time from 13 COPD patients and identified recurrent genetic changes arising in independent bacterial lineages colonizing different patients. Besides finding changes in phase-variable genes, we found recurrent loss-of-function mutations in the ompP1 (fadL) gene. We show that loss of OmpP1/FadL function reduces this bacterium’s ability to infect cells via the hCEACAM1 epithelial receptor but also increases its resistance to bactericidal fatty acids enriched within the COPD lung, suggesting a case of antagonistic pleiotropy that restricts ?fadL strains’ niche. These results show how H. influenzae adapts to host-generated inflammatory mediators in the COPD airways. Copyright © 2018 Moleres et al.


September 22, 2019

Structural variants exhibit allelic heterogeneity and shape variation in complex traits

Despite extensive effort to reveal the genetic basis of complex phenotypic variation, studies typically explain only a fraction of trait heritability. It has been hypothesized that individually rare hidden structural variants (SVs) could account for a significant fraction of variation in complex traits. To investigate this hypothesis, we assembled 14 Drosophila melanogaster genomes and systematically identified more than 20,000 euchromatic SVs, of which ~40% are invisible to high specificity short read genotyping approaches. SVs are common in Drosophila genes, with almost one third of diploid individuals harboring an SV in genes larger than 5kb, and nearly a quarter harboring multiple SVs in genes larger than 10kb. We show that SV alleles are rarer than amino acid polymorphisms, implying that they are more strongly deleterious. A number of functionally important genes harbor previously hidden structural variants that likely affect complex phenotypes (e.g., Cyp6g1, Drsl5, Cyp28d1&2, InR, and Gss1&2). Furthermore, SVs are overrepresented in quantitative trait locus candidate genes from eight Drosophila Synthetic Population Resource (DSPR) mapping experiments. We conclude that SVs are pervasive in genomes, are frequently present as heterogeneous allelic series, and can act as rare alleles of large effect.


September 22, 2019

Identification of the KPC plasmid pCT-KPC334: New insights on the evolutionary pathway of epidemic plasmids harboring fosA3-blaKPC-2 genes.

A novel, non-conjugative plasmid pKP1034 isolated from a fosfomycin-resistant, carbapenemase-producing Klebsiella pneumonia strain KP1034 was recently reported to carry fosA3, blaKPC-2, blaCTX-M-65, blaSHV-12 and rmtB genes, and was hypothesized to evolve from several recombination events of two closely related plasmids, pHN7A8 and pKPC-LK30 [1]. In this study, a plasmid pCT-KPC334 carrying fosA3, blaKPC-2, blaCTX-M-65, blaSHV-12, blaTEM-1, and rmtB genes was identified, providing evidence on the evolutionary pathway of plasmids harboring fosA3-blaKPC-2 genes.


September 22, 2019

Generic accelerated sequence alignment in SeqAn using vectorization and multi-threading.

Pairwise sequence alignment is undoubtedly a central tool in many bioinformatics analyses. In this paper, we present a generically accelerated module for pairwise sequence alignments applicable for a broad range of applications. In our module, we unified the standard dynamic programming kernel used for pairwise sequence alignments and extended it with a generalized inter-sequence vectorization layout, such that many alignments can be computed simultaneously by exploiting SIMD (single instruction multiple data) instructions of modern processors. We then extended the module by adding two layers of thread-level parallelization, where we (a) distribute many independent alignments on multiple threads and (b) inherently parallelize a single alignment computation using a work stealing approach producing a dynamic wavefront progressing along the minor diagonal.We evaluated our alignment vectorization and parallelization on different processors, including the newest Intel® Xeon® (Skylake) and Intel® Xeon PhiTM (KNL) processors, and use cases. The instruction set AVX512-BW (Byte and Word), available on Skylake processors, can genuinely improve the performance of vectorized alignments. We could run single alignments 1600 times faster on the Xeon PhiTM and 1400 times faster on the Xeon® than executing them with our previous sequential alignment module.The module is programmed in C++?using the SeqAn (Reinert et al., 2017) library and distributed with version 2.4 under the BSD license. We support SSE4, AVX2, AVX512 instructions and included UME: SIMD, a SIMD-instruction wrapper library, to extend our module for further instruction sets. We thoroughly test all alignment components with all major C++?compilers on various platforms.Supplementary data are available at Bioinformatics online.


September 22, 2019

Development and validation of 58K SNP-array and high-density linkage map in Nile tilapia (O. niloticus).

Despite being the second most important aquaculture species in the world accounting for 7.4% of global production in 2015, tilapia aquaculture has lacked genomic tools like SNP-arrays and high-density linkage maps to improve selection accuracy and accelerate genetic progress. In this paper, we describe the development of a genotyping array containing more than 58,000 SNPs for Nile tilapia (Oreochromis niloticus). SNPs were identified from whole genome resequencing of 32 individuals from the commercial population of the Genomar strain, and were selected for the SNP-array based on polymorphic information content and physical distribution across the genome using the Orenil1.1 genome assembly as reference sequence. SNP-performance was evaluated by genotyping 4991 individuals, including 689 offspring belonging to 41 full-sib families, which revealed high-quality genotype data for 43,588 SNPs. A preliminary genetic linkage map was constructed using Lepmap2 which in turn was integrated with information from the O_niloticus_UMD1 genome assembly to produce an integrated physical and genetic linkage map comprising 40,186 SNPs distributed across 22 linkage groups (LGs). Around one-third of the LGs showed a different recombination rate between sexes, with the female being greater than the male map by a factor of 1.2 (1632.9 to 1359.6 cM, respectively), with most LGs displaying a sigmoid recombination profile. Finally, the sex-determining locus was mapped to position 40.53 cM on LG23, in the vicinity of the anti-Müllerian hormone (amh) gene. These new resources has the potential to greatly influence and improve the genetic gain when applying genomic selection and surpass the difficulties of efficient selection for invasively measured traits in Nile tilapia.


September 22, 2019

A homeobox gene, BarH-1, underlies a female alternative life-history strategy

Colias butterflies (the “clouded sulphurs”) often occur in mixed populations where females exhibit two color morphs, yellow/orange or white. White females, known as the Alba morph, reallocate resources from the synthesis of costly colored pigments to reproductive and somatic development 1. Due to this tradeoff Alba females develop faster and have higher fecundity than orange females 2. However orange females, that have instead invested in pigments, are preferred by males who in turn provide a nutrient rich spermatophore during mating 2,3,4. Thus the wing color morphs represent alternative life history strategies (ALHS) that are female-limited, wherein tradeoffs, due to divergent resource investment, result in distinct phenotypes with associated fitness consequences. Here we map the genetic basis of Alba in Colias crocea to a transposable element insertion downstream of the Colias homolog of BarH-1. To investigate the phenotypic effects of this insertion we use CRISPR/Cas9 to validate BarH-1’s functional role in the wing color switch and antibody staining to confirm expression differences in the scale building cells of pupal wings. We then use scanning electron microscopy to determine that BarH-1 expression in the wings causes a reduction in pigment granules within wing scales, and thereby gives rise to the white color. Finally, lipid and transcriptome analyses reveal additional physiological differences that arise due to Alba, suggesting pleiotropic effects beyond wing color. Together these findings provide the first well documented mechanism for a female ALHS and support an alternative view of color polymorphism as indicative of pleiotropic effects with life history consequences.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.