Menu
September 22, 2019

Large-scale gene losses underlie the genome evolution of parasitic plant Cuscuta australis.

Dodders (Cuscuta spp., Convolvulaceae) are root- and leafless parasitic plants. The physiology, ecology, and evolution of these obligate parasites are poorly understood. A high-quality reference genome of Cuscuta australis was assembled. Our analyses reveal that Cuscuta experienced accelerated molecular evolution, and Cuscuta and the convolvulaceous morning glory (Ipomoea) shared a common whole-genome triplication event before their divergence. C. australis genome harbors 19,671 protein-coding genes, and importantly, 11.7% of the conserved orthologs in autotrophic plants are lost in C. australis. Many of these gene loss events likely result from its parasitic lifestyle and the massive changes of its body plan. Moreover, comparison of the gene expression patterns in Cuscuta prehaustoria/haustoria and various tissues of closely related autotrophic plants suggests that Cuscuta haustorium formation requires mostly genes normally involved in root development. The C. australis genome provides important resources for studying the evolution of parasitism, regressive evolution, and evo-devo in plant parasites.


September 22, 2019

Evidence of non-tandemly repeated rDNAs and their intragenomic heterogeneity in Rhizophagus irregularis

Arbuscular mycorrhizal fungus (AMF) species are some of the most widespread symbionts of land plants. Our much improved reference genome assembly of a model AMF, Rhizophagus irregularis DAOM-181602 (total contigs?=?210), facilitated a discovery of repetitive elements with unusual characteristics. R. irregularis has only ten or 11 copies of complete 45S rDNAs, whereas the general eukaryotic genome has tens to thousands of rDNA copies. R. irregularis rDNAs are highly heterogeneous and lack a tandem repeat structure. These findings provide evidence for the hypothesis that rDNA heterogeneity depends on the lack of tandem repeat structures. RNA-Seq analysis confirmed that all rDNA variants are actively transcribed. Observed rDNA/rRNA polymorphisms may modulate translation by using different ribosomes depending on biotic and abiotic interactions. The non-tandem repeat structure and intragenomic heterogeneity of AMF rDNA/rRNA may facilitate successful adaptation to various environmental conditions, increasing host compatibility of these symbiotic fungi.


September 22, 2019

Genotype-Corrector: improved genotype calls for genetic mapping in F2 and RIL populations.

F2 and recombinant inbred lines (RILs) populations are very commonly used in plant genetic mapping studies. Although genome-wide genetic markers like single nucleotide polymorphisms (SNPs) can be readily identified by a wide array of methods, accurate genotype calling remains challenging, especially for heterozygous loci and missing data due to low sequencing coverage per individual. Therefore, we developed Genotype-Corrector, a program that corrects genotype calls and imputes missing data to improve the accuracy of genetic mapping. Genotype-Corrector can be applied in a wide variety of genetic mapping studies that are based on low coverage whole genome sequencing (WGS) or Genotyping-by-Sequencing (GBS) related techniques. Our results show that Genotype-Corrector achieves high accuracy when applied to both synthetic and real genotype data. Compared with using raw or only imputed genotype calls, the linkage groups built by corrected genotype data show much less noise and significant distortions can be corrected. Additionally, Genotype-Corrector compares favorably to the popular imputation software LinkImpute and Beagle in both F2 and RIL populations. Genotype-Corrector is publicly available on GitHub at https://github.com/freemao/Genotype-Corrector .


September 22, 2019

A mosaic monoploid reference sequence for the highly complex genome of sugarcane.

Sugarcane (Saccharum spp.) is a major crop for sugar and bioenergy production. Its highly polyploid, aneuploid, heterozygous, and interspecific genome poses major challenges for producing a reference sequence. We exploited colinearity with sorghum to produce a BAC-based monoploid genome sequence of sugarcane. A minimum tiling path of 4660 sugarcane BAC that best covers the gene-rich part of the sorghum genome was selected based on whole-genome profiling, sequenced, and assembled in a 382-Mb single tiling path of a high-quality sequence. A total of 25,316 protein-coding gene models are predicted, 17% of which display no colinearity with their sorghum orthologs. We show that the two species, S. officinarum and S. spontaneum, involved in modern cultivars differ by their transposable elements and by a few large chromosomal rearrangements, explaining their distinct genome size and distinct basic chromosome numbers while also suggesting that polyploidization arose in both lineages after their divergence.


September 22, 2019

npInv: accurate detection and genotyping of inversions using long read sub-alignment.

Detection of genomic inversions remains challenging. Many existing methods primarily target inzversions with a non repetitive breakpoint, leaving inverted repeat (IR) mediated non-allelic homologous recombination (NAHR) inversions largely unexplored.We present npInv, a novel tool specifically for detecting and genotyping NAHR inversion using long read sub-alignment of long read sequencing data. We benchmark npInv with other tools in both simulation and real data. We use npInv to generate a whole-genome inversion map for NA12878 consisting of 30 NAHR inversions (of which 15 are novel), including all previously known NAHR mediated inversions in NA12878 with flanking IR less than 7kb. Our genotyping accuracy on this dataset was 94%. We used PCR to confirm the presence of two of these novel inversions. We show that there is a near linear relationship between the length of flanking IR and the minimum inversion size, without inverted repeats.The application of npInv shows high accuracy in both simulation and real data. The results give deeper insight into understanding inversion.


September 22, 2019

Molecular characterization of invasive meningococcal isolates in Burkina Faso as the relative importance of serogroups X and W increases, 2008-2012.

Neisseria meningitidis serogroup A disease in Burkina Faso has greatly decreased following introduction of a meningococcal A conjugate vaccine in 2010, yet other serogroups continue to pose a risk of life-threatening disease. Capsule switching among epidemic-associated serogroup A N. meningitidis strains could allow these lineages to persist despite vaccination. The introduction of new strains at the national or sub-national levels could affect the epidemiology of disease.Isolates collected from invasive meningococcal disease in Burkina Faso between 2008 and 2012 were characterized by serogrouping and molecular typing. Genome sequences from a subset of isolates were used to infer phylogenetic relationships.The ST-5 clonal complex (CC5) was identified only among serogroup A isolates, which were rare after 2010. CC181 and CC11 were the most common clonal complexes after 2010, having serogroup X and W isolates, respectively. Whole-genome phylogenetic analysis showed that the CC181 isolates collected during and after the epidemic of 2010 formed a single clade that was closely related to isolates collected in Niger during 2005 and Burkina Faso during 2007. Geographic population structure was identified among the CC181 isolates, where pairs of isolates collected from the same region of Burkina Faso within a single year had less phylogenetic diversity than the CC181 isolate collection as a whole. However, the reduction of phylogenetic diversity within a region did not extend across multiple years. Instead, CC181 isolates collected during the same year had lower than average diversity, even when collected from different regions, indicating geographic mixing of strains across years. The CC11 isolates were primarily collected during the epidemic of 2012, with sparse sampling during 2011. These isolates belong to a clade that includes previously described isolates collected in Burkina Faso, Mali, and Niger from 2011 to 2015. Similar to CC181, reduced phylogenetic diversity was observed among CC11 isolate pairs collected from the same regions during a single year.The population of disease-associated N. meningitidis strains within Burkina Faso was highly dynamic between 2008 and 2012, reflecting both vaccine-imposed selection against serogroup A strains and potentially complex clonal waves of serogroup X and serogroup W strains.


September 22, 2019

Genome analysis of the ancient tracheophyte Selaginella tamariscina reveals evolutionary features relevant to the acquisition of desiccation tolerance.

Resurrection plants, which are the “gifts” of natural evolution, are ideal models for studying the genetic basis of plant desiccation tolerance. Here, we report a high-quality genome assembly of 301 Mb for the diploid spike moss Selaginella tamariscina, a primitive vascular resurrection plant. We predicated 27 761 protein-coding genes from the assembled S. tamariscina genome, 11.38% (2363) of which showed significant expression changes in response to desiccation. Approximately 60.58% of the S. tamariscina genome was annotated as repetitive DNA, which is an almost 2-fold increase of that in the genome of desiccation-sensitive Selaginella moellendorffii. Genomic and transcriptomic analyses highlight the unique evolution and complex regulations of the desiccation response in S. tamariscina, including species-specific expansion of the oleosin and pentatricopeptide repeat gene families, unique genes and pathways for reactive oxygen species generation and scavenging, and enhanced abscisic acid (ABA) biosynthesis and potentially distinct regulation of ABA signaling and response. Comparative analysis of chloroplast genomes of several Selaginella species revealed a unique structural rearrangement and the complete loss of chloroplast NAD(P)H dehydrogenase (NDH) genes in S. tamariscina, suggesting a link between the absence of the NDH complex and desiccation tolerance. Taken together, our comparative genomic and transcriptomic analyses reveal common and species-specific desiccation tolerance strategies in S. tamariscina, providing significant insights into the desiccation tolerance mechanism and the evolution of resurrection plants. Copyright © 2018 The Author. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Hotspots of independent and multiple rounds of LTR-retrotransposon bursts in Brassica species

Long terminal repeat retrotransposons (LTR-RTs) are a predominant group of plant transposable elements (TEs) that are an important component of plant genomes. A large number of LTR-RTs have been annotated in the genomes of the agronomically important oil and vegetable crops of the genus Brassica. Herein, full-length LTR-RTs in the genomes of Brassica and other closely related species were systematically analyzed. The full-length LTR-RT content varied greatly (from 0.43% to 23.4%) between different species, with Gypsy-like LTR-RTs constituting a primary group across these genomes. More importantly, many annotated LTR-RTs (from 10.03% to 33.25% of all detected LTR-RTs) were found to be enriched in localized hotspot regions. Furthermore, all of the analyzed species showed evidence of having experienced at least one round of a LTR-RT burst, with Raphanus sativus experiencing three or more. Moreover, these relatively ancient LTR-RT amplifications exhibited a clear expansion at specific time points. To gain a further understanding of this timing, Brassica rapa, B. oleracea, and R. sativus were examined for the presence of syntenic regions, but none were present. These findings indicate that these LTR-RT burst events were not inherited from a common ancestor, but instead were species-specific bursts that occurred after the divergence of Brassica species. This study further exemplifies the complexities of TE amplifications during the evolution of plant genomes and suggests that these LTR-RT bursts play an important role in genome expansion and divergence in Brassica species.


September 22, 2019

Strand-seq enables reliable separation of long reads by chromosome via expectation maximization.

Current sequencing technologies are able to produce reads orders of magnitude longer than ever possible before. Such long reads have sparked a new interest in de novo genome assembly, which removes reference biases inherent to re-sequencing approaches and allows for a direct characterization of complex genomic variants. However, even with latest algorithmic advances, assembling a mammalian genome from long error-prone reads incurs a significant computational burden and does not preclude occasional misassemblies. Both problems could potentially be mitigated if assembly could commence for each chromosome separately.To address this, we show how single-cell template strand sequencing (Strand-seq) data can be leveraged for this purpose. We introduce a novel latent variable model and a corresponding Expectation Maximization algorithm, termed SaaRclust, and demonstrates its ability to reliably cluster long reads by chromosome. For each long read, this approach produces a posterior probability distribution over all chromosomes of origin and read directionalities. In this way, it allows to assess the amount of uncertainty inherent to sparse Strand-seq data on the level of individual reads. Among the reads that our algorithm confidently assigns to a chromosome, we observed more than 99% correct assignments on a subset of Pacific Bioscience reads with 30.1×?coverage. To our knowledge, SaaRclust is the first approach for the in silico separation of long reads by chromosome prior to assembly.https://github.com/daewoooo/SaaRclust.


September 22, 2019

Genomic variation among and within six Juglans species.

Genomic analysis in Juglans (walnuts) is expected to transform the breeding and agricultural production of both nuts and lumber. To that end, we report here the determination of reference sequences for six additional relatives of Juglans regia: Juglans sigillata (also from section Dioscaryon), Juglans nigra, Juglans microcarpa, Juglans hindsii (from section Rhysocaryon), Juglans cathayensis (from section Cardiocaryon), and the closely related Pterocarya stenoptera While these are ‘draft’ genomes, ranging in size between 640Mbp and 990Mbp, their contiguities and accuracies can support powerful annotations of genomic variation that are often the foundation of new avenues of research and breeding. We annotated nucleotide divergence and synteny by creating complete pairwise alignments of each reference genome to the remaining six. In addition, we have re-sequenced a sample of accessions from four Juglans species (including regia). The variation discovered in these surveys comprises a critical resource for experimentation and breeding, as well as a solid complementary annotation. To demonstrate the potential of these resources the structural and sequence variation in and around the polyphenol oxidase loci, PPO1 and PPO2 were investigated. As reported for other seed crops variation in this gene is implicated in the domestication of walnuts. The apparently Juglandaceae specific PPO1 duplicate shows accelerated divergence and an excess of amino acid replacement on the lineage leading to accessions of the domesticated nut crop species, Juglans regia and sigillata. Copyright © 2018 Stevens et al.


September 22, 2019

The genome assembly of the fungal pathogen Pyrenochaeta lycopersici from Single-Molecule Real-Time sequencing sheds new light on its biological complexity.

The first draft genome sequencing of the non-model fungal pathogen Pyrenochaeta lycopersici showed an expansion of gene families associated with heterokaryon incompatibility and lacking of mating-type genes, providing insights into the genetic basis of this “imperfect” fungus which lost the ability to produce the sexual stage. However, due to the Illumina short-read technology, the draft genome was too fragmented to allow a comprehensive characterization of the genome, especially of the repetitive sequence fraction. In this work, the sequencing of another P. lycopersici isolate using long-read Single Molecule Real-Time sequencing technology was performed with the aim of obtaining a gapless genome. Indeed, a gapless genome assembly of 62.7 Mb was obtained, with a fraction of repetitive sequences representing 30% of the total bases. The gene content of the two P. lycopersici isolates was very similar, and the large difference in genome size (about 8 Mb) might be attributable to the high fraction of repetitive sequences detected for the new sequenced isolate. The role of repetitive elements, including transposable elements, in modulating virulence effectors is well established in fungal plant pathogens. Moreover, transposable elements are of fundamental importance in creating and re-modelling genes, especially in imperfect fungi. Their abundance in P. lycopersici, together with the large expansion of heterokaryon incompatibility genes in both sequenced isolates, suggest the presence of possible mechanisms alternative to gene re-assorting mediated by sexual recombination. A quite large fraction (~9%) of repetitive elements in P. lycopersici, has no homology with known classes, strengthening this hypothesis. The availability of a gapless genome of P. lycopersici allowed the in-depth analysis of its genome content, by annotating functional genes and TEs. This goal will be an important resource for shedding light on the evolution of the reproductive and pathogenic behaviour of this soilborne pathogen and the onset of a possible speciation within this species.


September 22, 2019

Comparative genomic analysis of Bacillus thuringiensis reveals molecular adaptation to copper tolerance

Bacillus thuringiensis is a type of Gram positive and rod shaped bacterium that is found in a wide range of habitats. Despite the intensive studies conducted on this bacterium, most of the information available are related to its pathogenic characteristics, with only a limited number of publications mentioning its ability to survive in extreme environments. Recently, a B. thuringiensis MCMY1 strain was successfully isolated from a copper contaminated site in Mamut Copper Mine, Sabah. This study aimed to conduct a comparative genomic analysis by using the genome sequence of MCMY1 strain published in GenBank (PRJNA374601) as a target genome for comparison with other available B. thuringiensis genomes at the GenBank. Whole genome alignment, Fragment all-against-all comparison analysis, phylogenetic reconstruction and specific copper genes comparison were applied to all forty-five B. thuringiensis genomes to reveal the molecular adaptation to copper tolerance. The comparative results indicated that B. thuringiensis MCMY1 strain is closely related to strain Bt407 and strain IS5056. This strain harbors almost all available copper genes annotated from the forty-five B. thuringiensis genomes, except for the gene for Magnesium and cobalt efflux protein (CorC) which plays an indirect role in reducing the oxidative stress that caused by copper and other metal ions. Furthermore, the findings also showed that the Copper resistance gene family, CopABCDZ and its repressor (CsoR) are conserved in almost all sequenced genomes but the presence of the genes for Cytoplasmic copper homeostasis protein (CutC) and CorC across the sample genomes are highly inconsonant. The variation of these genes across the B. thuringiensis genomes suggests that each strain may have adapted to their specific ecological niche. However, further investigations will be need to support this preliminary hypothesis.


September 22, 2019

Genomic characterization of extensively drug-resistant Acinetobacter baumannii strain, KAB03 belonging to ST451 from Korea.

Extensively drug-resistant (XDR) Acinetobacter baumannii strains have emerged rapidly worldwide. The antibiotic resistance characteristics of XDR A. baumannii strains show regional differences; therefore, it is necessary to analyze both genomic and proteomic characteristics of emerging XDR A. baumannii clinical strains isolated in Korea to elucidate their multidrug resistance. Here, we isolated new sequence type of XDR A. baumannii clinical strain (KAB03) from Korean hospitals and performed comprehensive genome analyses. The strain belongs to new sequence type, ST451. Single nucleotide polymorphism (SNP) analysis with other types of A. baumannii strains revealed that KAB03 has unique SNP pattern in the regions of gyrB and gpi of MLST profiles. A. baumannii KAB03 harbours three antibiotic resistance islands (AbGRI1, 2, and 3). AbGRI1 harbours two copies of Tn2006 containing blaOXA-23, which play an important role in antibiotic resistance. AbGRI2 possesses aminoglycoside resistant gene aph(3′)-Ic and class A ß-lactamase blaTEM. AbGIR3 has macrolide resistant genes and aminoglycoside resistant gene armA. A. baumannii KAB03 harbours mutations in pmrB and pmrC, which are believed to confer colistin resistance. In addition, proteomic and transcriptional analysis of KAB03 confirmed that ß-lactamases (ADC-73 and OXA-23), Ade efflux pumps (AdeIJK), outer membrane proteins (OmpA and OmpW), and colistin resistance genes (PmrCAB) were major proteins responsible for antibiotic resistance. Our proteogenomic results provide valuable information for multi-drug resistance in emerging XDR A. baumannii strains belonging to ST451. Copyright © 2018. Published by Elsevier B.V.


September 22, 2019

The Chara genome: Secondary complexity and implications for plant terrestrialization.

Land plants evolved from charophytic algae, among which Charophyceae possess the most complex body plans. We present the genome of Chara braunii; comparison of the genome to those of land plants identified evolutionary novelties for plant terrestrialization and land plant heritage genes. C. braunii employs unique xylan synthases for cell wall biosynthesis, a phragmoplast (cell separation) mechanism similar to that of land plants, and many phytohormones. C. braunii plastids are controlled via land-plant-like retrograde signaling, and transcriptional regulation is more elaborate than in other algae. The morphological complexity of this organism may result from expanded gene families, with three cases of particular note: genes effecting tolerance to reactive oxygen species (ROS), LysM receptor-like kinases, and transcription factors (TFs). Transcriptomic analysis of sexual reproductive structures reveals intricate control by TFs, activity of the ROS gene network, and the ancestral use of plant-like storage and stress protection proteins in the zygote. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

Genome survey of the freshwater mussel Venustaconcha ellipsiformis (Bivalvia: Unionida) using a hybrid de novo assembly approach.

Freshwater mussels (Bivalvia: Unionida) serve an important role as aquatic ecosystem engineers but are one of the most critically imperilled groups of animals. Here, we used a combination of sequencing strategies to assemble and annotate a draft genome of Venustaconcha ellipsiformis, which will serve as a valuable genomic resource given the ecological value and unique “doubly uniparental inheritance” mode of mitochondrial DNA transmission of freshwater mussels. The genome described here was obtained by combining high-coverage short reads (65× genome coverage of Illumina paired-end and 11× genome coverage of mate-pairs sequences) with low-coverage Pacific Biosciences long reads (0.3× genome coverage). Briefly, the final scaffold assembly accounted for a total size of 1.54?Gb (366,926 scaffolds, N50?=?6.5 kb, with 2.3% of “N” nucleotides), representing 86% of the predicted genome size of 1.80?Gb, while over one third of the genome (37.5%) consisted of repeated elements and >85% of the core eukaryotic genes were recovered. Given the repeated genetic bottlenecks of V. ellipsiformis populations as a result of glaciations events, heterozygosity was also found to be remarkably low (0.6%), in contrast to most other sequenced bivalve species. Finally, we reassembled the full mitochondrial genome and found six polymorphic sites with respect to the previously published reference. This resource opens the way to comparative genomics studies to identify genes related to the unique adaptations of freshwater mussels and their distinctive mitochondrial inheritance mechanism.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.