Menu
July 19, 2019

The Rosa genome provides new insights into the domestication of modern roses.

Roses have high cultural and economic importance as ornamental plants and in the perfume industry. We report the rose whole-genome sequencing and assembly and resequencing of major genotypes that contributed to rose domestication. We generated a homozygous genotype from a heterozygous diploid modern rose progenitor, Rosa chinensis ‘Old Blush’. Using single-molecule real-time sequencing and a meta-assembly approach, we obtained one of the most comprehensive plant genomes to date. Diversity analyses highlighted the mosaic origin of ‘La France’, one of the first hybrids combining the growth vigor of European species and the recurrent blooming of Chinese species. Genomic segments of Chinese ancestry identified new candidate genes for recurrent blooming. Reconstructing regulatory and secondary metabolism pathways allowed us to propose a model of interconnected regulation of scent and flower color. This genome provides a foundation for understanding the mechanisms governing rose traits and should accelerate improvement in roses, Rosaceae and ornamentals.


July 19, 2019

Genomic variation in 3,010 diverse accessions of Asian cultivated rice.

Here we analyse genetic variation, population structure and diversity among 3,010 diverse Asian cultivated rice (Oryza sativa L.) genomes from the 3,000 Rice Genomes Project. Our results are consistent with the five major groups previously recognized, but also suggest several unreported subpopulations that correlate with geographic location. We identified 29 million single nucleotide polymorphisms, 2.4 million small indels and over 90,000 structural variations that contribute to within- and between-population variation. Using pan-genome analyses, we identified more than 10,000 novel full-length protein-coding genes and a high number of presence-absence variations. The complex patterns of introgression observed in domestication genes are consistent with multiple independent rice domestication events. The public availability of data from the 3,000 Rice Genomes Project provides a resource for rice genomics research and breeding.


July 19, 2019

Unexpected diversity in the mobilome of a Pseudomonas aeruginosa strain isolated from a dental unit waterline revealed by SMRT Sequencing.

The Gram-negative bacterium Pseudomonas aeruginosa is found in several habitats, both natural and human-made, and is particularly known for its recurrent presence as a pathogen in the lungs of patients suffering from cystic fibrosis, a genetic disease. Given its clinical importance, several major studies have investigated the genomic adaptation of P. aeruginosa in lungs and its transition as acute infections become chronic. However, our knowledge about the diversity and adaptation of the P. aeruginosa genome to non-clinical environments is still fragmentary, in part due to the lack of accurate reference genomes of strains from the numerous environments colonized by the bacterium. Here, we used PacBio long-read technology to sequence the genome of PPF-1, a strain of P. aeruginosa isolated from a dental unit waterline. Generating this closed genome was an opportunity to investigate genomic features that are difficult to accurately study in a draft genome (contigs state). It was possible to shed light on putative genomic islands, some shared with other reference genomes, new prophages, and the complete content of insertion sequences. In addition, four different group II introns were also found, including two characterized here and not listed in the specialized group II intron database.


July 19, 2019

The highly heterogeneous methylated genomes and diverse restriction-modification systems of bloom-forming Microcystis.

The occurrence of harmful Microcystis blooms is increasing in frequency in a myriad of freshwater ecosystems. Despite considerable research pertaining to the cause and nature of these blooms, the molecular mechanisms behind the cosmopolitan distribution and phenotypic diversity in Microcystis are still unclear. We compared the patterns and extent of DNA methylation in three strains of Microcystis, PCC 7806SL, NIES-2549 and FACHB-1757, using Single Molecule Real-Time (SMRT) sequencing technology. Intact restriction-modification (R-M) systems were identified from the genomes of these strains, and from two previously sequenced strains of Microcystis, NIES-843 and TAIHU98. A large number of methylation motifs and R-M genes were identified in these strains, which differ substantially among different strains. Of the 35 motifs identified, eighteen had not previously been reported. Strain NIES-843 contains a larger number of total putative methyltransferase genes than have been reported previously from any bacterial genome. Genomic comparisons reveal that methyltransferases (some partial) may have been acquired from the environment through horizontal gene transfer. Copyright © 2018 Elsevier B.V. All rights reserved.


July 19, 2019

The genomic floral language of rose

Roses have held an attraction for people all over the world as ornamental plants. Now genome sequencing of the highly heterozygous Rosa chinensis and resequencing of major genotypes open the door to a greater understanding of rose evolutionary history and the regulatory mechanisms determining rose flower color and scent.


July 19, 2019

Long read assemblies of geographically dispersed Plasmodium falciparum isolates reveal highly structured subtelomeres.

Background: Although thousands of clinical isolates of Plasmodium falciparum are being sequenced and analysed by short read technology, the data do not resolve the highly variable subtelomeric regions of the genomes that contain polymorphic gene families involved in immune evasion and pathogenesis. There is also no current standard definition of the boundaries of these variable subtelomeric regions. Methods: Using long-read sequence data (Pacific Biosciences SMRT technology), we assembled and annotated the genomes of 15 P. falciparum isolates, ten of which are newly cultured clinical isolates. We performed comparative analysis of the entire genome with particular emphasis on the subtelomeric regions and the internal var genes clusters.   Results: The nearly complete sequence of these 15 isolates has enabled us to define a highly conserved core genome, to delineate the boundaries of the subtelomeric regions, and to compare these across isolates. We found highly structured variable regions in the genome. Some exported gene families purportedly involved in release of merozoites show copy number variation. As an example of ongoing genome evolution, we found a novel CLAG gene in six isolates.  We also found a novel gene that was relatively enriched in the South East Asian isolates compared to those from Africa. Conclusions: These 15 manually curated new reference genome sequences with their nearly complete subtelomeric regions and fully assembled genes are an important new resource for the malaria research community. We report the overall conserved structure and pattern of important gene families and the more clearly defined subtelomeric regions.


July 19, 2019

High-Throughput Single-Cell Sequencing of both TCR-ß Alleles.

Allelic exclusion is a vital mechanism for the generation of monospecificity to foreign Ags in B and T lymphocytes. In this study, we developed a high-throughput barcoded method to simultaneously analyze the VDJ recombination status of both mouse TCR-ß alleles in hundreds of single cells using next-generation sequencing. Copyright © 2018 by The American Association of Immunologists, Inc.


July 19, 2019

Antigenic variation in the lyme spirochete: Insights into recombinational switching with a suggested role for error-prone repair.

The Lyme disease spirochete, Borrelia burgdorferi, uses antigenic variation as a strategy to evade the host’s acquired immune response. New variants of surface-localized VlsE are generated efficiently by unidirectional recombination from 15 unexpressed vls cassettes into the vlsE locus. Using algorithms to analyze switching from vlsE sequencing data, we characterize a population of over 45,000 inferred recombination events generated during mouse infection. We present evidence for clustering of these recombination events within the population and along the vlsE gene, a role for the direct repeats flanking the variable region in vlsE, and the importance of sequence homology in determining the location of recombination, despite RecA’s dispensability. Finally, we report that non-templated sequence variation is strongly associated with recombinational switching and occurs predominantly at the 5′ end of conversion tracts. This likely results from an error-prone repair mechanism operational during recombinational switching that elevates the mutation rate > 5,000-fold in switched regions. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.


July 19, 2019

Introduction: The host-associated microbiome: Pattern, process and function.

An explosion of studies in recent years has established the ubiquity of host-associated microbes and their centrality to host biology (McFall-Ngai et al., 2013; Russell, Dubilier, & Rudgers, 2014). Microbes aid in digestion, modulate development, contribute to host immunity, mediate abiotic stress and more. While relationships with host-associated microbes are ubiquitous and important, they are cer- tainly not monolithic. Characterizing the microbial diversity associ- ated with an ever-broadening array of hosts (diverse animals, plants, algae and protists) has shown that essential functions can be per- formed by microbes that are integrated with the host to varying degrees, ranging from embedded endosymbionts to a variable cast of transient microbes acquired from the environment. The maturing host–microbiome field is now developing a mechanistic understand- ing of host/microbe relationships across this spectrum and the cross- talk mediating these interactions. Similarly, studies across systems are illuminating the ecological and evolutionary factors that shape host–microbe interactions today and providing hints into the origins of specific relationships.


July 19, 2019

Detailed analysis of HTT repeat elements in human blood using targeted amplification-free long-read sequencing.

Amplification of DNA is required as a mandatory step during library preparation in most targeted sequencing protocols. This can be a critical limitation when targeting regions that are highly repetitive or with extreme guanine-cytosine (GC) content, including repeat expansions associated with human disease. Here, we used an amplification-free protocol for targeted enrichment utilizing the CRISPR/Cas9 system (No-Amp Targeted sequencing) in combination with single molecule, real-time (SMRT) sequencing for studying repeat elements in the huntingtin (HTT) gene, where an expanded CAG repeat is causative for Huntington disease. We also developed a robust data analysis pipeline for repeat element analysis that is independent of alignment of reads to a reference genome. The method was applied to 11 diagnostic blood samples, and for all 22 alleles the resulting CAG repeat count agreed with previous results based on fragment analysis. The amplification-free protocol also allowed for studying somatic variability of repeat elements in our samples, without the interference of PCR stutter. In summary, with No-Amp Targeted sequencing in combination with our analysis pipeline, we could accurately study repeat elements that are difficult to investigate using PCR-based methods.© 2018 The Authors. Human Mutation published by Wiley Periodicals, Inc.


July 19, 2019

Long-read sequencing and de novo genome assembly of Ammopiptanthus nanus, a desert shrub.

Ammopiptanthus nanus is a rare broad-leaved shrub that is found in the desert and arid regions of Central Asia. This plant species exhibits extremely high tolerance to drought and freezing and has been used in abiotic tolerance research in plants. As a relic of the tertiary period, A. nanus is of great significance to plant biogeographic research in the ancient Mediterranean region. Here, we report a draft genome assembly using the Pacific Biosciences (PacBio) platform and gene annotation for A. nanus.A total of 64.72 Gb of raw PacBio sequel reads were generated from four 20-kb libraries. After filtering, 64.53 Gb of clean reads were obtained, giving 72.59× coverage depth. Assembly using Canu gave an assembly length of 823.74 Mb, with a contig N50 of 2.76 Mb. The final size of the assembled A. nanus genome was close to the 889 Mb estimated by k-mer analysis. The gene annotation completeness was evaluated using Benchmarking Universal Single-Copy Orthologs; 1,327 of the 1,440 conserved genes (92.15%) could be found in the A. nanus assembly. Genome annotation revealed that 74.08% of the A. nanus genome is composed of repetitive elements and 53.44% is composed of long terminal repeat elements. We predicted ?37,188 protein-coding genes, of which 96.53% were functionally annotated.The genomic sequences of A. nanus could be a valuable source for comparative genomic analysis in the legume family and will be useful for understanding the phylogenetic relationships of the Thermopsideae and the evolutionary response of plant species to the Qinghai Tibetan Plateau uplift.


July 19, 2019

De novo repeat interruptions are associated with reduced somatic instability and mild or absent clinical features in myotonic dystrophy type 1.

Myotonic dystrophy type 1 (DM1) is a multisystem disorder, caused by expansion of a CTG trinucleotide repeat in the 3′-untranslated region of the DMPK gene. The repeat expansion is somatically unstable and tends to increase in length with time, contributing to disease progression. In some individuals, the repeat array is interrupted by variant repeats such as CCG and CGG, stabilising the expansion and often leading to milder symptoms. We have characterised three families, each including one person with variant repeats that had arisen de novo on paternal transmission of the repeat expansion. Two individuals were identified for screening due to an unusual result in the laboratory diagnostic test, and the third due to exceptionally mild symptoms. The presence of variant repeats in all three expanded alleles was confirmed by restriction digestion of small pool PCR products, and allele structures were determined by PacBio sequencing. Each was different, but all contained CCG repeats close to the 3′-end of the repeat expansion. All other family members had inherited pure CTG repeats. The variant repeat-containing alleles were more stable in the blood than pure alleles of similar length, which may in part account for the mild symptoms observed in all three individuals. This emphasises the importance of somatic instability as a disease mechanism in DM1. Further, since patients with variant repeats may have unusually mild symptoms, identification of these individuals has important implications for genetic counselling and for patient stratification in DM1 clinical trials.


July 19, 2019

A Borrelia burgdorferi mini-vls system that undergoes antigenic switching in mice: investigation of the role of plasmid topology and the long inverted repeat.

Borrelia burgdorferi evades the host immune system by switching the surface antigen. VlsE, in a process known as antigenic variation. The DNA mechanisms and genetic elements present on the vls locus that participate in the switching process remain to be elucidated. Manipulating the vls locus has been difficult due to its instability on Escherichia coli plasmids. In this study, we generated for the first time a mini-vls system composed of a single silent vlsE variable region (silent cassette 2) through the vlsE gene by performing some cloning steps directly in a highly transformable B. burgdorferi strain. Variants of the mini system were constructed with or without the long inverted repeat (IR) located upstream of vlsE and on both circular and linear plasmids to investigate the importance of the IR and plasmid topology on recombinational switching at vlsE. Amplicon sequencing using PacBio long read technology and analysis of the data with our recently reported pipeline and VAST software showed that the system undergoes switching in mice in both linear and circular versions and that the presence of the hairpin does not seem to be crucial in the linear version, however it is required when the topology is circular.© 2018 John Wiley & Sons Ltd.


July 19, 2019

Comparison between complete genomes of an isolate of Pseudomonas syringae pv. actinidiae from Japan and a New Zealand isolate of the pandemic.

The modern pandemic of the bacterial kiwifruit pathogen Pseudomonas syringae pv actinidiae (Psa) is caused by a particular Psa lineage. To better understand the genetic basis of the virulence of this lineage, we compare the completely assembled genome of a pandemic New Zealand strain with that of the Psa type strain first isolated in Japan in 1983. Aligning the two genomes shows numerous translocations, constrained so as to retain the appropriate orientation of the Architecture Imparting Sequences (AIMs). There are several large horizontally acquired regions, some of which include Type I, Type II or Type III restriction systems. The activity of these systems is reflected in the methylation patterns of the two strains. The pandemic strain carries an Integrative Conjugative Element (ICE) located at a tRNA-Lys site. Two other complex elements are also present at tRNA-Lys sites in the genome. These elements are derived from ICE but have now acquired some alternative secretion function. There are numerous types of mobile element in the two genomes. Analysis of these elements reveals no evidence of recombination between the two Psa lineages.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.