Menu
September 22, 2019

Mosaic structure as the main feature of Mycobacterium bovis BCG genomes

Background: The genome stability of attenuated live BCG vaccine preventing the acute forms of childhood tuberculosis is an important aspect of vaccine production. The pur- pose of our study was a whole genome comparative analysis of BCG sub-strains and identification of potential triggers of sub-strains’ transition. Results: Genomes of three BCG Russia seed lots (1963, 1982, 2006 years) have been sequenced, and the stability of vaccine sub-strain genomes has been confirmed. A com- parative genome analysis of nine Mycobacterium bovis BCG and three M. bovis strains revealed their specific genome features associated with prophage profiles. A number of prophage-coded homologs to Caudovirales ORFs were common to all BCG genomes. Prophage profiles of BCG Tice and BCG Montreal genomes were unique and coded homologs to herpes viruses ORFs. The data of phylogenetic analysis of BCG sub-strain groups based on whole genome sequences and genome restriction maps were in con- gruence with prophage profiles. The only fragmentary similarity of specific prophage sequences of BCG Tice, BCG Montreal, and BCG Russia 368 in pair-wise alignments was observed, suggesting the impact of prophages on mosaic structure of genomes. Conclusions: The whole genome sequencing approach is essential for genomes with mosaic structure, harboring numerous prophage sequences. Tools for prophage search are effective instruments in this analysis.


September 22, 2019

Long-read whole genome sequencing and comparative analysis of six strains of the human pathogen Orientia tsutsugamushi.

Orientia tsutsugamushi is a clinically important but neglected obligate intracellular bacterial pathogen of the Rickettsiaceae family that causes the potentially life-threatening human disease scrub typhus. In contrast to the genome reduction seen in many obligate intracellular bacteria, early genetic studies of Orientia have revealed one of the most repetitive bacterial genomes sequenced to date. The dramatic expansion of mobile elements has hampered efforts to generate complete genome sequences using short read sequencing methodologies, and consequently there have been few studies of the comparative genomics of this neglected species.We report new high-quality genomes of O. tsutsugamushi, generated using PacBio single molecule long read sequencing, for six strains: Karp, Kato, Gilliam, TA686, UT76 and UT176. In comparative genomics analyses of these strains together with existing reference genomes from Ikeda and Boryong strains, we identify a relatively small core genome of 657 genes, grouped into core gene islands and separated by repeat regions, and use the core genes to infer the first whole-genome phylogeny of Orientia.Complete assemblies of multiple Orientia genomes verify initial suggestions that these are remarkable organisms. They have larger genomes compared with most other Rickettsiaceae, with widespread amplification of repeat elements and massive chromosomal rearrangements between strains. At the gene level, Orientia has a relatively small set of universally conserved genes, similar to other obligate intracellular bacteria, and the relative expansion in genome size can be accounted for by gene duplication and repeat amplification. Our study demonstrates the utility of long read sequencing to investigate complex bacterial genomes and characterise genomic variation.


September 22, 2019

Improved de novo genome assembly and analysis of the Chinese cucurbit Siraitia grosvenorii, also known as monk fruit or luo-han-guo.

Luo-han-guo (Siraitia grosvenorii), also called monk fruit, is a member of the Cucurbitaceae family. Monk fruit has become an important area for research because of the pharmacological and economic potential of its noncaloric, extremely sweet components (mogrosides). It is also commonly used in traditional Chinese medicine for the treatment of lung congestion, sore throat, and constipation. Recently, a single reference genome became available for monk fruit, assembled from 36.9x genome coverage reads via Illumina sequencing platforms. This genome assembly has a relatively short (34.2 kb) contig N50 length and lacks integrated annotations. These drawbacks make it difficult to use as a reference in assembling transcriptomes and discovering novel functional genes.Here, we offer a new high-quality draft of the S. grosvenorii genome assembled using 31 Gb (~73.8x) long single molecule real time sequencing reads and polished with ~50 Gb Illumina paired-end reads. The final genome assembly is approximately 469.5 Mb, with a contig N50 length of 432,384 bp, representing a 12.6-fold improvement. We further annotated 237.3 Mb of repetitive sequence and 30,565 consensus protein coding genes with combined evidence. Phylogenetic analysis showed that S. grosvenorii diverged from members of the Cucurbitaceae family approximately 40.9 million years ago. With comprehensive transcriptomic analysis and differential expression testing, we identified 4,606 up-regulated genes in the early fruit compared to the leaf, a number of which were linked to metabolic pathways regulating fruit development and ripening.The availability of this new monk fruit genome assembly, as well as the annotations, will facilitate the discovery of new functional genes and the genetic improvement of monk fruit.


September 22, 2019

Diversity of hepatitis E virus genotype 3

Summary Hepatitis E virus genotype 3 (HEV-3) can lead to chronic infection in immunocompromised patients, and ribavirin is the treatment of choice. Recently, mutations in the polymerase gene have been associated with ribavirin failure but their frequency before treatment according to HEV-3 subtypes has not been studied on a large data set. We used single-molecule real-time sequencing technology to sequence 115 new complete genomes of HEV-3 infecting French patients. We analyzed phylogenetic relationships, the length of the polyproline region, and mutations in the HEV polymerase gene. Eighty-five (74%) were in the clade HEV-3efg, 28 (24%) in HEV-3chi clade, and 2 (2%) in HEV-3ra clade. Using automated partitioning of maximum likelihood phylogenetic trees, complete genomes were classified into subtypes. Polyproline region length differs within HEV-3 clades (from 189 to 315 nt). Investigating mutations in the polymerase gene, distinct polymorphisms between HEV-3 subtypes were found (G1634R in 95% of HEV-3e, G1634K in 56% of HEV-3ra, and V1479I in all HEV-3efg, clade HEV-3ra, and HEV-3k strains). Subtype-specific polymorphisms in the HEV-3 polymerase have been identified. Our study provides new complete genome sequences of HEV-3 that could be useful for comparing strains circulating in humans and the animal reservoir.


September 22, 2019

De novo genome assembly of Oryza granulata reveals rapid genome expansion and adaptive evolution

The wild relatives of rice have adapted to different ecological environments and constitute a useful reservoir of agronomic traits for genetic improvement. Here we present the ~777?Mb de novo assembled genome sequence of Oryza granulata. Recent bursts of long-terminal repeat retrotransposons, especially RIRE2, led to a rapid twofold increase in genome size after O. granulata speciation. Universal centromeric tandem repeats are absent within its centromeres, while gypsy-type LTRs constitute the main centromere-specific repetitive elements. A total of 40,116 protein-coding genes were predicted in O. granulata, which is close to that of Oryza sativa. Both the copy number and function of genes involved in photosynthesis and energy production have undergone positive selection during the evolution of O. granulata, which might have facilitated its adaptation to the low light habitats. Together, our findings reveal the rapid genome expansion, distinctive centromere organization, and adaptive evolution of O. granulata.


September 22, 2019

Comparative genomics of Spiraeoideae-infecting Erwinia amylovora strains provides novel insight to genetic diversity and identifies the genetic basis of a low-virulence strain.

Erwinia amylovora is the causal agent of fire blight, one of the most devastating diseases of apple and pear. Erwinia amylovora is thought to have originated in North America and has now spread to at least 50 countries worldwide. An understanding of the diversity of the pathogen population and the transmission to different geographical regions is important for the future mitigation of this disease. In this research, we performed an expanded comparative genomic study of the Spiraeoideae-infecting (SI) E. amylovora population in North America and Europe. We discovered that, although still highly homogeneous, the genetic diversity of 30 E. amylovora genomes examined was about 30 times higher than previously determined. These isolates belong to four distinct clades, three of which display geographical clustering and one of which contains strains from various geographical locations (‘Widely Prevalent’ clade). Furthermore, we revealed that strains from the Widely Prevalent clade displayed a higher level of recombination with strains from a clade strictly from the eastern USA, which suggests that the Widely Prevalent clade probably originated from the eastern USA before it spread to other locations. Finally, we detected variations in virulence in the SI E. amylovora strains on immature pear, and identified the genetic basis of one of the low-virulence strains as being caused by a single nucleotide polymorphism in hfq, a gene encoding an important virulence regulator. Our results provide insights into the population structure, distribution and evolution of SI E. amylovora in North America and Europe.© 2017 BSPP AND JOHN WILEY & SONS LTD.


September 22, 2019

RAD sequencing and a hybrid Antarctic fur seal genome assembly reveal rapidly decaying linkage disequilibrium, global population structure and evidence for inbreeding.

Recent advances in high throughput sequencing have transformed the study of wild organisms by facilitating the generation of high quality genome assemblies and dense genetic marker datasets. These resources have the potential to significantly advance our understanding of diverse phenomena at the level of species, populations and individuals, ranging from patterns of synteny through rates of linkage disequilibrium (LD) decay and population structure to individual inbreeding. Consequently, we used PacBio sequencing to refine an existing Antarctic fur seal (Arctocephalus gazella) genome assembly and genotyped 83 individuals from six populations using restriction site associated DNA (RAD) sequencing. The resulting hybrid genome comprised 6,169 scaffolds with an N50 of 6.21 Mb and provided clear evidence for the conservation of large chromosomal segments between the fur seal and dog (Canis lupus familiaris). Focusing on the most extensively sampled population of South Georgia, we found that LD decayed rapidly, reaching the background level by around 400 kb, consistent with other vertebrates but at odds with the notion that fur seals experienced a strong historical bottleneck. We also found evidence for population structuring, with four main Antarctic island groups being resolved. Finally, appreciable variance in individual inbreeding could be detected, reflecting the strong polygyny and site fidelity of the species. Overall, our study contributes important resources for future genomic studies of fur seals and other pinnipeds while also providing a clear example of how high throughput sequencing can generate diverse biological insights at multiple levels of organization. Copyright © 2018 Humble et al.


September 22, 2019

Two ancestral genes shaped the Xanthomonas campestris TAL effector gene repertoire.

Xanthomonas transcription activator-like effectors (TALEs) are injected inside plant cells to promote host susceptibility by enhancing transcription of host susceptibility genes. TALE-encoding (tal) genes were thought to be absent from Brassicaceae-infecting Xanthomonas campestris (Xc) genomes based on four reference genomic sequences. We discovered tal genes in 26 of 49 Xc strains isolated worldwide and used a combination of single molecule real time (SMRT) and tal amplicon sequencing to yield a near-complete description of the TALEs found in Xc (Xc TALome). The 53 sequenced tal genes encode 21 distinct DNA binding domains that sort into seven major DNA binding specificities. In silico analysis of the Brassica rapa promoterome identified a repertoire of predicted TALE targets, five of which were experimentally validated using quantitative reverse transcription polymerase chain reaction. The Xc TALome shows multiple signs of DNA rearrangements that probably drove its evolution from two ancestral tal genes. We discovered that Tal12a and Tal15a of Xcc strain Xca5 contribute together in the development of disease symptoms on susceptible B. oleracea var. botrytis cv Clovis. This large and polymorphic repertoire of TALEs opens novel perspectives for elucidating TALE-mediated susceptibility of Brassicaceae to black rot disease and for understanding the molecular processes underlying TALE evolution.© 2018 The Authors New Phytologist © 2018 New Phytologist Trust.


September 22, 2019

Redefinition and unification of the SXT/R391 family of integrative and conjugative elements.

Integrative and conjugative elements (ICEs) of the SXT/R391 family are key drivers of the spread of antibiotic resistance in Vibrio cholerae, the infectious agent of cholera, and other pathogenic bacteria. The SXT/R391 family of ICEs was defined based on the conservation of a core set of 52 genes and site-specific integration into the 5′ end of the chromosomal gene prfC Hence, the integrase gene int has been intensively used as a marker to detect SXT/R391 ICEs in clinical isolates. ICEs sharing most core genes but differing by their integration site and integrase gene have been recently reported and excluded from the SXT/R391 family. Here we explored the prevalence and diversity of atypical ICEs in GenBank databases and their relationship with typical SXT/R391 ICEs. We found atypical ICEs in V. cholerae isolates that predate the emergence and expansion of typical SXT/R391 ICEs in the mid-1980s in seventh-pandemic toxigenic V. cholerae strains O1 and O139. Our analyses revealed that while atypical ICEs are not associated with antibiotic resistance genes, they often carry cation efflux pumps, suggesting heavy metal resistance. Atypical ICEs constitute a polyphyletic group likely because of occasional recombination events with typical ICEs. Furthermore, we show that the alternative integration and excision genes of atypical ICEs remain under the control of SetCD, the main activator of the conjugative functions of SXT/R391 ICEs. Together, these observations indicate that substitution of the integration/excision module and change of specificity of integration do not preclude atypical ICEs from inclusion into the SXT/R391 family.IMPORTANCEVibrio cholerae is the causative agent of cholera, an acute intestinal infection that remains to this day a world public health threat. Integrative and conjugative elements (ICEs) of the SXT/R391 family have played a major role in spreading antimicrobial resistance in seventh-pandemic V. cholerae but also in several species of Enterobacteriaceae Most epidemiological surveys use the integrase gene as a marker to screen for SXT/R391 ICEs in clinical or environmental strains. With the recent reports of closely related elements that carry an alternative integrase gene, it became urgent to investigate whether ICEs that have been left out of the family are a liability for the accuracy of such screenings. In this study, based on comparative genomics, we broaden the SXT/R391 family of ICEs to include atypical ICEs that are often associated with heavy metal resistance. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Biosynthesis of abscisic acid in fungi: identification of a sesquiterpene cyclase as the key enzyme in Botrytis cinerea.

While abscisic acid (ABA) is known as a hormone produced by plants through the carotenoid pathway, a small number of phytopathogenic fungi are also able to produce this sesquiterpene but they use a distinct pathway that starts with the cyclization of farnesyl diphosphate (FPP) into 2Z,4E-a-ionylideneethane which is then subjected to several oxidation steps. To identify the sesquiterpene cyclase (STC) responsible for the biosynthesis of ABA in fungi, we conducted a genomic approach in Botrytis cinerea. The genome of the ABA-overproducing strain ATCC58025 was fully sequenced and five STC-coding genes were identified. Among them, Bcstc5 exhibits an expression profile concomitant with ABA production. Gene inactivation, complementation and chemical analysis demonstrated that BcStc5/BcAba5 is the key enzyme responsible for the key step of ABA biosynthesis in fungi. Unlike what is observed for most of the fungal secondary metabolism genes, the key enzyme-coding gene Bcstc5/Bcaba5 is not clustered with the other biosynthetic genes, i.e., Bcaba1 to Bcaba4 that are responsible for the oxidative transformation of 2Z,4E-a-ionylideneethane. Finally, our study revealed that the presence of the Bcaba genes among Botrytis species is rare and that the majority of them do not possess the ability to produce ABA.© 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.


September 22, 2019

A molecular window into the biology and epidemiology of Pneumocystis spp.

Pneumocystis, a unique atypical fungus with an elusive lifestyle, has had an important medical history. It came to prominence as an opportunistic pathogen that not only can cause life-threatening pneumonia in patients with HIV infection and other immunodeficiencies but also can colonize the lungs of healthy individuals from a very early age. The genus Pneumocystis includes a group of closely related but heterogeneous organisms that have a worldwide distribution, have been detected in multiple mammalian species, are highly host species specific, inhabit the lungs almost exclusively, and have never convincingly been cultured in vitro, making Pneumocystis a fascinating but difficult-to-study organism. Improved molecular biologic methodologies have opened a new window into the biology and epidemiology of Pneumocystis. Advances include an improved taxonomic classification, identification of an extremely reduced genome and concomitant inability to metabolize and grow independent of the host lungs, insights into its transmission mode, recognition of its widespread colonization in both immunocompetent and immunodeficient hosts, and utilization of strain variation to study drug resistance, epidemiology, and outbreaks of infection among transplant patients. This review summarizes these advances and also identifies some major questions and challenges that need to be addressed to better understand Pneumocystis biology and its relevance to clinical care. Copyright © 2018 American Society for Microbiology.


September 22, 2019

A high-quality genome sequence of Rosa chinensis to elucidate ornamental traits.

Rose is the world’s most important ornamental plant, with economic, cultural and symbolic value. Roses are cultivated worldwide and sold as garden roses, cut flowers and potted plants. Roses are outbred and can have various ploidy levels. Our objectives were to develop a high-quality reference genome sequence for the genus Rosa by sequencing a doubled haploid, combining long and short reads, and anchoring to a high-density genetic map, and to study the genome structure and genetic basis of major ornamental traits. We produced a doubled haploid rose line (‘HapOB’) from Rosa chinensis ‘Old Blush’ and generated a rose genome assembly anchored to seven pseudo-chromosomes (512?Mb with N50 of 3.4?Mb and 564 contigs). The length of 512?Mb represents 90.1-96.1% of the estimated haploid genome size of rose. Of the assembly, 95% is contained in only 196 contigs. The anchoring was validated using high-density diploid and tetraploid genetic maps. We delineated hallmark chromosomal features, including the pericentromeric regions, through annotation of transposable element families and positioned centromeric repeats using fluorescent in situ hybridization. The rose genome displays extensive synteny with the Fragaria vesca genome, and we delineated only two major rearrangements. Genetic diversity was analysed using resequencing data of seven diploid and one tetraploid Rosa species selected from various sections of the genus. Combining genetic and genomic approaches, we identified potential genetic regulators of key ornamental traits, including prickle density and the number of flower petals. A rose APETALA2/TOE homologue is proposed to be the major regulator of petal number in rose. This reference sequence is an important resource for studying polyploidization, meiosis and developmental processes, as we demonstrated for flower and prickle development. It will also accelerate breeding through the development of molecular markers linked to traits, the identification of the genes underlying them and the exploitation of synteny across Rosaceae.


September 22, 2019

High-quality assembly of the reference genome for scarlet sage, Salvia splendens, an economically important ornamental plant.

Salvia splendens Ker-Gawler, scarlet or tropical sage, is a tender herbaceous perennial widely introduced and seen in public gardens all over the world. With few molecular resources, breeding is still restricted to traditional phenotypic selection, and the genetic mechanisms underlying phenotypic variation remain unknown. Hence, a high-quality reference genome will be very valuable for marker-assisted breeding, genome editing, and molecular genetics.We generated 66 Gb and 37 Gb of raw DNA sequences, respectively, from whole-genome sequencing of a largely homozygous scarlet sage inbred line using Pacific Biosciences (PacBio) single-molecule real-time and Illumina HiSeq sequencing platforms. The PacBio de novo assembly yielded a final genome with a scaffold N50 size of 3.12 Mb and a total length of 808 Mb. The repetitive sequences identified accounted for 57.52% of the genome sequence, and ?54,008 protein-coding genes were predicted collectively with ab initio and homology-based gene prediction from the masked genome. The divergence time between S. splendens and Salvia miltiorrhiza was estimated at 28.21 million years ago (Mya). Moreover, 3,797 species-specific genes and 1,187 expanded gene families were identified for the scarlet sage genome.We provide the first genome sequence and gene annotation for the scarlet sage. The availability of these resources will be of great importance for further breeding strategies, genome editing, and comparative genomics among related species.


September 22, 2019

Evidence of non-tandemly repeated rDNAs and their intragenomic heterogeneity in Rhizophagus irregularis

Arbuscular mycorrhizal fungus (AMF) species are some of the most widespread symbionts of land plants. Our much improved reference genome assembly of a model AMF, Rhizophagus irregularis DAOM-181602 (total contigs?=?210), facilitated a discovery of repetitive elements with unusual characteristics. R. irregularis has only ten or 11 copies of complete 45S rDNAs, whereas the general eukaryotic genome has tens to thousands of rDNA copies. R. irregularis rDNAs are highly heterogeneous and lack a tandem repeat structure. These findings provide evidence for the hypothesis that rDNA heterogeneity depends on the lack of tandem repeat structures. RNA-Seq analysis confirmed that all rDNA variants are actively transcribed. Observed rDNA/rRNA polymorphisms may modulate translation by using different ribosomes depending on biotic and abiotic interactions. The non-tandem repeat structure and intragenomic heterogeneity of AMF rDNA/rRNA may facilitate successful adaptation to various environmental conditions, increasing host compatibility of these symbiotic fungi.


September 22, 2019

Molecular characterization of invasive meningococcal isolates in Burkina Faso as the relative importance of serogroups X and W increases, 2008-2012.

Neisseria meningitidis serogroup A disease in Burkina Faso has greatly decreased following introduction of a meningococcal A conjugate vaccine in 2010, yet other serogroups continue to pose a risk of life-threatening disease. Capsule switching among epidemic-associated serogroup A N. meningitidis strains could allow these lineages to persist despite vaccination. The introduction of new strains at the national or sub-national levels could affect the epidemiology of disease.Isolates collected from invasive meningococcal disease in Burkina Faso between 2008 and 2012 were characterized by serogrouping and molecular typing. Genome sequences from a subset of isolates were used to infer phylogenetic relationships.The ST-5 clonal complex (CC5) was identified only among serogroup A isolates, which were rare after 2010. CC181 and CC11 were the most common clonal complexes after 2010, having serogroup X and W isolates, respectively. Whole-genome phylogenetic analysis showed that the CC181 isolates collected during and after the epidemic of 2010 formed a single clade that was closely related to isolates collected in Niger during 2005 and Burkina Faso during 2007. Geographic population structure was identified among the CC181 isolates, where pairs of isolates collected from the same region of Burkina Faso within a single year had less phylogenetic diversity than the CC181 isolate collection as a whole. However, the reduction of phylogenetic diversity within a region did not extend across multiple years. Instead, CC181 isolates collected during the same year had lower than average diversity, even when collected from different regions, indicating geographic mixing of strains across years. The CC11 isolates were primarily collected during the epidemic of 2012, with sparse sampling during 2011. These isolates belong to a clade that includes previously described isolates collected in Burkina Faso, Mali, and Niger from 2011 to 2015. Similar to CC181, reduced phylogenetic diversity was observed among CC11 isolate pairs collected from the same regions during a single year.The population of disease-associated N. meningitidis strains within Burkina Faso was highly dynamic between 2008 and 2012, reflecting both vaccine-imposed selection against serogroup A strains and potentially complex clonal waves of serogroup X and serogroup W strains.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.