Menu
April 21, 2020

Whole Genome Sequencing of the Mutamouse Model Reveals Strain- and Colony-Level Variation, and Genomic Features of the Transgene Integration Site.

The MutaMouse transgenic rodent model is widely used for assessing in vivo mutagenicity. Here, we report the characterization of MutaMouse’s whole genome sequence and its genetic variants compared to the C57BL/6 reference genome. High coverage (>50X) next-generation sequencing (NGS) of whole genomes from multiple MutaMouse animals from the Health Canada (HC) colony showed ~5 million SNVs per genome, ~20% of which are putatively novel. Sequencing of two animals from a geographically separated colony at Covance indicated that, over the course of 23 years, each colony accumulated 47,847 (HC) and 17,677 (Covance) non-parental homozygous single nucleotide variants. We found no novel nonsense or missense mutations that impair the MutaMouse response to genotoxic agents. Pairing sequencing data with array comparative genomic hybridization (aCGH) improved the accuracy and resolution of copy number variants (CNVs) calls and identified 300 genomic regions with CNVs. We also used long-read sequence technology (PacBio) to show that the transgene integration site involved a large deletion event with multiple inversions and rearrangements near a retrotransposon. The MutaMouse genome gives important genetic context to studies using this model, offers insight on the mechanisms of structural variant formation, and contributes a framework to analyze aCGH results alongside NGS data.


April 21, 2020

Genome analysis and genetic transformation of a water surface-floating microalga Chlorococcum sp. FFG039.

Microalgal harvesting and dewatering are the main bottlenecks that need to be overcome to tap the potential of microalgae for production of valuable compounds. Water surface-floating microalgae form robust biofilms, float on the water surface along with gas bubbles entrapped under the biofilms, and have great potential to overcome these bottlenecks. However, little is known about the molecular mechanisms involved in the water surface-floating phenotype. In the present study, we analysed the genome sequence of a water surface-floating microalga Chlorococcum sp. FFG039, with a next generation sequencing technique to elucidate the underlying mechanisms. Comparative genomics study with Chlorococcum sp. FFG039 and other non-floating green microalgae revealed some of the unique gene families belonging to this floating microalga, which may be involved in biofilm formation. Furthermore, genetic transformation of this microalga was achieved with an electroporation method. The genome information and transformation techniques presented in this study will be useful to obtain molecular insights into the water surface-floating phenotype of Chlorococcum sp. FFG039.


April 21, 2020

Genetic basis of functional variability in adhesion G protein-coupled receptors.

The enormous sizes of adhesion G protein-coupled receptors (aGPCRs) go along with complex genomic exon-intron architectures giving rise to multiple mRNA variants. There is a need for a comprehensive catalog of aGPCR variants for proper evaluation of the complex functions of aGPCRs found in structural, in vitro and animal model studies. We used an established bioinformatics pipeline to extract, quantify and visualize mRNA variants of aGPCRs from deeply sequenced transcriptomes. Data analysis showed that aGPCRs have multiple transcription start sites even within introns and that tissue-specific splicing is frequent. On average, 19 significantly expressed transcript variants are derived from a given aGPCR gene. The domain architecture of the N terminus encoded by transcript variants often differs and N termini without or with an incomplete seven-helix transmembrane anchor as well as separate seven-helix transmembrane domains are frequently derived from aGPCR genes. Experimental analyses of selected aGPCR transcript variants revealed marked functional differences. Our analysis has an impact on a rational design of aGPCR constructs for structural analyses and gene-deficient mouse lines and provides new support for independent functions of both, the large N terminus and the transmembrane domain of aGPCRs.


April 21, 2020

The genome sequence of Streptomyces rochei 7434AN4, which carries a linear chromosome and three characteristic linear plasmids.

Streptomyces rochei 7434AN4 produces two structurally unrelated polyketide antibiotics, lankacidin and lankamycin, and carries three linear plasmids, pSLA2-L (211?kb), -M (113?kb), and -S (18?kb), whose nucleotide sequences were previously reported. The complete nucleotide sequence of the S. rochei chromosome has now been determined using the long-read PacBio RS-II sequencing together with short-read Illumina Genome Analyzer IIx sequencing and Roche 454 pyrosequencing techniques. The assembled sequence revealed an 8,364,802-bp linear chromosome with a high G?+?C content of 71.7% and 7,568 protein-coding ORFs. Thus, the gross genome size of S. rochei 7434AN4 was confirmed to be 8,706,406?bp including the three linear plasmids. Consistent with our previous study, a tap-tpg gene pair, which is essential for the maintenance of a linear topology of Streptomyces genomes, was not found on the chromosome. Remarkably, the S. rochei chromosome contains seven ribosomal RNA (rrn) operons (16S-23S-5S), although Streptomyces species generally contain six rrn operons. Based on 2ndFind and antiSMASH platforms, the S. rochei chromosome harbors at least 35 secondary metabolite biosynthetic gene clusters, including those for the 28-membered polyene macrolide pentamycin and the azoxyalkene compound KA57-A.


April 21, 2020

Genome and plasmid diversity of Extended-Spectrum ß-Lactamase-producing Escherichia coli ST131 – tracking phylogenetic trajectories with Bayesian inference.

Clonal lineages of ESBL (Extended-Spectrum ß-Lactamase)-producing E. coli belonging to sequence type 131 (ST131) have disseminated globally during the last 30 years, leading to an increased prevalence of resistance to fluoroquinolones and extended-spectrum cephalosporins in clinical isolates of E. coli. We aimed to study if Swedish ESBL-producing ST131 isolates originated from single or multiple introductions to the population by assessing the amount of genetic variation, on chromosomal and plasmid level, between Swedish and international E. coli ST131. Bayesian inference of Swedish E. coli ST131 isolates (n?=?29), sequenced using PacBio RSII, together with an international ST131 dataset showed that the Swedish isolates were part of the international ST131 A, C1 and C2 clades. Highly conserved plasmids were identified in three clusters although they were separated by several years, which indicates a strong co-evolution between some ST131 lineages and specific plasmids. In conclusion, the tight clonal relationship observed within the ST131 clades, together with highly conserved plasmids, challenges investigation of strain transmission events. A combination of few SNPs on a genome-wide scale and an epidemiological temporospatial link, are needed to track the spread of the ST131 subclones.


April 21, 2020

SMRT sequencing reveals differential patterns of methylation in two O111:H- STEC isolates from a hemolytic uremic syndrome outbreak in Australia.

In 1995 a severe haemolytic-uremic syndrome (HUS) outbreak in Adelaide occurred. A recent genomic analysis of Shiga toxigenic Escherichia coli (STEC) O111:H- strains 95JB1 and 95NR1 from this outbreak found that the more virulent isolate, 95NR1, harboured two additional copies of the Shiga toxin 2 (Stx2) genes encoded within prophage regions. The structure of the Stx2-converting prophages could not be fully resolved using short-read sequence data alone and it was not clear if there were other genomic differences between 95JB1 and 95NR1. In this study we have used Pacific Biosciences (PacBio) single molecule real-time (SMRT) sequencing to characterise the genome and methylome of 95JB1 and 95NR1. We completely resolved the structure of all prophages including two, tandemly inserted, Stx2-converting prophages in 95NR1 that were absent from 95JB1. Furthermore we defined all insertion sequences and found an additional IS1203 element in the chromosome of 95JB1. Our analysis of the methylome of 95NR1 and 95JB1 identified hemi-methylation of a novel motif (5′-CTGCm6AG-3′) in more than 4000 sites in the 95NR1 genome. These sites were entirely unmethylated in the 95JB1 genome, and included at least 177 potential promoter regions that could contribute to regulatory differences between the strains. IS1203 mediated deactivation of a novel type IIG methyltransferase in 95JB1 is the likely cause of the observed differential patterns of methylation between 95NR1 and 95JB1. This study demonstrates the capability of PacBio SMRT sequencing to resolve complex prophage regions and reveal the genetic and epigenetic heterogeneity within a clonal population of bacteria.


April 21, 2020

Differences in resource use lead to coexistence of seed-transmitted microbial populations.

Seeds are involved in the vertical transmission of microorganisms in plants and act as reservoirs for the plant microbiome. They could serve as carriers of pathogens, making the study of microbial interactions on seeds important in the emergence of plant diseases. We studied the influence of biological disturbances caused by seed transmission of two phytopathogenic agents, Alternaria brassicicola Abra43 (Abra43) and Xanthomonas campestris pv. campestris 8004 (Xcc8004), on the structure and function of radish seed microbial assemblages, as well as the nutritional overlap between Xcc8004 and the seed microbiome, to find seed microbial residents capable of outcompeting this pathogen. According to taxonomic and functional inference performed on metagenomics reads, no shift in structure and function of the seed microbiome was observed following Abra43 and Xcc8004 transmission. This lack of impact derives from a limited overlap in nutritional resources between Xcc8004 and the major bacterial populations of radish seeds. However, two native seed-associated bacterial strains belonging to Stenotrophomonas rhizophila displayed a high overlap with Xcc8004 regarding the use of resources; they might therefore limit its transmission. The strategy we used may serve as a foundation for the selection of seed indigenous bacterial strains that could limit seed transmission of pathogens.


April 21, 2020

Insight into the microbial world of Bemisia tabaci cryptic species complex and its relationships with its host.

The 37 currently recognized Bemisia tabaci cryptic species are economically important species and contain both primary and secondary endosymbionts, but their diversity has never been mapped systematically across the group. To achieve this, PacBio sequencing of full-length bacterial 16S rRNA gene amplicons was carried out on 21 globally collected species in the B. tabaci complex, and two samples from B. afer were used here as outgroups. The microbial diversity was first explored across the major lineages of the whole group and 15 new putative bacterial sequences were observed. Extensive comparison of our results with previous endosymbiont diversity surveys which used PCR or multiplex 454 pyrosequencing platforms showed that the bacterial diversity was underestimated. To validate these new putative bacteria, one of them (Halomonas) was first confirmed to be present in MED B. tabaci using Hiseq2500 and FISH technologies. These results confirmed PacBio is a reliable and informative venue to reveal the bacterial diversity of insects. In addition, many new secondary endosymbiotic strains of Rickettsia and Arsenophonus were found, increasing the known diversity in these groups. For the previously described primary endosymbionts, one Portiera Operational Taxonomic Units (OTU) was shared by all B. tabaci species. The congruence of the B. tabaci-host and Portiera phylogenetic trees provides strong support for the hypothesis that primary endosymbionts co-speciated with their hosts. Likewise, a comparison of bacterial alpha diversities, Principal Coordinate Analysis, indistinct endosymbiotic communities harbored by different species and the co-divergence analyses suggest a lack of association between overall microbial diversity with cryptic species, further indicate that the secondary endosymbiont-mediated speciation is unlikely to have occurred in the B. tabaci species group.


April 21, 2020

Complete assembly of the Leishmania donovani (HU3 strain) genome and transcriptome annotation.

Leishmania donovani is a unicellular parasite that causes visceral leishmaniasis, a fatal disease in humans. In this study, a complete assembly of the genome of L. donovani is provided. Apart from being the first published genome of this strain (HU3), this constitutes the best assembly for an L. donovani genome attained to date. The use of a combination of sequencing platforms enabled to assemble, without any sequence gap, the 36 chromosomes for this species. Additionally, based on this assembly and using RNA-seq reads derived from poly-A?+?RNA, the transcriptome for this species, not yet available, was delineated. Alternative SL addition sites and heterogeneity in the poly-A addition sites were commonly observed for most of the genes. After a complete annotation of the transcriptome, 2,410 novel transcripts were defined. Additionally, the relative expression for all transcripts present in the promastigote stage was determined. Events of cis-splicing have been documented to occur during the maturation of the transcripts derived from genes LDHU3_07.0430 and LDHU3_29.3990. The complete genome assembly and the availability of the gene models (including annotation of untranslated regions) are important pieces to understand how differential gene expression occurs in this pathogen, and to decipher phenotypic peculiarities like tissue tropism, clinical disease, and drug susceptibility.


April 21, 2020

High quality reference genomes for toxigenic and non-toxigenic Vibrio cholerae serogroup O139.

Toxigenic Vibrio cholerae of the O139 serogroup have been responsible for several large cholera epidemics in South Asia, and continue to be of clinical and historical significance today. This serogroup was initially feared to represent a new, emerging V. cholerae clone that would lead to an eighth cholera pandemic. However, these concerns were ultimately unfounded. The majority of clinically relevant V. cholerae O139 isolates are closely related to serogroup O1, biotype El Tor V. cholerae, and comprise a single sublineage of the seventh pandemic El Tor lineage. Although related, these V. cholerae serogroups differ in several fundamental ways, in terms of their O-antigen, capsulation phenotype, and the genomic islands found on their chromosomes. Here, we present four complete, high-quality genomes for V. cholerae O139, obtained using long-read sequencing. Three of these sequences are from toxigenic V. cholerae, and one is from a bacterium which, although classified serologically as V. cholerae O139, lacks the CTXf bacteriophage and the ability to produce cholera toxin. We highlight fundamental genomic differences between these isolates, the V. cholerae O1 reference strain N16961, and the prototypical O139 strain MO10. These sequences are an important resource for the scientific community, and will improve greatly our ability to perform genomic analyses of non-O1 V. cholerae in the future. These genomes also offer new insights into the biology of a V. cholerae serogroup that, from a genomic perspective, is poorly understood.


April 21, 2020

An integrated whole genome analysis of Mycobacterium tuberculosis reveals insights into relationship between its genome, transcriptome and methylome.

Human tuberculosis disease (TB), caused by Mycobacterium tuberculosis (Mtb), is a complex disease, with a spectrum of outcomes. Genomic, transcriptomic and methylation studies have revealed differences between Mtb lineages, likely to impact on transmission, virulence and drug resistance. However, so far no studies have integrated sequence-based genomic, transcriptomic and methylation characterisation across a common set of samples, which is critical to understand how DNA sequence and methylation affect RNA expression and, ultimately, Mtb pathogenesis. Here we perform such an integrated analysis across 22?M. tuberculosis clinical isolates, representing ancient (lineage 1) and modern (lineages 2 and 4) strains. The results confirm the presence of lineage-specific differential gene expression, linked to specific SNP-based expression quantitative trait loci: with 10 eQTLs involving SNPs in promoter regions or transcriptional start sites; and 12 involving potential functional impairment of transcriptional regulators. Methylation status was also found to have a role in transcription, with evidence of differential expression in 50 genes across lineage 4 samples. Lack of methylation was associated with three novel variants in mamA, likely to cause loss of function of this enzyme. Overall, our work shows the relationship of DNA sequence and methylation to RNA expression, and differences between ancient and modern lineages. Further studies are needed to verify the functional consequences of the identified mechanisms of gene expression regulation.


April 21, 2020

Characteristics and homogeneity of N6-methylation in human genomes.

A novel DNA modification, N-6 methylated deoxyadenosine (m6dA), has recently been discovered in eukaryotic genomes. Despite its low abundance in eukaryotes, m6dA is implicated in human diseases such as cancer. It is therefore important to precisely identify and characterize m6dA in the human genome. Here, we identify m6dA sites at nucleotide level, in different human cells, genome wide. We compare m6dA features between distinct human cells and identify m6dA characteristics in human genomes. Our data demonstrates for the first time that despite low m6dA abundance, the m6dA mark does often occur consistently at the same genomic location within a given human cell type, demonstrating m6dA homogeneity. We further show, for the first time, higher levels of m6dA homogeneity within one chromosome. Most m6dA are found on a single chromosome from a diploid sample, suggesting inheritance. Our transcriptome analysis not only indicates that human genes with m6dA are associated with higher RNA transcript levels but identifies allele-specific gene transcripts showing haplotype-specific m6dA methylation, which are implicated in different biological functions. Our analyses demonstrate the precision and consistency by which the m6dA mark occurs within the human genome, suggesting that m6dA marks are precisely inherited in humans.


April 21, 2020

Insight into the genome and brackish water adaptation strategies of toxic and bloom-forming Baltic Sea Dolichospermum sp. UHCC 0315.

The Baltic Sea is a shallow basin of brackish water in which the spatial salinity gradient is one of the most important factors contributing to species distribution. The Baltic Sea is infamous for its annual cyanobacterial blooms comprised of Nodularia spumigena, Aphanizomenon spp., and Dolichospermum spp. that cause harm, especially for recreational users. To broaden our knowledge of the cyanobacterial adaptation strategies for brackish water environments, we sequenced the entire genome of Dolichospermum sp. UHCC 0315, a species occurring not only in freshwater environments but also in brackish water. Comparative genomics analyses revealed a close association with Dolichospermum sp. UHCC 0090 isolated from a lake in Finland. The genome closure of Dolichospermum sp. UHCC 0315 unraveled a mixture of two subtypes in the original culture, and subtypes exhibited distinct buoyancy phenotypes. Salinity less than 3?g?L-1 NaCl enabled proper growth of Dolichospermum sp. UHCC 0315, whereas growth was arrested at moderate salinity (6?g?L-1 NaCl). The concentrations of toxins, microcystins, increased at moderate salinity, whereas RNA sequencing data implied that Dolichospermum remodeled its primary metabolism in unfavorable high salinity. Based on our results, the predicted salinity decrease in the Baltic Sea may favor toxic blooms of Dolichospermum spp.


April 21, 2020

An African Salmonella Typhimurium ST313 sublineage with extensive drug-resistance and signatures of host adaptation.

Bloodstream infections by Salmonella enterica serovar Typhimurium constitute a major health burden in sub-Saharan Africa (SSA). These invasive non-typhoidal (iNTS) infections are dominated by isolates of the antibiotic resistance-associated sequence type (ST) 313. Here, we report emergence of ST313 sublineage II.1 in the Democratic Republic of the Congo. Sublineage II.1 exhibits extensive drug resistance, involving a combination of multidrug resistance, extended spectrum ß-lactamase production and azithromycin resistance. ST313 lineage II.1 isolates harbour an IncHI2 plasmid we name pSTm-ST313-II.1, with one isolate also exhibiting decreased ciprofloxacin susceptibility. Whole genome sequencing reveals that ST313 II.1 isolates have accumulated genetic signatures potentially associated with altered pathogenicity and host adaptation, related to changes observed in biofilm formation and metabolic capacity. Sublineage II.1 emerged at the beginning of the 21st century and is involved in on-going outbreaks. Our data provide evidence of further evolution within the ST313 clade associated with iNTS in SSA.


April 21, 2020

A chromosome-level genome assembly of Cydia pomonella provides insights into chemical ecology and insecticide resistance.

The codling moth Cydia pomonella, a major invasive pest of pome fruit, has spread around the globe in the last half century. We generated a chromosome-level scaffold assembly including the Z chromosome and a portion of the W chromosome. This assembly reveals the duplication of an olfactory receptor gene (OR3), which we demonstrate enhances the ability of C. pomonella to exploit kairomones and pheromones in locating both host plants and mates. Genome-wide association studies contrasting insecticide-resistant and susceptible strains identify hundreds of single nucleotide polymorphisms (SNPs) potentially associated with insecticide resistance, including three SNPs found in the promoter of CYP6B2. RNAi knockdown of CYP6B2 increases C. pomonella sensitivity to two insecticides, deltamethrin and azinphos methyl. The high-quality genome assembly of C. pomonella informs the genetic basis of its invasiveness, suggesting the codling moth has distinctive capabilities and adaptive potential that may explain its worldwide expansion.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.