Menu
July 7, 2019

Interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing.

Microsatellite expansion, such as trinucleotide repeat expansion (TRE), is known to cause a number of genetic diseases. Sanger sequencing and next-generation short-read sequencing are unable to interrogate TRE reliably. We developed a novel algorithm called RepeatHMM to estimate repeat counts from long-read sequencing data. Evaluation on simulation data, real amplicon sequencing data on two repeat expansion disorders, and whole-genome sequencing data generated by PacBio and Oxford Nanopore technologies showed superior performance over competing approaches. We concluded that long-read sequencing coupled with RepeatHMM can estimate repeat counts on microsatellites and can interrogate the “unsequenceable” genomic trinucleotide repeat disorders.


July 7, 2019

Hunting structural variants: Population by population

Until recently, most population-scale genome sequencing studies have focused on identifying single nucleotide variants (SNVs) to explore genetic differences between individuals. Like so many SNV-based genome-wide association studies, however, these efforts have had difficulty identifying causative genetic mechanisms underlying most complex functions. More and more, the genomics community has realised that structural variation is likely responsible for many of the traits and phenotypes that scientists have not been able to attribute to SNVs. This class of variants, defined as genetic differences of 50 bp or larger, accounts for most of the DNA sequence differences between any two people. Structural variants (SVs) are also already known to cause many common and rare diseases including ALS, schizophrenia, leukemia, Carney complex, and Huntington’s disease. Despite the importance of SVs, these larger variants have been understudied and underreported compared to their single-nucleotide counterparts. One reason is that they remain difficult to detect. Their length often means they cannot be fully spanned using short sequencing reads. They also often occur in highly repetitive or GC-rich regions of the genome, making them challenging targets. As such, this class of human genetic variation has remained vastly under-explored in global populations and is now ripe for discovery.


July 7, 2019

Convergence of plasmid architectures drives emergence of multi-drug resistance in a clonally diverse Escherichia coli population from a veterinary clinical care setting.

The purpose of this study was to determine the plasmid architecture and context of resistance genes in multi-drug resistant (MDR) Escherichia coli strains isolated from urinary tract infections in dogs. Illumina and single-molecule real-time (SMRT) sequencing were applied to assemble the complete genomes of E. coli strains associated with clinical urinary tract infections, which were either phenotypically MDR or drug susceptible. This revealed that multiple distinct families of plasmids were associated with building an MDR phenotype. Plasmid-mediated AmpC (CMY-2) beta-lactamase resistance was associated with a clonal group of IncI1 plasmids that has remained stable in isolates collected up to a decade apart. Other plasmids, in particular those with an IncF replicon type, contained other resistance gene markers, so that the emergence of these MDR strains was driven by the accumulation of multiple plasmids, up to 5 replicons in specific cases. This study indicates that vulnerable patients, often with complex clinical histories provide a setting leading to the emergence of MDR E. coli strains in clonally distinct commensal backgrounds. While it is known that horizontally-transferred resistance supplements uropathogenic strains of E. coli such as ST131, our study demonstrates that the selection of an MDR phenotype in commensal E. coli strains can result in opportunistic infections in vulnerable patient populations. These strains provide a reservoir for the onward transfer of resistance alleles into more typically pathogenic strains and provide opportunities for the coalition of resistance and virulence determinants on plasmids as evidenced by the IncF replicons characterised in this study. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.


July 7, 2019

Complete genome sequence of Pseudomonas corrugata strain RM1-1-4, a stress protecting agent from the rhizosphere of an oilseed rape bait plant

Pseudomonas corrugata strain RM1-1-4 is a rhizosphere colonizer of oilseed rape. A previous study has shown that this motile, Gram-negative, non-sporulating bacterium is an effective stress protecting and biocontrol agent, which protects their hosts against abiotic and biotic stresses. Here, we announce and describe the complete genome sequence of P. corrugata RM1-1-4 consisting of a single 6.1 Mb circular chromosome that encodes 5189 protein coding genes and 85 RNA-only encoding genes. Genome analysis revealed genes predicting functions such as detoxifying mechanisms, stress inhibitors, exoproteases, lipoproteins or volatile components as well as rhizobactin siderophores and spermidine. Further analysis of its genome will help to identify traits promising for stress protection, biocontrol and plant growth promotion properties.


July 7, 2019

Hybrid de novo genome assembly and centromere characterization of the gray mouse lemur (Microcebus murinus).

The de novo assembly of repeat-rich mammalian genomes using only high-throughput short read sequencing data typically results in highly fragmented genome assemblies that limit downstream applications. Here, we present an iterative approach to hybrid de novo genome assembly that incorporates datasets stemming from multiple genomic technologies and methods. We used this approach to improve the gray mouse lemur (Microcebus murinus) genome from early draft status to a near chromosome-scale assembly.We used a combination of advanced genomic technologies to iteratively resolve conflicts and super-scaffold the M. murinus genome.We improved the M. murinus genome assembly to a scaffold N50 of 93.32 Mb. Whole genome alignments between our primary super-scaffolds and 23 human chromosomes revealed patterns that are congruent with historical comparative cytogenetic data, thus demonstrating the accuracy of our de novo scaffolding approach and allowing assignment of scaffolds to M. murinus chromosomes. Moreover, we utilized our independent datasets to discover and characterize sequences associated with centromeres across the mouse lemur genome. Quality assessment of the final assembly found 96% of mouse lemur canonical transcripts nearly complete, comparable to other published high-quality reference genome assemblies.We describe a new assembly of the gray mouse lemur (Microcebus murinus) genome with chromosome-scale scaffolds produced using a hybrid bioinformatic and sequencing approach. The approach is cost effective and produces superior results based on metrics of contiguity and completeness. Our results show that emerging genomic technologies can be used in combination to characterize centromeres of non-model species and to produce accurate de novo chromosome-scale genome assemblies of complex mammalian genomes.


July 7, 2019

Complete genome sequence of Clostridium perfringens LLY_N11, a necrotic enteritis-inducing strain isolated from a healthy chicken intestine.

Clostridium perfringens strain LLY_N11, a commensal bacterium, which previously induced necrotic enteritis in an experimental study, was isolated from the intestine of a young healthy chicken. Here, we present the complete genome sequence of this strain, which may provide a better understanding of the molecular mechanisms involved in necrotic enteritis pathogenesis.


July 7, 2019

Evaluation of oritavancin dosing strategies against vancomycin-resistant Enterococcus faecium isolates with or without reduced susceptibility to daptomycin in an in vitro pharmacokinetic/pharmacodynamic model.

Clinical development of nonsusceptibility to the lipopeptide antibiotic daptomycin remains a serious concern during therapy for infections caused by vancomycin-resistant Enterococcus faecium (VREfm). The long-acting lipoglycopeptide oritavancin exhibits potent in vitro activity against VREfm although its safety and efficacy in treating clinical VREfm infections have not been established. In this study, novel dosing regimens of daptomycin and oritavancin were assessed against both VREfm and daptomycin-nonsusceptible VREfm isolates in an in vitro pharmacokinetic/pharmacodynamic model. Copyright © 2017 American Society for Microbiology.


July 7, 2019

Mechanisms of surface antigenic variation in the human pathogenic fungus Pneumocystis jirovecii.

Microbial pathogens commonly escape the human immune system by varying surface proteins. We investigated the mechanisms used for that purpose by Pneumocystis jirovecii This uncultivable fungus is an obligate pulmonary pathogen that in immunocompromised individuals causes pneumonia, a major life-threatening infection. Long-read PacBio sequencing was used to assemble a core of subtelomeres of a single P. jirovecii strain from a bronchoalveolar lavage fluid specimen from a single patient. A total of 113 genes encoding surface proteins were identified, including 28 pseudogenes. These genes formed a subtelomeric gene superfamily, which included five families encoding adhesive glycosylphosphatidylinositol (GPI)-anchored glycoproteins and one family encoding excreted glycoproteins. Numerical analyses suggested that diversification of the glycoproteins relies on mosaic genes created by ectopic recombination and occurs only within each family. DNA motifs suggested that all genes are expressed independently, except those of the family encoding the most abundant surface glycoproteins, which are subject to mutually exclusive expression. PCR analyses showed that exchange of the expressed gene of the latter family occurs frequently, possibly favored by the location of the genes proximal to the telomere because this allows concomitant telomere exchange. Our observations suggest that (i) the P. jirovecii cell surface is made of a complex mixture of different surface proteins, with a majority of a single isoform of the most abundant glycoprotein, (ii) genetic mosaicism within each family ensures variation of the glycoproteins, and (iii) the strategy of the fungus consists of the continuous production of new subpopulations composed of cells that are antigenically different.IMPORTANCEPneumocystis jirovecii is a fungus causing severe pneumonia in immunocompromised individuals. It is the second most frequent life-threatening invasive fungal infection. We have studied the mechanisms of antigenic variation used by this pathogen to escape the human immune system, a strategy commonly used by pathogenic microorganisms. Using a new DNA sequencing technology generating long reads, we could characterize the highly repetitive gene families encoding the proteins that are present on the cellular surface of this pest. These gene families are localized in the regions close to the ends of all chromosomes, the subtelomeres. Such chromosomal localization was found to favor genetic recombinations between members of each gene family and to allow diversification of these proteins continuously over time. This pathogen seems to use a strategy of antigenic variation consisting of the continuous production of new subpopulations composed of cells that are antigenically different. Such a strategy is unique among human pathogens. Copyright © 2017 Schmid-Siegert et al.


July 7, 2019

Complete genome sequence of Streptococcus thermophilus strain B59671, which naturally produces the broad-spectrum bacteriocin thermophilin 110.

Streptococcus thermophilus strain B59671 is a Gram-positive lactic acid bacterium that naturally produces a broad-spectrum bacteriocin, thermophilin 110, and is capable of producing gamma-aminobutyric acid (GABA). The complete genome sequence for this strain contains 1,821,173 nucleotides, 1,936 predicted genes, and an average G+C content of 39.1%.


July 7, 2019

Complete genome sequence of indigo-producing bacterium Celeribacter sp. strain TSPH2.

Celeribacter sp. strain TSPH2, a novel producer of indigo, was isolated from oil-contaminated sediment. We present here its genome sequence consisting of one circular chromosome (4 Mb) and one plasmid (0.15 Mb), with an overall G+C content of 60.9%. This strain contains oxygenase genes involved in indigo synthesis, such as flavin-containing monooxygenase. Copyright © 2017 Kim et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.