Menu
September 22, 2019

The structure of a conserved telomeric region associated with variant antigen loci in the blood parasite Trypanosoma congolense

African trypanosomiasis is a vector-borne disease of humans and livestock caused by African trypanosomes (Trypanosoma spp.). Survival in the vertebrate bloodstream depends on antigenic variation of Variant Surface Glycoproteins (VSGs) coating the parasite surface. In T. brucei, a model for antigenic variation, monoallelic VSG expression originates from dedicated VSG expression sites (VES). Trypanosoma brucei VES have a conserved structure consisting of a telomeric VSG locus downstream of unique, repeat sequences, and an independent promoter. Additional protein-coding sequences, known as “Expression Site Associated Genes (ESAGs)”, are also often present and are implicated in diverse, bloodstream-stage functions. Trypanosoma congolense is a related veterinary pathogen, also displaying VSG-mediated antigenic variation. A T. congolense VES has not been described, making it unclear if regulation of VSG expression is conserved between species. Here, we describe a conserved telomeric region associated with VSG loci from long-read DNA sequencing of two T. congolense strains, which consists of a distal repeat, conserved noncoding elements and other genes besides the VSG; although these are not orthologous to T. brucei ESAGs. Most conserved telomeric regions are associated with accessory minichromosomes, but the same structure may also be associated with megabase chromosomes. We propose that this region represents the T. congolense VES, and through comparison with T. brucei, we discuss the parallel evolution of antigenic switching mechanisms, and unique adaptation of the T. brucei VES for developmental regulation of bloodstream-stage genes. Hence, we provide a basis for understanding antigenic switching in T. congolense and the origins of the African trypanosome VES.


September 22, 2019

Comparative genome analysis of jujube witches’-broom Phytoplasma, an obligate pathogen that causes jujube witches’-broom disease.

JWB phytoplasma is a kind of insect-transmitted and uncultivable bacterial plant pathogen causeing a destructive Jujube disease. To date, no genome information about JWB phytoplasma has been published, which hindered its characterization at genomic level. To understand its pathogenicity and ecology, the genome of a JWB phytoplasma isolate jwb-nky was sequenced and compared with other phytoplasmas enabled us to explore the mechanisms of genomic rearrangement.The complete genome sequence of JWB phytoplasma (jwb-nky) was determined, which consisting of one circular chromosome of 750,803 bp with a GC content of 23.3%. 694 protein-encoding genes, 2 operons for rRNA genes and 31 tRNA genes as well as 4 potential mobile units (PMUs) containing clusters of DNA repeats were identified. Based on PHIbaes analysis, a large number of genes were genome-specific and approximately 13% of JWB phytoplasma genes were predicted to be associated with virulence. Although transporters for maltose, dipeptides/oligopeptides, spermidine/putrescine, cobalt, Mn/Zn and methionine were identified, KEGG pathway analysis revealed the reduced metabolic capabilities of JWB phytoplasma. Comparative genome analyses between JWB phytoplasma and other phytoplasmas shows the occurrence of large-scale gene rearrangements. The low synteny with other phytoplasmas indicated that the expansion of multiple gene families/duplication probably occurred separately after differentiation.In this study, the complete genome sequence of a JWB phytoplasma isolate jwb-nky that causing JWB disease was reported for the first time and a number of species-specific genes were identified in the genome. The study enhanced our understandings about genomic basis and the pathogenicity mechanism of this pathogen, which will aid in the development of improved strategies for efficient management of JWB diseases.


September 22, 2019

Sharing of human milk oligosaccharides degradants within bifidobacterial communities in faecal cultures supplemented with Bifidobacterium bifidum.

Gut microbiota of breast-fed infants are generally rich in bifidobacteria. Recent studies show that infant gut-associated bifidobacteria can assimilate human milk oligosaccharides (HMOs) specifically among the gut microbes. Nonetheless, little is known about how bifidobacterial-rich communities are shaped in the gut. Interestingly, HMOs assimilation ability is not related to the dominance of each species. Bifidobacterium longum susbp. longum and Bifidobacterium breve are commonly found as the dominant species in infant stools; however, they show limited HMOs assimilation ability in vitro. In contrast, avid in vitro HMOs consumers, Bifidobacterium bifidum and Bifidobacterium longum subsp. infantis, are less abundant in infant stools. In this study, we observed altruistic behaviour by B. bifidum when incubated in HMOs-containing faecal cultures. Four B. bifidum strains, all of which contained complete sets of HMO-degrading genes, commonly left HMOs degradants unconsumed during in vitro growth. These strains stimulated the growth of other Bifidobacterium species when added to faecal cultures supplemented with HMOs, thereby increasing the prevalence of bifidobacteria in faecal communities. Enhanced HMOs consumption by B. bifidum-supplemented cultures was also observed. We also determined the complete genome sequences of B. bifidum strains JCM7004 and TMC3115. Our results suggest B. bifidum-mediated cross-feeding of HMOs degradants within bifidobacterial communities.


September 22, 2019

Ring synthetic chromosome V SCRaMbLE.

Structural variations (SVs) exert important functional impacts on biological phenotypic diversity. Here we show a ring synthetic yeast chromosome V (ring_synV) can be used to continuously generate complex genomic variations and improve the production of prodeoxyviolacein (PDV) by applying Synthetic Chromosome Recombination and Modification by LoxP-mediated Evolution (SCRaMbLE) in haploid yeast cells. The SCRaMbLE of ring_synV generates aneuploid yeast strains with increased PDV productivity, and we identify aneuploid chromosome I, III, VI, XII, XIII, and ring_synV. The neochromosome of SCRaMbLEd ring_synV generated more unbalanced forms of variations, including duplication, insertions, and balanced forms of translocations and inversions than its linear form. Furthermore, of the 29 novel SVs detected, 11 prompted the PDV biosynthesis; and the deletion of uncharacterized gene YER182W is related to the improvement of the PDV. Overall, the SCRaMbLEing ring_synV embraces the evolution of the genome by modifying the chromosome number, structure, and organization, identifying targets for phenotypic comprehension.


September 22, 2019

Novel clade C-I Clostridium difficile strains escape diagnostic tests, differ in pathogenicity potential and carry toxins on extrachromosomal elements.

The population structure of Clostridium difficile currently comprises eight major genomic clades. For the highly divergent C-I clade, only two toxigenic strains have been reported, which lack the tcdA and tcdC genes and carry a complete locus for the binary toxin (CDT) next to an atypical TcdB monotoxin pathogenicity locus (PaLoc). As part of a routine surveillance of C. difficile in stool samples from diarrheic human patients, we discovered three isolates that consistently gave negative results in a PCR-based screening for tcdC. Through phenotypic assays, whole-genome sequencing, experiments in cell cultures, and infection biomodels we show that these three isolates (i) escape common laboratory diagnostic procedures, (ii) represent new ribotypes, PFGE-types, and sequence types within the Clade C-I, (iii) carry chromosomal or plasmidal TcdBs that induce classical or variant cytopathic effects (CPE), and (iv) cause different levels of cytotoxicity and hamster mortality rates. These results show that new strains of C. difficile can be detected by more refined techniques and raise questions on the origin, evolution, and distribution of the toxin loci of C. difficile and the mechanisms by which this emerging pathogen causes disease.


September 22, 2019

Asymmetric processing of DNA ends at a double-strand break leads to unconstrained dynamics and ectopic translocation.

Multiple pathways regulate the repair of double-strand breaks (DSBs) to suppress potentially dangerous ectopic recombination. Both sequence and chromatin context are thought to influence pathway choice between non-homologous end-joining (NHEJ) and homology-driven recombination. To test the effect of repetitive sequences on break processing, we have inserted TG-rich repeats on one side of an inducible DSB at the budding yeast MAT locus on chromosome III. Five clustered Rap1 sites within a break-proximal TG repeat are sufficient to block Mre11-Rad50-Xrs2 recruitment, impair resection, and favor elongation by telomerase. The two sides of the break lose end-to-end tethering and show enhanced, uncoordinated movement. Only the TG-free side is resected and shifts to the nuclear periphery. In contrast to persistent DSBs without TG repeats that are repaired by imprecise NHEJ, nearly all survivors of repeat-proximal DSBs repair the break by a homology-driven, non-reciprocal translocation from ChrIII-R to ChrVII-L. This suppression of imprecise NHEJ at TG-repeat-flanked DSBs requires the Uls1 translocase activity. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Insights into the evolution of multicellularity from the sea lettuce genome.

We report here the 98.5 Mbp haploid genome (12,924 protein coding genes) of Ulva mutabilis, a ubiquitous and iconic representative of the Ulvophyceae or green seaweeds. Ulva’s rapid and abundant growth makes it a key contributor to coastal biogeochemical cycles; its role in marine sulfur cycles is particularly important because it produces high levels of dimethylsulfoniopropionate (DMSP), the main precursor of volatile dimethyl sulfide (DMS). Rapid growth makes Ulva attractive biomass feedstock but also increasingly a driver of nuisance “green tides.” Ulvophytes are key to understanding the evolution of multicellularity in the green lineage, and Ulva morphogenesis is dependent on bacterial signals, making it an important species with which to study cross-kingdom communication. Our sequenced genome informs these aspects of ulvophyte cell biology, physiology, and ecology. Gene family expansions associated with multicellularity are distinct from those of freshwater algae. Candidate genes, including some that arose following horizontal gene transfer from chromalveolates, are present for the transport and metabolism of DMSP. The Ulva genome offers, therefore, new opportunities to understand coastal and marine ecosystems and the fundamental evolution of the green lineage. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019

Complete genome sequence and characterization of a protein-glutaminase producing strain, Chryseobacterium proteolyticum QSH1265.

Recently, an enzyme named protein-glutaminase (PG) has been identified as a new type of enzyme with significant potential for deamidation of food proteins. The enzyme is shown to be expressed as a pre-pro-protein with a putative signal peptide of 21 amino acids, a pro-sequence of 114 amino acids, and a mature PG of 185 amino acids. The microbial enzyme PG specifically catalyzes deamidation of proteins without protein hydrolysis pretreatment and only reacts with glutamine residues in the side-chains of proteins or long peptides. All these attributes suggest that it has a great potential for food industrial applications. However, until recently, there have been relatively few studies of the PG-producing strains. A strain named Chryseobacterium proteolyticum QSH1265 which can produce PG was isolated from a soil sample collected in Songjiang, Shanghai, China. Its enzyme activity was about 0.34 ± 0.01 U/mL when using carboxybenzoxy-Gln-Gly as a substrate. The strain can produce acid from D-glucose, maltose, L-arabinose sucrose, glycerol, and mannitol but not fructose, and it is also positive for indole production and urease. Here we describe the complete genome sequence of this strain via PacBio RSII sequencing. The C. proteolyticum QSH1265 genome consists of a circular chromosome with total length of 4,849,803 bp without any plasmids. All of 4563 genes were predicted including 4459 genes for protein-coding and 104 RNA-relative genes with an average G+C content of 36.16%. The KEGG and COG annotation provide information for the specific function of proteins encoded in the genome, such as proteases, chromoproteins, stress proteins, antiporters, etc. A highly conserved hypothetical protein shares a promoter with the gene encoding the protein-glutaminase enzyme. The genome sequence and preliminary annotation provide valuable genetic information for further study of C. proteolyticum.


September 22, 2019

Comparison of the mitochondrial genome sequences of six Annulohypoxylon stygium isolates suggests short fragment insertions as a potential factor leading to larger genomic size.

Mitochondrial DNA (mtDNA) is a core non-nuclear genetic material found in all eukaryotic organisms, the size of which varies extensively in the eumycota, even within species. In this study, mitochondrial genomes of six isolates of Annulohypoxylon stygium (Lév.) were assembled from raw reads from PacBio and Illumina sequencing. The diversity of genomic structures, conserved genes, intergenic regions and introns were analyzed and compared. Genome sizes ranged from 132 to 147 kb and contained the same sets of conserved protein-coding, tRNA and rRNA genes and shared the same gene arrangements and orientation. In addition, most intergenic regions were homogeneous and had similar sizes except for the region between cytochrome b (cob) and cytochrome c oxidase I (cox1) genes which ranged from 2,998 to 8,039 bp among the six isolates. Sixty-five intron insertion sites and 99 different introns were detected in these genomes. Each genome contained 45 or more introns, which varied in distribution and content. Introns from homologous insertion sites also showed high diversity in size, type and content. Comparison of introns at the same loci showed some complex introns, such as twintrons and ORF-less introns. There were 44 short fragment insertions detected within introns, intergenic regions, or as introns, some of them located at conserved domain regions of homing endonuclease genes. Insertions of short fragments such as small inverted repeats might affect or hinder the movement of introns, and these allowed for intron accumulation in the mitochondrial genomes analyzed, and enlarged their size. This study showed that the evolution of fungal mitochondrial introns is complex, and the results suggest short fragment insertions as a potential factor leading to larger mitochondrial genomes in A. stygium.


September 22, 2019

De novo assembly, delivery and expression of a 101 kb human gene in mouse cells

Design and large-scale synthesis of DNA has been applied to the functional study of viral and microbial genomes. New and expanded technology development is required to unlock the transformative potential of such bottom-up approaches to the study of larger, mammalian genomes. Two major challenges include assembling and delivering long DNA sequences. Here we describe a pipeline for de novo DNA assembly and delivery that enables functional evaluation of mammalian genes on the length scale of 100 kb. The DNA assembly step is supported by an integrated robotic workcell. We assemble the 101 kb human HPRT1 gene in yeast, deliver it to mouse cells, and show expression of the human protein from its full-length gene. This pipeline provides a framework for producing systematic, designer variants of any mammalian gene locus for functional evaluation in cells.


September 22, 2019

Parliament2: Fast structural variant calling using optimized combinations of callers

Here we present Parliament2: a structural variant caller which combines multiple best-in-class structural variant callers to create a highly accurate callset. This captures more events than the individual callers achieve independently. Parliament2 uses a call-overlap-genotype approach that is highly extensible to new methods and presents users the choice to run some or all of Breakdancer, Breakseq, CNVnator, Delly, Lumpy, and Manta to run. Parliament2 applies an additional parallelization framework to speed certain callers and executes these in parallel, taking advantage of the different resource requirements to complete structural variant calling much faster than running the programs individually. Parliament2 is available as a Docker container, which pre-installs all required dependencies. This allows users to run any caller with easy installation and execution. This Docker container can easily be deployed in cloud or local environments and is available as an app on DNAnexus.


September 22, 2019

Forward genetics by genome sequencing uncovers the central role of the Aspergillus niger goxB locus in hydrogen peroxide induced glucose oxidase expression.

Aspergillus niger is an industrially important source for gluconic acid and glucose oxidase (GOx), a secreted commercially important flavoprotein which catalyses the oxidation of ß-D-glucose by molecular oxygen to D-glucolactone and hydrogen peroxide. Expression of goxC, the GOx encoding gene and the concomitant two step conversion of glucose to gluconic acid requires oxygen and the presence of significant amounts of glucose in the medium and is optimally induced at pH 5.5. The molecular mechanisms underlying regulation of goxC expression are, however, still enigmatic. Genetic studies aimed at understanding GOx induction have indicated the involvement of at least seven complementation groups, for none of which the molecular basis has been resolved. In this study, a mapping-by-sequencing forward genetics approach was used to uncover the molecular role of the goxB locus in goxC expression. Using the Illumina and PacBio sequencing platforms a hybrid high quality draft genome assembly of laboratory strain N402 was obtained and used as a reference for mapping of genomic reads obtained from the derivative NW103:goxB mutant strain. The goxB locus encodes a thioredoxin reductase. A deletion of the encoding gene in the N402 parent strain led to a high constitutive expression level of the GOx and the lactonase encoding genes required for the two-step conversion of glucose in gluconic acid and of the catR gene encoding catalase R. This high constitutive level of expression was observed to be irrespective of the carbon source and oxidative stress applied. A model clarifying the role of GoxB in the regulation of the expression of goxC involving hydrogen peroxide as second messenger is presented.


September 22, 2019

Extraordinary genome instability and widespread chromosome rearrangements during vegetative growth

The haploid genome of the pathogenic fungus Zymoseptoria tritici is contained on “core” and “accessory” chromosomes. While 13 core chromosomes are found in all strains, as many as eight accessory chromosomes show presence/absence variation and rearrangements among field isolates. The factors influencing these presence/absence polymorphisms are so far unknown. We investigated chromosome stability using experimental evolution, karyotyping, and genome sequencing. We report extremely high and variable rates of accessory chromosome loss during mitotic propagation in vitro and in planta Spontaneous chromosome loss was observed in 2 to >50% of cells during 4 weeks of incubation. Similar rates of chromosome loss in the closely related Zymoseptoria ardabiliae suggest that this extreme chromosome dynamic is a conserved phenomenon in the genus. Elevating the incubation temperature greatly increases instability of accessory and even core chromosomes, causing severe rearrangements involving telomere fusion and chromosome breakage. Chromosome losses do not affect the fitness of Zymoseptoria tritici in vitro, but some lead to increased virulence, suggesting an adaptive role of this extraordinary chromosome instability. Copyright © 2018 by the Genetics Society of America.


September 22, 2019

Amycomicin is a potent and specific antibiotic discovered with a targeted interaction screen.

The rapid emergence of antibiotic-resistant pathogenic bacteria has accelerated the search for new antibiotics. Many clinically used antibacterials were discovered through culturing a single microbial species under nutrient-rich conditions, but in the environment, bacteria constantly encounter poor nutrient conditions and interact with neighboring microbial species. In an effort to recapitulate this environment, we generated a nine-strain actinomycete community and used 16S rDNA sequencing to deconvolute the stochastic production of antimicrobial activity that was not observed from any of the axenic cultures. We subsequently simplified the community to just two strains and identified Amycolatopsis sp. AA4 as the producing strain and Streptomyces coelicolor M145 as an inducing strain. Bioassay-guided isolation identified amycomicin (AMY), a highly modified fatty acid containing an epoxide isonitrile warhead as a potent and specific inhibitor of Staphylococcus aureus Amycomicin targets an essential enzyme (FabH) in fatty acid biosynthesis and reduces S. aureus infection in a mouse skin-infection model. The discovery of AMY demonstrates the utility of screening complex communities against specific targets to discover small-molecule antibiotics.


September 22, 2019

Variation graph toolkit improves read mapping by representing genetic variation in the reference.

Reference genomes guide our interpretation of DNA sequence data. However, conventional linear references represent only one version of each locus, ignoring variation in the population. Poor representation of an individual’s genome sequence impacts read mapping and introduces bias. Variation graphs are bidirected DNA sequence graphs that compactly represent genetic variation across a population, including large-scale structural variation such as inversions and duplications. Previous graph genome software implementations have been limited by scalability or topological constraints. Here we present vg, a toolkit of computational methods for creating, manipulating, and using these structures as references at the scale of the human genome. vg provides an efficient approach to mapping reads onto arbitrary variation graphs using generalized compressed suffix arrays, with improved accuracy over alignment to a linear reference, and effectively removing reference bias. These capabilities make using variation graphs as references for DNA sequencing practical at a gigabase scale, or at the topological complexity of de novo assemblies.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.