Menu
September 22, 2019

Full-length extension of HLA allele sequences by HLA allele-specific hemizygous Sanger sequencing (SSBT).

The gold standard for typing at the allele level of the highly polymorphic Human Leucocyte Antigen (HLA) gene system is sequence based typing. Since sequencing strategies have mainly focused on identification of the peptide binding groove, full-length sequence information is lacking for >90% of the HLA alleles. One of the goals of the 17th IHIWS workshop is to establish full-length sequences for as many HLA alleles as possible. In our component “Extension of HLA sequences by full-length HLA allele-specific hemizygous Sanger sequencing” we have used full-length hemizygous Sanger Sequence Based Typing to achieve this goal. We selected samples of which full length sequences were not available in the IPD-IMGT/HLA database. In total we have generated the full-length sequences of 48 HLA-A, 45 -B and 31 -C alleles. For HLA-A extended alleles, 39/48 showed no intron differences compared to the first allele of the corresponding allele group, for HLA-B this was 26/45 and for HLA-C 20/31. Comparing the intron sequences to other alleles of the same allele group revealed that in 5/48 HLA-A, 16/45 HLA-B and 8/31 HLA-C alleles the intron sequence was identical to another allele of the same allele group. In the remaining 10 cases, the sequence either showed polymorphism at a conserved nucleotide or was the result of a gene conversion event. Elucidation of the full-length sequence gives insight in the polymorphic content of the alleles and facilitates the identification of its evolutionary origin. Copyright © 2018 American Society for Histocompatibility and Immunogenetics. All rights reserved.


September 22, 2019

The genomic architecture and molecular evolution of ant odorant receptors.

The massive expansions of odorant receptor (OR) genes in ant genomes are notable examples of rapid genome evolution and adaptive gene duplication. However, the molecular mechanisms leading to gene family expansion remain poorly understood, partly because available ant genomes are fragmentary. Here, we present a highly contiguous, chromosome-level assembly of the clonal raider ant genome, revealing the largest known OR repertoire in an insect. While most ant ORs originate via local tandem duplication, we also observe several cases of dispersed duplication followed by tandem duplication in the most rapidly evolving OR clades. We found that areas of unusually high transposable element density (TE islands) were depauperate in ORs in the clonal raider ant, and found no evidence for retrotransposition of ORs. However, OR loci were enriched for transposons relative to the genome as a whole, potentially facilitating tandem duplication by unequal crossing over. We also found that ant OR genes are highly AT-rich compared to other genes. In contrast, in flies, OR genes are dispersed and largely isolated within the genome, and we find that fly ORs are not AT-rich. The genomic architecture and composition of ant ORs thus show convergence with the unrelated vertebrate ORs rather than the related fly ORs. This might be related to the greater gene numbers and/or potential similarities in gene regulation between ants and vertebrates as compared to flies.© 2018 McKenzie and Kronauer; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Genomic discovery of the hypsin gene and biosynthetic pathways for terpenoids in Hypsizygus marmoreus.

Hypsizygus marmoreus (Beech mushroom) is a popular ingredient in Asian cuisine. The medicinal effects of its bioactive compounds such as hypsin and hypsiziprenol have been reported, but the genetic basis or biosynthesis of these components is unknown.In this study, we sequenced a reference strain of H. marmoreus (Haemi 51,987-8). We evaluated various assembly strategies, and as a result the Allpaths and PBJelly produced the best assembly. The resulting genome was 42.7 Mbp in length and annotated with 16,627 gene models. A putative gene (Hypma_04324) encoding the antifungal and antiproliferative hypsin protein with 75% sequence identity with the previously known N-terminal sequence was identified. Carbohydrate active enzyme analysis displayed the typical feature of white-rot fungi where auxiliary activity and carbohydrate-binding modules were enriched. The genome annotation revealed four terpene synthase genes responsible for terpenoid biosynthesis. From the gene tree analysis, we identified that terpene synthase genes can be classified into six clades. Four terpene synthase genes of H. marmoreus belonged to four different groups that implies they may be involved in the synthesis of different structures of terpenes. A terpene synthase gene cluster was well-conserved in Agaricomycetes genomes, which contained known biosynthesis and regulatory genes.Genome sequence analysis of this mushroom led to the discovery of the hypsin gene. Comparative genome analysis revealed the conserved gene cluster for terpenoid biosynthesis in the genome. These discoveries will further our understanding of the biosynthesis of medicinal bioactive molecules in this edible mushroom.


September 22, 2019

A complete Leishmania donovani reference genome identifies novel genetic variations associated with virulence.

Leishmania donovani is responsible for visceral leishmaniasis, a neglected and lethal parasitic disease with limited treatment options and no vaccine. The study of L. donovani has been hindered by the lack of a high-quality reference genome and this can impact experimental outcomes including the identification of virulence genes, drug targets and vaccine development. We therefore generated a complete genome assembly by deep sequencing using a combination of second generation (Illumina) and third generation (PacBio) sequencing technologies. Compared to the current L. donovani assembly, the genome assembly reported within resulted in the closure over 2,000 gaps, the extension of several chromosomes up to telomeric repeats and the re-annotation of close to 15% of protein coding genes and the annotation of hundreds of non-coding RNA genes. It was possible to correctly assemble the highly repetitive A2 and Amastin virulence gene clusters. A comparative sequence analysis using the improved reference genome confirmed 70 published and identified 15 novel genomic differences between closely related visceral and atypical cutaneous disease-causing L. donovani strains providing a more complete map of genes associated with virulence and visceral organ tropism. Bioinformatic tools including protein variation effect analyzer and basic local alignment search tool were used to prioritize a list of potential virulence genes based on mutation severity, gene conservation and function. This complete genome assembly and novel information on virulence factors will support the identification of new drug targets and the development of a vaccine for L. donovani.


September 22, 2019

Genomic insights into virulence mechanisms of Leishmania donovani: evidence from an atypical strain.

Leishmaniasis is a neglected tropical disease with diverse clinical phenotypes, determined by parasite, host and vector interactions. Despite the advances in molecular biology and the availability of more Leishmania genome references in recent years, the association between parasite species and distinct clinical phenotypes remains poorly understood. We present a genomic comparison of an atypical variant of Leishmania donovani from a South Asian focus, where it mostly causes cutaneous form of leishmaniasis.Clinical isolates from six cutaneous leishmaniasis patients (CL-SL); 2 of whom were poor responders to antimony (CL-PR), and two visceral leishmaniasis patients (VL-SL) were sequenced on an Illumina MiSeq platform. Chromosome aneuploidy was observed in both groups but was more frequent in CL-SL. 248 genes differed by 2 fold or more in copy number among the two groups. Genes involved in amino acid use (LdBPK_271940) and energy metabolism (LdBPK_271950), predominated the VL-SL group with the same distribution pattern reflected in gene tandem arrays. Genes encoding amastins were present in higher copy numbers in VL-SL and CL-PR as well as being among predicted pseudogenes in CL-SL. Both chromosome and SNP profiles showed CL-SL and VL-SL to form two distinct groups. While expected heterozygosity was much higher in VL-SL, SNP allele frequency patterns did not suggest potential recent recombination breakpoints. The SNP/indel profile obtained using the more recently generated PacBio sequence did not vary markedly from that based on the standard LdBPK282A1 reference. Several genes previously associated with resistance to antimonials were observed in higher copy numbers in the analysis of CL-PR. H-locus amplification was seen in one cutaneous isolate which however did not belong to the CL-PR group.The data presented suggests that intra species variations at chromosome and gene level are more likely to influence differences in tropism as well as response to treatment, and contributes to greater understanding of parasite molecular mechanisms underpinning these differences. These findings should be substantiated with a larger sample number and expression/functional studies.


September 22, 2019

The central exons of the human MUC2 and MUC6 mucins are highly repetitive and variable in sequence between individuals

The DNA sequence of the two human mucin genes MUC2 and MUC6 have not been completely resolved due to the repetitive nature of their central exon coding for Proline, Threonine and Serine rich sequences. The exact nucleotide sequence of these exons has remained unknown for a long time due to limitations in traditional sequencing techniques. These are still very poorly covered in new whole genome sequencing projects with the corresponding protein sequences partly missing. We used a BAC clone containing both these genes and third generation sequencing technology, SMRT sequencing, to obtain the full-length contiguous MUC2 and MUC6 tandem repeat sequences. The new sequences span the entire repeat regions with good coverage revealing their length, variation in repeat sequences and their internal organization. The sequences obtained were used to compare with available sequences from whole genome sequencing projects indicating variation in number of repeats and their internal organization between individuals. The lack of these sequences has limited the association of genetic alterations with disease. The full sequences of these mucins will now allow such studies, which could be of importance for inflammatory bowel diseases for MUC2 and gastric ulcer diseases for MUC6 where deficient mucus protection is assumed to play an important role.


September 22, 2019

Microevolution of Neisseria lactamica during nasopharyngeal colonisation induced by controlled human infection.

Neisseria lactamica is a harmless coloniser of the infant respiratory tract, and has a mutually-excluding relationship with the pathogen Neisseria meningitidis. Here we report controlled human infection with genomically-defined N. lactamica and subsequent bacterial microevolution during 26 weeks of colonisation. We find that most mutations that occur during nasopharyngeal carriage are transient indels within repetitive tracts of putative phase-variable loci associated with host-microbe interactions (pgl and lgt) and iron acquisition (fetA promotor and hpuA). Recurrent polymorphisms occurred in genes associated with energy metabolism (nuoN, rssA) and the CRISPR-associated cas1. A gene encoding a large hypothetical protein was often mutated in 27% of the subjects. In volunteers who were naturally co-colonised with meningococci, recombination altered allelic identity in N. lactamica to resemble meningococcal alleles, including loci associated with metabolism, outer membrane proteins and immune response activators. Our results suggest that phase variable genes are often mutated during carriage-associated microevolution.


September 22, 2019

Spread of the florfenicol resistance floR gene among clinical Klebsiella pneumoniae isolates in China.

Florfenicol is a derivative of chloramphenicol that is used only for the treatment of animal diseases. A key resistance gene for florfenicol, floR, can spread among bacteria of the same and different species or genera through horizontal gene transfer. To analyze the potential transmission of resistance genes between animal and human pathogens, we investigated floR in Klebsiella pneumoniae isolates from patient samples. floR in human pathogens may originate from animal pathogens and would reflect the risk to human health of using antimicrobial agents in animals.PCR was used to identify floR-positive strains. The floR genes were cloned, and the minimum inhibitory concentrations (MICs) were determined to assess the relative resistance levels of the genes and strains. Sequencing and comparative genomics methods were used to analyze floR gene-related sequence structure as well as the molecular mechanism of resistance dissemination.Of the strains evaluated, 20.42% (67/328) were resistant to florfenicol, and 86.96% (20/23) of the floR-positive strains demonstrated high resistance to florfenicol with MICs =512 µg/mL. Conjugation experiments showed that transferrable plasmids carried the floR gene in three isolates. Sequencing analysis of a plasmid approximately 125 kb in size (pKP18-125) indicated that the floR gene was flanked by multiple copies of mobile genetic elements. Comparative genomics analysis of a 9-kb transposon-like fragment of pKP18-125 showed that an approximately 2-kb sequence encoding lysR-floR-virD2 was conserved in the majority (79.01%, 83/105) of floR sequences collected from NCBI nucleotide database. Interestingly, the most similar sequence was a 7-kb fragment of plasmid pEC012 from an Escherichia coli strain isolated from a chicken.Identified on a transferable plasmid in the human pathogen K. pneumoniae, the floR gene may be disseminated through horizontal gene transfer from animal pathogens. Studies on the molecular mechanism of resistance gene dissemination in different bacterial species of animal origin could provide useful information for preventing or controlling the spread of resistance between animal and human pathogens.


September 22, 2019

Leishmania genome dynamics during environmental adaptation reveal strain-specific differences in gene copy number variation, karyotype instability, and telomeric amplification.

Protozoan parasites of the genus Leishmania adapt to environmental change through chromosome and gene copy number variations. Only little is known about external or intrinsic factors that govern Leishmania genomic adaptation. Here, by conducting longitudinal genome analyses of 10 new Leishmania clinical isolates, we uncovered important differences in gene copy number among genetically highly related strains and revealed gain and loss of gene copies as potential drivers of long-term environmental adaptation in the field. In contrast, chromosome rather than gene amplification was associated with short-term environmental adaptation to in vitro culture. Karyotypic solutions were highly reproducible but unique for a given strain, suggesting that chromosome amplification is under positive selection and dependent on species- and strain-specific intrinsic factors. We revealed a progressive increase in read depth towards the chromosome ends for various Leishmania isolates, which may represent a nonclassical mechanism of telomere maintenance that can preserve integrity of chromosome ends during selection for fast in vitro growth. Together our data draw a complex picture of Leishmania genomic adaptation in the field and in culture, which is driven by a combination of intrinsic genetic factors that generate strain-specific phenotypic variations, which are under environmental selection and allow for fitness gain.IMPORTANCE Protozoan parasites of the genus Leishmania cause severe human and veterinary diseases worldwide, termed leishmaniases. A hallmark of Leishmania biology is its capacity to adapt to a variety of unpredictable fluctuations inside its human host, notably pharmacological interventions, thus, causing drug resistance. Here we investigated mechanisms of environmental adaptation using a comparative genomics approach by sequencing 10 new clinical isolates of the L. donovani, L. major, and L. tropica complexes that were sampled across eight distinct geographical regions. Our data provide new evidence that parasites adapt to environmental change in the field and in culture through a combination of chromosome and gene amplification that likely causes phenotypic variation and drives parasite fitness gains in response to environmental constraints. This novel form of gene expression regulation through genomic change compensates for the absence of classical transcriptional control in these early-branching eukaryotes and opens new venues for biomarker discovery. Copyright © 2018 Bussotti et al.


September 22, 2019

Noise-Cancelling Repeat Finder: Uncovering tandem repeats in error-prone long-read sequencing data

Tandem DNA repeats can be sequenced with long-read technologies, but cannot be accurately deciphered due to the lack of computational tools taking high error rates of these technologies into account. Here we introduce Noise-Cancelling Repeat Finder (NCRF) to uncover putative tandem repeats of specified motifs in noisy long reads produced by Pacific Biosciences and Oxford Nanopore sequencers. Using simulations, we validated the use of NCRF to locate tandem repeats with motifs of various lengths and demonstrated its superior performance as compared to two alternative tools. Using real human whole-genome sequencing data, NCRF identified long arrays of the (AATGG)n repeat involved in heat shock stress response.


September 22, 2019

Diagnostic and Therapeutic Strategies for Fluoropyrimidine Treatment of Patients Carrying Multiple DPYD Variants.

DPYD genotyping prior to fluoropyrimidine treatment is increasingly implemented in clinical care. Without phasing information (i.e., allelic location of variants), current genotype-based dosing guidelines cannot be applied to patients carrying multiple DPYD variants. The primary aim of this study is to examine diagnostic and therapeutic strategies for fluoropyrimidine treatment of patients carrying multiple DPYD variants. A case series of patients carrying multiple DPYD variants is presented. Different genotyping techniques were used to determine phasing information. Phenotyping was performed by dihydropyrimidine dehydrogenase (DPD) enzyme activity measurements. Publicly available databases were queried to explore the frequency and phasing of variants of patients carrying multiple DPYD variants. Four out of seven patients carrying multiple DPYD variants received a full dose of fluoropyrimidines and experienced severe toxicity. Phasing information could be retrieved for four patients. In three patients, variants were located on two different alleles, i.e., in trans. Recommended dose reductions based on the phased genotype differed from the phenotype-derived dose reductions in three out of four cases. Data from publicly available databases show that the frequency of patients carrying multiple DPYD variants is low (< 0.2%), but higher than the frequency of the commonly tested DPYD*13 variant (0.1%). Patients carrying multiple DPYD variants are at high risk of developing severe toxicity. Additional analyses are required to determine the correct dose of fluoropyrimidine treatment. In patients carrying multiple DPYD variants, we recommend that a DPD phenotyping assay be carried out to determine a safe starting dose.


September 22, 2019

Molecular characteristics and comparative genomics analysis of a clinical Enterococcus casseliflavus with a resistance plasmid.

The aim of this work was to investigate the molecular characterization of a clinical Enterococcus casseliflavus strain with a resistance plasmid.En. casseliflavus EC369 was isolated from a patient in a hospital in southern China. The minimum inhibitory concentration was found by means of the agar dilution method to determine the antimicrobial susceptibilities of the strains. Whole-genome sequencing and comparative genomics analysis were performed to analyze the mechanism of antibiotic resistance and the horizontal gene transfer of the resistance gene-related mobile genetic elements.En. casseliflavus EC369 showed resistance to erythromycin, kanamycin, and streptomycin, but was susceptible to vancomycin, ampicillin, and streptothricin and other antimicrobials. There were six resistance genes (aph3′, ant6, bla, sat4, and two ermBs) carried by a transposon identified on the plasmid pEC369 and a complete resistance gene cluster of vancomycin and a tet (M) gene encoded on the chromosome. This is the first complete plasmid sequence reported in clinically isolated En. casseliflavus. The plasmid with the greatest sequence identity with pEC369 was the plasmid of Enterococcus sp. FDAARGOS_375, followed by the plasmids of Enterococcus faecium strains F12085 and pRE25, whereas the sequence with the greatest identity to the resistance genes carrying a transposon of pEC369 was on the chromosome of Staphylococcus aureus strain GD1677.The resistance profiles of En. casseliflavus EC369 might contribute to the resistance genes encoded on the plasmid. The fact that the most similar sequence to the transposon carrying resistance genes of pEC369 was encoded in the chromosome of a S. aureus strain provides insights into the mechanism of dissemination of multidrug resistance between bacteria of different species or genera through horizontal gene transfer.


September 22, 2019

Extensive and deep sequencing of the Venter/HuRef genome for developing and benchmarking genome analysis tools.

We produced an extensive collection of deep re-sequencing datasets for the Venter/HuRef genome using the Illumina massively-parallel DNA sequencing platform. The original Venter genome sequence is a very-high quality phased assembly based on Sanger sequencing. Therefore, researchers developing novel computational tools for the analysis of human genome sequence variation for the dominant Illumina sequencing technology can test and hone their algorithms by making variant calls from these Venter/HuRef datasets and then immediately confirm the detected variants in the Sanger assembly, freeing them of the need for further experimental validation. This process also applies to implementing and benchmarking existing genome analysis pipelines. We prepared and sequenced 200?bp and 350?bp short-insert whole-genome sequencing libraries (sequenced to 100x and 40x genomic coverages respectively) as well as 2?kb, 5?kb, and 12?kb mate-pair libraries (49x, 122x, and 145x physical coverages respectively). Lastly, we produced a linked-read library (128x physical coverage) from which we also performed haplotype phasing.


September 22, 2019

DNA Methylation by Restriction Modification Systems Affects the Global Transcriptome Profile in Borrelia burgdorferi.

Prokaryote restriction modification (RM) systems serve to protect bacteria from potentially detrimental foreign DNA. Recent evidence suggests that DNA methylation by the methyltransferase (MTase) components of RM systems can also have effects on transcriptome profiles. The type strain of the causative agent of Lyme disease, Borrelia burgdorferi B31, possesses two RM systems with N6-methyladenosine (m6A) MTase activity, which are encoded by the bbe02 gene located on linear plasmid lp25 and bbq67 on lp56. The specific recognition and/or methylation sequences had not been identified for either of these B. burgdorferi MTases, and it was not previously known whether these RM systems influence transcript levels. In the current study, single-molecule real-time sequencing was utilized to map genome-wide m6A sites and to identify consensus modified motifs in wild-type B. burgdorferi as well as MTase mutants lacking either the bbe02 gene alone or both bbe02 and bbq67 genes. Four novel conserved m6A motifs were identified and were fully attributable to the presence of specific MTases. Whole-genome transcriptome changes were observed in conjunction with the loss of MTase enzymes, indicating that DNA methylation by the RM systems has effects on gene expression. Genes with altered transcription in MTase mutants include those involved in vertebrate host colonization (e.g., rpoS regulon) and acquisition by/transmission from the tick vector (e.g., rrp1 and pdeB). The results of this study provide a comprehensive view of the DNA methylation pattern in B. burgdorferi, and the accompanying gene expression profiles add to the emerging body of research on RM systems and gene regulation in bacteria.IMPORTANCE Lyme disease is the most prevalent vector-borne disease in North America and is classified by the Centers for Disease Control and Prevention (CDC) as an emerging infectious disease with an expanding geographical area of occurrence. Previous studies have shown that the causative bacterium, Borrelia burgdorferi, methylates its genome using restriction modification systems that enable the distinction from foreign DNA. Although much research has focused on the regulation of gene expression in B. burgdorferi, the effect of DNA methylation on gene regulation has not been evaluated. The current study characterizes the patterns of DNA methylation by restriction modification systems in B. burgdorferi and evaluates the resulting effects on gene regulation in this important pathogen. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Discovery of the actinoplanic acid pathway in Streptomyces rapamycinicus reveals a genetically conserved synergism with rapamycin.

Actinobacteria possess a great wealth of pathways for production of bioactive compounds. Following advances in genome mining, dozens of natural product (NP) gene clusters are routinely found in each actinobacterial genome; however, the modus operandi of this large arsenal is poorly understood. During investigations of the secondary metabolome of Streptomyces rapamycinicus, the producer of rapamycin, we observed accumulation of two compounds never before reported from this organism. Structural elucidation revealed actinoplanic acid A and its demethyl analogue. Actinoplanic acids (APLs) are potent inhibitors of Ras farnesyltransferase and therefore represent bioactive compounds of medicinal interest. Supported with the unique structure of these polyketides and using genome mining, we identified a gene cluster responsible for their biosynthesis in S. rapamycinicus Based on experimental evidence and genetic organization of the cluster, we propose a stepwise biosynthesis of APL, the first bacterial example of a pathway incorporating the rare tricarballylic moiety into an NP. Although phylogenetically distant, the pathway shares some of the biosynthetic principles with the mycotoxins fumonisins. Namely, the core polyketide is acylated with the tricarballylate by an atypical nonribosomal peptide synthetase-catalyzed ester formation. Finally, motivated by the conserved colocalization of the rapamycin and APL pathway clusters in S. rapamycinicus and all other rapamycin-producing actinobacteria, we confirmed a strong synergism of these compounds in antifungal assays. Mining for such evolutionarily conserved coharboring of pathways would likely reveal further examples of NP sets, attacking multiple targets on the same foe. These could then serve as a guide for development of new combination therapies.© 2018 Mrak et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.