Variant detection Archives - Page 27 of 65

September 22, 2019

Genomic structural variations within five continental populations of Drosophila melanogaster.

Chromosomal structural variations (SV) including insertions, deletions, inversions, and translocations occur within the genome and can have a significant effect on organismal phenotype. Some of these effects are caused by structural variations containing genes. Large structural variations represent a significant amount of the genetic diversity within a population. We used a global sampling of Drosophila melanogaster (Ithaca, Zimbabwe, Beijing, Tasmania, and Netherlands) to represent diverse populations within the species. We used long-read sequencing and optical mapping technologies to identify SVs in these genomes. Among the five lines examined, we found an average of 2,928 structural variants within these genomes. These structural variations varied greatly in size and location, included many exonic regions, and could impact adaptation and genomic evolution. Copyright © 2018 Long et al.

September 22, 2019

Production of glycine-derived ammonia as a low-cost and long-distance antibiotic strategy by Streptomyces

Soil-inhabiting streptomycetes are Natures medicine makers, producing over half of all known antibiotics and many other bioactive natural products. However, these bacteria also produce many volatile compounds, and research into these molecules and their role in soil ecology is rapidly gaining momentum. Here we show that streptomycetes have the ability to kill bacteria over long distances via air-borne antibiosis. Our research shows that streptomycetes do so by producing surprisingly high amounts of the low-cost volatile antimicrobial ammonia, which travels over long distances and antagonises both Gram-positive and Gram-negative bacteria. Glycine is required as precursor to produce ammonia, and inactivation of the glycine cleavage system annihilated air-borne antibiosis. As a resistance strategy, E. coli cells acquired mutations resulting in reduced expression of the porin master regulator OmpR and its cognate kinase EnvZ, which was just enough to allow them to survive. We further show that ammonia enhances the activity of the more costly canonical antibiotics, suggesting that streptomycetes adopt a low-cost strategy to sensitize competitors for antibiosis over longer distances.

September 22, 2019

Discovery of mcr-1-mediated colistin resistance in a highly virulent Escherichia coli lineage.

Resistance to last-line polymyxins mediated by the plasmid-borne mobile colistin resistance gene (mcr-1) represents a new threat to global human health. Here we present the complete genome sequence of an mcr-1-positive multidrug-resistant Escherichia coli strain (MS8345). We show that MS8345 belongs to serotype O2:K1:H4, has a large 241,164-bp IncHI2 plasmid that carries 15 other antibiotic resistance genes (including the extended-spectrum ß-lactamase blaCTX-M-1) and 3 putative multidrug efflux systems, and contains 14 chromosomally encoded antibiotic resistance genes. MS8345 also carries a large ColV-like virulence plasmid that has been associated with E. coli bacteremia. Whole-genome phylogeny revealed that MS8345 clusters within a discrete clade in the sequence type 95 (ST95) lineage, and MS8345 is very closely related to the highly virulent O45:K1:H4 clone associated with neonatal meningitis. Overall, the acquisition of a plasmid carrying resistance to colistin and multiple other antibiotics in this virulent E. coli lineage is concerning and might herald an era where the empirical treatment of ST95 infections becomes increasingly more difficult.IMPORTANCEEscherichia coli ST95 is a globally disseminated clone frequently associated with bloodstream infections and neonatal meningitis. However, the ST95 lineage is defined by low levels of drug resistance amongst clinical isolates, which normally provides for uncomplicated treatment options. Here, we provide the first detailed genomic analysis of an E. coli ST95 isolate that has both high virulence potential and resistance to multiple antibiotics. Using the genome, we predicted its virulence and antibiotic resistance mechanisms, which include resistance to last-line antibiotics mediated by the plasmid-borne mcr-1 gene. Finding an ST95 isolate resistant to nearly all antibiotics that also has a high virulence potential is of major clinical importance and underscores the need to monitor new and emerging trends in antibiotic resistance development in this important global lineage. Copyright © 2018 Forde et al.

September 22, 2019

The genomic basis of color pattern polymorphism in the Harlequin ladybird.

Many animal species comprise discrete phenotypic forms. A common example in natural populations of insects is the occurrence of different color patterns, which has motivated a rich body of ecological and genetic research [1-6]. The occurrence of dark, i.e., melanic, forms displaying discrete color patterns is found across multiple taxa, but the underlying genomic basis remains poorly characterized. In numerous ladybird species (Coccinellidae), the spatial arrangement of black and red patches on adult elytra varies wildly within species, forming strikingly different complex color patterns [7, 8]. In the harlequin ladybird, Harmonia axyridis, more than 200 distinct color forms have been described, which classic genetic studies suggest result from allelic variation at a single, unknown, locus [9, 10]. Here, we combined whole-genome sequencing, population-based genome-wide association studies, gene expression, and functional analyses to establish that the transcription factor Pannier controls melanic pattern polymorphism in H. axyridis. We show that pannier is necessary for the formation of melanic elements on the elytra. Allelic variation in pannier leads to protein expression in distinct domains on the elytra and thus determines the distinct color patterns in H. axyridis. Recombination between pannier alleles may be reduced by a highly divergent sequence of ~170 kb in the cis-regulatory regions of pannier, with a 50 kb inversion between color forms. This most likely helps maintain the distinct alleles found in natural populations. Thus, we propose that highly variable discrete color forms can arise in natural populations through cis-regulatory allelic variation of a single gene. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.

September 22, 2019

Genome analysis of the yeast M14, an industrial brewing yeast strain widely used in China

The lager brewing yeast M14 is the most widely used yeast strain in the high gravity brewing process in China. To investigate the characteristics of this strain, the genome of the yeast M14 was sequenced and the genome annotation information is presented in this study. The current assembly contained 133 scaffolds and its total size was around 23?Mb with a GC content of 38.98%. The brewing yeast M14 is a hybrid Saccharomyces cerevisiae?×?Saccharomyces uvarum at the genomic level and its genome is comprised of one circular mitochondrial genome originating from S. uvarum. Furthermore, the functions of the 9,796 protein coding genes were annotated and their functions were analyzed using the Swiss-Prot database. Among them, the key genes responsible for typical lager brewing yeast characteristics, such as maltotriose uptake and sulfite production, were annotated and analyzed. Interestingly, nine specific genes present in the brewing yeast M14 were not found in the genome of either S. uvarum CBS 7001 or S. cerevisiae S288C, which are very close to strain M14 in the phylogenetic relationship. These nine genes encoding proteins were melibiase, DNA replication protein, fructose symporter, hypothetical protein, hypothetical protein M773_09155, LIF1, minor spike protein H, ribosomal protein S27, and mitochondrial chaperones, respectively. The genome sequence of the yeast strain M14 provides a new tool to better understand brewing yeast behavior in industrial beer production.

September 22, 2019

How long are long tandem repeats? A challenge for current methods of whole-genome sequence assembly: The case of satellites in Caenorhabditis elegans.

Repetitive genome regions have been difficult to sequence, mainly because of the comparatively small size of the fragments used in assembly. Satellites or tandem repeats are very abundant in nematodes and offer an excellent playground to evaluate different assembly methods. Here, we compare the structure of satellites found in three different assemblies of the Caenorhabditis elegans genome: the original sequence obtained by Sanger sequencing, an assembly based on PacBio technology, and an assembly using Nanopore sequencing reads. In general, satellites were found in equivalent genomic regions, but the new long-read methods (PacBio and Nanopore) tended to result in longer assembled satellites. Important differences exist between the assemblies resulting from the two long-read technologies, such as the sizes of long satellites. Our results also suggest that the lengths of some annotated genes with internal repeats which were assembled using Sanger sequencing are likely to be incorrect.

September 22, 2019

Repeat elements organise 3D genome structure and mediate transcription in the filamentous fungus Epichloë festucae.

Structural features of genomes, including the three-dimensional arrangement of DNA in the nucleus, are increasingly seen as key contributors to the regulation of gene expression. However, studies on how genome structure and nuclear organisation influence transcription have so far been limited to a handful of model species. This narrow focus limits our ability to draw general conclusions about the ways in which three-dimensional structures are encoded, and to integrate information from three-dimensional data to address a broader gamut of biological questions. Here, we generate a complete and gapless genome sequence for the filamentous fungus, Epichloë festucae. We use Hi-C data to examine the three-dimensional organisation of the genome, and RNA-seq data to investigate how Epichloë genome structure contributes to the suite of transcriptional changes needed to maintain symbiotic relationships with the grass host. Our results reveal a genome in which very repeat-rich blocks of DNA with discrete boundaries are interspersed by gene-rich sequences that are almost repeat-free. In contrast to other species reported to date, the three-dimensional structure of the genome is anchored by these repeat blocks, which act to isolate transcription in neighbouring gene-rich regions. Genes that are differentially expressed in planta are enriched near the boundaries of these repeat-rich blocks, suggesting that their three-dimensional orientation partly encodes and regulates the symbiotic relationship formed by this organism.

September 22, 2019

Complete genome sequence and characterization of linezolid-resistant Enterococcus faecalis clinical isolate KUB3006 carrying a cfr(B)-transposon on its chromosome and optrA-plasmid.

Linezolid (LZD) has become one of the most important antimicrobial agents for infections caused by gram-positive bacteria, including those caused by Enterococcus species. LZD-resistant (LR) genetic features include mutations in 23S rRNA/ribosomal proteins, a plasmid-borne 23S rRNA methyltransferase gene cfr, and ribosomal protection genes (optrA and poxtA). Recently, a cfr gene variant, cfr(B), was identified in a Tn6218-like transposon (Tn) in a Clostridioides difficile isolate. Here, we isolated an LR Enterococcus faecalis clinical isolate, KUB3006, from a urine specimen of a patient with urinary tract infection during hospitalization in 2017. Comparative and whole-genome analyses were performed to characterize the genetic features and overall antimicrobial resistance genes in E. faecalis isolate KUB3006. Complete genome sequencing of KUB3006 revealed that it carried cfr(B) on a chromosomal Tn6218-like element. Surprisingly, this Tn6218-like element was almost (99%) identical to that of C. difficile Ox3196, which was isolated from a human in the UK in 2012, and to that of Enterococcus faecium 5_Efcm_HA-NL, which was isolated from a human in the Netherlands in 2012. An additional oxazolidinone and phenicol resistance gene, optrA, was also identified on a plasmid. KUB3006 is sequence type (ST) 729, suggesting that it is a minor ST that has not been reported previously and is unlikely to be a high-risk E. faecalis lineage. In summary, LR E. faecalis KUB3006 possesses a notable Tn6218-like-borne cfr(B) and a plasmid-borne optrA. This finding raises further concerns regarding the potential declining effectiveness of LZD treatment in the future.

September 22, 2019

Loss of bacitracin resistance due to a large genomic deletion among Bacillus anthracis strains.

Bacillus anthracis is a Gram-positive endospore-forming bacterial species that causes anthrax in both humans and animals. In Zambia, anthrax cases are frequently reported in both livestock and wildlife, with occasional transmission to humans, causing serious public health problems in the country. To understand the genetic diversity of B. anthracis strains in Zambia, we sequenced and compared the genomic DNA of B. anthracis strains isolated across the country. Single nucleotide polymorphisms clustered these strains into three groups. Genome sequence comparisons revealed a large deletion in strains belonging to one of the groups, possibly due to unequal crossing over between a pair of rRNA operons. The deleted genomic region included genes conferring resistance to bacitracin, and the strains with the deletion were confirmed with loss of bacitracin resistance. Similar deletions between rRNA operons were also observed in a few B. anthracis strains phylogenetically distant from Zambian strains. The structure of bacitracin resistance genes flanked by rRNA operons was conserved only in members of the Bacillus cereus group. The diversity and genomic characteristics of B. anthracis strains determined in this study would help in the development of genetic markers and treatment of anthrax in Zambia. IMPORTANCE Anthrax is caused by Bacillus anthracis, an endospore-forming soil bacterium. The genetic diversity of B. anthracis is known to be low compared with that of Bacillus species. In this study, we performed whole-genome sequencing of Zambian isolates of B. anthracis to understand the genetic diversity between closely related strains. Comparison of genomic sequences revealed that closely related strains were separated into three groups based on single nucleotide polymorphisms distributed throughout the genome. A large genomic deletion was detected in the region containing a bacitracin resistance gene cluster flanked by rRNA operons, resulting in the loss of bacitracin resistance. The structure of the deleted region, which was also conserved among species of the Bacillus cereus group, has the potential for both deletion and amplification and thus might be enabling the species to flexibly control the level of bacitracin resistance for adaptive evolution.

September 22, 2019

A complete Cannabis chromosome assembly and adaptive admixture for elevated cannabidiol (CBD) content

Cannabis has been cultivated for millennia with distinct cultivars providing either fiber and grain or tetrahydrocannabinol. Recent demand for cannabidiol rather than tetrahydrocannabinol has favored the breeding of admixed cultivars with extremely high cannabidiol content. Despite several draft Cannabis genomes, the genomic structure of cannabinoid synthase loci has remained elusive. A genetic map derived from a tetrahydrocannabinol/cannabidiol segregating population and a complete chromosome assembly from a high-cannabidiol cultivar together resolve the linkage of cannabidiolic and tetrahydrocannabinolic acid synthase gene clusters which are associated with transposable elements. High-cannabidiol cultivars appear to have been generated by integrating hemp-type cannabidiolic acid synthase gene clusters into a background of marijuana-type cannabis. Quantitative trait locus mapping suggests that overall drug potency, however, is associated with other genomic regions needing additional study.

September 22, 2019

Physiological genomics of dietary adaptation in a marine herbivorous fish

Adopting a new diet is a significant evolutionary change and can profoundly affect an animaltextquoterights physiology, biochemistry, ecology, and its genome. To study this evolutionary transition, we investigated the physiology and genomics of digestion of a derived herbivorous fish, the monkeyface prickleback (Cebidichthys violaceus). We sequenced and assembled its genome and digestive transcriptome and revealed the molecular changes related to important dietary enzymes, finding abundant evidence for adaptation at the molecular level. In this species, two gene families experienced expansion in copy number and adaptive amino acid substitutions. These families, amylase, and bile salt activated lipase, are involved digestion of carbohydrates and lipids, respectively. Both show elevated levels of gene expression and increased enzyme activity. Because carbohydrates are abundant in the pricklebacktextquoterights diet and lipids are rare, these findings suggest that such dietary specialization involves both exploiting abundant resources and scavenging rare ones, especially essential nutrients, like essential fatty acids.

September 22, 2019

Targeted genotyping of variable number tandem repeats with adVNTR.

Whole-genome sequencing is increasingly used to identify Mendelian variants in clinical pipelines. These pipelines focus on single-nucleotide variants (SNVs) and also structural variants, while ignoring more complex repeat sequence variants. Here, we consider the problem of genotyping Variable Number Tandem Repeats (VNTRs), composed of inexact tandem duplications of short (6-100 bp) repeating units. VNTRs span 3% of the human genome, are frequently present in coding regions, and have been implicated in multiple Mendelian disorders. Although existing tools recognize VNTR carrying sequence, genotyping VNTRs (determining repeat unit count and sequence variation) from whole-genome sequencing reads remains challenging. We describe a method, adVNTR, that uses hidden Markov models to model each VNTR, count repeat units, and detect sequence variation. adVNTR models can be developed for short-read (Illumina) and single-molecule (Pacific Biosciences [PacBio]) whole-genome and whole-exome sequencing, and show good results on multiple simulated and real data sets.© 2018 Bakhtiari et al.; Published by Cold Spring Harbor Laboratory Press.

September 22, 2019

A continuous genome assembly of the corkwing wrasse (Symphodus melops).

The wrasses (Labridae) are one of the most successful and species-rich families of the Perciformes order of teleost fish. Its members display great morphological diversity, and occupy distinct trophic levels in coastal waters and coral reefs. The cleaning behaviour displayed by some wrasses, such as corkwing wrasse (Symphodus melops), is of particular interest for the salmon aquaculture industry to combat and control sea lice infestation as an alternative to chemicals and pharmaceuticals. There are still few genome assemblies available within this fish family for comparative and functional studies, despite the rapid increase in genome resources generated during the past years. Here, we present a highly continuous genome assembly of the corkwing wrasse using PacBio SMRT sequencing (x28.8) followed by error correction with paired-end Illumina data (x132.9). The present genome assembly consists of 5040 contigs (N50?=?461,652?bp) and a total size of 614 Mbp, of which 8.5% of the genome sequence encode known repeated elements. The genome assembly covers 94.21% of highly conserved genes across ray-finned fish species. We find evidence for increased copy numbers specific for corkwing wrasse possibly highlighting diversification and adaptive processes in gene families including N-linked glycosylation (ST8SIA6) and stress response kinases (HIPK1). By comparative analyses, we discover that de novo repeats, often not properly investigated during genome annotation, encode hundreds of immune-related genes. This new genomic resource, together with the ballan wrasse (Labrus bergylta), will allow for in-depth comparative genomics as well as population genetic analyses for the understudied wrasses. Copyright © 2018 Elsevier Inc. All rights reserved.

September 22, 2019

Combining probabilistic alignments with read pair information improves accuracy of split-alignments.

Split-alignments provide base-pair-resolution evidence of genomic rearrangements. In practice, they are found by first computing high-scoring local alignments, parts of which are then combined into a split-alignment. This approach is challenging when aligning a short read to a large and repetitive reference, as it tends to produce many spurious local alignments leading to ambiguities in identifying the correct split-alignment. This problem is further exacerbated by the fact that rearrangements tend to occur in repeat-rich regions.We propose a split-alignment technique that combats the issue of ambiguous alignments by combining information from probabilistic alignment with positional information from paired-end reads. We demonstrate that our method finds accurate split-alignments, and that this translates into improved performance of variant-calling tools that rely on split-alignments.An open-source implementation is freely available at: https://bitbucket.org/splitpairedend/last-split-pe.Supplementary data are available at Bioinformatics online.

September 22, 2019

Computational tools to unmask transposable elements.

A substantial proportion of the genome of many species is derived from transposable elements (TEs). Moreover, through various self-copying mechanisms, TEs continue to proliferate in the genomes of most species. TEs have contributed numerous regulatory, transcript and protein innovations and have also been linked to disease. However, notwithstanding their demonstrated impact, many genomic studies still exclude them because their repetitive nature results in various analytical complexities. Fortunately, a growing array of methods and software tools are being developed to cater for them. This Review presents a summary of computational resources for TEs and highlights some of the challenges and remaining gaps to perform comprehensive genomic analyses that do not simply ‘mask’ repeats.

Auto Tag: Variant detection

Genomic structural variations within five continental populations of Drosophila melanogaster.

Production of glycine-derived ammonia as a low-cost and long-distance antibiotic strategy by Streptomyces

Discovery of mcr-1-mediated colistin resistance in a highly virulent Escherichia coli lineage.

The genomic basis of color pattern polymorphism in the Harlequin ladybird.

Genome analysis of the yeast M14, an industrial brewing yeast strain widely used in China

How long are long tandem repeats? A challenge for current methods of whole-genome sequence assembly: The case of satellites in Caenorhabditis elegans.

Repeat elements organise 3D genome structure and mediate transcription in the filamentous fungus Epichloë festucae.

Complete genome sequence and characterization of linezolid-resistant Enterococcus faecalis clinical isolate KUB3006 carrying a cfr(B)-transposon on its chromosome and optrA-plasmid.

Loss of bacitracin resistance due to a large genomic deletion among Bacillus anthracis strains.

A complete Cannabis chromosome assembly and adaptive admixture for elevated cannabidiol (CBD) content

Physiological genomics of dietary adaptation in a marine herbivorous fish

Targeted genotyping of variable number tandem repeats with adVNTR.

A continuous genome assembly of the corkwing wrasse (Symphodus melops).

Combining probabilistic alignments with read pair information improves accuracy of split-alignments.

Computational tools to unmask transposable elements.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert