Fragmentation Archives - Page 9 of 10

September 22, 2019

A model for the evolution of prokaryotic DNA restriction-modification systems based upon the structural malleability of Type I restriction-modification enzymes.

Restriction Modification (RM) systems prevent the invasion of foreign genetic material into bacterial cells by restriction and protect the host’s genetic material by methylation. They are therefore important in maintaining the integrity of the host genome. RM systems are currently classified into four types (I to IV) on the basis of differences in composition, target recognition, cofactors and the manner in which they cleave DNA. Comparing the structures of the different types, similarities can be observed suggesting an evolutionary link between these different types. This work describes the ‘deconstruction’ of a large Type I RM enzyme into forms structurally similar to smaller Type II RM enzymes in an effort to elucidate the pathway taken by Nature to form these different RM enzymes. Based upon the ability to engineer new enzymes from the Type I ‘scaffold’, an evolutionary pathway and the evolutionary pressures required to move along the pathway from Type I RM systems to Type II RM systems are proposed. Experiments to test the evolutionary model are discussed.

September 22, 2019

Genomic assemblies of newly sequenced Trypanosoma cruzi strains reveal new genomic expansion and greater complexity.

Chagas disease is a complex illness caused by the protozoan Trypanosoma cruzi displaying highly diverse clinical outcomes. In this sense, the genome sequence elucidation and comparison between strains may lead to disease understanding. Here, two new T. cruzi strains, have been sequenced, Y using Illumina and Bug2148 using PacBio, assembled, analyzed and compared with the T. cruzi annotated genomes available to date. The assembly stats from the new sequences show effective improvement of T. cruzi genome over the actual ones. Such as, the largest contig assembled (1.3?Mb in Bug2148) in de novo attempts and the highest mean assembly coverage (71X for Y). Our analysis reveals a new genomic expansion and greater complexity for those multi-copy gene families related to infection process and disease development, such as Trans-sialidases, Mucins and Mucin Associated Surface Proteins, among others. On one side, we demonstrate that multi-copy gene families are located near telomeric regions of the “chromosome-like” 1.3?Mb contig assembled of Bug2148, where they likely suffer high evolutive pressure. On the other hand, we identified several strain-specific single copy genes that might help to understand the differences in infectivity and physiology among strains. In summary, our results indicate that T. cruzi has a complex genomic architecture that may have promoted its evolution.

September 22, 2019

Comparative analysis of blaKPC-2- and rmtB-carrying IncFII-family pKPC-LK30/pHN7A8 hybrid plasmids from Klebsiella pneumoniae CG258 strains disseminated among multiple Chinese hospitals.

We recently reported the complete sequence of a blaKPC-2- and rmtB-carrying IncFII-family plasmid p675920-1 with the pKPC-LK30/pHN7A8 hybrid structure. Comparative genomics of additional sequenced plasmids with similar hybrid structures and their prevalence in blaKPC-carrying Klebsiella pneumoniae strains from China were investigated in this follow-up study.A total of 51 blaKPC-carrying K. pneumoniae strains were isolated from 2012 to 2016 from five Chinese hospitals and genotyped by multilocus sequence typing. The blaKPC-carrying plasmids from four representative strains were sequenced and compared with p675920-1 and pCT-KPC. Plasmid transfer, carbapenemase activity determination, and bacterial antimicrobial susceptibility test were performed to characterize resistance phenotypes mediated by these plasmids. The prevalence of pCT-KPC-like plasmids in these blaKPC-carrying K. pneumoniae strains was screened by PCR.The six KPC-encoding plasmids p1068-KPC, p20049-KPC, p12139-KPC and p64917-KPC (sequenced in this study) and p675920-1 and pCT-KPC slightly differed from one another due to deletion and acquisition of various backbone and accessory regions. Two major accessory resistance regions, which included the blaKPC-2 region harboring blaKPC-2 (carbapenem resistance) and blaSHV-12 (ß-lactam resistance), and the MDR region carrying rmtB (aminoglycoside resistance), fosA3 (fosfomycin resistance), blaTEM-1B (ß-lactam resistance) and blaCTX-M-65 (ß-lactam resistance), were found in each of these six plasmids and exhibited several parallel evolution routes. The pCT-KPC-like plasmids were present in all the 51 K. pneumoniae isolates, all of which belonged to CG258.There was clonal dissemination of K. pneumoniae CG258 strains, harboring blaKPC-2- and rmtB-carrying IncFII-family pKPC-LK30/pHN7A8 hybrid plasmids, among multiple Chinese hospitals.

September 22, 2019

A continuous genome assembly of the corkwing wrasse (Symphodus melops).

The wrasses (Labridae) are one of the most successful and species-rich families of the Perciformes order of teleost fish. Its members display great morphological diversity, and occupy distinct trophic levels in coastal waters and coral reefs. The cleaning behaviour displayed by some wrasses, such as corkwing wrasse (Symphodus melops), is of particular interest for the salmon aquaculture industry to combat and control sea lice infestation as an alternative to chemicals and pharmaceuticals. There are still few genome assemblies available within this fish family for comparative and functional studies, despite the rapid increase in genome resources generated during the past years. Here, we present a highly continuous genome assembly of the corkwing wrasse using PacBio SMRT sequencing (x28.8) followed by error correction with paired-end Illumina data (x132.9). The present genome assembly consists of 5040 contigs (N50?=?461,652?bp) and a total size of 614 Mbp, of which 8.5% of the genome sequence encode known repeated elements. The genome assembly covers 94.21% of highly conserved genes across ray-finned fish species. We find evidence for increased copy numbers specific for corkwing wrasse possibly highlighting diversification and adaptive processes in gene families including N-linked glycosylation (ST8SIA6) and stress response kinases (HIPK1). By comparative analyses, we discover that de novo repeats, often not properly investigated during genome annotation, encode hundreds of immune-related genes. This new genomic resource, together with the ballan wrasse (Labrus bergylta), will allow for in-depth comparative genomics as well as population genetic analyses for the understudied wrasses. Copyright © 2018 Elsevier Inc. All rights reserved.

September 22, 2019

Full-length extension of HLA allele sequences by HLA allele-specific hemizygous Sanger sequencing (SSBT).

The gold standard for typing at the allele level of the highly polymorphic Human Leucocyte Antigen (HLA) gene system is sequence based typing. Since sequencing strategies have mainly focused on identification of the peptide binding groove, full-length sequence information is lacking for >90% of the HLA alleles. One of the goals of the 17th IHIWS workshop is to establish full-length sequences for as many HLA alleles as possible. In our component “Extension of HLA sequences by full-length HLA allele-specific hemizygous Sanger sequencing” we have used full-length hemizygous Sanger Sequence Based Typing to achieve this goal. We selected samples of which full length sequences were not available in the IPD-IMGT/HLA database. In total we have generated the full-length sequences of 48 HLA-A, 45 -B and 31 -C alleles. For HLA-A extended alleles, 39/48 showed no intron differences compared to the first allele of the corresponding allele group, for HLA-B this was 26/45 and for HLA-C 20/31. Comparing the intron sequences to other alleles of the same allele group revealed that in 5/48 HLA-A, 16/45 HLA-B and 8/31 HLA-C alleles the intron sequence was identical to another allele of the same allele group. In the remaining 10 cases, the sequence either showed polymorphism at a conserved nucleotide or was the result of a gene conversion event. Elucidation of the full-length sequence gives insight in the polymorphic content of the alleles and facilitates the identification of its evolutionary origin. Copyright © 2018 American Society for Histocompatibility and Immunogenetics. All rights reserved.

September 22, 2019

Genome mining of Streptomyces xinghaiensis NRRL B-24674T for the discovery of the gene cluster involved in anticomplement activities and detection of novel xiamycin analogs.

Marine actinobacterium Streptomyces xinghaiensis NRRL B-24674T has been characterized as a novel species, but thus far, its biosynthetic potential remains unexplored. In this study, the high-quality genome sequence of S. xinghaiensis NRRL B-24674T was obtained, and the production of anticomplement agents, xiamycin analogs, and siderophores was investigated by genome mining. Anticomplement compounds are valuable for combating numerous diseases caused by the abnormal activation of the human complement system. The biosynthetic gene cluster (BGC) nrps1 resembles that of complestatins, which are potent microbial-derived anticomplement agents. The identification of the nrps1 BGC revealed a core peptide that differed from that in complestatin; thus, we studied the anticomplement activity of this strain. The culture broth of S. xinghaiensis NRRL B-24674T displayed good anticomplement activity. Subsequently, the disruption of the genes in the nrps1 BGC resulted in the loss of anticomplement activity, confirming the involvement of this BGC in the biosynthesis of anticomplement agents. In addition, the mining of the BGC tep5, which resembles that of the antiviral pentacyclic indolosesquiterpene xiamycin, resulted in the discovery of nine xiamycin analogs, including three novel compounds. In addition to the BGCs responsible for desferrioxamine B, neomycin, ectoine, and carotenoid, 18 BGCs present in the genome are predicted to be novel. The results of this study unveil the potential of S. xinghaiensis as a producer of novel anticomplement agents and provide a basis for further exploration of the biosynthetic potential of S. xinghaiensis NRRL B-24674T for the discovery of novel bioactive compounds by genome mining.

September 22, 2019

A complete Leishmania donovani reference genome identifies novel genetic variations associated with virulence.

Leishmania donovani is responsible for visceral leishmaniasis, a neglected and lethal parasitic disease with limited treatment options and no vaccine. The study of L. donovani has been hindered by the lack of a high-quality reference genome and this can impact experimental outcomes including the identification of virulence genes, drug targets and vaccine development. We therefore generated a complete genome assembly by deep sequencing using a combination of second generation (Illumina) and third generation (PacBio) sequencing technologies. Compared to the current L. donovani assembly, the genome assembly reported within resulted in the closure over 2,000 gaps, the extension of several chromosomes up to telomeric repeats and the re-annotation of close to 15% of protein coding genes and the annotation of hundreds of non-coding RNA genes. It was possible to correctly assemble the highly repetitive A2 and Amastin virulence gene clusters. A comparative sequence analysis using the improved reference genome confirmed 70 published and identified 15 novel genomic differences between closely related visceral and atypical cutaneous disease-causing L. donovani strains providing a more complete map of genes associated with virulence and visceral organ tropism. Bioinformatic tools including protein variation effect analyzer and basic local alignment search tool were used to prioritize a list of potential virulence genes based on mutation severity, gene conservation and function. This complete genome assembly and novel information on virulence factors will support the identification of new drug targets and the development of a vaccine for L. donovani.

September 22, 2019

The central exons of the human MUC2 and MUC6 mucins are highly repetitive and variable in sequence between individuals

The DNA sequence of the two human mucin genes MUC2 and MUC6 have not been completely resolved due to the repetitive nature of their central exon coding for Proline, Threonine and Serine rich sequences. The exact nucleotide sequence of these exons has remained unknown for a long time due to limitations in traditional sequencing techniques. These are still very poorly covered in new whole genome sequencing projects with the corresponding protein sequences partly missing. We used a BAC clone containing both these genes and third generation sequencing technology, SMRT sequencing, to obtain the full-length contiguous MUC2 and MUC6 tandem repeat sequences. The new sequences span the entire repeat regions with good coverage revealing their length, variation in repeat sequences and their internal organization. The sequences obtained were used to compare with available sequences from whole genome sequencing projects indicating variation in number of repeats and their internal organization between individuals. The lack of these sequences has limited the association of genetic alterations with disease. The full sequences of these mucins will now allow such studies, which could be of importance for inflammatory bowel diseases for MUC2 and gastric ulcer diseases for MUC6 where deficient mucus protection is assumed to play an important role.

September 22, 2019

Genome sequence of the potato pathogenic fungus Alternaria solani HWC-168 reveals clues for its conidiation and virulence.

Alternaria solani is a known air-born deuteromycete fungus with a polycyclic life cycle and is the causal agent of early blight that causes significant yield losses of potato worldwide. However, the molecular mechanisms underlying the conidiation and pathogenicity remain largely unknown.We produced a high-quality genome assembly of A. solani HWC-168 that was isolated from a major potato-producing region of Northern China, which facilitated a comprehensive gene annotation, the accurate prediction of genes encoding secreted proteins and identification of conidiation-related genes. The assembled genome of A. solani HWC-168 has a genome size 32.8 Mb and encodes 10,358 predicted genes that are highly similar with related Alternaria species including Alternaria arborescens and Alternaria brassicicola. We identified conidiation-related genes in the genome of A. solani HWC-168 by searching for sporulation-related homologues identified from Aspergillus nidulans. A total of 975 secreted protein-encoding genes, which might act as virulence factors, were identified in the genome of A. solani HWC-168. The predicted secretome of A. solani HWC-168 possesses 261 carbohydrate-active enzymes (CAZy), 119 proteins containing RxLx[EDQ] motif and 27 secreted proteins unique to A. solani.Our findings will facilitate the identification of conidiation- and virulence-related genes in the genome of A. solani. This will permit new insights into understanding the molecular mechanisms underlying the A. solani-potato pathosystem and will add value to the global fungal genome database.

September 22, 2019

Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies.

Recent developments in third-gen long read sequencing and diploid-aware assemblers have resulted in the rapid release of numerous reference-quality assemblies for diploid genomes. However, assembly of highly heterozygous genomes is still problematic when regional heterogeneity is so high that haplotype homology is not recognised during assembly. This results in regional duplication rather than consolidation into allelic variants and can cause issues with downstream analysis, for example variant discovery, or haplotype reconstruction using the diploid assembly with unpaired allelic contigs.A new pipeline-Purge Haplotigs-was developed specifically for third-gen sequencing-based assemblies to automate the reassignment of allelic contigs, and to assist in the manual curation of genome assemblies. The pipeline uses a draft haplotype-fused assembly or a diploid assembly, read alignments, and repeat annotations to identify allelic variants in the primary assembly. The pipeline was tested on a simulated dataset and on four recent diploid (phased) de novo assemblies from third-generation long-read sequencing, and compared with a similar tool. After processing with Purge Haplotigs, haploid assemblies were less duplicated with minimal impact on genome completeness, and diploid assemblies had more pairings of allelic contigs.Purge Haplotigs improves the haploid and diploid representations of third-gen sequencing based genome assemblies by identifying and reassigning allelic contigs. The implementation is fast and scales well with large genomes, and it is less likely to over-purge repetitive or paralogous elements compared to alignment-only based methods. The software is available at https://bitbucket.org/mroachawri/purge_haplotigs under a permissive MIT licence.

September 22, 2019

The impact of genome evolution on the allotetraploid Nicotiana rustica – an intriguing story of enhanced alkaloid production.

Nicotiana rustica (Aztec tobacco), like common tobacco (Nicotiana tabacum), is an allotetraploid formed through a recent hybridization event; however, it originated from completely different progenitor species. Here, we report the comparative genome analysis of wild type N. rustica (5 Gb; 2n?=?4x?=?48) with its three putative diploid progenitors (2.3-3 Gb; 2n?=?2x =24), Nicotiana undulata, Nicotiana paniculata and Nicotiana knightiana.In total, 41% of N. rustica genome originated from the paternal donor (N. undulata), while 59% originated from the maternal donor (N. paniculata/N. knightiana). Chloroplast genome and gene analyses indicated that N. knightiana is more closely related to N. rustica than N. paniculata. Gene clustering revealed 14,623 ortholog groups common to other Nicotiana species and 207 unique to N. rustica. Genome sequence analysis indicated that N. knightiana is more closely related to N. rustica than N. paniculata, and that the higher nicotine content of N. rustica leaves is the result of the progenitor genomes combination and of a more active transport of nicotine to the shoot.The availability of four new Nicotiana genome sequences provide insights into how speciation impacts plant metabolism, and in particular alkaloid transport and accumulation, and will contribute to better understanding the evolution of Nicotiana species.

September 22, 2019

Reconstitution of eukaryotic chromosomes and manipulation of DNA N6-methyladenine alters chromatin and gene expression

DNA N6-adenine methylation (6mA) has recently been reported in diverse eukaryotes, spanning unicellular organisms to metazoans. Yet the functional significance of 6mA remains elusive due to its low abundance, difficulty of manipulation within native DNA, and lack of understanding of eukaryotic 6mA writers. Here, we report a novel DNA 6mA methyltransferase in ciliates, termed MTA1. The enzyme contains an MT-A70 domain but is phylogenetically distinct from all known RNA and DNA methyltransferases. Disruption of MTA1 in vivo leads to the genome-wide loss of 6mA in asexually growing cells and abolishment of the consensus ApT dimethylated motif. Genes exhibit subtle changes in chromatin organization or RNA expression upon loss of 6mA, depending on their starting methylation level. Mutants fail to complete the sexual cycle, which normally coincides with a peak of MTA1 expression. Thus, MTA1 functions in a developmental stage-specific manner. We determine the impact of 6mA on chromatin organization in vitro by reconstructing complete, full-length ciliate chromosomes harboring 6mA in native or ectopic positions. Using these synthetic chromosomes, we show that 6mA directly disfavors nucleosomes in vitro in a local, quantitative manner, independent of DNA sequence. Furthermore, the chromatin remodeler ACF can overcome this effect. Our study identifies a novel MT-A70 protein necessary for eukaryotic 6mA methylation and defines the impact of 6mA on chromatin organization using epigenetically defined synthetic chromosomes.

September 22, 2019

Three New Genome Assemblies Support a Rapid Radiation in Musa acuminata (Wild Banana).

Edible bananas result from interspecific hybridization between Musa acuminata and Musa balbisiana, as well as among subspecies in M. acuminata. Four particular M. acuminata subspecies have been proposed as the main contributors of edible bananas, all of which radiated in a short period of time in southeastern Asia. Clarifying the evolution of these lineages at a whole-genome scale is therefore an important step toward understanding the domestication and diversification of this crop. This study reports the de novo genome assembly and gene annotation of a representative genotype from three different subspecies of M. acuminata. These data are combined with the previously published genome of the fourth subspecies to investigate phylogenetic relationships. Analyses of shared and unique gene families reveal that the four subspecies are quite homogenous, with a core genome representing at least 50% of all genes and very few M. acuminata species-specific gene families. Multiple alignments indicate high sequence identity between homologous single copy-genes, supporting the close relationships of these lineages. Interestingly, phylogenomic analyses demonstrate high levels of gene tree discordance, due to both incomplete lineage sorting and introgression. This pattern suggests rapid radiation within Musa acuminata subspecies that occurred after the divergence with M. balbisiana. Introgression between M. a. ssp. malaccensis and M. a. ssp. burmannica was detected across the genome, though multiple approaches to resolve the subspecies tree converged on the same topology. To support evolutionary and functional analyses, we introduce the PanMusa database, which enables researchers to exploration of individual gene families and trees.

September 22, 2019

Discovery of the actinoplanic acid pathway in Streptomyces rapamycinicus reveals a genetically conserved synergism with rapamycin.

Actinobacteria possess a great wealth of pathways for production of bioactive compounds. Following advances in genome mining, dozens of natural product (NP) gene clusters are routinely found in each actinobacterial genome; however, the modus operandi of this large arsenal is poorly understood. During investigations of the secondary metabolome of Streptomyces rapamycinicus, the producer of rapamycin, we observed accumulation of two compounds never before reported from this organism. Structural elucidation revealed actinoplanic acid A and its demethyl analogue. Actinoplanic acids (APLs) are potent inhibitors of Ras farnesyltransferase and therefore represent bioactive compounds of medicinal interest. Supported with the unique structure of these polyketides and using genome mining, we identified a gene cluster responsible for their biosynthesis in S. rapamycinicus Based on experimental evidence and genetic organization of the cluster, we propose a stepwise biosynthesis of APL, the first bacterial example of a pathway incorporating the rare tricarballylic moiety into an NP. Although phylogenetically distant, the pathway shares some of the biosynthetic principles with the mycotoxins fumonisins. Namely, the core polyketide is acylated with the tricarballylate by an atypical nonribosomal peptide synthetase-catalyzed ester formation. Finally, motivated by the conserved colocalization of the rapamycin and APL pathway clusters in S. rapamycinicus and all other rapamycin-producing actinobacteria, we confirmed a strong synergism of these compounds in antifungal assays. Mining for such evolutionarily conserved coharboring of pathways would likely reveal further examples of NP sets, attacking multiple targets on the same foe. These could then serve as a guide for development of new combination therapies.© 2018 Mrak et al.

September 22, 2019

First draft genome for red sea bream of family Sparidae.

Reference genomes for all organisms on earth are now attainable owing to advances in genome sequencing technologies (Goodwin et al., 2016). Generally, species that contribute considerably to the economy or human welfare are sequenced and are considered more important than others. Furthermore, coastal indigenous people mainly depend on marine species for their food sources, which has resulted in the extinction of several marine species (Cisneros-Montemayor et al., 2016). Of these, an extinction risk assessment of marine fishes, mainly for sea breams (Family: Sparidae), has recently been conducted by way of a global extinction risk assessment from the dataset of the International Union for Conservation of Nature’s Red List Process, which mentions that around 25 species are threatened/near-threatened according to their body weight (Comeros-Raynal et al., 2016). Another report clearly showed the benefit of worldwide aquaculture production, which contributed to 47% of total seafood production, and also highlighted the over-fishing of sea breams (FAO, 2018). The Republic of Korea is the fourth largest seafood producer in the world, producing 3.3 million tons in 2015 and exporting seafood worth $1.6 billion in 2016; therefore, aquaculture- associated research is fundamental for Korea. In the present study, the red sea bream (Pagrus major), which belongs to the family Sparidae, which comprises 35 genera, 132 species, and 10 subspecies (de la Herran et al., 2001; NCBI, 2018), was assessed.

Auto Tag: Fragmentation

A model for the evolution of prokaryotic DNA restriction-modification systems based upon the structural malleability of Type I restriction-modification enzymes.

Genomic assemblies of newly sequenced Trypanosoma cruzi strains reveal new genomic expansion and greater complexity.

Comparative analysis of blaKPC-2- and rmtB-carrying IncFII-family pKPC-LK30/pHN7A8 hybrid plasmids from Klebsiella pneumoniae CG258 strains disseminated among multiple Chinese hospitals.

A continuous genome assembly of the corkwing wrasse (Symphodus melops).

Full-length extension of HLA allele sequences by HLA allele-specific hemizygous Sanger sequencing (SSBT).

Genome mining of Streptomyces xinghaiensis NRRL B-24674T for the discovery of the gene cluster involved in anticomplement activities and detection of novel xiamycin analogs.

A complete Leishmania donovani reference genome identifies novel genetic variations associated with virulence.

The central exons of the human MUC2 and MUC6 mucins are highly repetitive and variable in sequence between individuals

Genome sequence of the potato pathogenic fungus Alternaria solani HWC-168 reveals clues for its conidiation and virulence.

Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies.

The impact of genome evolution on the allotetraploid Nicotiana rustica – an intriguing story of enhanced alkaloid production.

Reconstitution of eukaryotic chromosomes and manipulation of DNA N6-methyladenine alters chromatin and gene expression

Three New Genome Assemblies Support a Rapid Radiation in Musa acuminata (Wild Banana).

Discovery of the actinoplanic acid pathway in Streptomyces rapamycinicus reveals a genetically conserved synergism with rapamycin.

First draft genome for red sea bream of family Sparidae.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert