Menu
April 21, 2020

The golden death bacillus Chryseobacterium nematophagum is a novel matrix digesting pathogen of nematodes.

Nematodes represent important pathogens of humans and farmed animals and cause significant health and economic impacts. The control of nematodes is primarily carried out by applying a limited number of anthelmintic compounds, for which there is now widespread resistance being reported. There is a current unmet need to develop novel control measures including the identification and characterisation of natural pathogens of nematodes.Nematode killing bacilli were isolated from a rotten fruit in association with wild free-living nematodes. These bacteria belong to the Chryseobacterium genus (golden bacteria) and represent a new species named Chryseobacterium nematophagum. These bacilli are oxidase-positive, flexirubin-pigmented, gram-negative rods that exhibit gelatinase activity. Caenorhabditis elegans are attracted to and eat these bacteria. Within 3 h of ingestion, however, the bacilli have degraded the anterior pharyngeal chitinous lining and entered the body cavity, ultimately killing the host. Within 24?h, the internal contents of the worms are digested followed by the final digestion of the remaining cuticle over a 2-3-day period. These bacteria will also infect and kill bacterivorous free-living (L1-L3) stages of all tested parasitic nematodes including the important veterinary Trichostrongylids such as Haemonchus contortus and Ostertagia ostertagi. The bacteria exhibit potent collagen-digesting properties, and genome sequencing has identified novel metalloprotease, collagenase and chitinase enzymes representing potential virulence factors.Chryseobacterium nematophagum is a newly discovered pathogen of nematodes that rapidly kills environmental stages of a wide range of key nematode parasites. These bacilli exhibit a unique invasion process, entering the body via the anterior pharynx through the specific degradation of extracellular matrices. This bacterial pathogen represents a prospective biological control agent for important nematode parasites.


April 21, 2020

The complete genome and methylome of Helicobacter pylori hpNEAfrica strain HP14039

Background Helicobacter pylori is a Gram-negative bacterium which mainly causes peptic ulcer disease in human, but is also the predominant cause of stomach cancer. It has been coevolving with human since 120,000 years and, according to Multi-locus sequence typing (MLST), H. pylori can be classified into seven major population types, namely, hpAfrica1, hpAfrica2, hpNEAfrica, hpEastAsia, hpAsia2, hpEurope and hpSahul. Helicobacter pylori harbours a large number of restriction-modification (R-M) systems. The methyltransferase (MTase) unit plays a significant role in gene regulation and also possibly modulates pathogenicity. The diversity in MTase can act as geomarkers to correlate strains with the phylogeographic origins. This paper describes the complete genome sequence and methylome of gastric pathogen H. pylori belonging to the population hpNEAfrica. Results In this paper, we present the complete genome sequence and the methylome profile of H. pylori hpNEAfrica strain HP14039, isolated from a patient who was born in Somalia and likely to be infected locally during early childhood prior to migration. The genome of HP14039 consists of 1,678,260 bp with 1574 coding genes and 38.7% GC content. The sequence analysis showed that this strain lacks the cag pathogenicity island. The vacA gene is of S2M2 type. We have also identified 15 methylation motifs, including WCANHNNNNTG and CTANNNNNNNTAYG that were not previously described. Conclusions We have described the complete genome of H. pylori strain HP14039. The information regarding phylo-geography, methylome and associated metadata would help scientific community to study more about hpNEAfrica population type.


April 21, 2020

External memory BWT and LCP computation for sequence collections with applications

Background: Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Bur- rows–Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the sheer size of the input it is important to build these data structures in external memory and time using in the best possible way the available RAM. Results: We propose a space-efficient algorithm to compute the BWT and LCP array for a collection of sequences in the external or semi-external memory setting. Our algorithm splits the input collection into subcollections sufficiently small that it can compute their BWT in RAM using an optimal linear time algorithm. Next, it merges the partial BWTs in external or semi-external memory and in the process it also computes the LCP values. Our algorithm can be modi- fied to output two additional arrays that, combined with the BWT and LCP array, provide simple, scan-based, external memory algorithms for three well known problems in bioinformatics: the computation of maximal repeats, the all pairs suffix–prefix overlaps, and the construction of succinct de Bruijn graphs. Conclusions: We prove that our algorithm performs O(nmaxlcp) sequential I/Os, where n is the total length of the collection and maxlcp is the maximum LCP value. The experimental results show that our algorithm is only slightly slower than the state of the art for short sequences but it is up to 40 times faster for longer sequences or when the available RAM is at least equal to the size of the input.


April 21, 2020

Origin and recent expansion of an endogenous gammaretroviral lineage in domestic and wild canids.

Vertebrate genomes contain a record of retroviruses that invaded the germlines of ancestral hosts and are passed to offspring as endogenous retroviruses (ERVs). ERVs can impact host function since they contain the necessary sequences for expression within the host. Dogs are an important system for the study of disease and evolution, yet no substantiated reports of infectious retroviruses in dogs exist. Here, we utilized Illumina whole genome sequence data to assess the origin and evolution of a recently active gammaretroviral lineage in domestic and wild canids.We identified numerous recently integrated loci of a canid-specific ERV-Fc sublineage within Canis, including 58 insertions that were absent from the reference assembly. Insertions were found throughout the dog genome including within and near gene models. By comparison of orthologous occupied sites, we characterized element prevalence across 332 genomes including all nine extant canid species, revealing evolutionary patterns of ERV-Fc segregation among species as well as subpopulations.Sequence analysis revealed common disruptive mutations, suggesting a predominant form of ERV-Fc spread by trans complementation of defective proviruses. ERV-Fc activity included multiple circulating variants that infected canid ancestors from the last 20 million to within 1.6 million years, with recent bursts of germline invasion in the sublineage leading to wolves and dogs.


April 21, 2020

The Y chromosome sequence of the channel catfish suggests novel sex determination mechanisms in teleost fish.

Sex determination mechanisms in teleost fish broadly differ from mammals and birds, with sex chromosomes that are far less differentiated and recombination often occurring along the length of the X and Y chromosomes, posing major challenges for the identification of specific sex determination genes. Here, we take an innovative approach of comparative genome analysis of the genomic sequences of the X chromosome and newly sequenced Y chromosome in the channel catfish.Using a YY channel catfish as the sequencing template, we generated, assembled, and annotated the Y genome sequence of channel catfish. The genome sequence assembly had a contig N50 size of 2.7 Mb and a scaffold N50 size of 26.7 Mb. Genetic linkage and GWAS analyses placed the sex determination locus within a genetic distance less than 0.5?cM and physical distance of 8.9?Mb. However, comparison of the channel catfish X and Y chromosome sequences showed no sex-specific genes. Instead, comparative RNA-Seq analysis between females and males revealed exclusive sex-specific expression of an isoform of the breast cancer anti-resistance 1 (BCAR1) gene in the male during early sex differentiation. Experimental knockout of BCAR1 gene converted genetic males (XY) to phenotypic females, suggesting BCAR1 as a putative sex determination gene.We present the first Y chromosome sequence among teleost fish, and one of the few whole Y chromosome sequences among vertebrate species. Comparative analyses suggest that sex-specific isoform expression through alternative splicing may underlie sex determination processes in the channel catfish, and we identify BCAR1 as a potential sex determination gene.


April 21, 2020

MZPAQ: a FASTQ data compression tool.

Due to the technological progress in Next Generation Sequencing (NGS), the amount of genomic data that is produced daily has seen a tremendous increase. This increase has shifted the bottleneck of genomic projects from sequencing to computation and specifically storing, managing and analyzing the large amount of NGS data. Compression tools can reduce the physical storage used to save large amount of genomic data as well as the bandwidth used to transfer this data. Recently, DNA sequence compression has gained much attention among researchers.In this paper, we study different techniques and algorithms used to compress genomic data. Most of these techniques take advantage of some properties that are unique to DNA sequences in order to improve the compression rate, and usually perform better than general-purpose compressors. By exploring the performance of available algorithms, we produce a powerful compression tool for NGS data called MZPAQ. Results show that MZPAQ outperforms state-of-the-art tools on all benchmark datasets obtained from a recent survey in terms of compression ratio. MZPAQ offers the best compression ratios regardless of the sequencing platform or the size of the data.Currently, MZPAQ’s strength is its higher compression ratio as well as its compatibility with all major sequencing platforms. MZPAQ is more suitable when the size of compressed data is crucial, such as long-term storage and data transfer. More efforts will be made in the future to target other aspects such as compression speed and memory utilization.


April 21, 2020

Resource Concentration Modulates the Fate of Dissimilated Nitrogen in a Dual-Pathway Actinobacterium.

Respiratory ammonification and denitrification are two evolutionarily unrelated dissimilatory nitrogen (N) processes central to the global N cycle, the activity of which is thought to be controlled by carbon (C) to nitrate (NO3-) ratio. Here we find that Intrasporangium calvum C5, a novel dual-pathway denitrifier/respiratory ammonifier, disproportionately utilizes ammonification rather than denitrification when grown under low C concentrations, even at low C:NO3- ratios. This finding is in conflict with the paradigm that high C:NO3- ratios promote ammonification and low C:NO3- ratios promote denitrification. We find that the protein atomic composition for denitrification modules (NirK) are significantly cost minimized for C and N compared to ammonification modules (NrfA), indicating that limitation for C and N is a major evolutionary selective pressure imprinted in the architecture of these proteins. The evolutionary precedent for these findings suggests ecological importance for microbial activity as evidenced by higher growth rates when I. calvum grows predominantly using its ammonification pathway and by assimilating its end-product (ammonium) for growth under ammonium-free conditions. Genomic analysis of I. calvum further reveals a versatile ecophysiology to cope with nutrient stress and redox conditions. Metabolite and transcriptional profiles during growth indicate that enzyme modules, NrfAH and NirK, are not constitutively expressed but rather induced by nitrite production via NarG. Mechanistically, our results suggest that pathway selection is driven by intracellular redox potential (redox poise), which may be lowered when resource concentrations are low, thereby decreasing catalytic activity of upstream electron transport steps (i.e., the bc1 complex) needed for denitrification enzymes. Our work advances our understanding of the biogeochemical flexibility of N-cycling organisms, pathway evolution, and ecological food-webs.


April 21, 2020

Comprehensive analysis of full genome sequence and Bd-milRNA/target mRNAs to discover the mechanism of hypovirulence in Botryosphaeria dothidea strains on pear infection with BdCV1 and BdPV1

Pear ring rot disease, mainly caused by Botryosphaeria dothidea, is widespread in most pear and apple-growing regions. Mycoviruses are used for biocontrol, especially in fruit tree disease. BdCV1 (Botryosphaeria dothidea chrysovirus 1) and BdPV1 (Botryosphaeria dothidea partitivirus 1) influence the biological characteristics of B. dothidea strains. BdCV1 is a potential candidate for the control of fungal disease. Therefore, it is vital to explore interactions between B. dothidea and mycovirus to clarify the pathogenic mechanisms of B. dothidea and hypovirulence of B. dothidea in pear. A high-quality full-length genome sequence of the B. dothidea LW-Hubei isolate was obtained using Single Molecule Real-Time sequencing. It has high repeat sequence with 9.3% and DNA methylation existence in the genome. The 46.34?Mb genomes contained 14,091 predicted genes, which of 13,135 were annotated. B. dothidea was predicted to express 3833 secreted proteins. In bioinformatics analysis, 351 CAZy members, 552 transporters, 128 kinases, and 1096 proteins associated with plant-host interaction (PHI) were identified. RNA-silencing components including two endoribonuclease Dicer, four argonaute (Ago) and three RNA-dependent RNA polymerase (RdRp) molecules were identified and expressed in response to mycovirus infection. Horizontal transfer of the LW-C and LW-P strains indicated that BdCV1 induced host gene silencing in LW-C to suppress BdPV1 transmission. To investigate the role of RNA-silencing in B. dothidea defense, we constructed four small RNA libraries and sequenced B. dothidea micro-like RNAs (Bd-milRNAs) produced in response to BdCV1 and BdPV1 infection. Among these, 167 conserved and 68 candidate novel Bd-milRNAs were identified, of which 161 conserved and 20 novel Bd-milRNA were differentially expressed. WEGO analysis revealed involvement of the differentially expressed Bd-milRNA-targeted genes in metabolic process, catalytic activity, cell process and response to stress or stimulus. BdCV1 had a greater effect on the phenotype, virulence, conidiomata, vertical and horizontal transmission ability, and mycelia cellular structure biological characteristics of B. dothidea strains than BdPV1 and virus-free strains. The results obtained in this study indicate that mycovirus regulates biological processes in B. dothidea through the combined interaction of antiviral defense mediated by RNA-silencing and milRNA-mediated regulation of target gene mRNA expression.


October 23, 2019

Altering tropism of rAAV by directed evolution.

Directed evolution represents an attractive approach to derive AAV capsid variants capable of selectively infect specific tissue or cell targets. It involves the generation of an initial library of high complexity followed by cycles of selection during which the library is progressively enriched for target-specific variants. Each selection cycle consists of the following: reconstitution of complete AAV genomes within plasmid molecules; production of virions for which each particular capsid variant is matched with the particular capsid gene encoding it; recovery of capsid gene sequences from target tissue after systemic administration. Prevalent variants are then analyzed and evaluated.


October 23, 2019

Optimized CRISPR-Cas9 genome editing for Leishmania and its use to target a multigene family, induce chromosomal translocation, and study DNA break repair mechanisms.

CRISPR-Cas9-mediated genome editing has recently been adapted for Leishmania spp. parasites, the causative agents of human leishmaniasis. We have optimized this genome-editing tool by selecting for cells with CRISPR-Cas9 activity through cotargeting the miltefosine transporter gene; mutation of this gene leads to miltefosine resistance. This cotargeting strategy integrated into a triple guide RNA (gRNA) expression vector was used to delete all 11 copies of the A2 multigene family; this was not previously possible with the traditional gene-targeting method. We found that the Leishmania donovani rRNA promoter is more efficient than the U6 promoter in driving gRNA expression, and sequential transfections of the oligonucleotide donor significantly eased the isolation of edited mutants. A gRNA and Cas9 coexpression vector was developed that was functional in all tested Leishmania species, including L. donovani, L. major, and L. mexicana. By simultaneously targeting sites from two different chromosomes, all four types of targeted chromosomal translocations were generated, regardless of the polycistronic transcription direction from the parent chromosomes. It was possible to use this CRISPR system to create a single conserved amino acid substitution (A189G) mutation for both alleles of RAD51, a DNA recombinase involved in homology-directed repair. We found that RAD51 is essential for L. donovani survival based on direct observation of the death of mutants with both RAD51 alleles disrupted, further confirming that this CRISPR system can reveal gene essentiality. Evidence is also provided that microhomology-mediated end joining (MMEJ) plays a major role in double-strand DNA break repair in L. donovani. IMPORTANCELeishmania parasites cause human leishmaniasis. To accelerate characterization of Leishmania genes for new drug and vaccine development, we optimized and simplified the CRISPR-Cas9 genome-editing tool for Leishmania. We show that co-CRISPR targeting of the miltefosine transporter gene and serial transfections of an oligonucleotide donor significantly eased isolation of edited mutants. This cotargeting strategy was efficiently used to delete all 11 members of the A2 virulence gene family. This technical advancement is valuable, since there are many gene clusters and supernumerary chromosomes in the various Leishmania species and isolates. We simplified this CRISPR system by developing a gRNA and Cas9 coexpression vector which could be used to delete genes in various Leishmania species. This CRISPR system could also be used to generate specific chromosomal translocations, which will help in the study of Leishmania gene expression and transcription control. This study also provides new information about double-strand DNA break repair mechanisms in Leishmania.


October 23, 2019

ParLECH: Parallel Long-Read Error Correction with Hadoop

Long-read sequencing is emerging as a promising sequencing technology because it can tackle the short length limitation of second-generation sequencing, which has dominated the sequencing market in past years. However, it has substantially higher error rates compared to short-read sequencing (e.g., 13% vs. 0.1%), and its sequencing cost per base is typically more expensive than that of short-read sequencing. To address these limitations, we present a distributed hybrid error correction framework, called ParLECH, that is scalable and cost-efficient for PacBio long reads. For correcting the errors in the long reads, ParLECH utilizes the Illumina short reads that have the low error rate with high coverage at low cost. To efficiently analyze the high-throughput Illumina short reads, ParLECH is equipped with Hadoop and a distributed NoSQL system. To further improve the accuracy, ParLECH utilizes the k-mer coverage information of the Illumina short reads. Specifically, we develop a distributed version of the widest path algorithm, which maximizes the minimum k-mer coverage in a path of the de Bruijn graph constructed from the Illumina short reads. We replace an error region in a long read with its corresponding widest path. Our experimental results show that ParLECH can handle large-scale real-world datasets in a scalable and accurate manner. Using ParLECH, we can process a 312 GB human genome PacBio dataset, with a 452 GB Illumina dataset, on 128 nodes in less than 29 hours.


October 23, 2019

SAPTA: a new design tool for improving TALE nuclease activity.

Transcription activator-like effector nucleases (TALENs) have become a powerful tool for genome editing due to the simple code linking the amino acid sequences of their DNA-binding domains to TALEN nucleotide targets. While the initial TALEN-design guidelines are very useful, user-friendly tools defining optimal TALEN designs for robust genome editing need to be developed. Here we evaluated existing guidelines and developed new design guidelines for TALENs based on 205 TALENs tested, and established the scoring algorithm for predicting TALEN activity (SAPTA) as a new online design tool. For any input gene of interest, SAPTA gives a ranked list of potential TALEN target sites, facilitating the selection of optimal TALEN pairs based on predicted activity. SAPTA-based TALEN designs increased the average intracellular TALEN monomer activity by >3-fold, and resulted in an average endogenous gene-modification frequency of 39% for TALENs containing the repeat variable di-residue NK that favors specificity rather than activity. It is expected that SAPTA will become a useful and flexible tool for designing highly active TALENs for genome-editing applications. SAPTA can be accessed via the website at http://baolab.bme.gatech.edu/Research/BioinformaticTools/TAL_targeter.html.


October 23, 2019

TALENs facilitate targeted genome editing in human cells with high specificity and low cytotoxicity.

Designer nucleases have been successfully employed to modify the genomes of various model organisms and human cell types. While the specificity of zinc-finger nucleases (ZFNs) and RNA-guided endonucleases has been assessed to some extent, little data are available for transcription activator-like effector-based nucleases (TALENs). Here, we have engineered TALEN pairs targeting three human loci (CCR5, AAVS1 and IL2RG) and performed a detailed analysis of their activity, toxicity and specificity. The TALENs showed comparable activity to benchmark ZFNs, with allelic gene disruption frequencies of 15-30% in human cells. Notably, TALEN expression was overall marked by a low cytotoxicity and the absence of cell cycle aberrations. Bioinformatics-based analysis of designer nuclease specificity confirmed partly substantial off-target activity of ZFNs targeting CCR5 and AAVS1 at six known and five novel sites, respectively. In contrast, only marginal off-target cleavage activity was detected at four out of 49 predicted off-target sites for CCR5- and AAVS1-specific TALENs. The rational design of a CCR5-specific TALEN pair decreased off-target activity at the closely related CCR2 locus considerably, consistent with fewer genomic rearrangements between the two loci. In conclusion, our results link nuclease-associated toxicity to off-target cleavage activity and corroborate TALENs as a highly specific platform for future clinical translation. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.


October 23, 2019

AAV-mediated delivery of zinc finger nucleases targeting hepatitis B virus inhibits active replication.

Despite an existing effective vaccine, hepatitis B virus (HBV) remains a major public health concern. There are effective suppressive therapies for HBV, but they remain expensive and inaccessible to many, and not all patients respond well. Furthermore, HBV can persist as genomic covalently closed circular DNA (cccDNA) that remains in hepatocytes even during otherwise effective therapy and facilitates rebound in patients after treatment has stopped. Therefore, the need for an effective treatment that targets active and persistent HBV infections remains. As a novel approach to treat HBV, we have targeted the HBV genome for disruption to prevent viral reactivation and replication. We generated 3 zinc finger nucleases (ZFNs) that target sequences within the HBV polymerase, core and X genes. Upon the formation of ZFN-induced DNA double strand breaks (DSB), imprecise repair by non-homologous end joining leads to mutations that inactivate HBV genes. We delivered HBV-specific ZFNs using self-complementary adeno-associated virus (scAAV) vectors and tested their anti-HBV activity in HepAD38 cells. HBV-ZFNs efficiently disrupted HBV target sites by inducing site-specific mutations. Cytotoxicity was seen with one of the ZFNs. scAAV-mediated delivery of a ZFN targeting HBV polymerase resulted in complete inhibition of HBV DNA replication and production of infectious HBV virions in HepAD38 cells. This effect was sustained for at least 2 weeks following only a single treatment. Furthermore, high specificity was observed for all ZFNs, as negligible off-target cleavage was seen via high-throughput sequencing of 7 closely matched potential off-target sites. These results show that HBV-targeted ZFNs can efficiently inhibit active HBV replication and suppress the cellular template for HBV persistence, making them promising candidates for eradication therapy.


October 23, 2019

Codon swapping of zinc finger nucleases confers expression in primary cells and in vivo from a single lentiviral vector.

Zinc finger nucleases (ZFNs) are promising tools for genome editing for biotechnological as well as therapeutic purposes. Delivery remains a major issue impeding targeted genome modification. Lentiviral vectors are highly efficient for delivering transgenes into cell lines, primary cells and into organs, such as the liver. However, the reverse transcription of lentiviral vectors leads to recombination of homologous sequences, as found between and within ZFN monomers.We used a codon swapping strategy to both drastically disrupt sequence identity between ZFN monomers and to reduce sequence repeats within a monomer sequence. We constructed lentiviral vectors encoding codon-swapped ZFNs or unmodified ZFNs from a single mRNA transcript. Cell lines, primary hepatocytes and newborn rats were used to evaluate the efficacy of integrative-competent (ICLV) and integrative-deficient (IDLV) lentiviral vectors to deliver ZFNs into target cells.We reduced total identity between ZFN monomers from 90.9% to 61.4% and showed that a single ICLV allowed efficient expression of functional ZFNs targeting the rat UGT1A1 gene after codon-swapping, leading to much higher ZFN activity in cell lines (up to 7-fold increase compared to unmodified ZFNs and 60% activity in C6 cells), as compared to plasmid transfection or a single ICLV encoding unmodified ZFN monomers. Off-target analysis located several active sites for the 5-finger UGT1A1-ZFNs. Furthermore, we reported for the first time successful ZFN-induced targeted DNA double-strand breaks in primary cells (hepatocytes) and in vivo (liver) after delivery of a single IDLV encoding two ZFNs.These results demonstrate that a codon-swapping approach allowed a single lentiviral vector to efficiently express ZFNs and should stimulate the use of this viral platform for ZFN-mediated genome editing of primary cells, for both ex vivo or in vivo applications.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.