Menu
September 22, 2019

Targeted genotyping of variable number tandem repeats with adVNTR.

Whole-genome sequencing is increasingly used to identify Mendelian variants in clinical pipelines. These pipelines focus on single-nucleotide variants (SNVs) and also structural variants, while ignoring more complex repeat sequence variants. Here, we consider the problem of genotyping Variable Number Tandem Repeats (VNTRs), composed of inexact tandem duplications of short (6-100 bp) repeating units. VNTRs span 3% of the human genome, are frequently present in coding regions, and have been implicated in multiple Mendelian disorders. Although existing tools recognize VNTR carrying sequence, genotyping VNTRs (determining repeat unit count and sequence variation) from whole-genome sequencing reads remains challenging. We describe a method, adVNTR, that uses hidden Markov models to model each VNTR, count repeat units, and detect sequence variation. adVNTR models can be developed for short-read (Illumina) and single-molecule (Pacific Biosciences [PacBio]) whole-genome and whole-exome sequencing, and show good results on multiple simulated and real data sets.© 2018 Bakhtiari et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

A continuous genome assembly of the corkwing wrasse (Symphodus melops).

The wrasses (Labridae) are one of the most successful and species-rich families of the Perciformes order of teleost fish. Its members display great morphological diversity, and occupy distinct trophic levels in coastal waters and coral reefs. The cleaning behaviour displayed by some wrasses, such as corkwing wrasse (Symphodus melops), is of particular interest for the salmon aquaculture industry to combat and control sea lice infestation as an alternative to chemicals and pharmaceuticals. There are still few genome assemblies available within this fish family for comparative and functional studies, despite the rapid increase in genome resources generated during the past years. Here, we present a highly continuous genome assembly of the corkwing wrasse using PacBio SMRT sequencing (x28.8) followed by error correction with paired-end Illumina data (x132.9). The present genome assembly consists of 5040 contigs (N50?=?461,652?bp) and a total size of 614 Mbp, of which 8.5% of the genome sequence encode known repeated elements. The genome assembly covers 94.21% of highly conserved genes across ray-finned fish species. We find evidence for increased copy numbers specific for corkwing wrasse possibly highlighting diversification and adaptive processes in gene families including N-linked glycosylation (ST8SIA6) and stress response kinases (HIPK1). By comparative analyses, we discover that de novo repeats, often not properly investigated during genome annotation, encode hundreds of immune-related genes. This new genomic resource, together with the ballan wrasse (Labrus bergylta), will allow for in-depth comparative genomics as well as population genetic analyses for the understudied wrasses. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

The Butanol Producing Microbe Clostridium beijerinckii NCIMB 14988 Manipulated Using Forward and Reverse Genetic Tools.

The solventogenic anaerobe Clostridium beijerinckii has potential for use in the sustainable bioconversion of plant-derived carbohydrates into solvents, such as butanol or acetone. However, relatively few strains have been extensively characterised either at the genomic level or through exemplification of a complete genetic toolkit. To remedy this situation, a new strain of C. beijerinckii, NCIMB 14988, is selected from among a total of 55 new clostridial isolates capable of growth on hexose and pentose sugars. Chosen on the basis of its favorable properties, the complete genome sequence of NCIMB 14988 is determined and a high-efficiency plasmid transformation protocol devised. The developed DNA transfer procedure allowed demonstration in NCIMB 14988 of the forward and reverse genetic techniques of transposon mutagenesis and gene knockout, respectively. The latter is accomplished through the successful deployment of both group II intron retargeting (ClosTron) and allelic exchange. In addition to gene inactivation, the developed allelic exchange procedure is used to create point mutations in the chromosome, allowing for the effect of amino acid changes in enzymes involved in primary metabolism to be characterized. ClosTron mediated disruption of the currently unannotated non-coding region between genes LF65_05915 and LF65_05920 is found to result in a non-sporulating phenotype.© 2018 The Authors. Biotechnology Journal Published by Wiley-VCH Verlag GmbH & Co. KGaA.


September 22, 2019

Comparative genomics of Staphylococcus reveals determinants of speciation and diversification of antimicrobial defense.

The bacterial genus Staphylococcus comprises diverse species with most being described as colonizers of human and animal skin. A relational analysis of features that discriminate its species and contribute to niche adaptation and survival remains to be fully described. In this study, an interspecies, whole-genome comparative analysis of 21 Staphylococcus species was performed based on their orthologues. Three well-defined multi-species groups were identified: group A (including aureus/epidermidis); group B (including saprophyticus/xylosus) and group C (including pseudintermedius/delphini). The machine learning algorithm Random Forest was applied to prioritize orthologs that drive formation of the Staphylococcus species groups A-C. Orthologues driving staphylococcal intrageneric diversity comprised regulatory, metabolic and antimicrobial resistance proteins. Notably, the BraSR (NsaRS) two-component system (TCS) and its associated BraDE transporters that regulate antimicrobial resistance showed limited distribution in the genus and their presence was most closely associated with a subset of Staphylococcus species dominated by those that colonize human skin. Divergence of BraSR and GraSR antimicrobial peptide survival TCS and their associated transporters was observed across the staphylococci, likely reflecting niche specific evolution of these TCS/transporters and their specificities for AMPs. Experimental evolution, with selection for resistance to the lantibiotic nisin, revealed multiple routes to resistance and differences in the selection outcomes of the BraSR-positive species S. hominis and S. aureus. Selection supported a role for GraSR in nisin survival responses of the BraSR-negative species S. saprophyticus. Our study reveals diversification of antimicrobial-sensing TCS across the staphylococci and hints at differential relationships between GraSR and BraSR in those species positive for both TCS.


September 22, 2019

Whole-Genome Analysis of an Extensively Drug-Resistant Acinetobacter baumannii Strain XDR-BJ83: Insights into the Mechanisms of Resistance of an ST368 Strain from a Tertiary Care Hospital in China.

Acinetobacter baumannii is an important pathogen of nosocomial infections. Nosocomial outbreaks caused by antibiotic-resistant A. baumannii remain a significant challenge. Understanding the antibiotic resistance mechanism of A. baumannii is critical for clinical treatment. The purpose of this study was to determine the whole-genome sequence (WGS) of an extensively drug-resistant (XDR) A. baumannii strain, XDR-BJ83, which was associated with a nosocomial outbreak in a tertiary care hospital of China, and to investigate the antibiotic resistance mechanism of this strain. The WGS of XDR-BJ83 was performed using single-molecule real-time sequencing. The complete genome of XDR-BJ83 consisted of a 4,011,552-bp chromosome and a 69,069-bp plasmid. The sequence type of XDR-BJ83 was ST368, which belongs to clonal complex 92 (CC92). The chromosome of XDR-BJ83 carried multiple antibiotic resistance genes, antibiotic efflux pump genes, and mobile genetic elements, including insertion sequences, transposons, integrons, and resistance islands. The plasmid of XDR-BJ83 (pBJ83) was a conjugative plasmid carrying type IV secretion system. These results indicate that the presence of multiple antibiotic resistance genes, efflux pumps, and mobile genetic elements is likely associated with resistance to various antibiotics in XDR-BJ83.


September 22, 2019

Combining probabilistic alignments with read pair information improves accuracy of split-alignments.

Split-alignments provide base-pair-resolution evidence of genomic rearrangements. In practice, they are found by first computing high-scoring local alignments, parts of which are then combined into a split-alignment. This approach is challenging when aligning a short read to a large and repetitive reference, as it tends to produce many spurious local alignments leading to ambiguities in identifying the correct split-alignment. This problem is further exacerbated by the fact that rearrangements tend to occur in repeat-rich regions.We propose a split-alignment technique that combats the issue of ambiguous alignments by combining information from probabilistic alignment with positional information from paired-end reads. We demonstrate that our method finds accurate split-alignments, and that this translates into improved performance of variant-calling tools that rely on split-alignments.An open-source implementation is freely available at: https://bitbucket.org/splitpairedend/last-split-pe.Supplementary data are available at Bioinformatics online.


September 22, 2019

Thermosipho spp. immune system differences affect variation in genome size and geographical distributions.

Thermosipho species inhabit thermal environments such as marine hydrothermal vents, petroleum reservoirs, and terrestrial hot springs. A 16S rRNA phylogeny of available Thermosipho spp. sequences suggested habitat specialists adapted to living in hydrothermal vents only, and habitat generalists inhabiting oil reservoirs, hydrothermal vents, and hotsprings. Comparative genomics of 15 Thermosipho genomes separated them into three distinct species with different habitat distributions: The widely distributed T. africanus and the more specialized, T. melanesiensis and T. affectus. Moreover, the species can be differentiated on the basis of genome size (GS), genome content, and immune system composition. For instance, the T. africanus genomes are largest and contained the most carbohydrate metabolism genes, which could explain why these isolates were obtained from ecologically more divergent habitats. Nonetheless, all the Thermosipho genomes, like other Thermotogae genomes, show evidence of genome streamlining. GS differences between the species could further be correlated to differences in defense capacities against foreign DNA, which influence recombination via HGT. The smallest genomes are found in T. affectus that contain both CRISPR-cas Type I and III systems, but no RM system genes. We suggest that this has caused these genomes to be almost devoid of mobile elements, contrasting the two other species genomes that contain a higher abundance of mobile elements combined with different immune system configurations. Taken together, the comparative genomic analyses of Thermosipho spp. revealed genetic variation allowing habitat differentiation within the genus as well as differentiation with respect to invading mobile DNA.


September 22, 2019

Functional genomic analysis of phthalate acid ester (PAE) catabolism genes in the versatile PAE-mineralising bacterium Rhodococcus sp. 2G.

Microbial degradation is considered the most promising method for removing phthalate acid esters (PAEs) from polluted environments; however, a comprehensive genomic understanding of the entire PAE catabolic process is still lacking. In this study, the repertoire of PAE catabolism genes in the metabolically versatile bacterium Rhodococcus sp. 2G was examined using genomic, metabolic, and bioinformatic analyses. A total of 4930 coding genes were identified from the 5.6?Mb genome of the 2G strain, including 337 esterase/hydrolase genes and 48 transferase and decarboxylase genes that were involved in hydrolysing PAEs into phthalate acid (PA) and decarboxylating PA into benzoic acid (BA). One gene cluster (xyl) responsible for transforming BA into catechol and two catechol-catabolism gene clusters controlling the ortho (cat) and meta (xyl &mhp) cleavage pathways were also identified. The proposed PAE catabolism pathway and some key degradation genes were validated by intermediate-utilising tests and real-time quantitative polymerase chain reaction. Our results provide novel insight into the mechanisms of PAE biodegradation at the molecular level and useful information on gene resources for future studies. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019

Prevalence, antimicrobial resistance and phylogenetic characterization of Yersinia enterocolitica in retail poultry meat and swine feces in parts of China

Yersinia enterocolitica is an enteropathogen transmitted by contaminated food. In this study, a total of 500 retail poultry meat samples from 4 provinces and 145 swine feces samples from 12 provinces in China was tested for Y. enterocolitica and 26 isolates were obtained for further bio-serotyping, testing with antimicrobial susceptibility testing to a panel of antimicrobial compounds, and genetically characterization based on the whole genome sequencing. Higher prevalence (4.8%) of Y. enterocolitica contamination in retail poultry meat than that in swine feces (2.76%) was observed. No difference in bio-serotypes, multilocus sequence typing (MLST) and virulence genes distribution between swine and poultry origin were found. All isolates were resistant to ampicillin, amoxicillin/clavulanic acid, and cefazolin and were multi-drug resistant (MDR). The most predominant drug-resistance profile was AMP-CFZ-AMC-FOX (42.31%). A pathogenic isolate with bio-serotype 3/O:3 and ST135 was cultured from retail fresh chicken meat for the first time in China. Based on the whole-genome single nucleotide polymorphisms (SNPs) tree analysis, pathogenic isolates clustered closely, while nonpathogenic isolates exhibited high genetic heterogeneity. These indicated that pathogenic isolates were conserved on genetic level. The whole-genome SNP tree also revealed that Y. enterocolitica of swine, chicken and duck origin may share a common ancestor. The findings highlight the emergence of drug-resistant pathogenic Y. entrocoliticas in retailed poultry meats in China.


September 22, 2019

The unique evolution of the pig LRC, a single KIR but expansion of LILR and a novel Ig receptor family.

The leukocyte receptor complex (LRC) encodes numerous immunoglobulin (Ig)-like receptors involved in innate immunity. These include the killer-cell Ig-like receptors (KIR) and the leukocyte Ig-like receptors (LILR) which can be polymorphic and vary greatly in number between species. Using the recent long-read genome assembly, Sscrofa11.1, we have characterized the porcine LRC on chromosome 6. We identified a ~?197-kb region containing numerous LILR genes that were missing in previous assemblies. Out of 17 such LILR genes and fragments, six encode functional proteins, of which three are inhibitory and three are activating, while the majority of pseudogenes had the potential to encode activating receptors. Elsewhere in the LRC, between FCAR and GP6, we identified a novel gene that encodes two Ig-like domains and a long inhibitory intracellular tail. Comparison with two other porcine assemblies revealed a second, nearly identical, non-functional gene encoding a short intracellular tail with ambiguous function. These novel genes were found in a diverse range of mammalian species, including a pseudogene in humans, and typically consist of a single long-tailed receptor and a variable number of short-tailed receptors. Using porcine transcriptome data, both the novel inhibitory gene and the LILR were highly expressed in peripheral blood, while the single KIR gene, KIR2DL1, was either very poorly expressed or not at all. These observations are a prerequisite for improved understanding of immune cell functions in the pig and other species.


September 22, 2019

Computational tools to unmask transposable elements.

A substantial proportion of the genome of many species is derived from transposable elements (TEs). Moreover, through various self-copying mechanisms, TEs continue to proliferate in the genomes of most species. TEs have contributed numerous regulatory, transcript and protein innovations and have also been linked to disease. However, notwithstanding their demonstrated impact, many genomic studies still exclude them because their repetitive nature results in various analytical complexities. Fortunately, a growing array of methods and software tools are being developed to cater for them. This Review presents a summary of computational resources for TEs and highlights some of the challenges and remaining gaps to perform comprehensive genomic analyses that do not simply ‘mask’ repeats.


September 22, 2019

TranSurVeyor: an improved database-free algorithm for finding non-reference transpositions in high-throughput sequencing data.

Transpositions transfer DNA segments between different loci within a genome; in particular, when a transposition is found in a sample but not in a reference genome, it is called a non-reference transposition. They are important structural variations that have clinical impact. Transpositions can be called by analyzing second generation high-throughput sequencing datasets. Current methods follow either a database-based or a database-free approach. Database-based methods require a database of transposable elements. Some of them have good specificity; however this approach cannot detect novel transpositions, and it requires a good database of transposable elements, which is not yet available for many species. Database-free methods perform de novo calling of transpositions, but their accuracy is low. We observe that this is due to the misalignment of the reads; since reads are short and the human genome has many repeats, false alignments create false positive predictions while missing alignments reduce the true positive rate. This paper proposes new techniques to improve database-free non-reference transposition calling: first, we propose a realignment strategy called one-end remapping that corrects the alignments of reads in interspersed repeats; second, we propose a SNV-aware filter that removes some incorrectly aligned reads. By combining these two techniques and other techniques like clustering and positive-to-negative ratio filter, our proposed transposition caller TranSurVeyor shows at least 3.1-fold improvement in terms of F1-score over existing database-free methods. More importantly, even though TranSurVeyor does not use databases of prior information, its performance is at least as good as existing database-based methods such as MELT, Mobster and Retroseq. We also illustrate that TranSurVeyor can discover transpositions that are not known in the current database.


September 22, 2019

FRI-4 carbapenemase-producing Enterobacter cloacae complex isolated in Tokyo, Japan.

A carbapenem-resistant Enterobacter cloacae complex isolated in Tokyo, Japan, produced a carbapenemase that was detected by a Carba NP test and a modified carbapenem inactivation method, but none of the ‘Big Five’ carbapenemase genes was detected by PCR. This study aimed to identify the carbapenemase.Carbapenemase genes were screened by WGS. Next, we generated a recombinant plasmid in which the carbapenemase gene was inserted. We also extracted the carbapenemase gene-carrying plasmid from the E. cloacae complex. The effects of both plasmids on the antibiotic susceptibility of Escherichia coli were then tested. The carbapenemase gene-carrying plasmid in the E. cloacae complex was completely sequenced.A novel carbapenemase gene, blaFRI-4, encoded an amino acid sequence that was 93.2% identical to French imipenemase (FRI-1). E. coli transformed with blaFRI-4 showed reduced carbapenem susceptibility. A complete sequence of the blaFRI-4-carrying 98?508?bp IncFII/IncR plasmid (pTMTA61661) showed that blaFRI-4 and the surrounding region (18.7?kb) were duplicated.The FRI-4-producing E. cloacae complex was isolated in Japan, whereas all other FRI variants have been found in Europe, suggesting that the spread of FRI carbapenemases is global.


September 22, 2019

Phylogenomics of colistin-susceptible and resistant XDR Acinetobacter baumannii.

Acinetobacter baumannii is a healthcare-associated pathogen with high rates of carbapenem resistance. Colistin is now routinely used for treatment of infections by this pathogen. However, colistin use has been associated with development of resistance to this agent.To elucidate the phylogenomics of colistin-susceptible and -resistant A. baumannii strain pairs from a cohort of hospitalized patients at a tertiary medical centre in the USA.WGS data from 21 pairs of colistin-susceptible and -resistant, XDR clinical strains were obtained and compared using phylogeny of aligned genome sequences, assessment of pairwise SNP differences and gene content.Fourteen patients had colistin-resistant strains that were highly genetically related to their own original susceptible strain with a median pairwise SNP distance of 5.5 (range 1-40 SNPs), while seven other strain pairs were divergent with =84 SNP differences. In addition, several strains from different patients formed distinct clusters on the phylogeny in keeping with closely linked transmission chains. The majority of colistin-resistant strains contained non-synonymous mutations within the pmrAB locus suggesting a central role for pmrAB mutations in colistin resistance. Excellent genotype-phenotype correlation was also observed for carbapenems, aminoglycosides and tetracyclines.The findings suggest that colistin resistance in the clinical setting arises through both in vivo evolution from colistin-susceptible strains and reinfection by unrelated colistin-resistant strains, the latter of which may involve patient-to-patient transmission.


September 22, 2019

An IncX1 plasmid isolated from Salmonella enterica subsp. enterica serovar Pullorum carrying blaTEM-1B, sul2, arsenic resistant operons.

We have identified an IncX1 plasmid named pQJDSal1 from Salmonella enterica subsp. enterica serovar Pullorum (S. Pullorum). The plasmid is 67,685?bp in size and has 72 putative genes. pQJDSal1 harbors a conserved IncX1-type backbone with predicted regions for conjugation, replication and partitioning, as well as a toxin/antitoxin plasmid addiction system. Two regions (A and B) that have not been previously reported in IncX1 plasmids are inserted into the backbone. Region A (10.7?kb), inserted between parA and taxD, consists of a new Tn6168-like transposon containing an arsenic resistant operon arsB2CHR and sulfonamide resistance gene sul2. Region B contains another arsenic resistant operon arsADHR, resistance gene blaTEM-1B and three transposable elements. Conjugation experiments showed that pQJDSal1 could transfer from S. Pullorum to Escherichia coli (E. coli) J53. Statistical analysis of 70 sequenced IncX1 plasmids revealed that IncX1 plasmids harbored various antibiotic resistance genes. The results highlight the importance of IncX1 plasmids in disseminating antibiotic resistance genes.Copyright © 2018. Published by Elsevier Inc.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.