Menu
September 22, 2019

Analysis of the Aedes albopictus C6/36 genome provides insight into cell line utility for viral propagation.

The 50-year-old Aedes albopictus C6/36 cell line is a resource for the detection, amplification, and analysis of mosquito-borne viruses including Zika, dengue, and chikungunya. The cell line is derived from an unknown number of larvae from an unspecified strain of Aedes albopictus mosquitoes. Toward improved utility of the cell line for research in virus transmission, we present an annotated assembly of the C6/36 genome.The C6/36 genome assembly has the largest contig N50 (3.3 Mbp) of any mosquito assembly, presents the sequences of both haplotypes for most of the diploid genome, reveals independent null mutations in both alleles of the Dicer locus, and indicates a male-specific genome. Gene annotation was computed with publicly available mosquito transcript sequences. Gene expression data from cell line RNA sequence identified enrichment of growth-related pathways and conspicuous deficiency in aquaporins and inward rectifier K+ channels. As a test of utility, RNA sequence data from Zika-infected cells were mapped to the C6/36 genome and transcriptome assemblies. Host subtraction reduced the data set by 89%, enabling faster characterization of nonhost reads.The C6/36 genome sequence and annotation should enable additional uses of the cell line to study arbovirus vector interactions and interventions aimed at restricting the spread of human disease.


September 22, 2019

Comparative genomics of the Baltic Sea toxic cyanobacteria Nodularia spumigena UHCC 0039 and its response to varying salinity.

Salinity is an important abiotic factor controlling the distribution and abundance of Nodularia spumigena, the dominating diazotrophic and toxic phototroph, in the brackish water cyanobacterial blooms of the Baltic Sea. To expand the available genomic information for brackish water cyanobacteria, we sequenced the isolate Nodularia spumigena UHCC 0039 using an Illumina-SMRT hybrid sequencing approach, revealing a chromosome of 5,294,286 base pairs (bp) and a single plasmid of 92,326 bp. Comparative genomics in Nostocales showed pronounced genetic similarity among Nodularia spumigena strains evidencing their short evolutionary history. The studied Baltic Sea strains share similar sets of CRISPR-Cas cassettes and a higher number of insertion sequence (IS) elements compared to Nodularia spumigena CENA596 isolated from a shrimp production pond in Brazil. Nodularia spumigena UHCC 0039 proliferated similarly at three tested salinities, whereas the lack of salt inhibited its growth and triggered transcriptome remodeling, including the up-regulation of five sigma factors and the down-regulation of two other sigma factors, one of which is specific for strain UHCC 0039. Down-regulated genes additionally included a large genetic region for the synthesis of two yet unidentified natural products. Our results indicate a remarkable plasticity of the Nodularia salinity acclimation, and thus salinity strongly impacts the intensity and distribution of cyanobacterial blooms in the Baltic Sea.


September 22, 2019

Engineering of Halomonas bluephagenesis for low cost production of poly(3-hydroxybutyrate-co-4-hydroxybutyrate) from glucose.

Poly(3-hydroxybutyrate-co-4-hydroxybutyrate) [P(3HB-co-4HB)] is one of the most promising biomaterials expected to be used in a wide range of scenarios. However, its large-scale production is still hindered by the high cost. Here we report the engineering of Halomonas bluephagenesis as a low-cost platform for non-sterile and continuous fermentative production of P(3HB-co-4HB) from glucose. Two interrelated 4-hydroxybutyrate (4HB) biosynthesis pathways were constructed to guarantee 4HB monomer supply for P(3HB-co-4HB) synthesis by working in concert with 3-hydroxybutyrate (3HB) pathway. Interestingly, only 0.17?mol% 4HB in the copolymer was obtained during shake flask studies. Pathway debugging using structurally related carbon source located the failure as insufficient 4HB accumulation. Further whole genome sequencing and comparative genomic analysis identified multiple orthologs of succinate semialdehyde dehydrogenase (gabD) that may compete with 4HB synthesis flux in H. bluephagenesis. Accordingly, combinatory gene-knockout strains were constructed and characterized, through which the molar fraction of 4HB was increased by 24-fold in shake flask studies. The best-performing strain was grown on glucose as the single carbon source for 60?h under non-sterile conditions in a 7-L bioreactor, reaching 26.3?g/L of dry cell mass containing 60.5% P(3HB-co-17.04?mol%4HB). Besides, 4HB molar fraction in the copolymer can be tuned from 13?mol% to 25?mol% by controlling the residual glucose concentration in the cultures. This is the first study to achieve the production of P(3HB-co-4HB) from only glucose using Halomonas. Copyright © 2018 International Metabolic Engineering Society. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Reproducible integration of multiple sequencing datasets to form high-confidence SNP, indel, and reference calls for five human genome reference materials

Benchmark small variant calls from the Genome in a Bottle Consortium (GIAB) for the CEPH/HapMap genome NA12878 (HG001) have been used extensively for developing, optimizing, and demonstrating performance of sequencing and bioinformatics methods. Here, we develop a reproducible, cloud-based pipeline to integrate multiple sequencing datasets and form benchmark calls, enabling application to arbitrary human genomes. We use these reproducible methods to form high-confidence calls with respect to GRCh37 and GRCh38 for HG001 and 4 additional broadly-consented genomes from the Personal Genome Project that are available as NIST Reference Materials. These new genomes’ broad, open consent with few restrictions on availability of samples and data is enabling a uniquely diverse array of applications. Our new methods produce 17% more high-confidence SNPs, 176% more indels, and 12% larger regions than our previously published calls. To demonstrate that these calls can be used for accurate benchmarking, we compare other high-quality callsets to ours (e.g., Illumina Platinum Genomes), and we demonstrate that the majority of discordant calls are errors in the other callsets, We also highlight challenges in interpreting performance metrics when benchmarking against imperfect high-confidence calls. We show that benchmarking tools from the Global Alliance for Genomics and Health can be used with our calls to stratify performance metrics by variant type and genome context and elucidate strengths and weaknesses of a method.


September 22, 2019

Repeat-driven generation of antigenic diversity in a major human pathogen, Trypanosoma cruzi

Trypanosoma cruzi, a zoonotic kinetoplastid protozoan with a complex genome, is the causative agent of American trypanosomiasis (Chagas disease). The parasite uses a highly diverse repertoire of surface molecules, with roles in cell invasion, immune evasion and pathogenesis. Thus far, the genomic regions containing these genes have been impossible to resolve and it has been impossible to study the structure and function of the several thousand repetitive genes encoding the surface molecules of the parasite. We here present an improved genome assembly of a T. cruzi clade I (TcI) strain using high coverage PacBio single molecule sequencing, together with Illumina sequencing of 34 T. cruzi TcI isolates and clones from different geographic locations, sample sources and clinical outcomes. Resolution of the surface molecule gene structure reveals an unusual duality in the organisation of the parasite genome, a core genomic region syntenous with related protozoa flanked by unique and highly plastic subtelomeric regions encoding surface antigens. The presence of abundant interspersed retrotransposons in the subtelomeres suggests that these elements are involved in a recombination mechanism for the generation of antigenic variation and evasion of the host immune response. The comparative genomic analysis of the cohort of TcI strains revealed multiple cases of such recombination events involving surface molecule genes and has provided new insights into T. cruzi population structure.


September 22, 2019

New Delhi metallo-beta-lactamase-producing Enterobacteriaceae in South Korea between 2010 and 2015.

This study was carried out to investigate the epidemiological time-course of New Delhi metallo-beta-lactamase- (NDM-) mediated carbapenem resistance in Enterobacteriaceae in South Korea. A total of 146 non-duplicate NDM-producing Enterobacteriaceae recovered between 2010 and 2015 were voluntarily collected from 33 general hospitals and confirmed by PCR. The species were identified by sequences of the 16S rDNA. Antimicrobial susceptibility was determined either by the disk diffusion method or by broth microdilution, and the carbapenem MICs were determined by agar dilution. Then, multilocus sequence typing and PCR-based replicon typing was carried out. Co-carried genes for drug resistance were identified by PCR and sequencing. The entire genomes of eight random selected NDM producers were sequenced. A total of 69 Klebsiella pneumoniae of 12 sequence types (STs), 34 Escherichia coli of 15 STs, 28 Enterobacter spp. (including one Enterobacter aerogenes), nine Citrobacter freundii, four Raoultella spp., and two Klebsiella oxytoca isolates produced either NDM-1 (n = 126), NDM-5 (n = 18), or NDM-7 (n = 2). The isolates co-produced CTX-M-type ESBL (52.1%), AmpCs (27.4%), additional carbapenemases (7.1%), and/or 16S rRNA methyltransferases (4.8%), resulting in multidrug-resistance (47.9%) or extensively drug-resistance (52.1%). Among plasmids harboring blaNDM, IncX3 was predominant (77.4%), followed by the IncFII type (5.8%). Genome analysis revealed inter-species and inter-strain horizontal gene transfer of the plasmid. Both clonal dissemination and plasmid transfer contributed to the wide dissemination of NDM producers in South Korea.


September 22, 2019

Expansions of intronic TTTCA and TTTTA repeats in benign adult familial myoclonic epilepsy.

Epilepsy is a common neurological disorder, and mutations in genes encoding ion channels or neurotransmitter receptors are frequent causes of monogenic forms of epilepsy. Here we show that abnormal expansions of TTTCA and TTTTA repeats in intron 4 of SAMD12 cause benign adult familial myoclonic epilepsy (BAFME). Single-molecule, real-time sequencing of BAC clones and nanopore sequencing of genomic DNA identified two repeat configurations in SAMD12. Intriguingly, in two families with a clinical diagnosis of BAFME in which no repeat expansions in SAMD12 were observed, we identified similar expansions of TTTCA and TTTTA repeats in introns of TNRC6A and RAPGEF2, indicating that expansions of the same repeat motifs are involved in the pathogenesis of BAFME regardless of the genes in which the expanded repeats are located. This discovery that expansions of noncoding repeats lead to neuronal dysfunction responsible for myoclonic tremor and epilepsy extends the understanding of diseases with such repeat expansion.


September 22, 2019

Characterization of Lactobacillus amylolyticus L6 as potential probiotics based on genome sequence and corresponding phenotypes

The potential of newly isolated Lactobacillus amylolyticus L6 as probiotics was investigated based on the whole genome sequence and corresponding phenotypes. With Lactobacillus acidophilus NCFM as positive control, several established methods of evaluating potential probiotics were performed on L. amylolyticus L6. The results indicated that L. amylolyticus L6 retained higher viability in human gastrointestinal (GI) tract and it also had strong inhibitory effect on pathogenic bacteria. Meanwhile, the candidate probiotics exhibited similar adhesion level as that of L. acidophilus NCFM in vitro test. As for carbohydrate utilization profile, L. amylolyticus L6 had high ability of utilizing raffinose and stachyose which were known as flatulence factors in soybean products. And this strain could also utilize starch. Besides, the mechanisms of probiotic and metabolic properties for L. amylolyticus L6 were further illustrated with the identification of related genes through the analysis of genome sequence. Therefore, we proposed that L. amylolyticus L6 have the potential to be used as probiotics from phenotypes to genotypes. And it is the first time that the complete genome sequence of L. amylolyticus L6 and the potential of this strain to be used as probiotics were reported in this study.


September 22, 2019

Autologous cell therapy approach for Duchenne muscular dystrophy using PiggyBac transposons and mesoangioblasts.

Duchenne muscular dystrophy (DMD) is a lethal muscle-wasting disease currently without cure. We investigated the use of the PiggyBac transposon for full-length dystrophin expression in murine mesoangioblast (MABs) progenitor cells. DMD murine MABs were transfected with transposable expression vectors for full-length dystrophin and transplanted intramuscularly or intra-arterially into mdx/SCID mice. Intra-arterial delivery indicated that the MABs could migrate to regenerating muscles to mediate dystrophin expression. Intramuscular transplantation yielded dystrophin expression in 11%-44% of myofibers in murine muscles, which remained stable for the assessed period of 5 months. The satellite cells isolated from transplanted muscles comprised a fraction of MAB-derived cells, indicating that the transfected MABs may colonize the satellite stem cell niche. Transposon integration site mapping by whole-genome sequencing indicated that 70% of the integrations were intergenic, while none was observed in an exon. Muscle resistance assessment by atomic force microscopy indicated that 80% of fibers showed elasticity properties restored to those of wild-type muscles. As measured in vivo, transplanted muscles became more resistant to fatigue. This study thus provides a proof-of-principle that PiggyBac transposon vectors may mediate full-length dystrophin expression as well as functional amelioration of the dystrophic muscles within a potential autologous cell-based therapeutic approach of DMD. Copyright © 2018 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Plasmid-mediated quinolone resistance in Shigella flexneriisolated from macaques.

Non-human primates (NHPs) for biomedical research are commonly infected with Shigella spp. that can cause acute dysentery or chronic episodic diarrhea. These animals are often prophylactically and clinically treated with quinolone antibiotics to eradicate these possible infections. However, chromosomally- and plasmid-mediated antibiotic resistance has become an emerging concern for species in the family Enterobacteriaceae. In this study, five individual isolates of multi-drug resistant Shigella flexneri were isolated from the feces of three macaques. Antibiotic susceptibility testing confirmed resistance or decreased susceptibility to ampicillin, amoxicillin-clavulanic acid, cephalosporins, gentamicin, tetracycline, ciprofloxacin, enrofloxacin, levofloxacin, and nalidixic acid. S. flexneri isolates were susceptible to trimethoprim-sulfamethoxazole, and this drug was used to eradicate infection in two of the macaques. Plasmid DNA from all isolates was positive for the plasmid-encoded quinolone resistance gene qnrS, but not qnrA and qnrB. Conjugation and transformation of plasmid DNA from several S. flexneri isolates into antibiotic-susceptible Escherichia coli strains conferred the recipients with resistance or decreased susceptibility to quinolones and beta-lactams. Genome sequencing of two representative S. flexneri isolates identified the qnrS gene on a plasmid-like contig. These contigs showed >99% homology to plasmid sequences previously characterized from quinolone-resistant Shigella flexneri 2a and Salmonella enterica strains. Other antibiotic resistance genes and virulence factor genes were also identified in chromosome and plasmid sequences in these genomes. The findings from this study indicate macaques harbor pathogenic S. flexneri strains with chromosomally- and plasmid-encoded antibiotic resistance genes. To our knowledge, this is the first report of plasmid-mediated quinolone resistance in S. flexneri isolated from NHPs and warrants isolation and antibiotic testing of enteric pathogens before treating macaques with quinolones prophylactically or therapeutically.


September 22, 2019

Comparison of phasing strategies for whole human genomes.

Humans are a diploid species that inherit one set of chromosomes paternally and one homologous set of chromosomes maternally. Unfortunately, most human sequencing initiatives ignore this fact in that they do not directly delineate the nucleotide content of the maternal and paternal copies of the 23 chromosomes individuals possess (i.e., they do not ‘phase’ the genome) often because of the costs and complexities of doing so. We compared 11 different widely-used approaches to phasing human genomes using the publicly available ‘Genome-In-A-Bottle’ (GIAB) phased version of the NA12878 genome as a gold standard. The phasing strategies we compared included laboratory-based assays that prepare DNA in unique ways to facilitate phasing as well as purely computational approaches that seek to reconstruct phase information from general sequencing reads and constructs or population-level haplotype frequency information obtained through a reference panel of haplotypes. To assess the performance of the 11 approaches, we used metrics that included, among others, switch error rates, haplotype block lengths, the proportion of fully phase-resolved genes, phasing accuracy and yield between pairs of SNVs. Our comparisons suggest that a hybrid or combined approach that leverages: 1. population-based phasing using the SHAPEIT software suite, 2. either genome-wide sequencing read data or parental genotypes, and 3. a large reference panel of variant and haplotype frequencies, provides a fast and efficient way to produce highly accurate phase-resolved individual human genomes. We found that for population-based approaches, phasing performance is enhanced with the addition of genome-wide read data; e.g., whole genome shotgun and/or RNA sequencing reads. Further, we found that the inclusion of parental genotype data within a population-based phasing strategy can provide as much as a ten-fold reduction in phasing errors. We also considered a majority voting scheme for the construction of a consensus haplotype combining multiple predictions for enhanced performance and site coverage. Finally, we also identified DNA sequence signatures associated with the genomic regions harboring phasing switch errors, which included regions of low polymorphism or SNV density.


September 22, 2019

Identification and pathogenomic analysis of an Escherichia coli strain producing a novel Shiga toxin 2 subtype.

Shiga toxin (Stx) is the key virulent factor in Shiga toxin-producing Escherichia coli (STEC). To date, three Stx1 subtypes and seven Stx2 subtypes have been described in E. coli, which differed in receptor preference and toxin potency. Here, we identified a novel Stx2 subtype designated Stx2h in E. coli strains isolated from wild marmots in the Qinghai-Tibetan plateau, China. Stx2h shares 91.9% nucleic acid sequence identity and 92.9% amino acid identity to the nearest Stx2 subtype. The expression of Stx2h in type strain STEC299 was inducible by mitomycin C, and culture supernatant from STEC299 was cytotoxic to Vero cells. The Stx2h converting prophage was unique in terms of insertion site and genetic composition. Whole genome-based phylo- and patho-genomic analysis revealed STEC299 was closer to other pathotypes of E. coli than STEC, and possesses virulence factors from other pathotypes. Our finding enlarges the pool of Stx2 subtypes and highlights the extraordinary genomic plasticity of E. coli strains. As the emergence of new Shiga toxin genotypes and new Stx-producing pathotypes pose a great threat to the public health, Stx2h should be further included in E. coli molecular typing, and in epidemiological surveillance of E. coli infections.


September 22, 2019

Epigenetic landscape influences the liver cancer genome architecture.

The accumulations of different types of genetic alterations such as nucleotide substitutions, structural rearrangements and viral genome integrations and epigenetic alterations contribute to carcinogenesis. Here, we report correlation between the occurrence of epigenetic features and genetic aberrations by whole-genome bisulfite, whole-genome shotgun, long-read, and virus capture sequencing of 373 liver cancers. Somatic substitutions and rearrangement breakpoints are enriched in tumor-specific hypo-methylated regions with inactive chromatin marks and actively transcribed highly methylated regions in the cancer genome. Individual mutation signatures depend on chromatin status, especially, signatures with a higher transcriptional strand bias occur within active chromatic areas. Hepatitis B virus (HBV) integration sites are frequently detected within inactive chromatin regions in cancer cells, as a consequence of negative selection for integrations in active chromatin regions. Ultra-high structural instability and preserved unmethylation of integrated HBV genomes are observed. We conclude that both precancerous and somatic epigenetic features contribute to the cancer genome architecture.


September 22, 2019

IMSindel: An accurate intermediate-size indel detection tool incorporating de novo assembly and gapped global-local alignment with split read analysis.

Insertions and deletions (indels) have been implicated in dozens of human diseases through the radical alteration of gene function by short frameshift indels as well as long indels. However, the accurate detection of these indels from next-generation sequencing data is still challenging. This is particularly true for intermediate-size indels (=50?bp), due to the short DNA sequencing reads. Here, we developed a new method that predicts intermediate-size indels using BWA soft-clipped fragments (unmatched fragments in partially mapped reads) and unmapped reads. We report the performance comparison of our method, GATK, PINDEL and ScanIndel, using whole exome sequencing data from the same samples. False positive and false negative counts were determined through Sanger sequencing of all predicted indels across these four methods. The harmonic mean of the recall and precision, F-measure, was used to measure the performance of each method. Our method achieved the highest F-measure of 0.84 in one sample, compared to 0.56 for GATK, 0.52 for PINDEL and 0.46 for ScanIndel. Similar results were obtained in additional samples, demonstrating that our method was superior to the other methods for detecting intermediate-size indels. We believe that this methodology will contribute to the discovery of intermediate-size indels associated with human disease.


September 22, 2019

SvABA: genome-wide detection of structural variants and indels by local assembly.

Structural variants (SVs), including small insertion and deletion variants (indels), are challenging to detect through standard alignment-based variant calling methods. Sequence assembly offers a powerful approach to identifying SVs, but is difficult to apply at scale genome-wide for SV detection due to its computational complexity and the difficulty of extracting SVs from assembly contigs. We describe SvABA, an efficient and accurate method for detecting SVs from short-read sequencing data using genome-wide local assembly with low memory and computing requirements. We evaluated SvABA’s performance on the NA12878 human genome and in simulated and real cancer genomes. SvABA demonstrates superior sensitivity and specificity across a large spectrum of SVs and substantially improves detection performance for variants in the 20-300 bp range, compared with existing methods. SvABA also identifies complex somatic rearrangements with chains of short (<1000 bp) templated-sequence insertions copied from distant genomic regions. We applied SvABA to 344 cancer genomes from 11 cancer types and found that short templated-sequence insertions occur in ~4% of all somatic rearrangements. Finally, we demonstrate that SvABA can identify sites of viral integration and cancer driver alterations containing medium-sized (50-300 bp) SVs.© 2018 Wala et al.; Published by Cold Spring Harbor Laboratory Press.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.