SMRT Sequencing Archives - Page 377 of 463

July 7, 2019

Disease onset in X-linked dystonia-parkinsonism correlates with expansion of a hexameric repeat within an SVA retrotransposon in TAF1.

X-linked dystonia-parkinsonism (XDP) is a neurodegenerative disease associated with an antisense insertion of a SINE-VNTR-Alu (SVA)-type retrotransposon within an intron ofTAF1This unique insertion coincides with six additional noncoding sequence changes inTAF1, the gene that encodes TATA-binding protein-associated factor-1, which appear to be inherited together as an identical haplotype in all reported cases. Here we examined the sequence of this SVA in XDP patients (n= 140) and detected polymorphic variation in the length of a hexanucleotide repeat domain, (CCCTCT)nThe number of repeats in these cases ranged from 35 to 52 and showed a highly significant inverse correlation with age at disease onset. Because other SVAs exhibit intrinsic promoter activity that depends in part on the hexameric domain, we assayed the transcriptional regulatory effects of varying hexameric lengths found in the unique XDP SVA retrotransposon using luciferase reporter constructs. When inserted sense or antisense to the luciferase reading frame, the XDP variants repressed or enhanced transcription, respectively, to an extent that appeared to vary with length of the hexamer. Further in silico analysis of this SVA sequence revealed multiple motifs predicted to form G-quadruplexes, with the greatest potential detected for the hexameric repeat domain. These data directly link sequence variation within the XDP-specific SVA sequence to phenotypic variability in clinical disease manifestation and provide insight into potential mechanisms by which this intronic retroelement may induce transcriptional interference inTAF1expression. Copyright © 2017 the Author(s). Published by PNAS.

July 7, 2019

Sex-specific influences of mtDNA mitotype and diet on mitochondrial functions and physiological traits in Drosophila melanogaster.

Here we determine the sex-specific influence of mtDNA type (mitotype) and diet on mitochondrial functions and physiology in two Drosophila melanogaster lines. In many species, males and females differ in aspects of their energy production. These sex-specific influences may be caused by differences in evolutionary history and physiological functions. We predicted the influence of mtDNA mutations should be stronger in males than females as a result of the organelle’s maternal mode of inheritance in the majority of metazoans. In contrast, we predicted the influence of diet would be greater in females due to higher metabolic flexibility. We included four diets that differed in their protein: carbohydrate (P:C) ratios as they are the two-major energy-yielding macronutrients in the fly diet. We assayed four mitochondrial function traits (Complex I oxidative phosphorylation, reactive oxygen species production, superoxide dismutase activity, and mtDNA copy number) and four physiological traits (fecundity, longevity, lipid content, and starvation resistance). Traits were assayed at 11 d and 25 d of age. Consistent with predictions we observe that the mitotype influenced males more than females supporting the hypothesis of a sex-specific selective sieve in the mitochondrial genome caused by the maternal inheritance of mitochondria. Also, consistent with predictions, we found that the diet influenced females more than males.

July 7, 2019

On the importance of homology in the age of phylogenomics

Homology is perhaps the most central concept of phylogenetic biology. Molecular systematists have traditionally paid due attention to the homology statements that are implied by their alignments of orthologous sequences, but some authors have suggested that manual gene-by-gene curation is not sustainable in the phylogenomics era. Here, we show that there are multiple ways to efficiently screen for and detect homology errors in phylogenomic data sets. Application of these screening approaches to two phylogenomic data sets, one for birds and another for mammals, shows that these data are replete with homology errors including alignments of different exons to each other, alignments of exons to introns, and alignments of paralogues to each other. The extent of these homology errors weakens the conclusions of studies based on these data sets. Despite advances in automated phylogenomic pipelines, we contend that much of the long, difficult, and sometimes tedious work of systematics is still required to guard against pervasive homology errors. This conclusion is underscored by recent studies that show that just a few outlier genes can impact phylogenetic results at short, tightly spaced internodes that are deep in the Tree of Life. The view that widespread DNA sequence alignment errors are not a major concern for rigorous systematic research is not tenable. If a primary goal of phylogenomics is to resolve the most challenging phylogenetic problems with the abundant data that are now available, researchers must employ effective procedures to screen for and correct homology errors prior to performing downstream phylogenetic analyses.

July 7, 2019

COSINE: non-seeding method for mapping long noisy sequences.

Third generation sequencing (TGS) are highly promising technologies but the long and noisy reads from TGS are difficult to align using existing algorithms. Here, we present COSINE, a conceptually new method designed specifically for aligning long reads contaminated by a high level of errors. COSINE computes the context similarity of two stretches of nucleobases given the similarity over distributions of their short k-mers (k = 3-4) along the sequences. The results on simulated and real data show that COSINE achieves high sensitivity and specificity under a wide range of read accuracies. When the error rate is high, COSINE can offer substantial advantages over existing alignment methods.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

July 7, 2019

Comparative whole-genomic analysis of an ancient L2 lineage Mycobacterium novel phylogenetic clade and common genetic determinants of hypervirulent strains.

Background: Development of improved therapeutics against tuberculosis (TB) is hindered by an inadequate understanding of the relationship between disease severity and genetic diversity of its causative agent, Mycobacterium tuberculosis. We previously isolated a hypervirulent M. tuberculosis strain H112 from an HIV-negative patient with an aggressive disease progression from pulmonary TB to tuberculous meningitis—the most severe manifestation of tuberculosis. Human macrophage challenge experiment demonstrated that the strain H112 exhibited significantly better intracellular survivability and induced lower level of TNF-a than the reference virulent strain H37Rv and other 123 clinical isolates. Aim: The present study aimed to identify the potential genetic determinants of mycobacterial virulence that were common to strain H112 and hypervirulent M. tuberculosis strains of the same phylogenetic clade isolated in other global regions. Methods: A low-virulent M. tuberculosis strain H54 which belonged to the same phylogenetic lineage (L2) as strain H112 was selected from a collection of 115 clinical isolates. Both H112 and H54 were whole-genome-sequenced using PacBio sequencing technology. A comparative genomics approach was adopted to identify mutations present in strain H112 but absent in strain H54. Subsequently, an extensive phylogenetic analysis was conducted by including all publically available M. tuberculosis genomes. Single-nucleotide-polymorphisms (SNPs) and structural variations (SVs) common to hypervirulent strains in the global collection of genomes were considered as potential genetic determinants of hypervirulence. Results: Sequencing data revealed that both H112 and H54 were identified as members of the same sub-lineage L2.2.1. After excluding the lineage-related mutations shared between H112 and H54, we analyzed the phylogenetic relatedness of H112 with global collection of M. tuberculosis genomes (n = 4,338), and identified a novel phylogenetic clade in which four hypervirulent strains isolated from geographically diverse regions were clustered together. All hypervirulent strains in the clade shared 12 SNPs and 5 SVs with H112, including those affecting key virulence-associated loci, notably, a deleterious SNP (rv0178 p. D150E) within mce1 operon and an intergenic deletion (854259_ 854261delCC) in close-proximity to phoP. Conclusion: The present study identified common genetic factors in a novel phylogenetic clade of hypervirulent M. tuberculosis. The causative role of these mutations in mycobacterial virulence should be validated in future study.

July 7, 2019

A feast of malaria parasite genomes.

The Plasmodium genus has evolved over time and across hosts, complexifying our understanding of malaria. In a recent Nature paper, Rutledge et al. (2017) describe the genome sequences of three major human malaria parasite species, providing insight into Plasmodium evolution and raising the question of how many species there are. Copyright © 2017 Elsevier Inc. All rights reserved.

July 7, 2019

A recurrence-based approach for validating structural variation using long-read sequencing technology.

Although numerous algorithms have been developed to identify structural variations (SVs) in genomic sequences, there is a dearth of approaches that can be used to evaluate their results. This is significant as the accurate identification of structural variation is still an outstanding but important problem in genomics. The emergence of new sequencing technologies that generate longer sequence reads can, in theory, provide direct evidence for all types of SVs regardless of the length of the region through which it spans. However, current efforts to use these data in this manner require the use of large computational resources to assemble these sequences as well as visual inspection of each region. Here we present VaPoR, a highly efficient algorithm that autonomously validates large SV sets using long-read sequencing data. We assessed the performance of VaPoR on SVs in both simulated and real genomes and report a high-fidelity rate for overall accuracy across different levels of sequence depths. We show that VaPoR can interrogate a much larger range of SVs while still matching existing methods in terms of false positive validations and providing additional features considering breakpoint precision and predicted genotype. We further show that VaPoR can run quickly and efficiency without requiring a large processing or assembly pipeline. VaPoR provides a long read-based validation approach for genomic SVs that requires relatively low read depth and computing resources and thus will provide utility with targeted or low-pass sequencing coverage for accurate SV assessment. The VaPoR Software is available at: https://github.com/mills-lab/vapor.© The Authors 2017. Published by Oxford University Press.

July 7, 2019

De novo design and synthesis of a 30-cistron translation-factor module.

Two of the many goals of synthetic biology are synthesizing large biochemical systems and simplifying their assembly. While several genes have been assembled together by modular idempotent cloning, it is unclear if such simplified strategies scale to very large constructs for expression and purification of whole pathways. Here we synthesize from oligodeoxyribonucleotides a completely de-novo-designed, 58-kb multigene DNA. This BioBrick plasmid insert encodes 30 of the 31 translation factors of the PURE translation system, each His-tagged and in separate transcription cistrons. Dividing the insert between three high-copy expression plasmids enables the bulk purification of the aminoacyl-tRNA synthetases and translation factors necessary for affordable, scalable reconstitution of an in vitro transcription and translation system, PURE 3.0.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

July 7, 2019

Nitrogen fixation genes and nitrogenase activity of the non-heterocystous cyanobacterium Thermoleptolyngbya sp. O-77.

Cyanobacteria are widely distributed in marine, aquatic, and terrestrial ecosystems, and play an important role in the global nitrogen cycle. In the present study, we examined the genome sequence of the thermophilic non-heterocystous N2-fixing cyanobacterium, Thermoleptolyngbya sp. O-77 (formerly known as Leptolyngbya sp. O-77) and characterized its nitrogenase activity. The genome of this cyanobacterial strain O-77 consists of a single chromosome containing a nitrogen fixation gene cluster. A phylogenetic analysis indicated that the NifH amino acid sequence from strain O-77 was clustered with those from a group of mesophilic species: the highest identity was found in Leptolyngbya sp. KIOST-1 (97.9% sequence identity). The nitrogenase activity of O-77 cells was dependent on illumination, whereas a high intensity of light of 40 µmol m-2 s-1 suppressed the effects of illumination.

July 7, 2019

The draft genome sequence of Pectobacterium carotovorum subsp. actinidiae KKH3 that infects kiwi plant and potential bioconversion applications

Pectobacterium carotovorum subsp. actinidiae KKH3 is an Enterobacteriaceae bacterial pathogen that infects kiwi plants, causing canker-like symptoms that pose a threat to the kiwifruit industry. Because the strain was originally isolated from woody plants and possesses numerous plant cell wall-degrading enzymes, this draft genome report provides insight into possible bioconversion applications, as well as a better understanding of this important plant pathogen.

July 7, 2019

Complete genome sequence of the Vibrio vulnificus strain VV2014DJH, a human-pathogenic bacterium isolated from a death case in China.

Vibrio vulnificus, an opportunistic pathogen, is the causative agent of life-threatening septicemia and severe wound infections. However, the pathogenicity and virulence factors of V. vulnificus are not fully understood. Here we report the complete genome sequence of V. vulnificus VV2014DJH, which was isolated from a death case.The genome of the V. vulnificus VV2014DJH contains two circular chromosomes with a mean G+C content of 46.8%, but does not consists of any plasmids. The chromosome I and chromosome II consist of 3,303,590 and 1,770,972 bp, respectively. In addition, the genome consists of 4617 protein coding genes, 172 RNA genes and type I, II and III secretion systems were predicted.In this study, the genomic information of the V. vulnificus VV2014DJH has been described. The information would contribute to the increasing scope and depth of Vibrio genome database, and provide insights into the pathogenicity and virulence factors of V. vulnificus.

July 7, 2019

Unlocking the biological potential of Euglena gracilis: evolution, cell biology and significance to parasitism

Photosynthetic euglenids are major components of aquatic ecosystems and relatives of trypanosomes. Euglena gracilis has considerable biotechnological potential and great adaptability, but exploitation remains hampered by the absence of a comprehensive gene catalogue. We address this by genome, RNA and protein sequencing: the E. gracilis genome is >2Gb, with 36,526 predicted proteins. Large lineage-specific paralog families are present, with evidence for flexibility in environmental monitoring, divergent mechanisms for metabolic control, and novel solutions for adaptation to extreme environments. Contributions from photosynthetic eukaryotes to the nuclear genome, consistent with the shopping bag model are found, together with transitions between kinetoplastid and canonical systems. Control of protein expression is almost exclusively post-transcriptional. These data are a major advance in understanding the nuclear genomes of euglenids and provide a platform for investigating the contributions of E. gracilis and its relatives to the biosphere.

July 7, 2019

The plastid genome in Cladophorales green algae is encoded by hairpin chromosomes.

Virtually all plastid (chloroplast) genomes are circular double-stranded DNA molecules, typically between 100 and 200 kb in size and encoding circa 80-250 genes. Exceptions to this universal plastid genome architecture are very few and include the dinoflagellates, where genes are located on DNA minicircles. Here we report on the highly deviant chloroplast genome of Cladophorales green algae, which is entirely fragmented into hairpin chromosomes. Short- and long-read high-throughput sequencing of DNA and RNA demonstrated that the chloroplast genes of Boodlea composita are encoded on 1- to 7-kb DNA contigs with an exceptionally high GC content, each containing a long inverted repeat with one or two protein-coding genes and conserved non-coding regions putatively involved in replication and/or expression. We propose that these contigs correspond to linear single-stranded DNA molecules that fold onto themselves to form hairpin chromosomes. The Boodlea chloroplast genes are highly divergent from their corresponding orthologs, and display an alternative genetic code. The origin of this highly deviant chloroplast genome most likely occurred before the emergence of the Cladophorales, and coincided with an elevated transfer of chloroplast genes to the nucleus. A chloroplast genome that is composed only of linear DNA molecules is unprecedented among eukaryotes, and highlights unexpected variation in plastid genome architecture. Copyright © 2017 Elsevier Ltd. All rights reserved.

July 7, 2019

Community-acquired cavitary Pseudomonas pneumonia linked to use of a home humidifier.

Pseudomonas aeruginosa is an opportunistic pathogen that rarely causes pneumonia in otherwise healthy patients. We describe a case of community-acquired P. aeruginosa pneumonia in a previously healthy individual who likely acquired the infection from a home humidifier.

July 7, 2019

The complete mitochondrial genome of Wonwhang (Pyrus pyrifolia)

This is a de novo assembly and annotation of a complete mitochondrial genome from Pyrus pyrifolia in the family Rosaceae. The complete mitochondrial genome of P. pyrifolia was assembled from PacBio RSII P6-C4 sequencing reads. The circular genome was 458,873?bp in length, containing 39 protein-coding genes, 23 tRNA genes and three rRNA genes. The nucleotide composition was A (27.5%), T (27.3%), G (22.6%) and C (22.6%) with GC content of 45.2%. Most of protein-coding genes use the canonical start codon ATG, whereas nad1, cox1, matR and rps4 use ACG, mttB uses ATT, rpl16 and rps19 uses GTG. The stop codon is also common in all mitochondrial genes. The phylogenetic analysis showed that P. pyrifolia was clustered with the Malus of Rosaceae family. Maximum-likelihood analysis suggests a clear relationship of Rosids and Asterids, which support the traditional classification.

Asset Tag: SMRT Sequencing

Disease onset in X-linked dystonia-parkinsonism correlates with expansion of a hexameric repeat within an SVA retrotransposon in TAF1.

Sex-specific influences of mtDNA mitotype and diet on mitochondrial functions and physiological traits in Drosophila melanogaster.

On the importance of homology in the age of phylogenomics

COSINE: non-seeding method for mapping long noisy sequences.

Comparative whole-genomic analysis of an ancient L2 lineage Mycobacterium novel phylogenetic clade and common genetic determinants of hypervirulent strains.

A feast of malaria parasite genomes.

A recurrence-based approach for validating structural variation using long-read sequencing technology.

De novo design and synthesis of a 30-cistron translation-factor module.

Nitrogen fixation genes and nitrogenase activity of the non-heterocystous cyanobacterium Thermoleptolyngbya sp. O-77.

The draft genome sequence of Pectobacterium carotovorum subsp. actinidiae KKH3 that infects kiwi plant and potential bioconversion applications

Complete genome sequence of the Vibrio vulnificus strain VV2014DJH, a human-pathogenic bacterium isolated from a death case in China.

Unlocking the biological potential of Euglena gracilis: evolution, cell biology and significance to parasitism

The plastid genome in Cladophorales green algae is encoded by hairpin chromosomes.

Community-acquired cavitary Pseudomonas pneumonia linked to use of a home humidifier.

The complete mitochondrial genome of Wonwhang (Pyrus pyrifolia)

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert