Bioinformatics Archives - Page 238 of 267

July 7, 2019

Complete genome sequence of the D-amino acid catabolism bacterium Phaeobacter sp. strain JL2886, isolated from deep seawater of the South China Sea

Phaeobacter sp. strain JL2886, isolated from deep seawater of the South China Sea, can catabolize d-amino acids. Here, we report the complete genome sequence of Phaeobacter sp. JL2886. It comprises ~4.06 Mbp, with a G+C content of 61.52%. A total of 3,913 protein-coding genes and 10 genes related to d-amino acid catabolism were obtained. Copyright © 2016 Fu et al.

July 7, 2019

Privacy-preserving read mapping using locality sensitive hashing and secure kmer voting

The recent explosion in the amount of available genome sequencing data imposes high computational demands on the tools designed to analyze it. Low-cost cloud computing has the potential to alleviate this burden. However, moving personal genome data analysis to the cloud raises serious privacy concerns. Read alignment is a critical and computationally intensive first step of most genomic data analysis pipelines. While significant effort has been dedicated to optimize the sensitivity and runtime efficiency of this step, few approaches have addressed outsourcing this computation securely to an untrusted party. The few secure solutions that have been proposed either do not scale to whole genome sequencing datasets or are not competitive with the state of the art in read mapping. In this paper, we present BALAUR, a privacy-preserving read mapping algorithm based on locality sensitive hashing and secure kmer voting. BALAUR securely outsources a significant portion of the computation to the public cloud by formulating the alignment task as a voting scheme between encrypted read and reference kmers. Our approach can easily handle typical genome-scale datasets and is highly competitive with non-cryptographic state-of-the-art read aligners in both accuracy and runtime performance on simulated and real read data. Moreover, our approach is significantly faster than state-of-the-art read aligners in long read mapping.

July 7, 2019

Genomic and transcriptomic analyses of the tangerine pathotype of Alternaria alternata in response to oxidative stress.

The tangerine pathotype of Alternaria alternata produces the A. citri toxin (ACT) and is the causal agent of citrus brown spot that results in significant yield losses worldwide. Both the production of ACT and the ability to detoxify reactive oxygen species (ROS) are required for A. alternata pathogenicity in citrus. In this study, we report the 34.41?Mb genome sequence of strain Z7 of the tangerine pathotype of A. alternata. The host selective ACT gene cluster in strain Z7 was identified, which included 25 genes with 19 of them not reported previously. Of these, 10 genes were present only in the tangerine pathotype, representing the most likely candidate genes for this pathotype specialization. A transcriptome analysis of the global effects of H2O2 on gene expression revealed 1108 up-regulated and 498 down-regulated genes. Expressions of those genes encoding catalase, peroxiredoxin, thioredoxin and glutathione were highly induced. Genes encoding several protein families including kinases, transcription factors, transporters, cytochrome P450, ubiquitin and heat shock proteins were found associated with adaptation to oxidative stress. Our data not only revealed the molecular basis of ACT biosynthesis but also provided new insights into the potential pathways that the phytopathogen A. alternata copes with oxidative stress.

July 7, 2019

Building two indica rice reference genomes with PacBio long-read and Illumina paired-end sequencing data.

Over the past 30 years, we have performed many fundamental studies on two Oryza sativa subsp. indica varieties, Zhenshan 97 (ZS97) and Minghui 63 (MH63). To improve the resolution of many of these investigations, we generated two reference-quality reference genome assemblies using the most advanced sequencing technologies. Using PacBio SMRT technology, we produced over 108 (ZS97) and 174 (MH63) Gb of raw sequence data from 166 (ZS97) and 209 (MH63) pools of BAC clones, and generated ~97 (ZS97) and ~74 (MH63) Gb of paired-end whole-genome shotgun (WGS) sequence data with Illumina sequencing technology. With these data, we successfully assembled two platinum standard reference genomes that have been publicly released. Here we provide the full sets of raw data used to generate these two reference genome assemblies. These data sets can be used to test new programs for better genome assembly and annotation, aid in the discovery of new insights into genome structure, function, and evolution, and help to provide essential support to biological research in general.

July 7, 2019

The genome sequence of allopolyploid Brassica juncea and analysis of differential homoeolog gene expression influencing selection.

The Brassica genus encompasses three diploid and three allopolyploid genomes, but a clear understanding of the evolution of agriculturally important traits via polyploidy is lacking. We assembled an allopolyploid Brassica juncea genome by shotgun and single-molecule reads integrated to genomic and genetic maps. We discovered that the A subgenomes of B. juncea and Brassica napus each had independent origins. Results suggested that A subgenomes of B. juncea were of monophyletic origin and evolved into vegetable-use and oil-use subvarieties. Homoeolog expression dominance occurs between subgenomes of allopolyploid B. juncea, in which differentially expressed genes display more selection potential than neutral genes. Homoeolog expression dominance in B. juncea has facilitated selection of glucosinolate and lipid metabolism genes in subvarieties used as vegetables and for oil production. These homoeolog expression dominance relationships among Brassicaceae genomes have contributed to selection response, predicting the directional effects of selection in a polyploid crop genome.

July 7, 2019

Strategies for sequence assembly of plant genomes

The field of plant genome assembly has greatly benefited from the development and widespread adoption of next-generation DNA sequencing platforms. Very high sequencing throughputs and low costs per nucleotide have considerably reduced the technical and budgetary constraints associated with early assembly projects done primarily with a traditional Sanger-based approach. Those improvements led to a sharp increase in the number of plant genomes being sequenced, including large and complex genomes of economically important crops. Although next-generation DNA sequencing has considerably improved our understanding of the overall structure and dynamics of many plant genomes, severe limitations still remain because next-generation DNA sequencing reads typically are shorter than Sanger reads. In addition, the software tools used to de novo assemble sequences are not necessarily designed to optimize the use of short reads. These cause challenges, common to many plant species with large genome sizes, high repeat contents, polyploidy and genome-wide duplications. This chapter provides an overview of historical and current methods used to sequence and assemble plant genomes, along with new solutions offered by the emergence of technologies such as single molecule sequencing and optical mapping to address the limitations of current sequence assemblies.

July 7, 2019

Development of Streptomyces sp. FR-008 as an emerging chassis

Microbial-derived natural products are important in both the pharmaceutical industry and academic research. As the metabolic potential of original producer especially Streptomyces is often limited by slow growth rate, complicated cultivation profile, and unfeasible genetic manipulation, so exploring a Streptomyces as a super industrial chassis is valuable and urgent. Streptomyces sp. FR-008 is a fast-growing microorganism and can also produce a considerable amount of macrolide candicidin via modular polyketide synthase. In this study, we evaluated Streptomyces sp. FR-008 as a potential industrial-production chassis. First, PacBio sequencing and transcriptome analyses indicated that the Streptomyces sp. FR-008 genome size is 7.26 Mb, which represents one of the smallest of currently sequenced Streptomyces genomes. In addition, we simplified the conjugation procedure without heat-shock and pre-germination treatments but with high conjugation efficiency, suggesting it is inherently capable of accepting heterologous DNA. In addition, a series of promoters selected from literatures was assessed based on GusA activity in Streptomyces sp. FR-008. Compared with the common used promoter ermE*-p, the strength of these promoters comprise a library with a constitutive range of 60–860%, thus providing the useful regulatory elements for future genetic engineering purpose. In order to minimum the genome, we also target deleted three endogenous polyketide synthase (PKS) gene clusters to generate a mutant LQ3. LQ3 is thus an “updated” version of Streptomyces sp. FR-008, producing fewer secondary metabolites profiles than Streptomyces sp. FR-008. We believe this work could facilitate further development of Streptomyces sp. FR-008 for use in biotechnological applications.

July 7, 2019

Emergence of epidemic Neisseria meningitidis serogroup C in Niger, 2015: an analysis of national surveillance data.

To combat Neisseria meningitidis serogroup A epidemics in the meningitis belt of sub-Saharan Africa, a meningococcal serogroup A conjugate vaccine (MACV) has been progressively rolled out since 2010. We report the first meningitis epidemic in Niger since the nationwide introduction of MACV.We compiled and analysed nationwide case-based meningitis surveillance data in Niger. Cases were confirmed by culture or direct real-time PCR, or both, of cerebrospinal fluid specimens, and whole-genome sequencing was used to characterise isolates. Information on vaccination campaigns was collected by the Niger Ministry of Health and WHO.From Jan 1 to June 30, 2015, 9367 suspected meningitis cases and 549 deaths were reported in Niger. Among 4301 cerebrospinal fluid specimens tested, 1603 (37·3%) were positive for a bacterial pathogen, including 1147 (71·5%) that were positive for N meningitidis serogroup C (NmC). Whole-genome sequencing of 77 NmC isolates revealed the strain to be ST-10217. Although vaccination campaigns were limited in scope because of a global vaccine shortage, 1·4 million people were vaccinated from March to June, 2015.This epidemic represents the largest global NmC outbreak so far and shows the continued threat of N meningitidis in sub-Saharan Africa. The risk of further regional expansion of this novel clone highlights the need for continued strengthening of case-based surveillance. The availability of an affordable, multivalent conjugate vaccine may be important in future epidemic response.MenAfriNet consortium, a partnership between the US Centers for Disease Control and Prevention, WHO, and Agence de Médecine Preventive, through a grant from the Bill & Melinda Gates Foundation. Copyright © 2016 World Health Organization. Published by Elsevier Ltd/Inc/BV. All rights reserved. Published by Elsevier Ltd.. All rights reserved.

July 7, 2019

Complete genome sequences of the historical Legionella pneumophila strains OLDA and Pontiac.

Here, we report the complete genome sequences of Legionella pneumophila serogroup 1 strains OLDA and Pontiac, which predate the 1976 Philadelphia Legionnaires’ disease outbreak. Strain OLDA was isolated in 1947 from an apparent sporadic case, and strain Pontiac caused an explosive outbreak at a Michigan health department in 1968. Copyright © 2016 Mercante et al.

July 7, 2019

Complete genome sequence of the proteorhodopsin-containing marine flavobacterium Dokdonia donghaensis DSW-1T, isolated from seawater off Dokdo in the East Sea (Sea of Korea).

Dokdonia spp. have been used for investigating the lifestyles of proteorhodopsin-containing photoheterotrophs and for understanding marine photobiology. Here, we report the complete genome sequence of Dokdonia donghaensis DSW-1(T) using the PacBio sequencing platform. It should provide a valuable resource for comparative genomic studies of marine life harboring microbial rhodopsins among others. Copyright © 2016 Kim et al.

July 7, 2019

Complete genome sequence of the intracellular bacterial symbiont TC1 in the anaerobic ciliate Trimyema compressum.

A free-living ciliate, Trimyema compressum, found in anoxic freshwater environments harbors methanogenic archaea and a bacterial symbiont named TC1 in its cytoplasm. Here, we report the complete genome sequence of the TC1 symbiont, consisting of a 1.59-Mb chromosome and a 35.8-kb plasmid, which was determined using the PacBio RSII sequencer. Copyright © 2016 Shinzato et al.

July 7, 2019

Draft genome sequence of Escherichia coli S51, a chicken isolate harboring a chromosomally encoded mcr-1 gene.

We present the draft genome of Escherichia coli S51, a colistin-resistant extended-spectrum ß-lactamase-producing strain isolated in 2015 from raw chicken meat imported from Germany. Assembly and annotation of this draft genome resulted in a 4,994,918-bp chromosome and revealed a chromosomally encoded mcr-1 gene responsible for the colistin resistance of the strain. Copyright © 2016 Zurfluh et al.

July 7, 2019

Lysosomal Cathepsin A plays a significant role in the processing of endogenous bioactive peptides.

Lysosomal serine carboxypeptidase Cathepsin A (CTSA) is a multifunctional enzyme with distinct protective and catalytic function. CTSA present in the lysosomal multienzyme complex to facilitate the correct lysosomal routing, stability and activation of with beta-galactosidase and alpha-neuraminidase. Beside CTSA has role in inactivation of bioactive peptides including bradykinin, substances P, oxytocin, angiotensin I and endothelin-I by cleavage of 1 or 2 amino acid(s) from C-terminal ends. In this study, we aimed to elucidate the regulatory role of CTSA on bioactive peptides in knock-in mice model of CTSA(S190A) . We investigated the level of bradykinin, substances P, oxytocin, angiotensin I and endothelin-I in the kidney, liver, lung, brain and serum from CTSA(S190A) mouse model at 3- and 6-months of age. Our results suggest CTSA selectively contributes to processing of bioactive peptides in different tissues from CTSA(S190A) mice compared to age matched WT mice.

July 7, 2019

Genome sequence of Arenibacter algicola strain TG409, a hydrocarbon-degrading bacterium associated with marine eukaryotic phytoplankton.

Arenibacter algicola strain TG409 was isolated from Skeletonema costatum and exhibits the ability to utilize polycyclic aromatic hydrocarbons as sole sources of carbon and energy. Here, we present the genome sequence of this strain, which is 5,550,230 bp with 4,722 genes and an average G+C content of 39.7%. Copyright © 2016 Gutierrez et al.

July 7, 2019

Complete genome sequence of a cylindrospermopsin-producing cyanobacterium, Cylindrospermopsis raciborskii CS505, containing a circular chromosome and a single extrachromosomal element.

Cylindrospermopsis raciborskii is a freshwater cyanobacterium producing bloom events and toxicity in drinking water source reservoirs. We present the first genome sequence for C. raciborskii CS505 (Australia), containing one 4.1-Mbp chromosome and one 110-Kbp plasmid having G+C contents of 40.3% (3933 genes) and 39.3% (111 genes), respectively. Copyright © 2016 Fuentes-Valdés et al.

Auto Tag: Bioinformatics

Complete genome sequence of the D-amino acid catabolism bacterium Phaeobacter sp. strain JL2886, isolated from deep seawater of the South China Sea

Privacy-preserving read mapping using locality sensitive hashing and secure kmer voting

Genomic and transcriptomic analyses of the tangerine pathotype of Alternaria alternata in response to oxidative stress.

Building two indica rice reference genomes with PacBio long-read and Illumina paired-end sequencing data.

The genome sequence of allopolyploid Brassica juncea and analysis of differential homoeolog gene expression influencing selection.

Strategies for sequence assembly of plant genomes

Development of Streptomyces sp. FR-008 as an emerging chassis

Emergence of epidemic Neisseria meningitidis serogroup C in Niger, 2015: an analysis of national surveillance data.

Complete genome sequences of the historical Legionella pneumophila strains OLDA and Pontiac.

Complete genome sequence of the proteorhodopsin-containing marine flavobacterium Dokdonia donghaensis DSW-1T, isolated from seawater off Dokdo in the East Sea (Sea of Korea).

Complete genome sequence of the intracellular bacterial symbiont TC1 in the anaerobic ciliate Trimyema compressum.

Draft genome sequence of Escherichia coli S51, a chicken isolate harboring a chromosomally encoded mcr-1 gene.

Lysosomal Cathepsin A plays a significant role in the processing of endogenous bioactive peptides.

Genome sequence of Arenibacter algicola strain TG409, a hydrocarbon-degrading bacterium associated with marine eukaryotic phytoplankton.

Complete genome sequence of a cylindrospermopsin-producing cyanobacterium, Cylindrospermopsis raciborskii CS505, containing a circular chromosome and a single extrachromosomal element.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert