Genome annotation Archives - Page 82 of 149

July 7, 2019

Draft genome sequences of Escherichia coli strains isolated from septic patients.

We present the draft genome sequences of six strains of Escherichia coli isolated from blood cultures collected from patients with sepsis. The strains were collected from two patient sets, those with a high severity of illness, and those with a low severity of illness. Each genome was sequenced by both Illumina and PacBio for comparison. Copyright © 2014 Dunitz et al.

July 7, 2019

Complete genome sequence of the larval shellfish pathogen Vibrio tubiashii type strain ATCC 19109.

Vibrio tubiashii is a larval shellfish pathogen. Here, we report the first closed genome sequence for this species (ATCC type strain 19109), which consists of two chromosomes (3,294,490 and 1,766,582 bp), two megaplasmids (251,408 and 122,808 bp), and two plasmids (57,076 and 47,973 bp). Copyright © 2014 Richards et al.

July 7, 2019

Complete genome sequence for the shellfish pathogen Vibrio coralliilyticus RE98 isolated from a shellfish hatchery.

Vibrio coralliilyticus is a pathogen of corals and larval shellfish. Publications on strain RE98 list it as a Vibrio tubiashii; however, whole genome sequencing confirms RE98 as V. coralliilyticus containing a total of 6,037,824 bp consisting of two chromosomes (3,420,228 and 1,917,482 bp) and two megaplasmids (380,714 and 319,400 bp). Copyright © 2014 Richards et al.

July 7, 2019

Complete genome sequence of the lignin-degrading bacterium Klebsiella sp. strain BRL6-2.

In an effort to discover anaerobic bacteria capable of lignin degradation, we isolated Klebsiella sp. strain BRL6-2 on minimal media with alkali lignin as the sole carbon source. This organism was isolated anaerobically from tropical forest soils collected from the Bisley watershed at the Ridge site in the El Yunque National Forest in Puerto Rico, USA, part of the Luquillo Long-Term Ecological Research Station. At this site, the soils experience strong fluctuations in redox potential and are characterized by cycles of iron oxidation and reduction. Genome sequencing was targeted because of its ability to grow on lignin anaerobically and lignocellulolytic activity via in vitro enzyme assays. The genome of Klebsiella sp. strain BRL6-2 is 5.80 Mbp with no detected plasmids, and includes a relatively small arsenal of genes encoding lignocellulolytic carbohydrate active enzymes. The genome revealed four putative peroxidases including glutathione and DyP-type peroxidases, and a complete protocatechuate pathway encoded in a single gene cluster. Physiological studies revealed Klebsiella sp. strain BRL6-2 to be relatively stress tolerant to high ionic strength conditions. It grows in increasing concentrations of ionic liquid (1-ethyl-3-methyl-imidazolium acetate) up to 73.44 mM and NaCl up to 1.5 M.

July 7, 2019

Genome sequence of the dark pink pigmented Listia bainesii microsymbiont Methylobacterium sp. WSM2598.

Strains of a pink-pigmented Methylobacterium sp. are effective nitrogen- (N2) fixing microsymbionts of species of the African crotalarioid genus Listia. Strain WSM2598 is an aerobic, motile, Gram-negative, non-spore-forming rod isolated in 2002 from a Listia bainesii root nodule collected at Estcourt Research Station in South Africa. Here we describe the features of Methylobacterium sp. WSM2598, together with information and annotation of a high-quality draft genome sequence. The 7,669,765 bp draft genome is arranged in 5 scaffolds of 83 contigs, contains 7,236 protein-coding genes and 18 RNA-only encoding genes. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 G enomic E ncyclopedia for B acteria and A rchaea- R oot N odule B acteria (GEBA-RNB) project.

July 7, 2019

Quality scores for 32,000 genomes.

More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). We have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences.Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes had quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes.The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.

July 7, 2019

Complete genome determination and analysis of Acholeplasma oculi strain 19L, highlighting the loss of basic genetic features in the Acholeplasmataceae.

BACKGROUND: Acholeplasma oculi belongs to the Acholeplasmataceae family, comprising the genera Acholeplasma and ‘Candidatus Phytoplasma’. Acholeplasmas are ubiquitous saprophytic bacteria. Several isolates are derived from plants or animals, whereas phytoplasmas are characterised as intracellular parasitic pathogens of plant phloem and depend on insect vectors for their spread. The complete genome sequences for eight strains of this family have been resolved so far, all of which were determined depending on clone-based sequencing. RESULTS:The A. oculi strain 19L chromosome was sequenced using two independent approaches. The first approach comprised sequencing by synthesis (Illumina) in combination with Sanger sequencing, while single molecule real time sequencing (PacBio) was used in the second. The genome was determined to be 1,587,120bp in size. Sequencing by synthesis resulted in six large genome fragments, while the single molecule real time sequencing approach yielded one circular chromosome sequence. High-quality sequences were obtained by both strategies differing in six positions, which are interpreted as reliable variations present in the culture population. Our genome analysis revealed 1,471 protein-coding genes and highlighted the absence of the F1FO-type Na+ ATPase system and GroEL/ES chaperone. Comparison of the four available Acholeplasma sequences revealed a core-genome encoding 703 proteins and a pan-genome of 2,867 proteins. CONCLUSIONS:The application of two state-of-the-art sequencing technologies highlights the potential of single molecule real time sequencing for complete genome determination. Comparative genome analyses revealed that the process of losing particular basic genetic features during genome reduction occurs in both genera, as indicated for several phytoplasma strains and at least A. oculi. The loss of the F1FO-type Na+ ATPase system may separate Acholeplasmataceae from other Mollicutes, while the loss of those genes encoding the chaperone GroEL/ES is not a rare exception in this bacterial class.

July 7, 2019

Global phylogenomic analysis of nonencapsulated Streptococcus pneumoniae reveals a deep-branching classic lineage that is distinct from multiple sporadic lineages.

The surrounding capsule of Streptococcus pneumoniae has been identified as a major virulence factor and is targeted by pneumococcal conjugate vaccines (PCV). However, nonencapsulated S. pneumoniae (non-Ec-Sp) have also been isolated globally, mainly in carriage studies. It is unknown if non-Ec-Sp evolve sporadically, if they have high antibiotic nonsusceptiblity rates and a unique, specific gene content. Here, whole-genome sequencing of 131 non-Ec-Sp isolates sourced from 17 different locations around the world was performed. Results revealed a deep-branching classic lineage that is distinct from multiple sporadic lineages. The sporadic lineages clustered with a previously sequenced, global collection of encapsulated S. pneumoniae (Ec-Sp) isolates while the classic lineage is comprised mainly of the frequently identified multilocus sequences types (STs) ST344 (n = 39) and ST448 (n = 40). All ST344 and nine ST448 isolates had high nonsusceptiblity rates to ß-lactams and other antimicrobials. Analysis of the accessory genome reveals that the classic non-Ec-Sp contained an increased number of mobile elements, than Ec-Sp and sporadic non-Ec-Sp. Performing adherence assays to human epithelial cells for selected classic and sporadic non-Ec-Sp revealed that the presence of a integrative conjugative element (ICE) results in increased adherence to human epithelial cells (P = 0.005). In contrast, sporadic non-Ec-Sp lacking the ICE had greater growth in vitro possibly resulting in improved fitness. In conclusion, non-Ec-Sp isolates from the classic lineage have evolved separately. They have spread globally, are well adapted to nasopharyngeal carriage and are able to coexist with Ec-Sp. Due to continued use of PCV, non-Ec-Sp may become more prevalent. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

July 7, 2019

Complete genome sequence of Bifidobacterium longum 105-A, a strain with high transformation efficiency.

Bifidobacterium longum 105-A shows high transformation efficiency and allows for the generation of gene knockout mutants through homologous recombination. Here, we report the complete genome sequence of strain 105-A. Genes encoding at least four putative restriction-modification systems were found in this genome, which might contribute to its transformation efficiency. Copyright © 2014 Kanesaki et al.

July 7, 2019

Potential impact on kidney infection: a whole-genome analysis of Leptospira santarosai serovar Shermani.

Leptospira santarosai serovar Shermani is the most frequently encountered serovar, and it causes leptospirosis and tubulointerstitial nephritis in Taiwan. This study aims to complete the genome sequence of L. santarosai serovar Shermani and analyze the transcriptional responses of L. santarosai serovar Shermani to renal tubular cells. To assemble this highly repetitive genome, we combined reads that were generated from four next-generation sequencing platforms by using hybrid assembly approaches to finish two-chromosome contiguous sequences without gaps by validating the data with optical restriction maps and Sanger sequencing. Whole-genome comparison studies revealed a 28-kb region containing genes that encode transposases and hypothetical proteins in L. santarosai serovar Shermani, but this region is absent in other pathogenic Leptospira spp. We found that lipoprotein gene expression in both L. santarosai serovar Shermani and L. interrogans serovar Copenhageni were upregulated upon interaction with renal tubular cells, and LSS19962, a L. santarosai serovar Shermani-specific gene within a 28-kb region that encodes hypothetical proteins, was upregulated in L. santarosai serovar Shermani-infected renal tubular cells. Lipoprotein expression during leptospiral infection might facilitate the interactions of leptospires within kidneys. The availability of the whole-genome sequence of L. santarosai serovar Shermani would make it the first completed sequence of this species, and its comparison with that of other Leptospira spp. may provide invaluable information for further studies in leptospiral pathogenesis.

July 7, 2019

Complete genome sequences of Bordetella pertussis isolates B1917 and B1920, representing two predominant global lineages.

Bordetella pertussis is the causative agent of pertussis, a disease which has resurged despite vaccination. We report the complete, annotated genomes of isolates B1917 and B1920, representing two lineages predominating globally in the last 50 years. The B1917 lineage has been associated with the resurgence of pertussis in the 1990s. Copyright © 2014 Bart et al.

July 7, 2019

Complete genome sequence of Listeria monocytogenes Lm60, a strain with an enhanced cold adaptation capacity.

July 7, 2019

Comparative genomics of the Campylobacter lari group.

The Campylobacter lari group is a phylogenetic clade within the epsilon subdivision of the Proteobacteria and is part of the thermotolerant Campylobacter spp., a division within the genus that includes the human pathogen Campylobacter jejuni. The C. lari group is currently composed of five species (C. lari, Campylobacter insulaenigrae, Campylobacter volucris, Campylobacter subantarcticus, and Campylobacter peloridis), as well as a group of strains termed the urease-positive thermophilic Campylobacter (UPTC) and other C. lari-like strains. Here we present the complete genome sequences of 11 C. lari group strains, including the five C. lari group species, four UPTC strains, and a lari-like strain isolated in this study. The genome of C. lari subsp. lari strain RM2100 was described previously. Analysis of the C. lari group genomes indicates that this group is highly related at the genome level. Furthermore, these genomes are strongly syntenic with minor rearrangements occurring only in 4 of the 12 genomes studied. The C. lari group can be bifurcated, based on the flagella and flagellar modification genes. Genomic analysis of the UPTC strains indicated that these organisms are variable but highly similar, closely related to but distinct from C. lari. Additionally, the C. lari group contains multiple genes encoding hemagglutination domain proteins, which are either contingency genes or linked to conserved contingency genes. Many of the features identified in strain RM2100, such as major deficiencies in amino acid biosynthesis and energy metabolism, are conserved across all 12 genomes, suggesting that these common features may play a role in the association of the C. lari group with coastal environments and watersheds. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2014. This work is written by US Government employees and is in the public domain in the US.

July 7, 2019

The challenges and importance of structural variation detection in livestock.

Recent studies in humans and other model organisms have demonstrated that structural variants (SVs) comprise a substantial proportion of variation among individuals of each species. Many of these variants have been linked to debilitating diseases in humans, thereby cementing the importance of refining methods for their detection. Despite progress in the field, reliable detection of SVs still remains a problem even for human subjects. Many of the underlying problems that make SVs difficult to detect in humans are amplified in livestock species, whose lower quality genome assemblies and incomplete gene annotation can often give rise to false positive SV discoveries. Regardless of the challenges, SV detection is just as important for livestock researchers as it is for human researchers, given that several productive traits and diseases have been linked to copy number variations (CNVs) in cattle, sheep, and pig. Already, there is evidence that many beneficial SVs have been artificially selected in livestock such as a duplication of the agouti signaling protein gene that causes white coat color in sheep. In this review, we will list current SV and CNV discoveries in livestock and discuss the problems that hinder routine discovery and tracking of these polymorphisms. We will also discuss the impacts of selective breeding on CNV and SV frequencies and mention how SV genotyping could be used in the future to improve genetic selection.

July 7, 2019

Genome sequence of Burkholderia cenocepacia H111, a cystic fibrosis airway isolate.

The Burkholderia cepacia complex (BCC) is a group of related bacterial species that are commonly isolated from environmental samples. Members of the BCC can cause respiratory infections in cystic fibrosis patients and immunocompromised individuals. We report here the genome sequence of Burkholderia cenocepacia H111, a well-studied model strain of the BCC.

Auto Tag: Genome annotation

Draft genome sequences of Escherichia coli strains isolated from septic patients.

Complete genome sequence of the larval shellfish pathogen Vibrio tubiashii type strain ATCC 19109.

Complete genome sequence for the shellfish pathogen Vibrio coralliilyticus RE98 isolated from a shellfish hatchery.

Complete genome sequence of the lignin-degrading bacterium Klebsiella sp. strain BRL6-2.

Genome sequence of the dark pink pigmented Listia bainesii microsymbiont Methylobacterium sp. WSM2598.

Quality scores for 32,000 genomes.

Complete genome determination and analysis of Acholeplasma oculi strain 19L, highlighting the loss of basic genetic features in the Acholeplasmataceae.

Global phylogenomic analysis of nonencapsulated Streptococcus pneumoniae reveals a deep-branching classic lineage that is distinct from multiple sporadic lineages.

Complete genome sequence of Bifidobacterium longum 105-A, a strain with high transformation efficiency.

Potential impact on kidney infection: a whole-genome analysis of Leptospira santarosai serovar Shermani.

Complete genome sequences of Bordetella pertussis isolates B1917 and B1920, representing two predominant global lineages.

Complete genome sequence of Listeria monocytogenes Lm60, a strain with an enhanced cold adaptation capacity.

Comparative genomics of the Campylobacter lari group.

The challenges and importance of structural variation detection in livestock.

Genome sequence of Burkholderia cenocepacia H111, a cystic fibrosis airway isolate.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert