Menu
July 7, 2019

It’s more than stamp collecting: how genome sequencing can unify biological research.

The availability of reference genome sequences, especially the human reference, has revolutionized the study of biology. However, while the genomes of some species have been fully sequenced, a wide range of biological problems still cannot be effectively studied for lack of genome sequence information. Here, I identify neglected areas of biology and describe how both targeted species sequencing and more broad taxonomic surveys of the tree of life can address important biological questions. I enumerate the significant benefits that would accrue from sequencing a broader range of taxa, as well as discuss the technical advances in sequencing and assembly methods that would allow for wide-ranging application of whole-genome analysis. Finally, I suggest that in addition to ‘big science’ survey initiatives to sequence the tree of life, a modified infrastructure-funding paradigm would better support reference genome sequence generation for research communities most in need. Copyright © 2015 Elsevier Ltd. All rights reserved.


July 7, 2019

Genetic stability of pneumococcal isolates during 35 days of human experimental carriage.

Pneumococcal carriage is a reservoir for transmission and a precursor to pneumococcal disease. The experimental human pneumococcal carriage model provides a useful tool to aid vaccine licensure through the measurement of vaccine efficacy against carriage (VEcol). Documentation of the genetic stability of the experimental human pneumococcal carriage model is important to further strengthen confidence in its safety and conclusions, enabling it to further facilitate vaccine licensure through providing evidence of VEcol.229 isolates were sequenced from 10 volunteers in whom experimental human pneumococcal carriage was established, sampled over a period of 35 days. Multiple isolates from within a single volunteer at a single time provided a deep resolution for detecting variation. HiSeq data from the isolates were mapped against a PacBio reference of the inoculum to call variable sites.The observed variation between experimental carriage isolates was minimal with the maximum SNP distance between any isolate and the reference being 3 SNPs.The low-level variation described provides evidence for the stability of the experimental human pneumococcal carriage model over 35 days, which can be reliably and confidently used to measure VEcol and aid future progression of pneumococcal vaccination. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.


July 7, 2019

Complete genome sequence of Spirosoma radiotolerans, a gamma-radiation-resistant bacterium isolated from rice field in South Korea.

Spirosoma radiotolerans is a Gram-negative, short rod-shaped and gamma-radiation-resistant bacterium isolated from rice field in South Korea (GPS; 37°34’30?N, 127°00’30?E). The complete genome of S. radiotolerans consists of a chromosome (7,029,352bp). From the genome sequence database, we have identified the cluster of genes responsible for DNA recovery from ionizing radiation. The key enzymes for the nucleotide excision repair (NER) were investigated and were identified, suggesting that S. radiotolerans DG5A use (NER) pathways for efficient removal of pyrimidine dimers which are the most abundant type of UV-induced damage. Complete genome information enables further studies on the DNA repair mechanisms during the ionizing radiation. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Whole-genome sequence of Serratia liquefaciens HUMV-21, a cytotoxic, auorum-sensing, and biofilm-producing clinical isolate.

A clinical isolate of Serratia liquefaciens (strain HUMV-21) was obtained from a skin ulcer of an adult patient. We report here its complete genome assembly using PacBio single-molecule real-time (SMRT) sequencing, which resulted in a single circular chromosome with 5.3 Mb. About 5,844 protein-coding genes are predicted from this assembly. Copyright © 2015 Lázaro-Díez et al.


July 7, 2019

Complete genome sequence of Microcystis aeruginosa NIES-2549, a bloom-forming cyanobacterium from Lake Kasumigaura, Japan.

Microcystis aeruginosa NIES-2549 is a freshwater bloom-forming cyanobacterium isolated from Lake Kasumigaura, Japan. We report the complete 4.29-Mbp genome sequence of NIES-2549 and its annotation and discuss the genetic diversity of M. aeruginosa strains. This is the third genome sequence of M. aeruginosa isolated from Lake Kasumigaura. Copyright © 2015 Yamaguchi et al.


July 7, 2019

Full genome sequence of Brevibacillus laterosporus strain B9, a biological control strain isolated from Zhejiang, China.

Brevibacillus laterosporus was newly classified from Bacillus laterosporus, which has ability to be used as a biological control agent in crop field. B. laterosporus strain B9 is an aerobic, motile, Gram-positive, spore-forming rod that was isolated from a field of Oryza sativa in Zhejiang, China in 2011. This bacterium has been confirmed to be a strong antagonist against bacterial brown strip of rice caused by Acidovorex avenae subsp. avenae. Here we describe the features of B. laterosporus strain B9, together with the complete genome sequence and its annotation. The 5,272,435bp genome contains 4804 protein-coding genes and 227 RNA-only encoding genes with 2 plasmids. Copyright © 2015. Published by Elsevier B.V.


July 7, 2019

Complete genome sequence of uropathogenic Escherichia coli strain CI5.

Escherichia coli represents the primary etiological agent responsible for urinary tract infections, one of the most common infections in humans. We report here the complete genome sequence of uropathogenic Escherichia coli strain CI5, a clinical pyelonephritis isolate used for studying pathogenesis. Copyright © 2015 Mehershahi et al.


July 7, 2019

GAML: genome assembly by maximum likelihood.

Resolution of repeats and scaffolding of shorter contigs are critical parts of genome assembly. Modern assemblers usually perform such steps by heuristics, often tailored to a particular technology for producing paired or long reads.We propose a new framework that allows systematic combination of diverse sequencing datasets into a single assembly. We achieve this by searching for an assembly with the maximum likelihood in a probabilistic model capturing error rate, insert lengths, and other characteristics of the sequencing technology used to produce each dataset. We have implemented a prototype genome assembler GAML that can use any combination of insert sizes with Illumina or 454 reads, as well as PacBio reads. Our experiments show that we can assemble short genomes with N50 sizes and error rates comparable to ALLPATHS-LG or Cerulean. While ALLPATHS-LG and Cerulean require each a specific combination of datasets, GAML works on any combination.We have introduced a new probabilistic approach to genome assembly and demonstrated that this approach can lead to superior results when used to combine diverse set of datasets from different sequencing technologies. Data and software is available at http://compbio.fmph.uniba.sk/gaml.


July 7, 2019

Complete genome sequence of endophytic nitrogen-fixing Klebsiella variicola strain DX120E.

Klebsiella variicola strain DX120E (=CGMCC 1.14935) is an endophytic nitrogen-fixing bacterium isolated from sugarcane crops grown in Guangxi, China and promotes sugarcane growth. Here we summarize the features of the strain DX120E and describe its complete genome sequence. The genome contains one circular chromosome and two plasmids, and contains 5,718,434 nucleotides with 57.1% GC content, 5,172 protein-coding genes, 25 rRNA genes, 87 tRNA genes, 7 ncRNA genes, 25 pseudo genes, and 2 CRISPR repeats.


July 7, 2019

Symbiosis island shuffling with abundant insertion sequences in the genomes of extra-slow-growing strains of soybean bradyrhizobia.

Extra-slow-growing bradyrhizobia from root nodules of field-grown soybeans harbor abundant insertion sequences (ISs) and are termed highly reiterated sequence-possessing (HRS) strains. We analyzed the genome organization of HRS strains with the focus on IS distribution and symbiosis island structure. Using pulsed-field gel electrophoresis, we consistently detected several plasmids (0.07 to 0.4 Mb) in the HRS strains (NK5, NK6, USDA135, 2281, USDA123, and T2), whereas no plasmids were detected in the non-HRS strain USDA110. The chromosomes of the six HRS strains (9.7 to 10.7 Mb) were larger than that of USDA110 (9.1 Mb). Using MiSeq sequences of 6 HRS and 17 non-HRS strains mapped to the USDA110 genome, we found that the copy numbers of ISRj1, ISRj2, ISFK1, IS1632, ISB27, ISBj8, and IS1631 were markedly higher in HRS strains. Whole-genome sequencing showed that the HRS strain NK6 had four small plasmids (136 to 212 kb) and a large chromosome (9,780 kb). Strong colinearity was found between 7.4-Mb core regions of the NK6 and USDA110 chromosomes. USDA110 symbiosis islands corresponded mainly to five small regions (S1 to S5) within two variable regions, V1 (0.8 Mb) and V2 (1.6 Mb), of the NK6 chromosome. The USDA110 nif gene cluster (nifDKENXSBZHQW-fixBCX) was split into two regions, S2 and S3, where ISRj1-mediated rearrangement occurred between nifS and nifB. ISs were also scattered in NK6 core regions, and ISRj1 insertion often disrupted some genes important for survival and environmental responses. These results suggest that HRS strains of soybean bradyrhizobia were subjected to IS-mediated symbiosis island shuffling and core genome degradation. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Draft genome sequence of Erwinia tracheiphila, an economically important bacterial pathogen of cucurbits.

Erwinia tracheiphila is one of the most economically important pathogens of cucumbers, melons, squashes, pumpkins, and gourds in the northeastern and midwestern United States, yet its molecular pathology remains uninvestigated. Here, we report the first draft genome sequence of an E. tracheiphila strain isolated from an infected wild gourd (Cucurbita pepo subsp. texana) plant. The genome assembly consists of 7 contigs and includes a putative plasmid and at least 20 phage and prophage elements. Copyright © 2015 Shapiro et al.


July 7, 2019

Complete genome of Kangiella geojedonensis KCTC 23420(T), putative evidence for recent genome reduction in marine environments.

Kangiella geojedonensis KCTC 23420(T) is an aerobic, Gram-negative, non-motile, non-spore-forming, rod-shaped bacterium that was isolated from seawater off the southern coast of Korea. We here report the complete genome of K. geojedonensis KCTC 23420(T), which consists of 2,495,242 bp (G+C content of 43.78%) with 2,257 protein-coding genes, 41 tRNAs, 2 rRNA operons. The genome is smaller than the other closely related genomes, indicating that K. geojedonensis has recently experienced reductive evolution. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Complete genome sequence of Actinobacillus equuli subspecies equuli ATCC 19392(T).

Actinobacillus equuli subsp. equuli is a member of the family Pasteurellaceae that is a common resident of the oral cavity and alimentary tract of healthy horses. At the same time, it can also cause a fatal septicemia in foals, commonly known as sleepy foal disease or joint ill disease. In addition, A. equuli subsp. equuli has recently been reported to act as a primary pathogen in breeding sows and piglets. To better understand how A. equuli subsp. equuli can cause disease, the genome of the type strain of A. equuli subsp. equuli, ATCC 19392(T), was sequenced using the PacBio RS II sequencing system. Its genome is comprised of 2,431,533 bp and is predicted to encode 2,264 proteins and 82 RNAs.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.