Menu
July 7, 2019

Best practices in insect genome sequencing: What works and what doesn’t.

The last decade of decreasing DNA sequencing costs and proliferating sequencing services in core labs and companies has brought the de-novo genome sequencing and assembly of insect species within reach for many entomologists. However, sequence production alone is not enough to generate a high quality reference genome, and in many cases, poor planning can lead to extremely fragmented genome assemblies preventing high quality gene annotation and other desired analyses. Insect genomes can be problematic to assemble, due to combinations of high polymorphism, inability to breed for genome homozygocity, and small physical sizes limiting the quantity of DNA able to be isolated from a single individual. Recent advances in sequencing technology and assembly strategies are enabling a revolution for insect genome reference sequencing and assembly. Here we review historical and new genome sequencing and assembly strategies, with a particular focus on their application to arthropod genomes. We highlight both the need to design sequencing strategies for the requirements of the assembly software, and new long-read technologies that are enabling a return to traditional assembly approaches. Finally, we compare and contrast very cost effective short read draft genome strategies with the long read approaches that although entailing additional cost, bring a higher likelihood of success and the possibility of archival assembly qualities approaching that of finished genomes.


July 7, 2019

Active site and laminarin binding in glycoside hydrolase family 55.

The Carbohydrate Active Enzyme (CAZy) database indicates that glycoside hydrolase family 55 (GH55) contains both endo- and exo-ß-1,3-glucanases. The founding structure in the GH55 is PcLam55A from the white rot fungus Phanerochaete chrysosporium (Ishida, T., Fushinobu, S., Kawai, R., Kitaoka, M., Igarashi, K., and Samejima, M. (2009) Crystal structure of glycoside hydrolase family 55 ß-1,3-glucanase from the basidiomycete Phanerochaete chrysosporium. J. Biol. Chem. 284, 10100-10109). Here, we present high resolution crystal structures of bacterial SacteLam55A from the highly cellulolytic Streptomyces sp. SirexAA-E with bound substrates and product. These structures, along with mutagenesis and kinetic studies, implicate Glu-502 as the catalytic acid (as proposed earlier for Glu-663 in PcLam55A) and a proton relay network of four residues in activating water as the nucleophile. Further, a set of conserved aromatic residues that define the active site apparently enforce an exo-glucanase reactivity as demonstrated by exhaustive hydrolysis reactions with purified laminarioligosaccharides. Two additional aromatic residues that line the substrate-binding channel show substrate-dependent conformational flexibility that may promote processive reactivity of the bound oligosaccharide in the bacterial enzymes. Gene synthesis carried out on ~30% of the GH55 family gave 34 active enzymes (19% functional coverage of the nonredundant members of GH55). These active enzymes reacted with only laminarin from a panel of 10 different soluble and insoluble polysaccharides and displayed a broad range of specific activities and optima for pH and temperature. Application of this experimental method provides a new, systematic way to annotate glycoside hydrolase phylogenetic space for functional properties.© 2015 by The American Society for Biochemistry and Molecular Biology, Inc.


July 7, 2019

Complete genome sequence of oxalate-degrading bacterium Pandoraea vervacti DSM 23571(T).

Pandoraea vervacti DSM 23571(T) is an oxalate metabolizing bacterium isolated from an uncultivated field soil in Mugla, Turkey. Here, we present the first complete genome sequence of P. vervacti DSM 23571(T). A complete pathway for degradation of oxalate was revealed from the genome analysis. These data are important to path new opportunities for genetic engineering in the field of biotechnology. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Complete genome sequence of biocontrol strain Pseudomonas fluorescens LBUM223.

Pseudomonas fluorescens LBUM223 is a plant growth-promoting rhizobacterium (PGPR) with biocontrol activity against various plant pathogens. It produces the antimicrobial metabolite phenazine-1-carboxylic acid, which is involved in the biocontrol of Streptomyces scabies, the causal agent of common scab of potato. Here, we report the complete genome sequence of P. fluorescens LBUM223. Copyright © 2015 Roquigny et al.


July 7, 2019

Genome resequencing of the virulent and multidrug-resistant reference strain Clostridium difficile 630.

We resequenced the complete genome of the virulent and multidrug-resistant pathogen Clostridium difficile strain 630. A combination of single-molecule real-time and Illumina sequencing technology revealed the presence of an additional rRNA gene cluster, additional tRNAs, and the absence of a transposon in comparison to the published and reannotated genome sequence. Copyright © 2015 Riedel et al.


July 7, 2019

Complete genome sequence of Haloarcula sp. CBA1115 isolated from non-purified solar salts.

Haloarcula sp. CBA1115, isolated from non-purified solar salts from South Korea, is a halophilic archaeon belonging to the family Halobacteriaceae. Here, we present the complete genome sequence of the strain Haloarcula sp. CBA1115 (4,225,046bp, with a G+C content of 61.98%), which is distributed over one chromosome and five plasmids. A comparison of the genome sequence of Haloarcula sp. CBA1115 with those of members of its closely related taxa showed that the closest neighbor is Haloarcula hispanica Y27, a popular model organism for archaeal studies. The strain was found to possess a number of genes predicted to be involved in osmo-regulatory strategies and metal regulation, suggesting that it might be useful for bioremediation in extreme environments. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Development of an orthogonal fatty acid biosynthesis system in E. coli for oleochemical production.

Here we report recombinant expression and activity of several type I fatty acid synthases that can function in parallel with the native Escherichia coli fatty acid synthase. Corynebacterium glutamicum FAS1A was the most active in E. coli and this fatty acid synthase was leveraged to produce oleochemicals including fatty alcohols and methyl ketones. Coexpression of FAS1A with the ACP/CoA-reductase Maqu2220 from Marinobacter aquaeolei shifted the chain length distribution of fatty alcohols produced. Coexpression of FAS1A with FadM, FadB, and an acyl-CoA-oxidase from Micrococcus luteus resulted in the production of methyl ketones, although at a lower level than cells using the native FAS. This work, to our knowledge, is the first example of in vivo function of a heterologous fatty acid synthase in E. coli. Using FAS1 enzymes for oleochemical production have several potential advantages, and further optimization of this system could lead to strains with more efficient conversion to desired products. Finally, functional expression of these large enzyme complexes in E. coli will enable their study without culturing the native organisms. Published by Elsevier Inc.


July 7, 2019

Complete genome sequence of Serratia multitudinisentens RB-25(T), a novel chitinolytic bacterium.

Serratia multitudinisentens RB-25(T) (=DSM 28811(T) =LMG 28304(T)) is a newly proposed type strain in the genus of Serratia isolated from a municipal landfill site. Here, we present the complete genome of S. multitudinisentens RB-25(T) which contains a complete chitinase operon and other chitin and N-acetylglucosamine utilisation enzymes. To our knowledge, this is the first report of the complete genome sequence of this novel isolate and its chitinase gene discovery. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Genome sequence of Penicillium capsulatum strain ATCC 48735, a rare Penicillium species used in paper manufactories but that recently caused invasive infection.

The genus Penicillium phylogenetically belongs to Trichocomaceae, with approximately 300 reported species. The majority of these species are saprobic and commonly occur in soil. This paper reports the genome sequence of Penicillium capsulatum strain ATCC 48735, a rare Penicillium species used in paper manufactories and that was recently reported as a human-invasive opportunist. Copyright © 2015 Yang et al.


July 7, 2019

Symbiosis island shuffling with abundant insertion sequences in the genomes of extra-slow-growing strains of soybean bradyrhizobia.

Extra-slow-growing bradyrhizobia from root nodules of field-grown soybeans harbor abundant insertion sequences (ISs) and are termed highly reiterated sequence-possessing (HRS) strains. We analyzed the genome organization of HRS strains with the focus on IS distribution and symbiosis island structure. Using pulsed-field gel electrophoresis, we consistently detected several plasmids (0.07 to 0.4 Mb) in the HRS strains (NK5, NK6, USDA135, 2281, USDA123, and T2), whereas no plasmids were detected in the non-HRS strain USDA110. The chromosomes of the six HRS strains (9.7 to 10.7 Mb) were larger than that of USDA110 (9.1 Mb). Using MiSeq sequences of 6 HRS and 17 non-HRS strains mapped to the USDA110 genome, we found that the copy numbers of ISRj1, ISRj2, ISFK1, IS1632, ISB27, ISBj8, and IS1631 were markedly higher in HRS strains. Whole-genome sequencing showed that the HRS strain NK6 had four small plasmids (136 to 212 kb) and a large chromosome (9,780 kb). Strong colinearity was found between 7.4-Mb core regions of the NK6 and USDA110 chromosomes. USDA110 symbiosis islands corresponded mainly to five small regions (S1 to S5) within two variable regions, V1 (0.8 Mb) and V2 (1.6 Mb), of the NK6 chromosome. The USDA110 nif gene cluster (nifDKENXSBZHQW-fixBCX) was split into two regions, S2 and S3, where ISRj1-mediated rearrangement occurred between nifS and nifB. ISs were also scattered in NK6 core regions, and ISRj1 insertion often disrupted some genes important for survival and environmental responses. These results suggest that HRS strains of soybean bradyrhizobia were subjected to IS-mediated symbiosis island shuffling and core genome degradation. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Complete genome sequence of Actinobacillus equuli subspecies equuli ATCC 19392(T).

Actinobacillus equuli subsp. equuli is a member of the family Pasteurellaceae that is a common resident of the oral cavity and alimentary tract of healthy horses. At the same time, it can also cause a fatal septicemia in foals, commonly known as sleepy foal disease or joint ill disease. In addition, A. equuli subsp. equuli has recently been reported to act as a primary pathogen in breeding sows and piglets. To better understand how A. equuli subsp. equuli can cause disease, the genome of the type strain of A. equuli subsp. equuli, ATCC 19392(T), was sequenced using the PacBio RS II sequencing system. Its genome is comprised of 2,431,533 bp and is predicted to encode 2,264 proteins and 82 RNAs.


July 7, 2019

The Streptomyces leeuwenhoekii genome: de novo sequencing and assembly in single contigs of the chromosome, circular plasmid pSLE1 and linear plasmid pSLE2.

Next Generation DNA Sequencing (NGS) and genome mining of actinomycetes and other microorganisms is currently one of the most promising strategies for the discovery of novel bioactive natural products, potentially revealing novel chemistry and enzymology involved in their biosynthesis. This approach also allows rapid insights into the biosynthetic potential of microorganisms isolated from unexploited habitats and ecosystems, which in many cases may prove difficult to culture and manipulate in the laboratory. Streptomyces leeuwenhoekii (formerly Streptomyces sp. strain C34) was isolated from the hyper-arid high-altitude Atacama Desert in Chile and shown to produce novel polyketide antibiotics.Here we present the de novo sequencing of the S. leeuwenhoekii linear chromosome (8 Mb) and two extrachromosomal replicons, the circular pSLE1 (86 kb) and the linear pSLE2 (132 kb), all in single contigs, obtained by combining Pacific Biosciences SMRT (PacBio) and Illumina MiSeq technologies. We identified the biosynthetic gene clusters for chaxamycin, chaxalactin, hygromycin A and desferrioxamine E, metabolites all previously shown to be produced by this strain (J Nat Prod, 2011, 74:1965) and an additional 31 putative gene clusters for specialised metabolites. As well as gene clusters for polyketides and non-ribosomal peptides, we also identified three gene clusters encoding novel lasso-peptides.The S. leeuwenhoekii genome contains 35 gene clusters apparently encoding the biosynthesis of specialised metabolites, most of them completely novel and uncharacterised. This project has served to evaluate the current state of NGS for efficient and effective genome mining of high GC actinomycetes. The PacBio technology now permits the assembly of actinomycete replicons into single contigs with >99 % accuracy. The assembled Illumina sequence permitted not only the correction of omissions found in GC homopolymers in the PacBio assembly (exacerbated by the high GC content of actinomycete DNA) but it also allowed us to obtain the sequences of the termini of the chromosome and of a linear plasmid that were not assembled by PacBio. We propose an experimental pipeline that uses the Illumina assembled contigs, in addition to just the reads, to complement the current limitations of the PacBio sequencing technology and assembly software.


July 7, 2019

Complete genome sequence of Pragia fontium 24613, an environmental bacterium from the family Enterobacteriaceae.

The complete genome sequence of Pragia fontium 24613 was determined using PacBio RSII, Roche 454, and SOLiD sequencing. A total of 3,579 genes were predicted, including 3,338 protein-coding sequences and 146 pseudogenes. This is the first whole-genome sequence of a strain belonging to the environmental genera of the family Enterobacteriaceae. Copyright © 2015 Snopková et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.