Menu
July 7, 2019

A step-by-step guide to assemble a reptilian genome.

Multiple technologies and software are now available facilitating the de novo sequencing and assembly of any vertebrate genome. Yet the quality of most available sequenced genomes is substantially poorer than that of the golden standard in the field: the human genome. Here, we present a step-by-step protocol for the successful sequencing and assembly of a high-quality snake genome that can be applied to any other reptilian or avian species. We combine the great sequencing depth and accuracy of short reads with the use of different insert size libraries for extended scaffolding followed by optical mapping. We show that this procedure improved the corn snake scaffold N50 from 3.7 kbp to 1.4 Mbp, currently making it one of the snake genomes with the longest scaffolds. Short guidelines are also given on the extraction of long DNA molecules from reptilian blood and the necessary modifications in DNA extraction protocols. This chapter is accompanied by a website ( www.reptilomics.org/stepbystep.html ), where we provide links to the suggested software, examples of input and output files, and running parameters.


July 7, 2019

Draft genome sequences from a novel clade of Bacillus cereus sensu lato strains, isolated from the International Space Station.

The draft genome sequences of six Bacillus strains, isolated from the International Space Station and belonging to the Bacillus anthracis-B. cereus-B. thuringiensis group, are presented here. These strains were isolated from the Japanese Experiment Module (one strain), U.S. Harmony Node 2 (three strains), and Russian Segment Zvezda Module (two strains). Copyright © 2017 Venkateswaran et al.


July 7, 2019

Complete genome sequence of Sulfuriferula sp. strain AH1, a sulfur-oxidizing autotroph isolated from weathered mine tailings from the Duluth Complex in Minnesota.

We report the closed and annotated genome sequence of Sulfuriferula sp. strain AH1. Strain AH1 has a 2,877,007-bp chromosome that includes a partial Sox system for inorganic sulfur oxidation and a complete nitrogen fixation pathway. It also has a single 39,138-bp plasmid with genes for arsenic and mercury resistance. Copyright © 2017 Jones et al.


July 7, 2019

Identification and characterization of the novel colonization factor CS30 based on whole genome sequencing in enterotoxigenic Escherichia coli (ETEC).

The ability to colonize the small intestine is essential for enterotoxigenic Escherichia coli (ETEC) to cause diarrhea. Although 22 antigenically different colonization factors (CFs) have been identified and characterized in ETEC at least 30% of clinical ETEC isolates lack known CFs. Ninety-four whole genome sequenced “CF negative” isolates were searched for novel CFs using a reverse genetics approach followed by phenotypic analyses. We identified a novel CF, CS30, encoded by a set of seven genes, csmA-G, related to the human CF operon CS18 and the porcine CF operon 987P (F6). CS30 was shown to be thermo-regulated, expressed at 37?°C, but not at 20?°C, by SDS-page and mass spectrometry analyses as well as electron microscopy imaging. Bacteria expressing CS30 were also shown to bind to differentiated human intestinal Caco-2 cells. The genes encoding CS30 were located on a plasmid (E873p3) together with the genes encoding LT and STp. PCR screening of ETEC isolates revealed that 8.6% (n?=?13) of “CF negative” (n?=?152) and 19.4% (n?=?13) of “CF negative” LT?+?STp (n?=?67) expressing isolates analyzed harbored CS30. Hence, we conclude that CS30 is common among “CF negative” LT?+?STp isolates and is associated with ETEC that cause diarrhea.


July 7, 2019

The Apostasia genome and the evolution of orchids.

Constituting approximately 10% of flowering plant species, orchids (Orchidaceae) display unique flower morphologies, possess an extraordinary diversity in lifestyle, and have successfully colonized almost every habitat on Earth. Here we report the draft genome sequence of Apostasia shenzhenica, a representative of one of two genera that form a sister lineage to the rest of the Orchidaceae, providing a reference for inferring the genome content and structure of the most recent common ancestor of all extant orchids and improving our understanding of their origins and evolution. In addition, we present transcriptome data for representatives of Vanilloideae, Cypripedioideae and Orchidoideae, and novel third-generation genome data for two species of Epidendroideae, covering all five orchid subfamilies. A. shenzhenica shows clear evidence of a whole-genome duplication, which is shared by all orchids and occurred shortly before their divergence. Comparisons between A. shenzhenica and other orchids and angiosperms also permitted the reconstruction of an ancestral orchid gene toolkit. We identify new gene families, gene family expansions and contractions, and changes within MADS-box gene classes, which control a diverse suite of developmental processes, during orchid evolution. This study sheds new light on the genetic mechanisms underpinning key orchid innovations, including the development of the labellum and gynostemium, pollinia, and seeds without endosperm, as well as the evolution of epiphytism; reveals relationships between the Orchidaceae subfamilies; and helps clarify the evolutionary history of orchids within the angiosperms.


July 7, 2019

Genome sequence of an Australian monophasic Salmonella enterica subsp. enterica Typhimurium isolate (TW-Stm6) carrying a large plasmid with multiple antimicrobial resistance genes.

We report the genome sequence of a monophasic Salmonella enterica subsp. enterica Typhimurium strain (TW-Stm6) isolated in Australia that is similar to epidemic multidrug-resistant strains from Europe and elsewhere. This strain carries additional antibiotic and heavy-metal resistance genes on a large (275-kb) IncHI2 plasmid. Copyright © 2017 Dyall-Smith et al.


July 7, 2019

Single-molecule sequencing and Hi-C-based proximity-guided assembly of amaranth (Amaranthus hypochondriacus) chromosomes provide insights into genome evolution.

Amaranth (Amaranthus hypochondriacus) was a food staple among the ancient civilizations of Central and South America that has recently received increased attention due to the high nutritional value of the seeds, with the potential to help alleviate malnutrition and food security concerns, particularly in arid and semiarid regions of the developing world. Here, we present a reference-quality assembly of the amaranth genome which will assist the agronomic development of the species.Utilizing single-molecule, real-time sequencing (Pacific Biosciences) and chromatin interaction mapping (Hi-C) to close assembly gaps and scaffold contigs, respectively, we improved our previously reported Illumina-based assembly to produce a chromosome-scale assembly with a scaffold N50 of 24.4 Mb. The 16 largest scaffolds contain 98% of the assembly and likely represent the haploid chromosomes (n?=?16). To demonstrate the accuracy and utility of this approach, we produced physical and genetic maps and identified candidate genes for the betalain pigmentation pathway. The chromosome-scale assembly facilitated a genome-wide syntenic comparison of amaranth with other Amaranthaceae species, revealing chromosome loss and fusion events in amaranth that explain the reduction from the ancestral haploid chromosome number (n?=?18) for a tetraploid member of the Amaranthaceae.The assembly method reported here minimizes cost by relying primarily on short-read technology and is one of the first reported uses of in vivo Hi-C for assembly of a plant genome. Our analyses implicate chromosome loss and fusion as major evolutionary events in the 2n?=?32 amaranths and clearly establish the homoeologous relationship among most of the subgenome chromosomes, which will facilitate future investigations of intragenomic changes that occurred post polyploidization.


July 7, 2019

Identification of novel conjugative plasmids with multiple copies of fosB that confer high-level fosfomycin resistance to vancomycin-resistant Enterococci.

To further characterize the fosB-carrying plasmids of 19 vancomycin-resistant enterococci, the complete sequences of the fosB- and vanA-containing plasmids of Enterococcus faecium (pEMA120) and E. avium (pEA19081) were obtained by single-molecule, real-time sequencing. We found that these two plasmids are essentially identical (99.99% nucleotide sequence identity), which proved the possibility of interspecies transmission. Comparative analysis of the plasmids revealed that the backbone of pEMA120 is 99% similar to a conjugative fosB-negative E. faecium plasmid, pZB18. There is a traE disrupted in the transfer region of pEMA120, in comparison to pZB18 with an intact traE. The difference of their transfer frequencies between pEMA120 and pZB18 suggests this interruption of traE might affect conjugative transfer. Two copies of the fosB gene linked to a tnpA gene, forming an ISL3-like transposon, were found at separate locations within pEMA120, which had not been reported previously. These two fosB-carrying transposons were confirmed to form circular intermediates by inverse PCR. The hybridization of plasmid DNA digested by BsaI, having restriction site within the fosB sequence, demonstrated that the presence of multiple copies of fosB per plasmid is common. The total copy number of the fosB gene as revealed by qRT-PCR did not correlate with fosfomycin MICs or growth rates at sub-MICs of fosfomycin in different transconjugants. From susceptibility tests, the fosB gene, regardless of the copy number, conferred high fosfomycin MICs that ranged from 16384 to 65536 µg/ml. This first complete nucleotide sequence of a plasmid carrying two copies of fosB in VRE suggests that the fosB gene can transfer to multiple loci of plasmids by the ISL3 family transposase TnpA, possibly in the form of circular intermediates, leading to the dissemination of high fosfomycin resistance in VRE.


July 7, 2019

The mobilome; A major contributor to Escherichia coli stx2-positive O26:H11 strains intra-serotype diversity.

Shiga toxin-producing Escherichia coli of serotype O26:H11/H- constitute a diverse group of strains and several clones with distinct genetic characteristics have been identified and characterized. Whole genome sequencing was performed using Illumina and PacBio technologies on eight stx2-positive O26:H11 strains circulating in France. Comparative analyses of the whole genome of the stx2-positive O26:H11 strains indicate that several clones of EHEC O26:H11 are co-circulating in France. Phylogenetic analysis of the French strains together with stx2-positive and stx-negative E. coli O26:H11 genomes obtained from Genbank indicates the existence of four clonal complexes (SNP-CCs) separated in two distinct lineages, one of which comprises the “new French clone” (SNP-CC1) that appears genetically closely related to stx-negative attaching and effacing E. coli (AEEC) strains. Interestingly, the whole genome SNP (wgSNP) phylogeny is summarized in the cas gene phylogeny, and a simple qPCR assay targeting the CRISPR array specific to SNP-CC1 (SP_O26-E) can distinguish between the two main lineages. The PacBio sequencing allowed a detailed analysis of the mobile genetic elements (MGEs) of the strains. Numerous MGEs were identified in each strain, including a large number of prophages and up to four large plasmids, representing overall 8.7-19.8% of the total genome size. Analysis of the prophage pool of the strains shows a considerable diversity with a complex history of recombination. Each clonal complex (SNP-CC) is characterized by a unique set of plasmids and phages, including stx-prophages, suggesting evolution through separate acquisition events. Overall, the MGEs appear to play a major role in O26:H11 intra-serotype clonal diversification.


July 7, 2019

The complete genome sequence of Streptomyces autolyticus CGMCC 0516, the producer of geldanamycin, autolytimycin, reblastatin and elaiophylin.

Streptomyces autolyticus CGMCC 0516 produces the anti-tumor benzoquinone ansamycins geldanamycin, autolytimycin, and reblastatin and the 16-membered macrodiolide elaiophylin. Here, we report the complete genome sequence of S. autolyticus CGMCC 0516, which consists of a 10,029,028bp linear chromosome and seven circular plasmids. Fifty-seven putative biosynthetic gene clusters for secondary metabolites were found. The geldanamycin, autolytimycin, and reblastatin biosynthetic gene clusters were located on the left arm (2.06-2.15Mb) of the chromosome, and the elaiophylin gene cluster was located on the right arm (9.45-9.53Mb). Twenty-one putative gene clusters with high or moderate similarity to important antibiotic biosynthetic gene clusters were found, including the antitumor agents echoside, bafilomycin, hygrocin, and toxoflavin; the antibacterial/antifungal agents nigericin, skyllamycin, kanamycin, naphthomycin, eco-02301, and bottromycin A2; the immunosuppressants meridamycin and brasilicardin A; the anti-inflammatory agent cyclooctatin; and the acute iron poisoning medication desferrioxamine B. The genome sequence reported here will enable us to study the biosynthetic mechanism of these important antibiotics and will facilitate the discovery of novel secondary metabolites with potential applications to human health. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

The complete genome sequence of the fish pathogen Tenacibaculum maritimum provides insights into virulence mechanisms.

Tenacibaculum maritimum is a devastating bacterial pathogen of wild and farmed marine fish with a broad host range and a worldwide distribution. We report here the complete genome sequence of the T. maritimum type strain NCIMB 2154(T). The genome consists of a 3,435,971-base pair circular chromosome with 2,866 predicted protein-coding genes. Genes encoding the biosynthesis of exopolysaccharides, the type IX secretion system, iron uptake systems, adhesins, hemolysins, proteases, and glycoside hydrolases were identified. They are likely involved in the virulence process including immune escape, invasion, colonization, destruction of host tissues, and nutrient scavenging. Among the predicted virulence factors, type IX secretion-mediated and cell-surface exposed proteins were identified including an atypical sialidase, a sphingomyelinase and a chondroitin AC lyase which activities were demonstrated in vitro.


July 7, 2019

Glycolytic functions are conserved in the genome of the wine yeast Hanseniaspora uvarum, and pyruvate kinase limits its capacity for alcoholic fermentation.

Hanseniaspora uvarum (anamorph Kloeckera apiculata) is a predominant yeast on wine grapes and other fruits and has a strong influence on wine quality, even when Saccharomyces cerevisiae starter cultures are employed. In this work, we sequenced and annotated approximately 93% of the H. uvarum genome. Southern and synteny analyses were employed to construct a map of the seven chromosomes present in a type strain. Comparative determinations of specific enzyme activities within the fermentative pathway in H. uvarum and S. cerevisiae indicated that the reduced capacity of the former yeast for ethanol production is caused primarily by an ~10-fold-lower activity of the key glycolytic enzyme pyruvate kinase. The heterologous expression of the encoding gene, H. uvarumPYK1 (HuPYK1), and two genes encoding the phosphofructokinase subunits, HuPFK1 and HuPFK2, in the respective deletion mutants of S. cerevisiae confirmed their functional homology.IMPORTANCEHanseniaspora uvarum is a predominant yeast species on grapes and other fruits. It contributes significantly to the production of desired as well as unfavorable aroma compounds and thus determines the quality of the final product, especially wine. Despite this obvious importance, knowledge on its genetics is scarce. As a basis for targeted metabolic modifications, here we provide the results of a genomic sequencing approach, including the annotation of 3,010 protein-encoding genes, e.g., those encoding the entire sugar fermentation pathway, key components of stress response signaling pathways, and enzymes catalyzing the production of aroma compounds. Comparative analyses suggest that the low fermentative capacity of H. uvarum compared to that of Saccharomyces cerevisiae can be attributed to low pyruvate kinase activity. The data reported here are expected to aid in establishing H. uvarum as a non-Saccharomyces yeast in starter cultures for wine and cider fermentations. Copyright © 2017 American Society for Microbiology.


July 7, 2019

The cacao Criollo genome v2.0: an improved version of the genome for genetic and functional genomic studies.

Theobroma cacao L., native to the Amazonian basin of South America, is an economically important fruit tree crop for tropical countries as a source of chocolate. The first draft genome of the species, from a Criollo cultivar, was published in 2011. Although a useful resource, some improvements are possible, including identifying misassemblies, reducing the number of scaffolds and gaps, and anchoring un-anchored sequences to the 10 chromosomes.We used a NGS-based approach to significantly improve the assembly of the Belizian Criollo B97-61/B2 genome. We combined four Illumina large insert size mate paired libraries with 52x of Pacific Biosciences long reads to correct misassembled regions and reduced the number of scaffolds. We then used genotyping by sequencing (GBS) methods to increase the proportion of the assembly anchored to chromosomes.The scaffold number decreased from 4,792 in assembly V1 to 554 in V2 while the scaffold N50 size has increased from 0.47 Mb in V1 to 6.5 Mb in V2. A total of 96.7% of the assembly was anchored to the 10 chromosomes compared to 66.8% in the previous version. Unknown sites (Ns) were reduced from 10.8% to 5.7%. In addition, we updated the functional annotations and performed a new RefSeq structural annotation based on RNAseq evidence.Theobroma cacao Criollo genome version 2 will be a valuable resource for the investigation of complex traits at the genomic level and for future comparative genomics and genetics studies in cacao tree. New functional tools and annotations are available on the Cocoa Genome Hub ( http://cocoa-genome-hub.southgreen.fr ).


July 7, 2019

Complete genome sequences of four extensively drug-resistant Pseudomonas aeruginosa strains, isolated from adults with ventilator-associated pneumonia at a tertiary referral hospital in Mexico City.

Four extensively drug-resistant Pseudomonas aeruginosa strains, isolated from patients with pneumonia, were sequenced using PacBio RS-II single-molecule real-time (SMRT) technology. Genome sequence analysis identified great variability among mobile genetic elements, as well as some previously undescribed genomic islands and new variants of class 1 integrons (In1402, In1403, In1404, and In1408). Copyright © 2017 Espinosa-Camacho et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.