Menu
July 7, 2019

Genome sequencing and analysis of the first complete genome of Lactobacillus kunkeei strain MP2, an Apis mellifera gut isolate

Background. The honey bee (Apis mellifera) is the most important pollinator in agriculture worldwide. However, the number of honey bees has fallen significantly since 2006, becoming a huge ecological problem nowadays. The principal cause is CCD, or Colony Collapse Disorder, characterized by the seemingly spontaneous abandonment of hives by their workers. One of the characteristics of CCD in honey bees is the alteration of the bacterial communities in their gastrointestinal tract, mainly due to the decrease of Firmicutes populations, such as the Lactobacilli. At this time, the causes of these alterations remain unknown. We recently isolated a strain of Lactobacillus kunkeei (L. kunkeei strain MP2) from the gut of Chilean honey bees. L. kunkeei, is one of the most commonly isolated bacterium from the honey bee gut and is highly versatile in different ecological niches. In this study, we aimed to elucidate in detail, the L. kunkeei genetic background and perform a comparative genome analysis with other Lactobacillus species. Methods. L. kunkeei MP2 was originally isolated from the guts of Chilean A. mellifera individuals. Genome sequencing was done using Pacific Biosciences single-molecule real-time sequencing technology. De novo assembly was performed using Celera assembler. The genome was annotated using Prokka, and functional information was added using the EggNOG 3.1 database. In addition, genomic islands were predicted using IslandViewer, and pro-phage sequences using PHAST. Comparisons between L. kunkeei MP2 with other L. kunkeei, and Lactobacillus strains were done using Roary. Results. The complete genome of L. kunkeei MP2 comprises one circular chromosome of 1,614,522 nt. with a GC content of 36,9%. Pangenome analysis with 16 L. kunkeei strains, identified 113 unique genes, most of them related to phage insertions. A large and unique region of L. kunkeei MP2 genome contains several genes that encode for phage structural protein and replication components. Comparative analysis of MP2 with other Lactobacillus species, identified several unique genes of L. kunkeei MP2 related with metabolism, biofilm generation, survival under stress conditions, and mobile genetic elements (MGEs). Discussion. The presence of multiple mobile genetic elements, including phage sequences, suggest a high degree of genetic variability in L. kunkeei. Its versatility and ability to survive in different ecological niches (bee guts, flowers, fruits among others) could be given by its genetic capacity to change and adapt to different environments. L. kunkeei could be a new source of Lactobacillus with beneficial properties. Indeed, L. kunkeei MP2 could play an important role in honey bee nutrition through the synthesis of components as isoprenoids.


July 7, 2019

A pigeonpea gene confers resistance to Asian soybean rust in soybean.

Asian soybean rust (ASR), caused by the fungus Phakopsora pachyrhizi, is one of the most economically important crop diseases, but is only treatable with fungicides, which are becoming less effective owing to the emergence of fungicide resistance. There are no commercial soybean cultivars with durable resistance to P. pachyrhizi, and although soybean resistance loci have been mapped, no resistance genes have been cloned. We report the cloning of a P. pachyrhizi resistance gene CcRpp1 (Cajanus cajan Resistance against Phakopsora pachyrhizi 1) from pigeonpea (Cajanus cajan) and show that CcRpp1 confers full resistance to P. pachyrhizi in soybean. Our findings show that legume species related to soybean such as pigeonpea, cowpea, common bean and others could provide a valuable and diverse pool of resistance traits for crop improvement.


July 7, 2019

First complete genome sequence of the Dutch veterinary Coxiella burnetii strain NL3262, originating from the largest global Q fever outbreak, and draft genome sequence of its epidemiologically linked chronic human isolate NLhu3345937

The largest global Q fever outbreak occurred in The Netherlands during 2007 to 2010. Goats and sheep were identified as the major sources of disease. Here, we report the first complete genome sequence of Coxiella burnetiigoat outbreak strain NL3262 and that of an epidemiologically linked chronic human strain, both having the outbreak-related CbNL01multilocus variable-number tandem-repeat analysis (MLVA) genotype. Copyright © 2016 Kuley et al.


July 7, 2019

Complete genome sequence of a low-temperature active and alkaline-stable endoglucanase-producing Paenibacillus sp. strain IHB B 3084 from the Indian Trans-Himalayas.

A genome of 5.88Mb with 46.83% G+C content is reported for an endoglucanase-producing bacterium Paenibacillus sp. strain IHB B 3084 isolated from the cold environments of the Indian Trans-Himalayas. The psychrotrophic bacterium produces low-temperature active and alkaline-stable endoglucanases of industrial importance. The genomic data has provided insight into genomic basis of cellulase production and survival of the bacterium in the cold environments. Copyright © 2016. Published by Elsevier B.V.


July 7, 2019

Variable presence of the inverted repeat and plastome stability in Erodium.

Several unrelated lineages such as plastids, viruses and plasmids, have converged on quadripartite genomes of similar size with large and small single copy regions and a large inverted repeat (IR). Except for Erodium (Geraniaceae), saguaro cactus and some legumes, the plastomes of all photosynthetic angiosperms display this structure. The functional significance of the IR is not understood and Erodium provides a system to examine the role of the IR in the long-term stability of these genomes. We compared the degree of genomic rearrangement in plastomes of Erodium that differ in the presence and absence of the IR.We sequenced 17 new Erodium plastomes. Using 454, Illumina, PacBio and Sanger sequences, 16 genomes were assembled and categorized along with one incomplete and two previously published Erodium plastomes. We conducted phylogenetic analyses among these species using a dataset of 19 protein-coding genes and determined if significantly higher evolutionary rates had caused the long branch seen previously in phylogenetic reconstructions within the genus. Bioinformatic comparisons were also performed to evaluate plastome evolution across the genus.Erodium plastomes fell into four types (Type 1-4) that differ in their substitution rates, short dispersed repeat content and degree of genomic rearrangement, gene and intron content and GC content. Type 4 plastomes had significantly higher rates of synonymous substitutions (dS) for all genes and for 14 of the 19 genes non-synonymous substitutions (dN) were significantly accelerated. We evaluated the evidence for a single IR loss in Erodium and in doing so discovered that Type 4 plastomes contain a novel IR.The presence or absence of the IR does not affect plastome stability in Erodium. Rather, the overall repeat content shows a negative correlation with genome stability, a pattern in agreement with other angiosperm groups and recent findings on genome stability in bacterial endosymbionts.© The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


July 7, 2019

Prediction of putative resistance islands in a carbapenem-resistant Acinetobacter baumannii global clone 2 clinical isolate.

We investigated the whole genome sequence (WGS) of a carbapenem-resistant Acinetobacter baumannii isolate belonging to the global clone 2 (GC2) and predicted resistance islands using a software tool.A. baumannii strain YU-R612 was isolated from the sputum of a 61-yr-old man with sepsis. The WGS of the YU-R612 strain was obtained by using the PacBio RS II Sequencing System (Pacific Biosciences Inc., USA). Antimicrobial resistance genes and resistance islands were analyzed by using ResFinder and Genomic Island Prediction software (GIPSy), respectively.The YU-R612 genome consisted of a circular chromosome (ca. 4,075 kb) and two plasmids (ca. 74 kb and 5 kb). Its sequence type (ST) under the Oxford scheme was ST191, consistent with assignment to GC2. ResFinder analysis showed that YU-R612 possessed the following resistance genes: four ß-lactamase genes bla(ADC-30), bla(OXA-66), bla(OXA-23), and bla(TEM-1); armA, aadA1, and aacA4 as aminoglycoside resistance-encoding genes; aac(6′)Ib-cr for fluoroquinolone resistance; msr(E) for macrolide, lincosamide, and streptogramin B resistance; catB8 for phenicol resistance; and sul1 for sulfonamide resistance. By GIPSy analysis, six putative resistant islands (PRIs) were determined on the YU-R612 chromosome. Among them, PRI1 possessed two copies of Tn2009 carrying bla(OXA-23), and PRI5 carried two copies of a class I integron carrying sul1 and armA genes.By prediction of resistance islands in the carbapenem-resistant A. baumannii YU-R612 GC2 strain isolated in Korea, PRIs were detected on the chromosome that possessed Tn2009 and class I integrons. The prediction of resistance islands using software tools was useful for analysis of the WGS.


July 7, 2019

Haemonchus contortus: genome structure, organization and comparative genomics

One of the first genome sequencing projects for a parasitic nematode was that for Haemonchus contortus. The open access data from the Wellcome Trust Sanger Institute provided a valuable early resource for the research community, particularly for the identification of specific genes and genetic markers. Later, a second sequencing project was initiated by the University of Melbourne, and the two draft genome sequences for H. contortus were published back-to-back in 2013. There is a pressing need for long-range genomic information for genetic mapping, population genetics and functional genomic studies, so we are continuing to improve the Wellcome Trust Sanger Institute assembly to provide a finished reference genome for H. contortus. This review describes this process, compares the H. contortus genome assemblies with draft genomes from other members of the strongylid group and discusses future directions for parasite genomics using the H. contortus model. Copyright © 2016 Elsevier Ltd. All rights reserved.


July 7, 2019

Analysis of the genome sequence of the medicinal plant Salvia miltiorrhiza.

Salvia miltiorrhiza Bunge (Danshen) is a medicinal plant of the Lamiaceae family, and its dried roots have long been used in traditional Chinese medicine with hydrophilic phenolic acids and tanshinones as pharmaceutically active components (Zhang et al., 2014; Xu et al., 2016). The first step of tanshinone biosynthesis is bicyclization of the general diterpene precursor (E,E,E)-geranylgeranyl diphosphate (GGPP) to copalyl diphosphate (CPP) by CPP synthases (CPSs), which is followed by a cyclization or rearrangement reaction catalyzed by kaurene synthase-like enzymes (KSL).


July 7, 2019

Genome sequence and analysis of a stress-tolerant, wild-derived strain of Saccharomyces cerevisiae used in biofuels research

The genome sequences of more than 100 strains of the yeast Saccharomyces cerevisiae have been published. Unfortunately, most of these genome assemblies contain dozens to hundreds of gaps at repetitive sequences, including transposable elements, tRNAs, and subtelomeric regions, which is where novel genes generally reside. Relatively few strains have been chosen for genome sequencing based on their biofuel production potential, leaving an additional knowledge gap. Here, we describe the nearly complete genome sequence of GLBRCY22-3 (Y22-3), a strain of S. cerevisiae derived from the stress-tolerant wild strain NRRL YB-210 and subsequently engineered for xylose metabolism. After benchmarking several genome assembly approaches, we developed a pipeline to integrate Pacific Biosciences (PacBio) and Illumina sequencing data and achieved one of the highest quality genome assemblies for any S. cerevisiae strain. Specifically, the contig N50 is 693 kbp, and the sequences of most chromosomes, the mitochondrial genome, and the 2-micron plasmid are complete. Our annotation predicts 92 genes that are not present in the reference genome of the laboratory strain S288c, over 70% of which were expressed. We predicted functions for 43 of these genes, 28 of which were previously uncharacterized and unnamed. Remarkably, many of these genes are predicted to be involved in stress tolerance and carbon metabolism and are shared with a Brazilian bioethanol production strain, even though the strains differ dramatically at most genetic loci. The Y22-3 genome sequence provides an exceptionally high-quality resource for basic and applied research in bioenergy and genetics. Copyright © 2016 McIlwain et al.


July 7, 2019

An improved genome assembly of Azadirachta indica A. Juss.

Neem (Azadirachta indica A. Juss.), an evergreen tree of the Meliaceae family, is known for its medicinal, cosmetic, pesticidal and insecticidal properties. We had previously sequenced and published the draft genome of the plant, using mainly short read sequencing data. In this report, we present an improved genome assembly generated using additional short reads from Illumina and long reads from Pacific Biosciences SMRT sequencer. We assembled short reads and error corrected long reads using Platanus, an assembler designed to perform well for heterozygous genomes. The updated genome assembly (v2.0) yielded 3- and 3.5-fold increase in N50 and N75, respectively; 2.6-fold decrease in the total number of scaffolds; 1.25-fold increase in the number of valid transcriptome alignments; 13.4-fold less mis-assembly and 1.85-fold increase in the percentage repeat, over the earlier assembly (v1.0). The current assembly also maps better to the genes known to be involved in the terpenoid biosynthesis pathway. Together, the data represents an improved assembly of the A. indica genome. The raw data described in this manuscript are submitted to the NCBI Short Read Archive under the accession numbers SRX1074131, SRX1074132, SRX1074133, and SRX1074134 (SRP013453). Copyright © 2016 Author et al.


July 7, 2019

Isolation and complete genome sequence of the thermophilic Geobacillus sp. 12AMOR1 from an Arctic deep-sea hydrothermal vent site.

Members of the genus Geobacillus have been isolated from a wide variety of habitats worldwide and are the subject for targeted enzyme utilization in various industrial applications. Here we report the isolation and complete genome sequence of the thermophilic starch-degrading Geobacillus sp. 12AMOR1. The strain 12AMOR1 was isolated from deep-sea hot sediment at the Jan Mayen hydrothermal Vent Site. Geobacillus sp. 12AMOR1 consists of a 3,410,035 bp circular chromosome and a 32,689 bp plasmid with a G?+?C content of 52 % and 47 %, respectively. The genome comprises 3323 protein-coding genes, 88 tRNA species and 10 rRNA operons. The isolate grows on a suite of sugars, complex polysaccharides and proteinous carbon sources. Accordingly, a versatility of genes encoding carbohydrate-active enzymes (CAZy) and peptidases were identified in the genome. Expression, purification and characterization of an enzyme of the glycoside hydrolase family 13 revealed a starch-degrading capacity and high thermal stability with a melting temperature of 76.4 °C. Altogether, the data obtained point to a new isolate from a marine hydrothermal vent with a large bioprospecting potential.


July 7, 2019

The kiwifruit genome

The whole-genome sequence of Actinidia chinensis var. chinensis ‘Hongyang’ was published in 2013 and was represented as the first publicly available Ericales genome sequence. Publication in 2015 of an improved linkage map for A. chinensis and interspecific comparison analyses coupled with the availability of a second whole-genome sequence of a genotype closely related to ‘Hongyang’ have enabled the kiwifruit research community to improve the existing whole-genome sequence. This chapter describes the original genome sequence and steps towards its improvement.


July 7, 2019

Near-Complete Genome Sequence of Clostridium paradoxum Strain JW-YL-7.

Clostridium paradoxum strain JW-YL-7 is a moderately thermophilic anaerobic alkaliphile isolated from the municipal sewage treatment plant in Athens, GA. We report the near-complete genome sequence of C. paradoxum strain JW-YL-7 obtained by using PacBio DNA sequencing and Pilon for sequence assembly refinement with Illumina data. Copyright © 2016 Lancaster et al.


July 7, 2019

Elucidating the triplicated ancestral genome structure of radish based on chromosome-level comparison with the Brassica genomes.

This study presents a chromosome-scale draft genome sequence of radish that is assembled into nine chromosomal pseudomolecules. A comprehensive comparative genome analysis with the Brassica genomes provides genomic evidences on the evolution of the mesohexaploid radish genome. Radish (Raphanus sativus L.) is an agronomically important root vegetable crop and its origin and phylogenetic position in the tribe Brassiceae is controversial. Here we present a comprehensive analysis of the radish genome based on the chromosome sequences of R. sativus cv. WK10039. The radish genome was sequenced and assembled into 426.2 Mb spanning >98 % of the gene space, of which 344.0 Mb were integrated into nine chromosome pseudomolecules. Approximately 36 % of the genome was repetitive sequences and 46,514 protein-coding genes were predicted and annotated. Comparative mapping of the tPCK-like ancestral genome revealed that the radish genome has intermediate characteristics between the Brassica A/C and B genomes in the triplicated segments, suggesting an internal origin from the genus Brassica. The evolutionary characteristics shared between radish and other Brassica species provided genomic evidences that the current form of nine chromosomes in radish was rearranged from the chromosomes of hexaploid progenitor. Overall, this study provides a chromosome-scale draft genome sequence of radish as well as novel insight into evolution of the mesohexaploid genomes in the tribe Brassiceae.


July 7, 2019

OPERA-LG: efficient and exact scaffolding of large, repeat-rich eukaryotic genomes with performance guarantees.

The assembly of large, repeat-rich eukaryotic genomes represents a significant challenge in genomics. While long-read technologies have made the high-quality assembly of small, microbial genomes increasingly feasible, data generation can be expensive for larger genomes. OPERA-LG is a scalable, exact algorithm for the scaffold assembly of large, repeat-rich genomes, out-performing state-of-the-art programs for scaffold correctness and contiguity. It provides a rigorous framework for scaffolding of repetitive sequences and a systematic approach for combining data from different second-generation and third-generation sequencing technologies. OPERA-LG provides an avenue for systematic augmentation and improvement of thousands of existing draft eukaryotic genome assemblies.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.