Menu
April 21, 2020

A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds.

The ability to generate long sequencing reads and access long-range linkage information is revolutionizing the quality and completeness of genome assemblies. Here we use a hybrid approach that combines data from four genome sequencing and mapping technologies to generate a new genome assembly of the honeybee Apis mellifera. We first generated contigs based on PacBio sequencing libraries, which were then merged with linked-read 10x Chromium data followed by scaffolding using a BioNano optical genome map and a Hi-C chromatin interaction map, complemented by a genetic linkage map.Each of the assembly steps reduced the number of gaps and incorporated a substantial amount of additional sequence into scaffolds. The new assembly (Amel_HAv3) is significantly more contiguous and complete than the previous one (Amel_4.5), based mainly on Sanger sequencing reads. N50 of contigs is 120-fold higher (5.381 Mbp compared to 0.053 Mbp) and we anchor >?98% of the sequence to chromosomes. All of the 16 chromosomes are represented as single scaffolds with an average of three sequence gaps per chromosome. The improvements are largely due to the inclusion of repetitive sequence that was unplaced in previous assemblies. In particular, our assembly is highly contiguous across centromeres and telomeres and includes hundreds of AvaI and AluI repeats associated with these features.The improved assembly will be of utility for refining gene models, studying genome function, mapping functional genetic variation, identification of structural variants, and comparative genomics.


April 21, 2020

Chromosome-level assembly of the water buffalo genome surpasses human and goat genomes in sequence contiguity.

Rapid innovation in sequencing technologies and improvement in assembly algorithms have enabled the creation of highly contiguous mammalian genomes. Here we report a chromosome-level assembly of the water buffalo (Bubalus bubalis) genome using single-molecule sequencing and chromatin conformation capture data. PacBio Sequel reads, with a mean length of 11.5?kb, helped to resolve repetitive elements and generate sequence contiguity. All five B. bubalis sub-metacentric chromosomes were correctly scaffolded with centromeres spanned. Although the index animal was partly inbred, 58% of the genome was haplotype-phased by FALCON-Unzip. This new reference genome improves the contig N50 of the previous short-read based buffalo assembly more than a thousand-fold and contains only 383 gaps. It surpasses the human and goat references in sequence contiguity and facilitates the annotation of hard to assemble gene clusters such as the major histocompatibility complex (MHC).


April 21, 2020

Genome and proteome of the chlorophyll f-producing cyanobacterium Halomicronema hongdechloris: adaptative proteomic shifts under different light conditions.

Halomicronema hongdechloris was the first cyanobacterium to be identified that produces chlorophyll (Chl) f. It contains Chl a and uses phycobiliproteins as its major light-harvesting components under white light conditions. However, under far-red light conditions H. hongdechloris produces Chl f and red-shifted phycobiliprotein complexes to absorb and use far-red light. In this study, we report the genomic sequence of H. hongdechloris and use quantitative proteomic approaches to confirm the deduced metabolic pathways as well as metabolic and photosynthetic changes in response to different photo-autotrophic conditions.The whole genome of H. hongdechloris was sequenced using three different technologies and assembled into a single circular scaffold with a genome size of 5,577,845?bp. The assembled genome has 54.6% GC content and encodes 5273 proteins covering 83.5% of the DNA sequence. Using Tandem Mass Tag labelling, the total proteome of H. hongdechloris grown under different light conditions was analyzed. A total of 1816 proteins were identified, with photosynthetic proteins accounting for 24% of the total mass spectral readings, of which 35% are phycobiliproteins. The proteomic data showed that essential cellular metabolic reactions remain unchanged under shifted light conditions. The largest differences in protein content between white and far-red light conditions reflect the changes to photosynthetic complexes, shifting from a standard phycobilisome and Chl a-based light harvesting system under white light, to modified, red-shifted phycobilisomes and Chl f-containing photosystems under far-red light conditions.We demonstrate that essential cellular metabolic reactions under different light conditions remain constant, including most of the enzymes in chlorophyll biosynthesis and photosynthetic carbon fixation. The changed light conditions cause significant changes in the make-up of photosynthetic protein complexes to improve photosynthetic light capture and reaction efficiencies. The integration of the global proteome with the genome sequence highlights that cyanobacterial adaptation strategies are focused on optimizing light capture and utilization, with minimal changes in other metabolic pathways. Our quantitative proteomic approach has enabled a deeper understanding of both the stability and the flexibility of cellular metabolic networks of H. hongdechloris in response to changes in its environment.


April 21, 2020

Comparative genomics reveals structural and functional features specific to the genome of a foodborne Escherichia coli O157:H7.

Escherichia coli O157:H7 (O157) has been linked to numerous foodborne disease outbreaks. The ability to rapidly sequence and analyze genomes is important for understanding epidemiology, virulence, survival, and evolution of outbreak strains. In the current study, we performed comparative genomics to determine structural and functional features of the genome of a foodborne O157 isolate NADC 6564 and infer its evolutionary relationship to other O157 strains.The chromosome of NADC 6564 contained 5466?kb compared to reference strains Sakai (5498?kb) and EDL933 (5547?kb) and shared 41 of its 43 Linear Conserved Blocks (LCB) with the reference strains. However, 18 of 41 LCB had inverse orientation in NADC 6564 compared to the reference strains. NADC 6564 shared 18 of 19 bacteriophages with reference strains except that the chromosomal positioning of some of the phages differed among these strains. The additional phage (P19) of NADC 6564 was located on a 39-kb insertion element (IE) encoding several hypothetical proteins, an integrase, transposases, transcriptional regulators, an adhesin, and a phosphoethanolamine transferase (PEA). The complete homologs of the 39-kb?IE were found in E. coli PCN061 of porcine origin. The IE-encoded PEA showed low homology (32-33%) to four other PEA in NADC 6564 and PEA linked to mobilizable colistin resistance in E. coli but was highly homologous (95%) to a PEA of uropathogenic, avian pathogenic, and enteroaggregative E. coli. NADC 6564 showed slightly higher minimum inhibitory concentration of colistin compared to the reference strains. The 39-kb?IE also contained dndBCDE and dptFGH operons encoding DNA S-modification and a restriction pathway, linked to oxidative stress tolerance and self-defense against foreign DNA, respectively. Evolutionary tree analysis grouped NADC 6564 with lineage I O157 strains.These results indicated that differential phage counts and different chromosomal positioning of many bacteriophages and genomic islands might have resulted in recombination events causing altered chromosomal organization in NADC 6564. Evolutionary analysis grouped NADC 6564 with lineage I strains and suggested its earlier divergence from these strains. The ability to perform S-DNA modification might affect tolerance of NADC 6564 to various stressors.


April 21, 2020

Comparative genomics and pathogenicity potential of members of the Pseudomonas syringae species complex on Prunus spp.

Diseases on Prunus spp. have been associated with a large number of phylogenetically different pathovars and species within the P. syringae species complex. Despite their economic significance, there is a severe lack of genomic information of these pathogens. The high phylogenetic diversity observed within strains causing disease on Prunus spp. in nature, raised the question whether other strains or species within the P. syringae species complex were potentially pathogenic on Prunus spp.To gain insight into the genomic potential of adaptation and virulence in Prunus spp., a total of twelve de novo whole genome sequences of P. syringae pathovars and species found in association with diseases on cherry (sweet, sour and ornamental-cherry) and peach were sequenced. Strains sequenced in this study covered three phylogroups and four clades. These strains were screened in vitro for pathogenicity on Prunus spp. together with additional genome sequenced strains thus covering nine out of thirteen of the currently defined P. syringae phylogroups. Pathogenicity tests revealed that most of the strains caused symptoms in vitro and no obvious link was found between presence of known virulence factors and the observed pathogenicity pattern based on comparative genomics. Non-pathogenic strains were displaying a two to three times higher generation time when grown in rich medium.In this study, the first set of complete genomes of cherry associated P. syringae strains as well as the draft genome of the quarantine peach pathogen P. syringae pv. persicae were generated. The obtained genomic data were matched with phenotypic data in order to determine factors related to pathogenicity to Prunus spp. Results of this study suggest that the inability to cause disease on Prunus spp. in vitro is not the result of host specialization but rather linked to metabolic impairments of individual strains.


April 21, 2020

Metaepigenomic analysis reveals the unexplored diversity of DNA methylation in an environmental prokaryotic community.

DNA methylation plays important roles in prokaryotes, and their genomic landscapes-prokaryotic epigenomes-have recently begun to be disclosed. However, our knowledge of prokaryotic methylation systems is focused on those of culturable microbes, which are rare in nature. Here, we used single-molecule real-time and circular consensus sequencing techniques to reveal the ‘metaepigenomes’ of a microbial community in the largest lake in Japan, Lake Biwa. We reconstructed 19 draft genomes from diverse bacterial and archaeal groups, most of which are yet to be cultured. The analysis of DNA chemical modifications in those genomes revealed 22 methylated motifs, nine of which were novel. We identified methyltransferase genes likely responsible for methylation of the novel motifs, and confirmed the catalytic specificities of four of them via transformation experiments using synthetic genes. Our study highlights metaepigenomics as a powerful approach for identification of the vast unexplored variety of prokaryotic DNA methylation systems in nature.


April 21, 2020

Assignment of virus and antimicrobial resistance genes to microbial hosts in a complex microbial community by combined long-read assembly and proximity ligation.

We describe a method that adds long-read sequencing to a mix of technologies used to assemble a highly complex cattle rumen microbial community, and provide a comparison to short read-based methods. Long-read alignments and Hi-C linkage between contigs support the identification of 188 novel virus-host associations and the determination of phage life cycle states in the rumen microbial community. The long-read assembly also identifies 94 antimicrobial resistance genes, compared to only seven alleles in the short-read assembly. We demonstrate novel techniques that work synergistically to improve characterization of biological features in a highly complex rumen microbial community.


April 21, 2020

Long-read based de novo assembly of low-complexity metagenome samples results in finished genomes and reveals insights into strain diversity and an active phage system.

Complete and contiguous genome assemblies greatly improve the quality of subsequent systems-wide functional profiling studies and the ability to gain novel biological insights. While a de novo genome assembly of an isolated bacterial strain is in most cases straightforward, more informative data about co-existing bacteria as well as synergistic and antagonistic effects can be obtained from a direct analysis of microbial communities. However, the complexity of metagenomic samples represents a major challenge. While third generation sequencing technologies have been suggested to enable finished metagenome-assembled genomes, to our knowledge, the complete genome assembly of all dominant strains in a microbiome sample has not been demonstrated. Natural whey starter cultures (NWCs) are used in cheese production and represent low-complexity microbiomes. Previous studies of Swiss Gruyère and selected Italian hard cheeses, mostly based on amplicon metagenomics, concurred that three species generally pre-dominate: Streptococcus thermophilus, Lactobacillus helveticus and Lactobacillus delbrueckii.Two NWCs from Swiss Gruyère producers were subjected to whole metagenome shotgun sequencing using the Pacific Biosciences Sequel and Illumina MiSeq platforms. In addition, longer Oxford Nanopore Technologies MinION reads had to be generated for one to resolve repeat regions. Thereby, we achieved the complete assembly of all dominant bacterial genomes from these low-complexity NWCs, which was corroborated by a 16S rRNA amplicon survey. Moreover, two distinct L. helveticus strains were successfully co-assembled from the same sample. Besides bacterial chromosomes, we could also assemble several bacterial plasmids and phages and a corresponding prophage. Biologically relevant insights were uncovered by linking the plasmids and phages to their respective host genomes using DNA methylation motifs on the plasmids and by matching prokaryotic CRISPR spacers with the corresponding protospacers on the phages. These results could only be achieved by employing long-read sequencing data able to span intragenomic as well as intergenomic repeats.Here, we demonstrate the feasibility of complete de novo genome assembly of all dominant strains from low-complexity NWCs based on whole metagenomics shotgun sequencing data. This allowed to gain novel biological insights and is a fundamental basis for subsequent systems-wide omics analyses, functional profiling and phenotype to genotype analysis of specific microbial communities.


April 21, 2020

Characterization of an NDM-5 carbapenemase-producing Escherichia coli ST156 isolate from a poultry farm in Zhejiang, China.

The emergence of carbapenem-resistant Enterobacteriaceae strains has posed a severe threat to public health in recent years. The mobile elements carrying the New Delhi metallo-ß-lactqtamase (NDM) gene have been regarded as the major mechanism leading to the rapid increase of carbapenem-resistant Enterobacteriaceae strains isolated from clinics and animals.We describe an NDM-5-producing Escherichia coli strain, ECCRA-119 (sequence type 156 [ST156]), isolated from a poultry farm in Zhejiang, China. ECCRA-119 is a multidrug-resistant (MDR) isolate that exhibited resistance to 27 antimicrobial compounds, including imipenem and meropenem, as detected by antimicrobial susceptibility testing (AST). The complete genome sequence of the ECCRA-119 isolate was also obtained using the PacBio RS II platform. Eleven acquired resistance genes were identified in the chromosome; four were detected in plasmid pTB201, while six were detected in plasmid pTB202. Importantly, the carbapenem-resistant gene blaNDM-5 was detected in the IncX3 plasmid pTB203. In addition, seven virulence genes and one metal-resistance gene were also detected. The results of conjugation experiments and the transfer regions identification indicated that the blaNDM-5-harboring plasmid pTB203 could be transferred between E. coli strains.The results reflected the severe bacterial resistance in a poultry farm in Zhejiang province and increased our understanding of the presence and transmission of the blaNDM-5 gene.


April 21, 2020

A high-quality de novo genome assembly from a single mosquito using PacBio sequencing

A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives, however, relatively high DNA input requirements (~5 µg for standard library protocol) have placed PacBio out of reach for many projects on small organisms that have lower DNA content, or on projects with limited input DNA for other reasons. Here we present a high-quality de novo genome assembly from a single Anopheles coluzzii mosquito. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 100 ng of starting genomic DNA. The sample was run on the Sequel System with chemistry 3.0 and software v6.0, generating, on average, 25 Gb of sequence per SMRT Cell with 20 h movies, followed by diploid de novo genome assembly with FALCON-Unzip. The resulting curated assembly had high contiguity (contig N50 3.5 Mb) and completeness (more than 98% of conserved genes were present and full-length). In addition, this single-insect assembly now places 667 (>90%) of formerly unplaced genes into their appropriate chromosomal contexts in the AgamP4 PEST reference. We were also able to resolve maternal and paternal haplotypes for over 1/3 of the genome. By sequencing and assembling material from a single diploid individual, only two haplotypes were present, simplifying the assembly process compared to samples from multiple pooled individuals. The method presented here can be applied to samples with starting DNA amounts as low as 100 ng per 1 Gb genome size. This new low-input approach puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life.


April 21, 2020

Complete genome sequence of the halophilic PHA-producing bacterium Halomonas sp. SF2003: insights into its biotechnological potential.

A halophilic Gram-negative eubacterium was isolated from the Iroise Sea and identified as an efficient producer of polyhydroxyalkanoates (PHA). The strain, designated SF2003, was found to belong to the Halomonas genus on the basis of 16S rRNA gene sequence similarity. Previous biochemical tests indicated that the Halomonas sp. strain SF2003 is capable of supporting various culture conditions which sometimes can be constraining for marine strains. This versatility could be of great interest for biotechnological applications. Therefore, a complete bacterial genome sequencing and de novo assembly were performed using a PacBio RSII sequencer and Hierarchical Genome Assembly Process software in order to predict Halomonas sp. SF2003 metabolisms, and to identify genes involved in PHA production and stress tolerance. This study demonstrates the complete genome sequence of Halomonas sp. SF2003 which contains a circular 4,36 Mbp chromosome, and replaces the strain in a phylogenetic tree. Genes related to PHA metabolism, carbohydrate metabolism, fatty acid metabolism and stress tolerance were identified and a comparison was made with metabolisms of relative species. Genes annotation highlighted the presence of typical genes involved in PHA biosynthesis such as phaA, phaB and phaC and enabled a preliminary analysis of their organization and characteristics. Several genes of carbohydrates and fatty acid metabolisms were also identified which provided helpful insights into both a better knowledge of the intricacies of PHA biosynthetic pathways and of production purposes. Results show the strong versatility of Halomonas sp. SF2003 to adapt to various temperatures and salinity which can subsequently be exploited for industrial applications such as PHA production.


October 23, 2019

Efficient genome editing of a facultative thermophile using mesophilic spCas9.

Well-developed genetic tools for thermophilic microorganisms are scarce, despite their industrial and scientific relevance. Whereas highly efficient CRISPR/Cas9-based genome editing is on the rise in prokaryotes, it has never been employed in a thermophile. Here, we apply Streptococcus pyogenes Cas9 (spCas9)-based genome editing to a moderate thermophile, i.e., Bacillus smithii, including a gene deletion, gene knockout via insertion of premature stop codons, and gene insertion. We show that spCas9 is inactive in vivo above 42 °C, and we employ the wide temperature growth range of B. smithii as an induction system for spCas9 expression. Homologous recombination with plasmid-borne editing templates is performed at 45-55 °C, when spCas9 is inactive. Subsequent transfer to 37 °C allows for counterselection through production of active spCas9, which introduces lethal double-stranded DNA breaks to the nonedited cells. The developed method takes 4 days with 90, 100, and 20% efficiencies for gene deletion, knockout, and insertion, respectively. The major advantage of our system is the limited requirement for genetic parts: only one plasmid, one selectable marker, and a promoter are needed, and the promoter does not need to be inducible or well-characterized. Hence, it can be easily applied for genome editing purposes in both mesophilic and thermophilic nonmodel organisms with a limited genetic toolbox and ability to grow at, or tolerate, temperatures of 37 and at or above 42 °C.


October 23, 2019

Gene editing and genetic engineering approaches for advanced probiotics: A review.

The applications of probiotics are significant and thus resulted in need of genome analysis of probiotic strains. Various omics methods and systems biology approaches enables us to understand and optimize the metabolic processes. These techniques have increased the researcher’s attention towards gut microbiome and provided a new source for the revelation of uncharacterized biosynthetic pathways which enables novel metabolic engineering approaches. In recent years, the broad and quantitative analysis of modified strains relies on systems biology tools such as in silico design which are commonly used methods for improving strain performance. The genetic manipulation of probiotic microorganisms is crucial for defining their role in intestinal microbiota and exploring their beneficial properties. This review describes an overview of gene editing and systems biology approaches, highlighting the advent of omics methods which allows the study of new routes for studying probiotic bacteria. We have also summarized gene editing tools like TALEN, ZFNs and CRISPR-Cas that edits or cleave the specific target DNA. Furthermore, in this review an overview of proposed design of advanced customized probiotic is also hypothesized to improvise the probiotics.


September 22, 2019

Acidipropionibacterium virtanenii sp. nov., isolated from malted barley.

A Gram-stain-positive, catalase-positive and pleomorphic rod organism was isolated from malted barley in Finland, classified initially by partial 16S rRNA gene sequencing and originally deposited in the VTT Culture Collection as a strain of Propionibacterium acidipropionici (currently Acidipropionibacterium acidipropionici). The subsequent comparison of the whole 16S rRNA gene with other representatives of the genus Acidipropionibacterium revealed that the strain belongs to a novel species, most closely related to Acidipropionibacterium microaerophilum and Acidipropionibacterium acidipropionici, with similarity values of 98.46 and 98.31?%, respectively. The whole genome sequencing using PacBio RS II platform allowed further comparison of the genome with all of the other DNA sequences available for the type strains of the Acidipropionibacterium species. Those comparisons revealed the highest similarity of strain JS278T to A. acidipropionici, which was confirmed by the average nucleotide identity analysis. The genome of strain JS278T is intermediate in size compared to the A. acidipropionici and Acidipropionibacterium jensenii at 3?432?872?bp, the G+C?content is 68.4?mol%. The strain fermented a wide range of carbon sources, and produced propionic acid as the major fermentation product. Besides its poor ability to grow at 37?°C and positive catalase reaction, the observed phenotype was almost indistinguishable from those of A. acidipropionici and A. jensenii. Based on our findings, we conclude that the organism represents a novel member of the genus Acidipropionibacterium, for which we propose the name Acidipropionibacteriumvirtanenii sp. nov. The type strain is JS278T (=VTT E-113202T=DSM 106790T).


September 22, 2019

An environmental bacterial taxon with a large and distinct metabolic repertoire.

Cultivated bacteria such as actinomycetes are a highly useful source of biomedically important natural products. However, such ‘talented’ producers represent only a minute fraction of the entire, mostly uncultivated, prokaryotic diversity. The uncultured majority is generally perceived as a large, untapped resource of new drug candidates, but so far it is unknown whether taxa containing talented bacteria indeed exist. Here we report the single-cell- and metagenomics-based discovery of such producers. Two phylotypes of the candidate genus ‘Entotheonella’ with genomes of greater than 9 megabases and multiple, distinct biosynthetic gene clusters co-inhabit the chemically and microbially rich marine sponge Theonella swinhoei. Almost all bioactive polyketides and peptides known from this animal were attributed to a single phylotype. ‘Entotheonella’ spp. are widely distributed in sponges and belong to an environmental taxon proposed here as candidate phylum ‘Tectomicrobia’. The pronounced bioactivities and chemical uniqueness of ‘Entotheonella’ compounds provide significant opportunities for ecological studies and drug discovery.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.