Menu
July 19, 2019

De novo assembly of two Swedish genomes reveals missing segments from the human GRCh38 reference and improves variant calling of population-scale sequencing data.

The current human reference sequence (GRCh38) is a foundation for large-scale sequencing projects. However, recent studies have suggested that GRCh38 may be incomplete and give a suboptimal representation of specific population groups. Here, we performed a de novo assembly of two Swedish genomes that revealed over 10 Mb of sequences absent from the human GRCh38 reference in each individual. Around 6 Mb of these novel sequences (NS) are shared with a Chinese personal genome. The NS are highly repetitive, have an elevated GC-content, and are primarily located in centromeric or telomeric regions. Up to 1 Mb of NS can be assigned to chromosome Y, and large segments are also missing from GRCh38 at chromosomes 14, 17, and 21. Inclusion of NS into the GRCh38 reference radically improves the alignment and variant calling from short-read whole-genome sequencing data at several genomic loci. A re-analysis of a Swedish population-scale sequencing project yields > 75,000 putative novel single nucleotide variants (SNVs) and removes > 10,000 false positive SNV calls per individual, some of which are located in protein coding regions. Our results highlight that the GRCh38 reference is not yet complete and demonstrate that personal genome assemblies from local populations can improve the analysis of short-read whole-genome sequencing data.


July 7, 2019

Comparative genome analysis of Pseudomonas knackmussii B13, the first bacterium known to degrade chloroaromatic compounds.

Pseudomonas knackmussii B13 was the first strain to be isolated in 1974 that could degrade chlorinated aromatic hydrocarbons. This discovery was the prologue for subsequent characterization of numerous bacterial metabolic pathways, for genetic and biochemical studies, and which spurred ideas for pollutant bioremediation. In this study, we determined the complete genome sequence of B13 using next generation sequencing technologies and optical mapping. Genome annotation indicated that B13 has a variety of metabolic pathways for degrading monoaromatic hydrocarbons including chlorobenzoate, aminophenol, anthranilate and hydroxyquinol, but not polyaromatic compounds. Comparative genome analysis revealed that B13 is closest to Pseudomonas denitrificans and Pseudomonas aeruginosa. The B13 genome contains at least eight genomic islands [prophages and integrative conjugative elements (ICEs)], which were absent in closely related pseudomonads. We confirm that two ICEs are identical copies of the 103?kb self-transmissible element ICEclc that carries the genes for chlorocatechol metabolism. Comparison of ICEclc showed that it is composed of a variable and a ‘core’ region, which is very conserved among proteobacterial genomes, suggesting a widely distributed family of so far uncharacterized ICE. Resequencing of two spontaneous B13 mutants revealed a number of single nucleotide substitutions, as well as excision of a large 220?kb region and a prophage that drastically change the host metabolic capacity and survivability. © 2014 Society for Applied Microbiology and John Wiley & Sons Ltd.


July 7, 2019

Drug resistance analysis by next generation sequencing in Leishmania.

The use of next generation sequencing has the power to expedite the identification of drug resistance determinants and biomarkers and was applied successfully to drug resistance studies in Leishmania. This allowed the identification of modulation in gene expression, gene dosage alterations, changes in chromosome copy numbers and single nucleotide polymorphisms that correlated with resistance in Leishmania strains derived from the laboratory and from the field. An impressive heterogeneity at the population level was also observed, individual clones within populations often differing in both genotypes and phenotypes, hence complicating the elucidation of resistance mechanisms. This review summarizes the most recent highlights that whole genome sequencing brought to our understanding of Leishmania drug resistance and likely new directions.


July 7, 2019

The draft genome of Primula veris yields insights into the molecular basis of heterostyly.

The flowering plant Primula veris is a common spring blooming perennial that is widely cultivated throughout Europe. This species is an established model system in the study of the genetics, evolution, and ecology of heterostylous floral polymorphisms. Despite the long history of research focused on this and related species, the continued development of this system has been restricted due the absence of genomic and transcriptomic resources.We present here a de novo draft genome assembly of P. veris covering 301.8 Mb, or approximately 63% of the estimated 479.22 Mb genome, with an N50 contig size of 9.5 Kb, an N50 scaffold size of 164 Kb, and containing an estimated 19,507 genes. The results of a RADseq bulk segregant analysis allow for the confident identification of four genome scaffolds that are linked to the P. veris S-locus. RNAseq data from both P. veris and the closely related species P. vulgaris allow for the characterization of 113 candidate heterostyly genes that show significant floral morph-specific differential expression. One candidate gene of particular interest is a duplicated GLOBOSA homolog that may be unique to Primula (PveGLO2), and is completely silenced in L-morph flowers.The P. veris genome represents the first genome assembled from a heterostylous species, and thus provides an immensely important resource for future studies focused on the evolution and genetic dissection of heterostyly. As the first genome assembled from the Primulaceae, the P. veris genome will also facilitate the expanded application of phylogenomic methods in this diverse family and the eudicots as a whole.


July 7, 2019

A draft genome of field pennycress (Thlaspi arvense) provides tools for the domestication of a new winter biofuel crop.

Field pennycress (Thlaspi arvense L.) is being domesticated as a new winter cover crop and biofuel species for the Midwestern United States that can be double-cropped between corn and soybeans. A genome sequence will enable the use of new technologies to make improvements in pennycress. To generate a draft genome, a hybrid sequencing approach was used to generate 47 Gb of DNA sequencing reads from both the Illumina and PacBio platforms. These reads were used to assemble 6,768 genomic scaffolds. The draft genome was annotated using the MAKER pipeline, which identified 27,390 predicted protein-coding genes, with almost all of these predicted peptides having significant sequence similarity to Arabidopsis proteins. A comprehensive analysis of pennycress gene homologues involved in glucosinolate biosynthesis, metabolism, and transport pathways revealed high sequence conservation compared with other Brassicaceae species, and helps validate the assembly of the pennycress gene space in this draft genome. Additional comparative genomic analyses indicate that the knowledge gained from years of basic Brassicaceae research will serve as a powerful tool for identifying gene targets whose manipulation can be predicted to result in improvements for pennycress. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


July 7, 2019

Evolution of novel wood decay mechanisms in Agaricales revealed by the genome sequences of Fistulina hepatica and Cylindrobasidium torrendii.

Wood decay mechanisms in Agaricomycotina have been traditionally separated in two categories termed white and brown rot. Recently the accuracy of such a dichotomy has been questioned. Here, we present the genome sequences of the white-rot fungus Cylindrobasidium torrendii and the brown-rot fungus Fistulina hepatica both members of Agaricales, combining comparative genomics and wood decay experiments. C. torrendii is closely related to the white-rot root pathogen Armillaria mellea, while F. hepatica is related to Schizophyllum commune, which has been reported to cause white rot. Our results suggest that C. torrendii and S. commune are intermediate between white-rot and brown-rot fungi, but at the same time they show characteristics of decay that resembles soft rot. Both species cause weak wood decay and degrade all wood components but leave the middle lamella intact. Their gene content related to lignin degradation is reduced, similar to brown-rot fungi, but both have maintained a rich array of genes related to carbohydrate degradation, similar to white-rot fungi. These characteristics appear to have evolved from white-rot ancestors with stronger ligninolytic ability. F. hepatica shows characteristics of brown rot both in terms of wood decay genes found in its genome and the decay that it causes. However, genes related to cellulose degradation are still present, which is a plesiomorphic characteristic shared with its white-rot ancestors. Four wood degradation-related genes, homologs of which are frequently lost in brown-rot fungi, show signs of pseudogenization in the genome of F. hepatica. These results suggest that transition toward a brown-rot lifestyle could be an ongoing process in F. hepatica. Our results reinforce the idea that wood decay mechanisms are more diverse than initially thought and that the dichotomous separation of wood decay mechanisms in Agaricomycotina into white rot and brown rot should be revisited. Copyright © 2015 Elsevier Inc. All rights reserved.


July 7, 2019

Complete genome sequence of the unclassified iron-oxidizing, chemolithoautotrophic Burkholderiales bacterium GJ-E10, isolated from an acidic river.

Burkholderiales bacterium GJ-E10, isolated from the Tamagawa River in Akita Prefecture, Japan, is an unclassified, iron-oxidizing chemolithoautotrophic bacterium. Its single circular genome, consisting of 3,276,549 bp, was sequenced by using three types of next-generation sequencers and the sequences were then confirmed by PCR-based Sanger sequencing. Copyright © 2015 Fukushima et al.


July 7, 2019

A17, the first sequenced strain of Lactococcus lactis subsp. cremoris with potential immunomodulatory functions.

Lactococcus lactis subsp. cremoris A17, isolated from Taiwan fermented cabbage, is the first sequenced strain of L. lactis subsp. cremoris with immunomodulatory activity and antiallergic functions. The resulting A17 draft genome contains 2,679,936 bp and indicates that A17 is a potential exopolysaccharide-producing strain without any known virulence gene. Copyright © 2015 Yang et al.


July 7, 2019

Genetic determinants of reutericyclin biosynthesis in Lactobacillus reuteri.

Reutericyclin is a unique antimicrobial tetramic acid produced by some strains of Lactobacillus reuteri. This study aimed to identify the genetic determinants of reutericyclin biosynthesis. Comparisons of the genomes of reutericyclin-producing L. reuteri strains with those of non-reutericyclin-producing strains identified a genomic island of 14 open reading frames (ORFs) including genes coding for a nonribosomal peptide synthetase (NRPS), a polyketide synthase (PKS), homologues of PhlA, PhlB, and PhlC, and putative transport and regulatory proteins. The protein encoded by rtcN is composed of a condensation domain, an adenylation domain likely specific for d-leucine, and a thiolation domain. rtcK codes for a PKS that is composed of a ketosynthase domain, an acyl-carrier protein domain, and a thioesterase domain. The products of rtcA, rtcB, and rtcC are homologous to the diacetylphloroglucinol-biosynthetic proteins PhlABC and may acetylate the tetramic acid moiety produced by RtcN and RtcK, forming reutericyclin. Deletion of rtcN or rtcABC in L. reuteri TMW1.656 abrogated reutericyclin production but did not affect resistance to reutericyclin. Genes coding for transport and regulatory proteins could be deleted only in the reutericyclin-negative L. reuteri strain TMW1.656?rtcN, and these deletions eliminated reutericyclin resistance. The genomic analyses suggest that the reutericyclin genomic island was horizontally acquired from an unknown source during a unique event. The combination of PhlABC homologues with both an NRPS and a PKS has also been identified in the lactic acid bacteria Streptococcus mutans and Lactobacillus plantarum, suggesting that the genes in these organisms and those in L. reuteri share an evolutionary origin. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Analysis of a draft genome sequence of Kitasatospora cheerisanensis KCTC 2395 producing bafilomycin antibiotics.

Kitasatospora cheerisanensis KCTC 2395, producing bafilomycin antibiotics belonging to plecomacrolide group, was isolated from a soil sample at Mt. Jiri, Korea. The draft genome sequence contains 8.04 Mb with 73.6% G+C content and 7,810 open reading frames. All the genes for aerial mycelium and spore formations were confirmed in this draft genome. In phylogenetic analysis of MurE proteins (UDP-N-acetylmuramyl-L-alanyl-D-glutamate:DAP ligase) in a conserved dcw (division of cell wall) locus, MurE proteins of Kitasatospora species were placed in a separate clade between MurEs of Streptomyces species incorporating LL-diaminopimelic acid (DAP) and MurEs of Saccharopolyspora erythraea as well as Mycobacterium tuberculosis ligating meso-DAP. From this finding, it was assumed that Kitasatospora MurEs exhibit the substrate specificity for both LL-DAP and meso-DAP. The bafilomycin biosynthetic gene cluster was located in the left subtelomeric region. In 71.3 kb-long gene cluster, 17 genes probably involved in the biosynthesis of bafilomycin derivatives were deduced, including 5 polyketide synthase (PKS) genes comprised of 12 PKS modules.


July 7, 2019

Convergent losses of decay mechanisms and rapid turnover of symbiosis genes in mycorrhizal mutualists.

To elucidate the genetic bases of mycorrhizal lifestyle evolution, we sequenced new fungal genomes, including 13 ectomycorrhizal (ECM), orchid (ORM) and ericoid (ERM) species, and five saprotrophs, which we analyzed along with other fungal genomes. Ectomycorrhizal fungi have a reduced complement of genes encoding plant cell wall-degrading enzymes (PCWDEs), as compared to their ancestral wood decayers. Nevertheless, they have retained a unique array of PCWDEs, thus suggesting that they possess diverse abilities to decompose lignocellulose. Similar functional categories of nonorthologous genes are induced in symbiosis. Of induced genes, 7-38% are orphan genes, including genes that encode secreted effector-like proteins. Convergent evolution of the mycorrhizal habit in fungi occurred via the repeated evolution of a ‘symbiosis toolkit’, with reduced numbers of PCWDEs and lineage-specific suites of mycorrhiza-induced genes.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.