Menu
July 7, 2019

A draft genome of field pennycress (Thlaspi arvense) provides tools for the domestication of a new winter biofuel crop.

Field pennycress (Thlaspi arvense L.) is being domesticated as a new winter cover crop and biofuel species for the Midwestern United States that can be double-cropped between corn and soybeans. A genome sequence will enable the use of new technologies to make improvements in pennycress. To generate a draft genome, a hybrid sequencing approach was used to generate 47 Gb of DNA sequencing reads from both the Illumina and PacBio platforms. These reads were used to assemble 6,768 genomic scaffolds. The draft genome was annotated using the MAKER pipeline, which identified 27,390 predicted protein-coding genes, with almost all of these predicted peptides having significant sequence similarity to Arabidopsis proteins. A comprehensive analysis of pennycress gene homologues involved in glucosinolate biosynthesis, metabolism, and transport pathways revealed high sequence conservation compared with other Brassicaceae species, and helps validate the assembly of the pennycress gene space in this draft genome. Additional comparative genomic analyses indicate that the knowledge gained from years of basic Brassicaceae research will serve as a powerful tool for identifying gene targets whose manipulation can be predicted to result in improvements for pennycress. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


July 7, 2019

Exploring possible DNA structures in real-time polymerase kinetics using Pacific Biosciences sequencer data.

BackgroundPausing of DNA polymerase can indicate the presence of a DNA structure that differs from the canonical double-helix. Here we detail a method to investigate how polymerase pausing in the Pacific Biosciences sequencer reads can be related to DNA sequences. The Pacific Biosciences sequencer uses optics to view a polymerase and its interaction with a single DNA molecule in real-time, offering a unique way to detect potential alternative DNA structures.ResultsWe have developed a new way to examine polymerase kinetics data and relate it to the DNA sequence by using a wavelet transform of read information from the sequencer. We use this method to examine how polymerase kinetics are related to nucleotide base composition. We then examine tandem repeat sequences known for their ability to form different DNA structures: (CGG)n and (CG)n repeats which can, respectively, form G-quadruplex DNA and Z-DNA. We find pausing around the (CGG)n repeat that may indicate the presence of G-quadruplexes in some of the sequencer reads. The (CG)n repeat does not appear to cause polymerase pausing, but its kinetics signature nevertheless suggests the possibility that alternative nucleotide conformations may sometimes be present.ConclusionWe discuss the implications of using our method to discover DNA sequences capable of forming alternative structures. The analyses presented here can be reproduced on any Pacific Biosciences kinetics data for any DNA pattern of interest using an R package that we have made publicly available.


July 7, 2019

Do read errors matter for genome assembly?

While most current high-throughput DNA sequencing technologies generate short reads with low error rates, emerging sequencing technologies generate long reads with high error rates. A basic question of interest is the tradeoff between read length and error rate in terms of the information needed for the perfect assembly of the genome. Using an adversarial erasure error model, we make progress on this problem by establishing a critical read length, as a function of the genome and the error rate, above which perfect assembly is guaranteed. For several real genomes, including those from the GAGE dataset, we verify that this critical read length is not significantly greater than the read length required for perfect assembly from reads without errors.


July 7, 2019

Evolution of novel wood decay mechanisms in Agaricales revealed by the genome sequences of Fistulina hepatica and Cylindrobasidium torrendii.

Wood decay mechanisms in Agaricomycotina have been traditionally separated in two categories termed white and brown rot. Recently the accuracy of such a dichotomy has been questioned. Here, we present the genome sequences of the white-rot fungus Cylindrobasidium torrendii and the brown-rot fungus Fistulina hepatica both members of Agaricales, combining comparative genomics and wood decay experiments. C. torrendii is closely related to the white-rot root pathogen Armillaria mellea, while F. hepatica is related to Schizophyllum commune, which has been reported to cause white rot. Our results suggest that C. torrendii and S. commune are intermediate between white-rot and brown-rot fungi, but at the same time they show characteristics of decay that resembles soft rot. Both species cause weak wood decay and degrade all wood components but leave the middle lamella intact. Their gene content related to lignin degradation is reduced, similar to brown-rot fungi, but both have maintained a rich array of genes related to carbohydrate degradation, similar to white-rot fungi. These characteristics appear to have evolved from white-rot ancestors with stronger ligninolytic ability. F. hepatica shows characteristics of brown rot both in terms of wood decay genes found in its genome and the decay that it causes. However, genes related to cellulose degradation are still present, which is a plesiomorphic characteristic shared with its white-rot ancestors. Four wood degradation-related genes, homologs of which are frequently lost in brown-rot fungi, show signs of pseudogenization in the genome of F. hepatica. These results suggest that transition toward a brown-rot lifestyle could be an ongoing process in F. hepatica. Our results reinforce the idea that wood decay mechanisms are more diverse than initially thought and that the dichotomous separation of wood decay mechanisms in Agaricomycotina into white rot and brown rot should be revisited. Copyright © 2015 Elsevier Inc. All rights reserved.


July 7, 2019

Complete genome sequence of Sphingobacterium sp. strain ML3W, isolated from wings of Myotis lucifugus infected with white nose syndrome.

Sphingobacterium sp. strain ML3W was isolated from the wing of a bat infected with white nose syndrome. We report the complete 5.33-Mb genome sequence of Sphingobacterium sp. strain ML3W, obtained using Pacific Biosciences technology. Being the second complete Sphingobacterium sequence, this will increase knowledge of the genus. Copyright © 2015 Smith et al.


July 7, 2019

Complete genome sequence of the Clostridium difficile laboratory strain 630¿ erm reveals differences from strain 630, including translocation of the mobile element CTn 5.

Background Clostridium difficile strain 630¿erm is a spontaneous erythromycin sensitive derivative of the reference strain 630 obtained by serial passaging in antibiotic-free media. It is widely used as a defined and tractable C. difficile strain. Though largely similar to the ancestral strain, it demonstrates phenotypic differences that might be the result of underlying genetic changes. Here, we performed a de novo assembly based on single-molecule real-time sequencing and an analysis of major methylation patterns.ResultsIn addition to single nucleotide polymorphisms and various indels, we found that the mobile element CTn5 is present in the gene encoding the methyltransferase rumA rather than adhesin CD1844 where it is located in the reference strain.ConclusionsTogether, the genetic features identified in this study may help to explain at least part of the phenotypic differences. The annotated genome sequence of this lab strain, including the first analysis of major methylation patterns, will be a valuable resource for genetic research on C. difficile.


July 7, 2019

Complete genome sequence of BS49 and draft genome sequence of BS34A, Bacillus subtilis strains carrying Tn916.

Bacillus subtilis strains BS49 and BS34A, both derived from a common ancestor, carry one or more copies of Tn916, an extremely common mobile genetic element capable of transfer to and from a broad range of microorganisms. Here, we report the complete genome sequence of BS49 and the draft genome sequence of BS34A, which have repeatedly been used as donors to transfer Tn916, Tn916 derivatives or oriTTn916-containing plasmids to clinically important pathogens. © FEMS 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Complete genome sequence of the fish pathogen Flavobacterium psychrophilum ATCC 49418(T.).

Flavobacterium psychrophilum is the causative agent of bacterial cold water disease and rainbow trout fry mortality syndrome in salmonid fishes and is associated with significant losses in the aquaculture industry. The virulence factors and molecular mechanisms of pathogenesis of F. psychrophilum are poorly understood. Moreover, at the present time, there are no effective vaccines and control using antimicrobial agents is problematic due to growing antimicrobial resistance and the fact that sick fish don’t eat. In the hopes of identifying vaccine and therapeutic targets, we sequenced the genome of the type strain ATCC 49418 which was isolated from the kidney of a Coho salmon (Oncorhychus kisutch) in Washington State (U.S.A.) in 1989. The genome is 2,715,909 bp with a G+C content of 32.75%. It contains 6 rRNA operons, 49 tRNA genes, and is predicted to encode 2,329 proteins.


July 7, 2019

Draft genome of Janthinobacterium sp. RA13 isolated from Lake Washington sediment.

Sequencing the genome of Janthinobacterium sp. RA13 from Lake Washington sediment is announced. From the genome content, a versatile life-style is predicted, but not bona fide methylotrophy. With the availability of its genomic sequence, Janthinobacterium sp. RA13 presents a prospective model for studying microbial communities in lake sediments. Copyright © 2015 McTaggart et al.


July 7, 2019

Genetic determinants of reutericyclin biosynthesis in Lactobacillus reuteri.

Reutericyclin is a unique antimicrobial tetramic acid produced by some strains of Lactobacillus reuteri. This study aimed to identify the genetic determinants of reutericyclin biosynthesis. Comparisons of the genomes of reutericyclin-producing L. reuteri strains with those of non-reutericyclin-producing strains identified a genomic island of 14 open reading frames (ORFs) including genes coding for a nonribosomal peptide synthetase (NRPS), a polyketide synthase (PKS), homologues of PhlA, PhlB, and PhlC, and putative transport and regulatory proteins. The protein encoded by rtcN is composed of a condensation domain, an adenylation domain likely specific for d-leucine, and a thiolation domain. rtcK codes for a PKS that is composed of a ketosynthase domain, an acyl-carrier protein domain, and a thioesterase domain. The products of rtcA, rtcB, and rtcC are homologous to the diacetylphloroglucinol-biosynthetic proteins PhlABC and may acetylate the tetramic acid moiety produced by RtcN and RtcK, forming reutericyclin. Deletion of rtcN or rtcABC in L. reuteri TMW1.656 abrogated reutericyclin production but did not affect resistance to reutericyclin. Genes coding for transport and regulatory proteins could be deleted only in the reutericyclin-negative L. reuteri strain TMW1.656?rtcN, and these deletions eliminated reutericyclin resistance. The genomic analyses suggest that the reutericyclin genomic island was horizontally acquired from an unknown source during a unique event. The combination of PhlABC homologues with both an NRPS and a PKS has also been identified in the lactic acid bacteria Streptococcus mutans and Lactobacillus plantarum, suggesting that the genes in these organisms and those in L. reuteri share an evolutionary origin. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Analysis of a draft genome sequence of Kitasatospora cheerisanensis KCTC 2395 producing bafilomycin antibiotics.

Kitasatospora cheerisanensis KCTC 2395, producing bafilomycin antibiotics belonging to plecomacrolide group, was isolated from a soil sample at Mt. Jiri, Korea. The draft genome sequence contains 8.04 Mb with 73.6% G+C content and 7,810 open reading frames. All the genes for aerial mycelium and spore formations were confirmed in this draft genome. In phylogenetic analysis of MurE proteins (UDP-N-acetylmuramyl-L-alanyl-D-glutamate:DAP ligase) in a conserved dcw (division of cell wall) locus, MurE proteins of Kitasatospora species were placed in a separate clade between MurEs of Streptomyces species incorporating LL-diaminopimelic acid (DAP) and MurEs of Saccharopolyspora erythraea as well as Mycobacterium tuberculosis ligating meso-DAP. From this finding, it was assumed that Kitasatospora MurEs exhibit the substrate specificity for both LL-DAP and meso-DAP. The bafilomycin biosynthetic gene cluster was located in the left subtelomeric region. In 71.3 kb-long gene cluster, 17 genes probably involved in the biosynthesis of bafilomycin derivatives were deduced, including 5 polyketide synthase (PKS) genes comprised of 12 PKS modules.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.