Menu
July 7, 2019

Dissecting a hidden gene duplication: the Arabidopsis thaliana SEC10 locus.

Repetitive sequences present a challenge for genome sequence assembly, and highly similar segmental duplications may disappear from assembled genome sequences. Having found a surprising lack of observable phenotypic deviations and non-Mendelian segregation in Arabidopsis thaliana mutants in SEC10, a gene encoding a core subunit of the exocyst tethering complex, we examined whether this could be explained by a hidden gene duplication. Re-sequencing and manual assembly of the Arabidopsis thaliana SEC10 (At5g12370) locus revealed that this locus, comprising a single gene in the reference genome assembly, indeed contains two paralogous genes in tandem, SEC10a and SEC10b, and that a sequence segment of 7 kb in length is missing from the reference genome sequence. Differences between the two paralogs are concentrated in non-coding regions, while the predicted protein sequences exhibit 99% identity, differing only by substitution of five amino acid residues and an indel of four residues. Both SEC10 genes are expressed, although varying transcript levels suggest differential regulation. Homozygous T-DNA insertion mutants in either paralog exhibit a wild-type phenotype, consistent with proposed extensive functional redundancy of the two genes. By these observations we demonstrate that recently duplicated genes may remain hidden even in well-characterized genomes, such as that of A. thaliana. Moreover, we show that the use of the existing A. thaliana reference genome sequence as a guide for sequence assembly of new Arabidopsis accessions or related species has at least in some cases led to error propagation.


July 7, 2019

Novel giant siphovirus from Bacillus anthracis features unusual genome characteristics.

Here we present vB_BanS-Tsamsa, a novel temperate phage isolated from Bacillus anthracis, the agent responsible for anthrax infections in wildlife, livestock and humans. Tsamsa phage is a giant siphovirus (order Caudovirales), featuring a long, flexible and non-contractile tail of 440 nm (not including baseplate structure) and an isometric head of 82 nm in diameter. We induced Tsamsa phage in samples from two different carcass sites in Etosha National Park, Namibia. The Tsamsa phage genome is the largest sequenced Bacillus siphovirus, containing 168,876 bp and 272 ORFs. The genome features an integrase/recombinase enzyme, indicative of a temperate lifestyle. Among bacterial strains tested, the phage infected only certain members of the Bacillus cereus sensu lato group (B. anthracis, B. cereus and B. thuringiensis) and exhibited moderate specificity for B. anthracis. Tsamsa lysed seven out of 25 B. cereus strains, two out of five B. thuringiensis strains and six out of seven B. anthracis strains tested. It did not lyse B. anthracis PAK-1, an atypical strain that is also resistant to both gamma phage and cherry phage. The Tsamsa endolysin features a broader lytic spectrum than the phage host range, indicating possible use of the enzyme in Bacillus biocontrol.


July 7, 2019

Genome sequence of Pseudomonas sp. strain P482, a tomato rhizosphere isolate with broad-spectrum antimicrobial activity.

The tomato rhizosphere isolate Pseudomonas sp. strain P482 is a member of a diverse group of fluorescent pseudomonads. P482 produces a yet unidentified broad-spectrum antimicrobial compound(s), active inter alia (i.a.) against Dickeya spp. Here, we present a nearly complete genome of P482 obtained by a hybrid assembly of Illumina and PacBio sequencing data. Copyright © 2014 Krzyzanowska et al.


July 7, 2019

Genome sequencing of two Neorhizobium galegae strains reveals a noeT gene responsible for the unusual acetylation of the nodulation factors.

The species Neorhizobium galegae comprises two symbiovars that induce nodules on Galega plants. Strains of both symbiovars, orientalis and officinalis, induce nodules on the same plant species, but fix nitrogen only in their own host species. The mechanism behind this strict host specificity is not yet known. In this study, genome sequences of representatives of the two symbiovars were produced, providing new material for studying properties of N. galegae, with a special interest in genomic differences that may play a role in host specificity.The genome sequences confirmed that the two representative strains are much alike at a whole-genome level. Analysis of orthologous genes showed that N. galegae has a higher number of orthologs shared with Rhizobium than with Agrobacterium. The symbiosis plasmid of strain HAMBI 1141 was shown to transfer by conjugation under optimal conditions. In addition, both sequenced strains have an acetyltransferase gene which was shown to modify the Nod factor on the residue adjacent to the non-reducing-terminal residue. The working hypothesis that this gene is of major importance in directing host specificity of N. galegae could not, however, be confirmed.Strains of N. galegae have many genes differentiating them from strains of Agrobacterium, Rhizobium and Sinorhizobium. However, the mechanism behind their ecological difference is not evident. Although the final determinant for the strict host specificity of N. galegae remains to be identified, the gene responsible for the species-specific acetylation of the Nod factors was identified in this study. We propose the name noeT for this gene to reflect its role in symbiosis.


July 7, 2019

Whole-genome sequence of Serratia symbiotica strain CWBI-2.3T, a free-living symbiont of the black bean aphid Aphis fabae.

The gammaproteobacterium Serratia symbiotica is one of the major secondary symbionts found in aphids. Here, we report the draft genome sequence of S. symbiotica strain CWBI-2.3(T), previously isolated from the black bean aphid Aphis fabae. The 3.58-Mb genome sequence might provide new insights to understand the evolution of insect-microbe symbiosis. Copyright © 2014 Foray et al.


July 7, 2019

Safety of the surrogate microorganism Enterococcus faecium NRRL B-2354 for use in thermal process validation.

Enterococcus faecium NRRL B-2354 is a surrogate microorganism used in place of pathogens for validation of thermal processing technologies and systems. We evaluated the safety of strain NRRL B-2354 based on its genomic and functional characteristics. The genome of E. faecium NRRL B-2354 was sequenced and found to comprise a 2,635,572-bp chromosome and a 214,319-bp megaplasmid. A total of 2,639 coding sequences were identified, including 45 genes unique to this strain. Hierarchical clustering of the NRRL B-2354 genome with 126 other E. faecium genomes as well as pbp5 locus comparisons and multilocus sequence typing (MLST) showed that the genotype of this strain is most similar to commensal, or community-associated, strains of this species. E. faecium NRRL B-2354 lacks antibiotic resistance genes, and both NRRL B-2354 and its clonal relative ATCC 8459 are sensitive to clinically relevant antibiotics. This organism also lacks, or contains nonfunctional copies of, enterococcal virulence genes including acm, cyl, the ebp operon, esp, gelE, hyl, IS16, and associated phenotypes. It does contain scm, sagA, efaA, and pilA, although either these genes were not expressed or their roles in enterococcal virulence are not well understood. Compared with the clinical strains TX0082 and 1,231,502, E. faecium NRRL B-2354 was more resistant to acidic conditions (pH 2.4) and high temperatures (60°C) and was able to grow in 8% ethanol. These findings support the continued use of E. faecium NRRL B-2354 in thermal process validation of food products.


July 7, 2019

Genome Sequence of Pseudomonas brassicacearum DF41.

Pseudomonas brassicacearum DF41, a Gram-negative soil bacterium, is able to suppress the fungal pathogen Sclerotinia sclerotiorum through a process known as biological control. Here, we present a 6.8-Mb assembly of its genome, which is the second fully assembled genome of a P. brassicacearum strain.


July 7, 2019

Genome sequence of Ensifer adhaerens OV14 provides insights into its ability as a novel vector for the genetic transformation of plant genomes.

Recently it has been shown that Ensifer adhaerens can be used as a plant transformation technology, transferring genes into several plant genomes when equipped with a Ti plasmid. For this study, we have sequenced the genome of Ensifer adhaerens OV14 (OV14) and compared it with those of Agrobacterium tumefaciens C58 (C58) and Sinorhizobium meliloti 1021 (1021); the latter of which has also demonstrated a capacity to genetically transform crop genomes, albeit at significantly reduced frequencies.The 7.7 Mb OV14 genome comprises two chromosomes and two plasmids. All protein coding regions in the OV14 genome were functionally grouped based on an eggNOG database. No genes homologous to the A. tumefaciens Ti plasmid vir genes appeared to be present in the OV14 genome. Unexpectedly, OV14 and 1021 were found to possess homologs to chromosomal based genes cited as essential to A. tumefaciens T-DNA transfer. Of significance, genes that are non-essential but exert a positive influence on virulence and the ability to genetically transform host genomes were identified in OV14 but were absent from the 1021 genome.This study reveals the presence of homologs to chromosomally based Agrobacterium genes that support T-DNA transfer within the genome of OV14 and other alphaproteobacteria. The sequencing and analysis of the OV14 genome increases our understanding of T-DNA transfer by non-Agrobacterium species and creates a platform for the continued improvement of Ensifer-mediated transformation (EMT).


July 7, 2019

Complete genome of the switchgrass endophyte Enterobacter clocace P101.

The Enterobacter cloacae complex is genetically very diverse. The increasing number of complete genomic sequences of E. cloacae is helping to determine the exact relationship among members of the complex. E. cloacae P101 is an endophyte of switchgrass (Panicum virgatum) and is closely related to other E. cloacae strains isolated from plants. The P101 genome consists of a 5,369,929 bp chromosome. The chromosome has 5,164 protein-coding regions, 100 tRNA sequences, and 8 rRNA operons.


July 7, 2019

Complete genome sequence of the sugar cane endophyte Pseudomonas aurantiaca PB-St2, a disease-suppressive bacterium with antifungal activity toward the plant pathogen Colletotrichum falcatum.

The endophytic bacterium Pseudomonas aurantiaca PB-St2 exhibits antifungal activity and represents a biocontrol agent to suppress red rot disease of sugar cane. Here, we report the completely sequenced 6.6-Mb genome of P. aurantiaca PB-St2. The sequence contains a repertoire of biosynthetic genes for secondary metabolites that putatively contribute to its antagonistic activity and its plant-microbe interactions.


July 7, 2019

Sequence alignment tools: one parallel pattern to rule them all?

In this paper, we advocate high-level programming methodology for next generation sequencers (NGS) alignment tools for both productivity and absolute performance. We analyse the problem of parallel alignment and review the parallelisation strategies of the most popular alignment tools, which can all be abstracted to a single parallel paradigm. We compare these tools to their porting onto the FastFlow pattern-based programming framework, which provides programmers with high-level parallel patterns. By using a high-level approach, programmers are liberated from all complex aspects of parallel programming, such as synchronisation protocols, and task scheduling, gaining more possibility for seamless performance tuning. In this work, we show some use cases in which, by using a high-level approach for parallelising NGS tools, it is possible to obtain comparable or even better absolute performance for all used datasets.


July 7, 2019

Thirty-thousand-year-old distant relative of giant icosahedral DNA viruses with a pandoravirus morphology.

The largest known DNA viruses infect Acanthamoeba and belong to two markedly different families. The Megaviridae exhibit pseudo-icosahedral virions up to 0.7 µm in diameter and adenine-thymine (AT)-rich genomes of up to 1.25 Mb encoding a thousand proteins. Like their Mimivirus prototype discovered 10 y ago, they entirely replicate within cytoplasmic virion factories. In contrast, the recently discovered Pandoraviruses exhibit larger amphora-shaped virions 1 µm in length and guanine-cytosine-rich genomes up to 2.8 Mb long encoding up to 2,500 proteins. Their replication involves the host nucleus. Whereas the Megaviridae share some general features with the previously described icosahedral large DNA viruses, the Pandoraviruses appear unrelated to them. Here we report the discovery of a third type of giant virus combining an even larger pandoravirus-like particle 1.5 µm in length with a surprisingly smaller 600 kb AT-rich genome, a gene content more similar to Iridoviruses and Marseillevirus, and a fully cytoplasmic replication reminiscent of the Megaviridae. This suggests that pandoravirus-like particles may be associated with a variety of virus families more diverse than previously envisioned. This giant virus, named Pithovirus sibericum, was isolated from a >30,000-y-old radiocarbon-dated sample when we initiated a survey of the virome of Siberian permafrost. The revival of such an ancestral amoeba-infecting virus used as a safe indicator of the possible presence of pathogenic DNA viruses, suggests that the thawing of permafrost either from global warming or industrial exploitation of circumpolar regions might not be exempt from future threats to human or animal health.


July 7, 2019

Detecting authorized and unauthorized genetically modified organisms containing vip3A by real-time PCR and next-generation sequencing.

The growing number of biotech crops with novel genetic elements increasingly complicates the detection of genetically modified organisms (GMOs) in food and feed samples using conventional screening methods. Unauthorized GMOs (UGMOs) in food and feed are currently identified through combining GMO element screening with sequencing the DNA flanking these elements. In this study, a specific and sensitive qPCR assay was developed for vip3A element detection based on the vip3Aa20 coding sequences of the recently marketed MIR162 maize and COT102 cotton. Furthermore, SiteFinding-PCR in combination with Sanger, Illumina or Pacific BioSciences (PacBio) sequencing was performed targeting the flanking DNA of the vip3Aa20 element in MIR162. De novo assembly and Basic Local Alignment Search Tool searches were used to mimic UGMO identification. PacBio data resulted in relatively long contigs in the upstream (1,326 nucleotides (nt); 95 % identity) and downstream (1,135 nt; 92 % identity) regions, whereas Illumina data resulted in two smaller contigs of 858 and 1,038 nt with higher sequence identity (>99 % identity). Both approaches outperformed Sanger sequencing, underlining the potential for next-generation sequencing in UGMO identification.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.