Menu
September 22, 2019

The integrative conjugative element clc (ICEclc) of Pseudomonas aeruginosa JB2.

Integrative conjugative elements (ICE) are a diverse group of chromosomally integrated, self-transmissible mobile genetic elements (MGE) that are active in shaping the functions of bacteria and bacterial communities. Each type of ICE carries a characteristic set of core genes encoding functions essential for maintenance and self-transmission, and cargo genes that endow on hosts phenotypes beneficial for niche adaptation. An important area to which ICE can contribute beneficial functions is the biodegradation of xenobiotic compounds. In the biodegradation realm, the best-characterized ICE is ICEclc, which carries cargo genes encoding for ortho-cleavage of chlorocatechols (clc genes) and aminophenol metabolism (amn genes). The element was originally identified in the 3-chlorobenzoate-degrader Pseudomonas knackmussii B13, and the closest relative is a nearly identical element in Burkholderia xenovorans LB400 (designated ICEclc-B13 and ICEclc-LB400, respectively). In the present report, genome sequencing of the o-chlorobenzoate degrader Pseudomonas aeruginosa JB2 was used to identify a new member of the ICEclc family, ICEclc-JB2. The cargo of ICEclc-JB2 differs from that of ICEclc-B13 and ICEclc-LB400 in consisting of a unique combination of genes that encode for the utilization of o-halobenzoates and o-hydroxybenzoate as growth substrates (ohb genes and hyb genes, respectively) and which are duplicated in a tandem repeat. Also, ICEclc-JB2 lacks an operon of regulatory genes (tciR-marR-mfsR) that is present in the other two ICEclc, and which controls excision from the host. Thus, the mechanisms regulating intracellular behavior of ICEclc-JB2 may differ from that of its close relatives. The entire tandem repeat in ICEclc-JB2 can excise independently from the element in a process apparently involving transposases/insertion sequence associated with the repeats. Excision of the repeats removes important niche adaptation genes from ICEclc-JB2, rendering it less beneficial to the host. However, the reduced version of ICEclc-JB2 could now acquire new genes that might be beneficial to a future host and, consequently, to the survival of ICEclc-JB2. Collectively, the present identification and characterization of ICEclc-JB2 provides insights into roles of MGE in bacterial niche adaptation and the evolution of catabolic pathways for biodegradation of xenobiotic compounds.


September 22, 2019

Genomes of ubiquitous marine and hypersaline Hydrogenovibrio, Thiomicrorhabdus and Thiomicrospira spp. encode a diversity of mechanisms to sustain chemolithoautotrophy in heterogeneous environments.

Chemolithoautotrophic bacteria from the genera Hydrogenovibrio, Thiomicrorhabdus and Thiomicrospira are common, sometimes dominant, isolates from sulfidic habitats including hydrothermal vents, soda and salt lakes and marine sediments. Their genome sequences confirm their membership in a deeply branching clade of the Gammaproteobacteria. Several adaptations to heterogeneous habitats are apparent. Their genomes include large numbers of genes for sensing and responding to their environment (EAL- and GGDEF-domain proteins and methyl-accepting chemotaxis proteins) despite their small sizes (2.1-3.1 Mbp). An array of sulfur-oxidizing complexes are encoded, likely to facilitate these organisms’ use of multiple forms of reduced sulfur as electron donors. Hydrogenase genes are present in some taxa, including group 1d and 2b hydrogenases in Hydrogenovibrio marinus and H. thermophilus MA2-6, acquired via horizontal gene transfer. In addition to high-affinity cbb3 cytochrome c oxidase, some also encode cytochrome bd-type quinol oxidase or ba3 -type cytochrome c oxidase, which could facilitate growth under different oxygen tensions, or maintain redox balance. Carboxysome operons are present in most, with genes downstream encoding transporters from four evolutionarily distinct families, which may act with the carboxysomes to form CO2 concentrating mechanisms. These adaptations to habitat variability likely contribute to the cosmopolitan distribution of these organisms.© 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.


September 22, 2019

Human copy number variants are enriched in regions of low mappability.

Copy number variants (CNVs) are known to affect a large portion of the human genome and have been implicated in many diseases. Although whole-genome sequencing (WGS) can help identify CNVs, most analytical methods suffer from limited sensitivity and specificity, especially in regions of low mappability. To address this, we use PopSV, a CNV caller that relies on multiple samples to control for technical variation. We demonstrate that our calls are stable across different types of repeat-rich regions and validate the accuracy of our predictions using orthogonal approaches. Applying PopSV to 640 human genomes, we find that low-mappability regions are approximately 5 times more likely to harbor germline CNVs, in stark contrast to the nearly uniform distribution observed for somatic CNVs in 95 cancer genomes. In addition to known enrichments in segmental duplication and near centromeres and telomeres, we also report that CNVs are enriched in specific types of satellite and in some of the most recent families of transposable elements. Finally, using this comprehensive approach, we identify 3455 regions with recurrent CNVs that were missing from existing catalogs. In particular, we identify 347 genes with a novel exonic CNV in low-mappability regions, including 29 genes previously associated with disease.


September 22, 2019

Optical and physical mapping with local finishing enables megabase-scale resolution of agronomically important regions in the wheat genome.

Numerous scaffold-level sequences for wheat are now being released and, in this context, we report on a strategy for improving the overall assembly to a level comparable to that of the human genome.Using chromosome 7A of wheat as a model, sequence-finished megabase-scale sections of this chromosome were established by combining a new independent assembly using a bacterial artificial chromosome (BAC)-based physical map, BAC pool paired-end sequencing, chromosome-arm-specific mate-pair sequencing and Bionano optical mapping with the International Wheat Genome Sequencing Consortium RefSeq v1.0 sequence and its underlying raw data. The combined assembly results in 18 super-scaffolds across the chromosome. The value of finished genome regions is demonstrated for two approximately 2.5 Mb regions associated with yield and the grain quality phenotype of fructan carbohydrate grain levels. In addition, the 50 Mb centromere region analysis incorporates cytological data highlighting the importance of non-sequence data in the assembly of this complex genome region.Sufficient genome sequence information is shown to now be available for the wheat community to produce sequence-finished releases of each chromosome of the reference genome. The high-level completion identified that an array of seven fructosyl transferase genes underpins grain quality and that yield attributes are affected by five F-box-only-protein-ubiquitin ligase domain and four root-specific lipid transfer domain genes. The completed sequence also includes the centromere.


September 22, 2019

In vivo evolution of drug-resistant Mycobacterium tuberculosis in patients during long-term treatment.

In the current scenario, the drug-resistant tuberculosis is a significant challenge in the control of tuberculosis worldwide. In order to investigate the in vivo evolution of drug-resistant M. tuberculosis, the present study envisaged sequencing of the draft genomes of 18 serial isolates from four pre-extensively drug-resistant (pre-XDR) tuberculosis patients for continuous genetic alterations.All of the isolates harbored single nucleotide polymorphisms (SNPs) ranging from 1303 to 1309 with M. tuberculosis H37Rv as the reference. SNPs ranged from 0 to 12 within patients. The evolution rates were higher than the reported SNPs of 0.5 in the four patients. All the isolates exhibited mutations at sites of known drug targets, while some contained mutations in uncertain drug targets including folC, proZ, and pyrG. The compensatory substitutions for rescuing these deleterious mutations during evolution were only found in RpoC I491T in one patient. Many loci with microheterogeneity showed transient mutations in different isolates. Ninety three SNPs exhibited significant association with refractory pre-XDR TB isolates.Our results showed evolutionary changes in the serial genetic characteristics of the pre-XDR TB patients due to accumulation of the fixed drug-resistant related mutations, and the transient mutations under continuous antibiotics pressure over several years.


September 22, 2019

Functional and genome sequence-driven characterization of tal effector gene repertoires reveals novel variants with altered specificities in closely related Malian Xanthomonas oryzae pv. oryzae strains.

Rice bacterial leaf blight (BLB) is caused by Xanthomonas oryzae pv. oryzae (Xoo) which injects Transcription Activator-Like Effectors (TALEs) into the host cell to modulate the expression of target disease susceptibility genes. Xoo major-virulence TALEs universally target susceptibility genes of the SWEET sugar transporter family. TALE-unresponsive alleles of OsSWEET genes have been identified in the rice germplasm or created by genome editing and confer resistance to BLB. In recent years, BLB has become one of the major biotic constraints to rice cultivation in Mali. To inform the deployment of alternative sources of resistance in this country, rice lines carrying alleles of OsSWEET14 unresponsive to either TalF (formerly Tal5) or TalC, two important TALEs previously identified in West African Xoo, were challenged with a panel of strains recently isolated in Mali and were found to remain susceptible to these isolates. The characterization of TALE repertoires revealed that talF and talC specific molecular markers were simultaneously present in all surveyed Malian strains, suggesting that the corresponding TALEs are broadly deployed by Malian Xoo to redundantly target the OsSWEET14 gene promoter. Consistent with this, the capacity of most Malian Xoo to induce OsSWEET14 was unaffected by either talC- or talF-unresponsive alleles of this gene. Long-read sequencing and assembly of eight Malian Xoo genomes confirmed the widespread occurrence of active TalF and TalC variants and provided a detailed insight into the diversity of TALE repertoires. All sequenced strains shared nine evolutionary related tal effector genes. Notably, a new TalF variant that is unable to induce OsSWEET14 was identified. Furthermore, two distinct TalB variants were shown to have lost the ability to simultaneously induce two susceptibility genes as previously reported for the founding members of this group from strains MAI1 and BAI3. Yet, both new TalB variants retained the ability to induce one or the other of the two susceptibility genes. These results reveal molecular and functional differences in tal repertoires and will be important for the sustainable deployment of broad-spectrum and durable resistance to BLB in West Africa.


September 22, 2019

Comparison of highly and weakly virulent Dickeya solani strains, with a view on the pangenome and panregulon of this species.

Bacteria belonging to the genera Dickeya and Pectobacterium are responsible for significant economic losses in a wide variety of crops and ornamentals. During last years, increasing losses in potato production have been attributed to the appearance of Dickeya solani. The D. solani strains investigated so far share genetic homogeneity, although different virulence levels were observed among strains of various origins. The purpose of this study was to investigate the genetic traits possibly related to the diverse virulence levels by means of comparative genomics. First, we developed a new genome assembly pipeline which allowed us to complete the D. solani genomes. Four de novo sequenced and ten publicly available genomes were used to identify the structure of the D. solani pangenome, in which 74.8 and 25.2% of genes were grouped into the core and dispensable genome, respectively. For D. solani panregulon analysis, we performed a binding site prediction for four transcription factors, namely CRP, KdgR, PecS and Fur, to detect the regulons of these virulence regulators. Most of the D. solani potential virulence factors were predicted to belong to the accessory regulons of CRP, KdgR, and PecS. Thus, some differences in gene expression could exist between D. solani strains. The comparison between a highly and a low virulent strain, IFB0099 and IFB0223, respectively, disclosed only small differences between their genomes but significant differences in the production of virulence factors like pectinases, cellulases and proteases, and in their mobility. The D. solani strains also diverge in the number and size of prophages present in their genomes. Another relevant difference is the disruption of the adhesin gene fhaB2 in the highly virulent strain. Strain IFB0223, which has a complete adhesin gene, is less mobile and less aggressive than IFB0099. This suggests that in this case, mobility rather than adherence is needed in order to trigger disease symptoms. This study highlights the utility of comparative genomics in predicting D. solani traits involved in the aggressiveness of this emerging plant pathogen.


September 22, 2019

The chromosome-level genome assemblies of two rattans (Calamus simplicifolius and Daemonorops jenkinsiana).

Calamus simplicifolius and Daemonorops jenkinsiana are two representative rattans, the most significant material sources for the rattan industry. However, the lack of reference genome sequences is a major obstacle for basic and applied biology on rattan.We produced two chromosome-level genome assemblies of C. simplicifolius and D. jenkinsiana using Illumina, Pacific Biosciences, and Hi-C sequencing data. A total of ~730 Gb and ~682 Gb of raw data covered the predicted genome lengths (~1.98 Gb of C. simplicifolius and ~1.61 Gb of D. jenkinsiana) to ~372 × and ~426 × read depths, respectively. The two de novo genome assemblies, ~1.94 Gb and ~1.58 Gb, were generated with scaffold N50s of ~160 Mb and ~119 Mb in C. simplicifolius and D. jenkinsiana, respectively. The C. simplicifolius and D. jenkinsiana genomes were predicted to harbor ?51,235 and ?53,342 intact protein-coding gene models, respectively. Benchmarking Universal Single-Copy Orthologs evaluation demonstrated that genome completeness reached 96.4% and 91.3% in the C. simplicifolius and D. jenkinsiana genomes, respectively. Genome evolution showed that four Arecaceae plants clustered together, and the divergence time between the two rattans was ~19.3 million years ago. Additionally, we identified 193 and 172 genes involved in the lignin biosynthesis pathway in the C. simplicifolius and D. jenkinsiana genomes, respectively.We present the first de novo assemblies of two rattan genomes (C. simplicifolius and D. jenkinsiana). These data will not only provide a fundamental resource for functional genomics, particularly in promoting germplasm utilization for breeding, but also serve as reference genomes for comparative studies between and among different species.


September 22, 2019

PacBio-based mitochondrial genome assembly of Leucaena trichandra (Leguminosae) and an intrageneric assessment of mitochondrial RNA editing.

Reconstructions of vascular plant mitochondrial genomes (mt-genomes) are notoriously complicated by rampant recombination that has resulted in comparatively few plant mt-genomes being available. The dearth of plant mitochondrial resources has limited our understanding of mt-genome structural diversity, complex patterns of RNA editing, and the origins of novel mt-genome elements. Here, we use an efficient long read (PacBio) iterative assembly pipeline to generate mt-genome assemblies for Leucaena trichandra (Leguminosae: Caesalpinioideae: mimosoid clade), providing the first assessment of non-papilionoid legume mt-genome content and structure to date. The efficiency of the assembly approach facilitated the exploration of alternative structures that are common place among plant mitochondrial genomes. A compact version (729 kbp) of the recovered assemblies was used to investigate sources of mt-genome size variation among legumes and mt-genome sequence similarity to the legume associated root holoparasite Lophophytum. The genome and an associated suite of transcriptome data from select species of Leucaena permitted an in-depth exploration of RNA editing in a diverse clade of closely related species that includes hybrid lineages. RNA editing in the allotetraploid, Leucaena leucocephala, is consistent with co-option of nearly equal maternal and paternal C-to-U edit components, generating novel combinations of RNA edited sites. A preliminary investigation of L. leucocephala C-to-U edit frequencies identified the potential for a hybrid to generate unique pools of alleles from parental variation through edit frequencies shared with one parental lineage, those intermediate between parents, and transgressive patterns.


September 22, 2019

A draft genome assembly of the Chinese sillago (Sillago sinica), the first reference genome for Sillaginidae fishes.

Sillaginidae, also known as smelt-whitings, is a family of benthic coastal marine fishes in the Indo-West Pacific that have high ecological and economic importance. Many Sillaginidae species, including the Chinese sillago (Sillago sinica), have been recently described in China, providing valuable material to analyze genetic diversification of the family Sillaginidae. Here, we constructed a reference genome for the Chinese sillago, with the aim to set up a platform for comparative analysis of all species in this family.Using the single-molecule real-time DNA sequencing platform Pacific Biosciences (PacBio) Sequel, we generated ~27.3 Gb genomic DNA sequences for the Chinese sillago. We reconstructed a genome assembly of 534 Mb using a strategy that takes advantage of complementary strengths of two genome assembly programs, Canu and FALCON. The genome size was consistent with the estimated genome size based on k-mer analysis. The assembled genome consisted of 802 contigs with a contig N50 length of 2.6 Mb. We annotated 22,122 protein-coding genes in the Chinese sillago genomes using a de novo method as well as RNA sequencing data and homologies to other teleosts. According to the phylogenetic analysis using protein-coding genes, the Chinese sillago is closely related to Larimichthys crocea and Dicentrarchus labrax and diverged from their ancestor around 69.5-82.6 million years ago.Using long reads generated with PacBio sequencing technology, we have built a draft genome assembly for the Chinese sillago, which is the first reference genome for Sillaginidae species. This genome assembly sets a stage for comparative analysis of the diversification and adaptation of fishes in Sillaginidae.


September 22, 2019

Spread of carbapenem resistance by transposition and conjugation among Pseudomonas aeruginosa.

The emergence of carbapenem-resistant Pseudomonas aeruginosa represents a worldwide problem. To understand the carbapenem-resistance mechanisms and their spreading among P. aeruginosa strains, whole genome sequences were determined of two extensively drug-resistant strains that are endemic in Dutch hospitals. Strain Carb01 63 is of O-antigen serotype O12 and of sequence type ST111, whilst S04 90 is a serotype O11 strain of ST446. Both strains carry a gene for metallo-ß-lactamase VIM-2 flanked by two aacA29 genes encoding aminoglycoside acetyltransferases on a class 1 integron. The integron is located on the chromosome in strain Carb01 63 and on a plasmid in strain S04 90. The backbone of the 159-kb plasmid, designated pS04 90, is similar to a previously described plasmid, pND6-2, from Pseudomonas putida. Analysis of the context of the integron showed that it is present in both strains on a ~30-kb mosaic DNA segment composed of four different transposons that can presumably act together as a novel, active, composite transposon. Apart from the presence of a 1237-bp insertion sequence element in the composite transposon on pS04 90, these transposons show > 99% sequence identity indicating that transposition between plasmid and chromosome could have occurred only very recently. The pS04 90 plasmid could be transferred by conjugation to a susceptible P. aeruginosa strain. A second class 1 integron containing a gene for a CARB-2 ß-lactamase flanked by an aacA4′-8 and an aadA2 gene, encoding an aminoglycoside acetyltransferase and adenylyltransferase, respectively, was present only in strain Carb01 63. This integron is located also on a composite transposon that is inserted in an integrative and conjugative element on the chromosome. Additionally, this strain contains a frameshift mutation in the oprD gene encoding a porin involved in the transport of carbapenems across the outer membrane. Together, the results demonstrate that integron-encoded carbapenem and carbapenicillin resistance can easily be disseminated by transposition and conjugation among Pseudomonas aeruginosa strains.


September 22, 2019

Genome plasticity of agr-defective Staphylococcus aureus during clinical infection.

Therapy for bacteremia caused by Staphylococcus aureus is often ineffective, even when treatment conditions are optimal according to experimental protocols. Adapted subclones, such as those bearing mutations that attenuate agr-mediated virulence activation, are associated with persistent infection and patient mortality. To identify additional alterations in agr-defective mutants, we sequenced and assembled the complete genomes of clone pairs from colonizing and infected sites of several patients in whom S. aureus demonstrated a within-host loss of agr function. We report that events associated with agr inactivation result in agr-defective blood and nares strain pairs that are enriched in mutations compared to pairs from wild-type controls. The random distribution of mutations between colonizing and infecting strains from the same patient, and between strains from different patients, suggests that much of the genetic complexity of agr-defective strains results from prolonged infection or therapy-induced stress. However, in one of the agr-defective infecting strains, multiple genetic changes resulted in increased virulence in a murine model of bloodstream infection, bypassing the mutation of agr and raising the possibility that some changes were selected. Expression profiling correlated the elevated virulence of this agr-defective mutant to restored expression of the agr-regulated ESAT6-like type VII secretion system, a known virulence factor. Thus, additional mutations outside the agr locus can contribute to diversification and adaptation during infection by S. aureus agr mutants associated with poor patient outcomes. Copyright © 2018 Altman et al.


September 22, 2019

The genome of tapeworm Taenia multiceps sheds light on understanding parasitic mechanism and control of coenurosis disease.

Coenurosis, caused by the larval coenurus of the tapeworm Taenia multiceps, is a fatal central nervous system disease in both sheep and humans. Though treatment and prevention options are available, the control of coenurosis still faces presents great challenges. Here, we present a high-quality genome sequence of T. multiceps in which 240 Mb (96%) of the genome has been successfully assembled using Pacbio single-molecule real-time (SMRT) and Hi-C data with a N50 length of 44.8 Mb. In total, 49.5 Mb (20.6%) repeat sequences and 13, 013 gene models were identified. We found that Taenia spp. have an expansion of transposable elements and recent small-scale gene duplications following the divergence of Taenia from Echinococcus, but not in Echinococcus genomes, and the genes underlying environmental adaptability and dosage effect tend to be over-retained in the T. multiceps genome. Moreover, we identified several genes encoding proteins involved in proglottid formation and interactions with the host central nervous system, which may contribute to the adaption of T. multiceps to its parasitic life style. Our study not only provides insights into the biology and evolution of T. multiceps, but also identifies a set of species-specific gene targets for developing novel treatment and control tools for coenurosis.


September 22, 2019

Comparative analyses of CTX prophage region of Vibrio cholerae seventh pandemic wave 1 strains isolated in Asia.

Vibrio cholerae O1 causes cholera, and cholera toxin, the principal mediator of massive diarrhea, is encoded by ctxAB in the cholera toxin (CTX) prophage. In this study, the structures of the CTX prophage region of V. cholerae strains isolated during the seventh pandemic wave 1 in Asian countries were determined and compared. Eighteen strains were categorized into eight groups by CTX prophage region-specific restriction fragment length polymorphism and PCR profiles and the structure of the region of a representative strain from each group was determined by DNA sequencing. Eight representative strains revealed eight distinct CTX prophage regions with various combinations of CTX-1, RS1 and a novel genomic island on chromosome I. CTX prophage regions carried by the wave 1 strains were diverse in structure. V. cholerae strains with an area specific CTX prophage region are believed to circulate in South-East Asian countries; additionally, multiple strains with distinct types of CTX prophage region are co-circulating in the area. Analysis of a phylogenetic tree generated by single nucleotide polymorphism differences across 2483 core genes revealed that V. cholerae strains categorized in the same group based on CTX prophage region structure were segregated in closer clusters. CTX prophage region-specific recombination events or gain and loss of genomic elements within the region may have occurred at much higher frequencies and contributed to producing a panel of CTX prophage regions with distinct structures among V. cholerae pathogenic strains in lineages with close genetic backgrounds in the early wave 1 period of the seventh cholera pandemic.© 2018 The Authors. Microbiology and Immunology published by The Societies and John Wiley & Sons Australia, Ltd.


September 22, 2019

Deletions linked to PROG1 gene participate in plant architecture domestication in Asian and African rice.

Improving the yield by modifying plant architecture was a key step during crop domestication. Here, we show that a 110-kb deletion on the short arm of chromosome 7 in Asian cultivated rice (Oryza sativa), which is closely linked to the previously identified PROSTRATE GROWTH 1 (PROG1) gene, harbors a tandem repeat of seven zinc-finger genes. Three of these genes regulate the plant architecture, suggesting that the deletion also promoted the critical transition from the prostrate growth and low yield of wild rice (O. rufipogon) to the erect growth and high yield of Asian cultivated rice. We refer to this locus as RICE PLANT ARCHITECTURE DOMESTICATION (RPAD). Further, a similar but independent 113-kb deletion is detected at the RPAD locus in African cultivated rice. These results indicate that the deletions, eliminating a tandem repeat of zinc-finger genes, may have been involved in the parallel domestication of plant architecture in Asian and African rice.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.