Menu
July 7, 2019

Methylomic and phenotypic analysis of the ModH5 phasevarion of Helicobacter pylori.

The Helicobacter pylori phase variable gene modH, typified by gene HP1522 in strain 26695, encodes a N6-adenosine type III DNA methyltransferase. Our previous studies identified multiple strain-specific modH variants (modH1 – modH19) and showed that phase variation of modH5 in H. pylori P12 influenced expression of motility-associated genes and outer membrane protein gene hopG. However, the ModH5 DNA recognition motif and the mechanism by which ModH5 controls gene expression were unknown. Here, using comparative single molecule real-time sequencing, we identify the DNA site methylated by ModH5 as 5′-Gm6ACC-3′. This motif is vastly underrepresented in H. pylori genomes, but overrepresented in a number of virulence genes, including motility-associated genes, and outer membrane protein genes. Motility and the number of flagella of H. pylori P12 wild-type were significantly higher than that of isogenic modH5 OFF or ?modH5 mutants, indicating that phase variable switching of modH5 expression plays a role in regulating H. pylori motility phenotypes. Using the flagellin A (flaA) gene as a model, we show that ModH5 modulates flaA promoter activity in a GACC methylation-dependent manner. These findings provide novel insights into the role of ModH5 in gene regulation and how it mediates epigenetic regulation of H. pylori motility.


July 7, 2019

Contributions of Zea mays subspecies mexicana haplotypes to modern maize.

Maize was domesticated from lowland teosinte (Zea mays ssp. parviglumis), but the contribution of highland teosinte (Zea mays ssp. mexicana, hereafter mexicana) to modern maize is not clear. Here, two genomes for Mo17 (a modern maize inbred) and mexicana are assembled using a meta-assembly strategy after sequencing of 10 lines derived from a maize-teosinte cross. Comparative analyses reveal a high level of diversity between Mo17, B73, and mexicana, including three Mb-size structural rearrangements. The maize spontaneous mutation rate is estimated to be 2.17?×?10-8 ~3.87?×?10-8 per site per generation with a nonrandom distribution across the genome. A higher deleterious mutation rate is observed in the pericentromeric regions, and might be caused by differences in recombination frequency. Over 10% of the maize genome shows evidence of introgression from the mexicana genome, suggesting that mexicana contributed to maize adaptation and improvement. Our data offer a rich resource for constructing the pan-genome of Zea mays and genetic improvement of modern maize varieties.


July 7, 2019

Integrated genomic and proteomic analyses of high-level chloramphenicol resistance in Campylobacter jejuni.

Campylobacter jejuni is a major zoonotic pathogen, and its resistance to antibiotics is of great concern for public health. However, few studies have investigated the global changes of the entire organism with respect to antibiotic resistance. Here, we provide mechanistic insights into high-level resistance to chloramphenicol in C. jejuni, using integrated genomic and proteomic analyses. We identified 27 single nucleotide polymorphisms (SNPs) as well as an efflux pump cmeB mutation that conferred modest resistance. We determined two radical S-adenosylmethionine (SAM) enzymes, one each from an SNP gene and a differentially expressed protein. Validation of major metabolic pathways demonstrated alterations in oxidative phosphorylation and ABC transporters, suggesting energy accumulation and increase in methionine import. Collectively, our data revealed a novel rRNA methylation mechanism by a radical SAM superfamily enzyme, indicating that two resistance mechanisms existed in Campylobacter. This work provided a systems biology perspective on understanding the antibiotic resistance mechanisms in bacteria.


July 7, 2019

Identification of sRNA mediated responses to nutrient depletion in Burkholderia pseudomallei.

The Burkholderia genus includes many species that are known to survive in diverse environmental conditions including low nutrient environments. One species, Burkholderia pseudomallei is a versatile pathogen that can survive in a wide range of hosts and environmental conditions. In this study, we investigated how a nutrient depleted growth environment evokes sRNA mediated responses by B. pseudomallei. Computationally predicted B. pseudomallei D286 sRNAs were mapped to RNA-sequencing data for cultures grown under two conditions: (1) BHIB as a nutrient rich media reference environment and (2) M9 media as a nutrient depleted stress environment. The sRNAs were further selected to identify potentially cis-encoded systems by investigating their possible interactions with their flanking genes. The mappings of predicted sRNA genes and interactions analysis to their flanking genes identified 12 sRNA candidates that may possibly have cis-acting regulatory roles that are associated to a nutrient depleted growth environment. Our approach can be used for identifying novel sRNA genes and their possible role as cis-mediated regulatory systems.


July 7, 2019

Assembly of an early-matured japonica (Geng) rice genome, Suijing18, based on PacBio and Illumina sequencing.

The early-matured japonica (Geng) rice variety, Suijing18 (SJ18), carries multiple elite traits including durable blast resistance, good grain quality, and high yield. Using PacBio SMRT technology, we produced over 25?Gb of long-read sequencing raw data from SJ18 with a coverage of 62×. Using Illumina paired-end whole-genome shotgun sequencing technology, we generated 59?Gb of short-read sequencing data from SJ18 (23.6?Gb from a 200?bp library with a coverage of 59× and 35.4?Gb from an 800?bp library with a coverage of 88×). With these data, we assembled a single SJ18 genome and then generated a set of annotation data. These data sets can be used to test new programs for variation deep mining, and will provide new insights into the genome structure, function, and evolution of SJ18, and will provide essential support for biological research in general.


July 7, 2019

Complete genome sequencing and genomic characterization of two Escherichia coli strains co-producing MCR-1 and NDM-1 from bloodstream infection.

We previously described the discovery of two Escherichia coli isolates (EC1002 and EC2474) co-harbouring mcr-1 and bla NDM-1 genes, which were recovered from bloodstream infection in China. More importantly, these antibiotic resistance genes were located on different plasmids and signaling the potential spread of pandrug-resistant bacteria. Here, the complete genome sequences of both isolates were determined using Pacbio RS II and Illumina HiSeq2000 systems. The genome of EC1002 consists of a 5,177,501 base pair chromosome and four circular plasmids, while the genome of EC2474 consists of a 5,013,813 base pair chromosome and three plasmids. The plasmid replicon type of pEC1002_NDM and pEC2474_NDM were identified as IncA/C2 and IncF, respectively. The genetic environment of bla NDM-1 in this study was similar to bla NDM-carrying plasmids detected in China, although the overall nucleotide identity and query coverage were variable. The plasmid replicon type of pEC1002_MCR and pEC2474_MCR were identified as IncI2 and IncHI2, respectively. Two different genetic strategies for mcr-1 gene spread were observed in this study and bla NDM-1 genes were also found transferred by two different mobile genetic elements in two plasmids. The findings of this study further support that the diversified transfer mechanisms of bla NDM-1 and mcr-1 present in Enterobacteriaceae.


July 7, 2019

Complete genome sequences of Clostridium perfringens Del1 strain isolated from chickens affected by necrotic enteritis.

Clostridium perfringens is ubiquitous in nature. It is a normal inhabitant in the intestinal tract of animals and humans. As the primary etiological agent of gas gangrene, necrosis and bacteremia, C. perfringens causes food poisoning, necrotic enteritis (NE), and even death. Epidemiology research has indicated that the increasing incidence of NE in poultry is associated with the withdrawal of in-feed antibiotic growth promoters in poultry production in response to government regulations. The recent omics studies have indicated that bacterial virulence is typically linked to highly efficient conjugative transfer of toxins, or plasmids carrying antibiotic-resistance traits. Currently, there is limited information on understanding of host-pathogen interaction in NE caused by virulent strains of C. perfringens. Elucidating such pathogenesis has practical impacts on fighting infectious diseases through adopting strategies of prophylactic or therapeutic interventions. In this report, we sequenced and analyzed the genome of C. perfringens Del1 strain using the hybrid of PacBio and Illumina sequencing technologies.Sequence analysis indicated that Del1 strain comprised a single circular chromosome with a complete 3,559,163 bp and 4 plasmids: pDel1_1 (82,596 bp), pDel1_2 (69,827 bp), pDel1_3 (49,582 bp), and pDel1_4 (49,728 bp). The genome had 3361 predicted coding DNA sequences, harbored numerous genes for pathogenesis and virulence factors, including 6 for antibiotic and antimicrobial resistance, and 3 phage-encoded genes. Phylogenetic analysis revealed that Del1 strain had similar genome and plasmid sequences to the CP4 strain.Complete chromosomal and plasmid sequences of Del1 strain are presented in this report. Since Del1 was isolated from a field disease outbreak, this strain is a good source to identify virulent genes that cause many damaging effects of Clostridial infections in chicken gut. Genome sequencing of the chicken pathogenic isolates from commercial farms provides valuable insights into the molecular pathogenesis of C. perfringens as a gastrointestinal pathogen in food animals. The detailed information on gene sequencing of this important field strain will benefit the development of novel vaccines specific for C. perfringens-induced NE in chickens.


July 7, 2019

RNA-seq and Tn-seq reveal fitness determinants of vancomycin-resistant Enterococcus faecium during growth in human serum.

The Gram-positive bacterium Enterococcus faecium is a commensal of the human gastrointestinal tract and a frequent cause of bloodstream infections in hospitalized patients. The mechanisms by which E. faecium can survive and grow in blood during an infection have not yet been characterized. Here, we identify genes that contribute to growth of E. faecium in human serum through transcriptome profiling (RNA-seq) and a high-throughput transposon mutant library sequencing approach (Tn-seq).We first sequenced the genome of E. faecium E745, a vancomycin-resistant clinical isolate, using a combination of short- and long read sequencing, revealing a 2,765,010 nt chromosome and 6 plasmids, with sizes ranging between 9.3 kbp and 223.7 kbp. We then compared the transcriptome of E. faecium E745 during exponential growth in rich medium and in human serum by RNA-seq. This analysis revealed that 27.8% of genes on the E. faecium E745 genome were differentially expressed in these two conditions. A gene cluster with a role in purine biosynthesis was among the most upregulated genes in E. faecium E745 upon growth in serum. The E. faecium E745 transposon mutant library was then used to identify genes that were specifically required for growth of E. faecium in serum. Genes involved in de novo nucleotide biosynthesis (including pyrK_2, pyrF, purD, purH) and a gene encoding a phosphotransferase system subunit (manY_2) were thus identified to be contributing to E. faecium growth in human serum. Transposon mutants in pyrK_2, pyrF, purD, purH and manY_2 were isolated from the library and their impaired growth in human serum was confirmed. In addition, the pyrK_2 and manY_2 mutants were tested for their virulence in an intravenous zebrafish infection model and exhibited significantly attenuated virulence compared to E. faecium E745.Genes involved in carbohydrate metabolism and nucleotide biosynthesis of E. faecium are essential for growth in human serum and contribute to the pathogenesis of this organism. These genes may serve as targets for the development of novel anti-infectives for the treatment of E. faecium bloodstream infections.


July 7, 2019

The complete genome sequence of Ensifer meliloti strain CCMM B554 (FSM-MA), a highly effective nitrogen-fixing microsymbiont of Medicago truncatula Gaertn.

Strain CCMM B554, also known as FSM-MA, is a soil dwelling and nodule forming, nitrogen-fixing bacterium isolated from the nodules of the legume Medicago arborea L. in the Maamora Forest, Morocco. The strain forms effective nitrogen fixing nodules on species of the Medicago, Melilotus and Trigonella genera and is exceptional because it is a highly effective symbiotic partner of the two most widely used accessions, A17 and R108, of the model legume Medicago truncatula Gaertn. Based on 16S rRNA gene sequence, multilocus sequence and average nucleotide identity analyses, FSM-MA is identified as a new Ensifer meliloti strain. The genome is 6,70 Mbp and is comprised of the chromosome (3,64 Mbp) harboring 3574 predicted genes and two megaplasmids, pSymA (1,42 Mbp) and pSymB (1,64 Mbp) with respectively 1481 and 1595 predicted genes. The average GC content of the genome is 61.93%. The FSM-MA genome structure is highly similar and co-linear to other E. meliloti strains in the chromosome and the pSymB megaplasmid while, in contrast, it shows high variability in the pSymA plasmid. The large number of strain-specific sequences in pSymA as well as strain-specific genes on pSymB involved in the biosynthesis of the lipopolysaccharide and capsular polysaccharide surface polysaccharides may encode novel symbiotic functions explaining the high symbiotic performance of FSM-MA.


July 7, 2019

Draft genome of Paraburkholderia caballeronis TNe-841T, a free-living, nitrogen-fixing, tomato plant-associated bacterium.

10.1601/nm.26956 caballeronis is a plant-associated bacterium. Strain TNe-841T was isolated from the rhizosphere of tomato (Solanum lycopersicum L. var. lycopersicum) growing in Nepantla Mexico State. Initially this bacterium was found to effectively nodulate Phaseolus vulgaris L. However, from an analysis of the genome of strain TNe-841T and from repeat inoculation experiments, we found that this strain did not nodulate bean and also lacked nodulation genes, suggesting that the genes were lost. The genome consists of 7,115,141 bp with a G?+?C content of 67.01%. The sequence includes 6251 protein-coding genes and 87 RNA genes.


July 7, 2019

Streptococcal toxic shock syndrome caused by the dissemination of an invasive emm3/ST15 strain of Streptococcus pyogenes.

Streptococcus pyogenes (group A Streptococcus [GAS]) is a major human pathogen that causes a wide spectrum of clinical manifestations. Although invasive GAS (iGAS) infections are relatively uncommon, emm3/ST15 GAS is a highly virulent, invasive, and pathogenic strain. Global molecular epidemiology analysis has suggested that the frequency of emm3 GAS has been recently increasing.A 14-year-old patient was diagnosed with streptococcal toxic shock syndrome and severe pneumonia, impaired renal function, and rhabdomyolysis. GAS was isolated from a culture of endotracheal aspirates and designated as KS030. Comparative genome analysis suggested that KS030 is classified as emm3 (emm-type) and ST15 (multilocus sequencing typing [MLST]), which is similar to iGAS isolates identified in the UK (2013) and Switzerland (2015).We conclude that the global dissemination of emm3/ST15 GAS strain has the potential to cause invasive disease.


July 7, 2019

HISEA: HIerarchical SEed Aligner for PacBio data.

The next generation sequencing (NGS) techniques have been around for over a decade. Many of their fundamental applications rely on the ability to compute good genome assemblies. As the technology evolves, the assembly algorithms and tools have to continuously adjust and improve. The currently dominant technology of Illumina produces reads that are too short to bridge many repeats, setting limits on what can be successfully assembled. The emerging SMRT (Single Molecule, Real-Time) sequencing technique from Pacific Biosciences produces uniform coverage and long reads of length up to sixty thousand base pairs, enabling significantly better genome assemblies. However, SMRT reads are much more expensive and have a much higher error rate than Illumina’s – around 10-15% – mostly due to indels. New algorithms are very much needed to take advantage of the long reads while mitigating the effect of high error rate and lowering the required coverage.An essential step in assembling SMRT data is the detection of alignments, or overlaps, between reads. High error rate and very long reads make this a much more challenging problem than for Illumina data. We present a new pairwise read aligner, or overlapper, HISEA (Hierarchical SEed Aligner) for SMRT sequencing data. HISEA uses a novel two-step k-mer search, employing consistent clustering, k-mer filtering, and read alignment extension.We compare HISEA against several state-of-the-art programs – BLASR, DALIGNER, GraphMap, MHAP, and Minimap – on real datasets from five organisms. We compare their sensitivity, precision, specificity, F1-score, as well as time and memory usage. We also introduce a new, more precise, evaluation method. Finally, we compare the two leading programs, MHAP and HISEA, for their genome assembly performance in the Canu pipeline.Our algorithm has the best alignment detection sensitivity among all programs for SMRT data, significantly higher than the current best. The currently best assembler for SMRT data is the Canu program which uses the MHAP aligner in its pipeline. We have incorporated our new HISEA aligner in the Canu pipeline and benchmarked it against the best pipeline for multiple datasets at two relevant coverage levels: 30x and 50x. Our assemblies are better than those using MHAP for both coverage levels. Moreover, Canu+HISEA assemblies for 30x coverage are comparable with Canu+MHAP assemblies for 50x coverage, while being faster and cheaper.The HISEA algorithm produces alignments with highest sensitivity compared with the current state-of-the-art algorithms. Integrated in the Canu pipeline, currently the best for assembling PacBio data, it produces better assemblies than Canu+MHAP.


July 7, 2019

Complete genome sequence of esterase-producing bacterium Croceicoccus marinus E4A9T.

Croceicoccus marinus E4A9Twas isolated from deep-sea sediment collected from the East Pacific polymetallic nodule area. The strain is able to produce esterase, which is widely used in the food, perfume, cosmetic, chemical, agricultural and pharmaceutical industries. Here we describe the characteristics of strain E4A9, including the genome sequence and annotation, presence of esterases, and metabolic pathways of the organism. The genome of strain E4A9T comprises 4,109,188 bp, with one chromosome (3,001,363 bp) and two large circular plasmids (761,621 bp and 346,204 bp, respectively). Complete genome contains 3653 coding sequences, 48 tRNAs, two operons of 16S-23S-5S rRNA gene and three ncRNAs. Strain E4A9T encodes 10 genes related to esterase, and three of the esterases (E3, E6 and E10) was successfully cloned and expressed in Escherichia coli Rosetta in a soluble form, revealing its potential application in biotechnological industry. Moreover, the genome provides clues of metabolic pathways of strain E4A9T, reflecting its adaptations to the ambient environment. The genome sequence of C. marinus E4A9T now provides the fundamental information for future studies.


July 7, 2019

A novel Tn1696-like composite transposon (Tn6404) harboring bla IMP-4 in a Klebsiella pneumoniae isolate carrying a rare ESBL gene bla SFO-1.

Genetic determinants of a clinical Klebsiella pneumoniae isolate (KP1814) coproducing IMP-4 and a rare ESBL gene SFO-1 was investigated. KP1814 belongs to a novel sequence type (ST) assigned to ST2270. WGS identified four circular DNA sequences in KP1814, including two multidrug-resistance (MDR) plasmids, one virulence plasmid, and one circular form. The MDR plasmid pKP1814-1 (299.9 Kb) is untypeable, and carries two large mosaic multiresistance regions (MRRs). bla SFO-1 and bla IMP-4 co-exists on MRR1, and bla SFO-1 is associated with an IS/Tn-independent genetic context. bla IMP-4 is carried by a novel In804-like integron (intlI-bla IMP-4-Kl.pn.I3-qacG2-aacA4-catB3?) associated with a novel Tn1696-like transposon (designed Tn6404) flanked by IS5075. The other MDR plasmid pKP1814-3 is a 95,701-bp IncFII plasmid, and is a hybrid of a Shigella flexneri plasmid pSF07201 and an E. coli plasmid pCA08. All resistance genes of pKP1814-3 were detected in a ~16-kb IS26-flanked composite transposon carried by a Tn5396 transposon. The circular form (18.3 Kb) was composed of two parts belonging to pKP1814-1 and pKP1814-3, respectively. The plasmid pKP1814-2, carrying multiple virulence factors, encodes IncFIBK and IncFIIK replicons with a size of 187,349?bp. The coexistence of MDR and virulence plasmids largely enhances the bacterial fitness in the host and environment.


July 7, 2019

Resequencing of the Leishmania infantum (strain JPCM5) genome and de novo assembly into 36 contigs.

Leishmania parasites are the causative of leishmaniasis, a group of potentially fatal human diseases. Control strategies for leishmaniasis can be enhanced by genome based investigations. The publication in 2005 of the Leishmania major genome sequence, and two years later the genomes for the species Leishmania braziliensis and Leishmania infantum were major milestones. Since then, the L. infantum genome, although highly fragmented and incomplete, has been used widely as the reference genome to address whole transcriptomics and proteomics studies. Here, we report the sequencing of the L. infantum genome by two NGS methodologies and, as a result, the complete genome assembly on 36 contigs (chromosomes). Regarding the present L. infantum genome-draft, 495 new genes have been annotated, a hundred have been corrected and 75 previous annotated genes have been discontinued. These changes are not only the result of an increase in the genome size, but a significant contribution derives from the existence of a large number of incorrectly assembled regions in current chromosomal scaffolds. Furthermore, an improved assembly of tandemly repeated genes has been obtained. All these analyses support that the de novo assembled L. infantum genome represents a robust assembly and should replace the currently available in the databases.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.