Menu
July 7, 2019

HALC: High throughput algorithm for long read error correction.

The third generation PacBio SMRT long reads can effectively address the read length issue of the second generation sequencing technology, but contain approximately 15% sequencing errors. Several error correction algorithms have been designed to efficiently reduce the error rate to 1%, but they discard large amounts of uncorrected bases and thus lead to low throughput. This loss of bases could limit the completeness of downstream assemblies and the accuracy of analysis.Here, we introduce HALC, a high throughput algorithm for long read error correction. HALC aligns the long reads to short read contigs from the same species with a relatively low identity requirement so that a long read region can be aligned to at least one contig region, including its true genome region’s repeats in the contigs sufficiently similar to it (similar repeat based alignment approach). It then constructs a contig graph and, for each long read, references the other long reads’ alignments to find the most accurate alignment and correct it with the aligned contig regions (long read support based validation approach). Even though some long read regions without the true genome regions in the contigs are corrected with their repeats, this approach makes it possible to further refine these long read regions with the initial insufficient short reads and correct the uncorrected regions in between. In our performance tests on E. coli, A. thaliana and Maylandia zebra data sets, HALC was able to obtain 6.7-41.1% higher throughput than the existing algorithms while maintaining comparable accuracy. The HALC corrected long reads can thus result in 11.4-60.7% longer assembled contigs than the existing algorithms.The HALC software can be downloaded for free from this site: https://github.com/lanl001/halc .


July 7, 2019

Determination of nucleopolyhedrovirus’ taxonomic position

To date , over 78 genomes of nucleopolyhedroviruses (NPVs) have been sequenced and deposited in NCBI. How to define a new virus from the infected larvae in the field is usually the first question. Two NPV strains, which were isolated from casuarina moth (L. xylina) and golden birdwing larvae (Troides aeacus), respectively, displayed the same question. Due to the identity of polyhedrin (polh) sequences of these two isolates to that of Lymantria dispar MNPV and Bombyx mori NPV, they are named LdMNPV-like virus and TraeNPV, provisionally. To further clarify the relationships of LdMNPV-like virus and TraeNPV to closely related NPVs, Kimura 2-parameter (K-2-P) analysis was performed. Apparently, the results of K-2-P analysis that showed LdMNPV-like virus is an LdMNPV isolate, while TraeNPV had an ambiguous relationship to BmNPV. Otherwise, MaviNPV, which is a mini-AcMNPV, also exhibited a different story by K-2-P analysis. Since K-2-P analysis could not cover all species determination issues, therefore, TraeNPV needs to be sequenced for defining its taxonomic position. For this purpose, different genomic sequencing technologies and bioinformatic analysis approaches will be discussed. We anticipated that these applications will help to exam nucleotide information of unknown species and give an insight and facilitate to this issue.


July 7, 2019

Novel plasmid-mediated colistin resistance gene mcr-3 in Escherichia coli.

The mobile colistin resistance gene mcr-1 has attracted global attention, as it heralds the breach of polymyxins, one of the last-resort antibiotics for the treatment of severe clinical infections caused by multidrug-resistant Gram-negative bacteria. To date, six slightly different variants of mcr-1, and a second mobile colistin resistance gene, mcr-2, have been reported or annotated in the GenBank database. Here, we characterized a third mobile colistin resistance gene, mcr-3 The gene coexisted with 18 additional resistance determinants in the 261-kb IncHI2-type plasmid pWJ1 from porcine Escherichia colimcr-3 showed 45.0% and 47.0% nucleotide sequence identity to mcr-1 and mcr-2, respectively, while the deduced amino acid sequence of MCR-3 showed 99.8 to 100% and 75.6 to 94.8% identity to phosphoethanolamine transferases found in other Enterobacteriaceae species and in 10 Aeromonas species, respectively. pWJ1 was mobilized to an E. coli recipient by conjugation and contained a plasmid backbone similar to those of other mcr-1-carrying plasmids, such as pHNSHP45-2 from the original mcr-1-harboring E. coli strain. Moreover, a truncated transposon element, TnAs2, which was characterized only in Aeromonas salmonicida, was located upstream of mcr-3 in pWJ1. This ?TnAs2-mcr-3 element was also identified in a shotgun genome sequence of a porcine E. coli isolate from Malaysia, a human Klebsiella pneumoniae isolate from Thailand, and a human Salmonella enterica serovar Typhimurium isolate from the United States. These results suggest the likelihood of a wide dissemination of the novel mobile colistin resistance gene mcr-3 among Enterobacteriaceae and aeromonads; the latter may act as a potential reservoir for mcr-3IMPORTANCE The emergence of the plasmid-mediated colistin resistance gene mcr-1 has attracted substantial attention worldwide. Here, we examined a colistin-resistant Escherichia coli isolate that was negative for both mcr-1 and mcr-2 and discovered a novel mobile colistin resistance gene, mcr-3 The amino acid sequence of MCR-3 aligned closely with phosphoethanolamine transferases from Enterobacteriaceae and Aeromonas species originating from both clinical infections and environmental samples collected in 12 countries on four continents. Due to the ubiquitous profile of aeromonads in the environment and the potential transfer of mcr-3 between Enterobacteriaceae and Aeromonas species, the wide spread of mcr-3 may be largely underestimated. As colistin has been and still is widely used in veterinary medicine and used at increasing frequencies in human medicine, the continuous monitoring of mobile colistin resistance determinants in colistin-resistant Gram-negative bacteria is imperative for understanding and tackling the dissemination of mcr genes in both the agricultural and health care sectors. Copyright © 2017 Yin et al.


July 7, 2019

Complete genome sequence of the cellulose-producing strain Komagataeibacter nataicola RZS01.

Komagataeibacter nataicola is an acetic acid bacterium (AAB) that can produce abundant bacterial cellulose and tolerate high concentrations of acetic acid. To globally understand its fermentation characteristics, we present a high-quality complete genome sequence of K. nataicola RZS01. The genome consists of a 3,485,191-bp chromosome and 6 plasmids, which encode 3,514 proteins and bear three cellulose synthase operons. Phylogenetic analysis at the genome level provides convincing evidence of the evolutionary position of K. nataicola with respect to related taxa. Genomic comparisons with other AAB revealed that RZS01 shares 36.1%~75.1% of sequence similarity with other AAB. The sequence data was also used for metabolic analysis of biotechnological substrates. Analysis of the resistance to acetic acid at the genomic level indicated a synergistic mechanism responsible for acetic acid tolerance. The genomic data provide a viable platform that can be used to understand and manipulate the phenotype of K. nataicola RZS01 to further improve bacterial cellulose production.


July 7, 2019

Non-toxin-producing Bacillus cereus strains belonging to the B. anthracis clade isolated from the International Space Station.

In an ongoing Microbial Observatory investigation of the International Space Station (ISS), 11 Bacillus strains (2 from the Kibo Japanese experimental module, 4 from the U.S. segment, and 5 from the Russian module) were isolated and their whole genomes were sequenced. A comparative analysis of the 16S rRNA gene sequences of these isolates showed the highest similarity (>99%) to the Bacillus anthracis-B. cereus-B. thuringiensis group. The fatty acid composition, polar lipid profile, peptidoglycan type, and matrix-assisted laser desorption ionization-time of flight profiles were consistent with the B. cereus sensu lato group. The phenotypic traits such as motile rods, enterotoxin production, lack of capsule, and resistance to gamma phage/penicillin observed in ISS isolates were not characteristics of B. anthracis. Whole-genome sequence characterizations showed that ISS strains had the plcR non-B. anthracis ancestral “C” allele and lacked anthrax toxin-encoding plasmids pXO1 and pXO2, excluding their identification as B. anthracis. The genetic identities of all 11 ISS isolates characterized via gyrB analyses arbitrarily identified them as members of the B. cereus group, but traditional DNA-DNA hybridization (DDH) showed that the ISS isolates are similar to B. anthracis (88% to 90%) but distant from the B. cereus (42%) and B. thuringiensis (48%) type strains. The DDH results were supported by average nucleotide identity (>98.5%) and digital DDH (>86%) analyses. However, the collective phenotypic traits and genomic evidence were the reasons to exclude the ISS isolates from B. anthracis. Nevertheless, multilocus sequence typing and whole-genome single nucleotide polymorphism analyses placed these isolates in a clade that is distinct from previously described members of the B. cereus sensu lato group but closely related to B. anthracis. IMPORTANCE The International Space Station Microbial Observatory (Microbial Tracking-1) study is generating a microbial census of the space station’s surfaces and atmosphere by using advanced molecular microbial community analysis techniques supported by traditional culture-based methods and modern bioinformatic computational modeling. This approach will lead to long-term, multigenerational studies of microbial population dynamics in a closed environment and address key questions, including whether microgravity influences the evolution and genetic modification of microorganisms. The spore-forming Bacillus cereus sensu lato group consists of pathogenic (B. anthracis), food poisoning (B. cereus), and biotechnologically useful (B. thuringiensis) microorganisms; their presence in a closed system such as the ISS might be a concern for the health of crew members. A detailed characterization of these potential pathogens would lead to the development of suitable countermeasures that are needed for long-term future missions and a better understanding of microorganisms associated with space missions.


July 7, 2019

Coping with living in the soil: the genome of the parthenogenetic springtail Folsomia candida.

Folsomia candida is a model in soil biology, belonging to the family of Isotomidae, subclass Collembola. It reproduces parthenogenetically in the presence of Wolbachia, and exhibits remarkable physiological adaptations to stress. To better understand these features and adaptations to life in the soil, we studied its genome in the context of its parthenogenetic lifestyle.We applied Pacific Bioscience sequencing and assembly to generate a reference genome for F. candida of 221.7 Mbp, comprising only 162 scaffolds. The complete genome of its endosymbiont Wolbachia, was also assembled and turned out to be the largest strain identified so far. Substantial gene family expansions and lineage-specific gene clusters were linked to stress response. A large number of genes (809) were acquired by horizontal gene transfer. A substantial fraction of these genes are involved in lignocellulose degradation. Also, the presence of genes involved in antibiotic biosynthesis was confirmed. Intra-genomic rearrangements of collinear gene clusters were observed, of which 11 were organized as palindromes. The Hox gene cluster of F. candida showed major rearrangements compared to arthropod consensus cluster, resulting in a disorganized cluster.The expansion of stress response gene families suggests that stress defense was important to facilitate colonization of soils. The large number of HGT genes related to lignocellulose degradation could be beneficial to unlock carbohydrate sources in soil, especially those contained in decaying plant and fungal organic matter. Intra- as well as inter-scaffold duplications of gene clusters may be a consequence of its parthenogenetic lifestyle. This high quality genome will be instrumental for evolutionary biologists investigating deep phylogenetic lineages among arthropods and will provide the basis for a more mechanistic understanding in soil ecology and ecotoxicology.


July 7, 2019

Complete genome sequences of two Salmonella enterica subsp. enterica serovar Enteritidis strains isolated from egg products in the United States.

Egg-associated salmonellosis is an important public health problem in many countries. Here, we report the genome sequences, including plasmids, of two strains of Salmonella enterica subsp. enterica serovar Enteritidis isolated from egg products in 2012 and 2013 in the United States. This will provide more information and insight into the research about egg-associated salmonellosis. Copyright © 2017 Hu et al.


July 7, 2019

Genetic and biochemical characterization of HMB-1, a novel subclass B1 metallo-ß-lactamase found in a Pseudomonas aeruginosa clinical isolate.

To characterize a novel subclass B1 metallo-ß-lactamase (MBL) found in an MDR Pseudomonas aeruginosa clinical isolate.The isolate P. aeruginosa NRZ-03096 was recovered in 2012 from an anal swab from a patient hospitalized in Northern Germany and showed high MICs of carbapenems. MBL production was analysed by several phenotypic tests. Genetic characterization of the novel bla gene and MLST was performed by WGS. The novel bla gene was expressed in Escherichia coli TOP10 and the enzyme was subjected to biochemical characterization to determine the kinetic parameters K m and k cat .P. aeruginosa NRZ-03096 was resistant to all tested ß-lactams and showed an MBL phenotype. Shotgun cloning experiments yielded a clone producing a novel subclass B1 enzyme with only 74.3% identity to the next nearest relative, KHM-1. The novel MBL was named HMB-1 (for Hamburg MBL). Analysis of WGS data showed that the bla HMB-1 gene was chromosomally located as part of a Tn 3 family transposon that was named Tn 6345 . Expression of bla HMB-1 in E. coli TOP10 led to increased resistance to ß-lactams. Determination of K m and k cat revealed that HMB-1 had different hydrolytic characteristics compared with KHM-1, with lower hydrolytic rates for cephalosporins and a higher rate for imipenem.The identification of HMB-1 further underlines the ongoing spread and diversification of carbapenemases in Gram-negative human pathogens and especially in P. aeruginosa .


July 7, 2019

Characterization and genome comparisons of three Achromobacter phages of the family Siphoviridae.

In this study, we present the characterization and genomic data of three Achromobacter phages belonging to the family Siphoviridae. Phages 83-24, JWX and JWF were isolated from sewage samples in Paris and Braunschweig, respectively, and infect Achromobacter xylosoxidans, an emerging nosocomial pathogen in cystic fibrosis patients. Analysis of morphology and growth parameters revealed that phages 83-24 and JWX have similar properties, both have nearly the same head and tail measurements, and both have a burst size between 85 and 100 pfu/cell. In regard to morphological properties, JWF had a much longer and more flexible tail compared to other phages. The linear double-stranded DNAs of all three phages are terminally redundant and not circularly permutated. The complete nucleotide sequences consist of 81,541 bp for JWF, 49,714 bp for JWX and 48,216 bp for 83-24. Analysis of the genome sequences showed again that phages JWX and 83-24 are quite similar. Comparison to the GenBank database via BLASTN revealed partial similarities to Roseobacter phage RDJL phi1 and Burkholderia phage BcepGomr. In contrast, BLASTN analysis of the genome sequence of phage JWF revealed only few similarities to non-annotated prophage regions in different strains of Burkholderia and Mesorhizobium.


July 7, 2019

Butterfly genomics: insights from the genome of Melitaea cinxia

The first lepidopteran genome (Bombyx mori) was published in 2004. Ten years later the genome of Melitaea cinxia came out as the third butterfly genome published, and the first eukaryotic genome sequenced in Finland. Owing to Ilkka Hanski, the M. cinxia system in the Åland Islands has become a famous model for metapopulation biology. More than 20 years of research on this system provides a strong ecological basis upon which a genetic framework could be built. Genetic knowledge is an essential addition for understanding eco-evolutionary dynamics and the genetic basis of variability in life history traits. Here we review the process of the M. cinxia genome project, its implications for lepidopteran genome evolution, and describe how the genome has been used for gene expression studies to identify genetic consequences of habitat fragmentation. Finally, we introduce some future possibilities and challenges for genomic research in M. cinxia and other Lepidoptera.


July 7, 2019

Genome sequence of Oxalobacter formigenes strain HC-1.

The lack of Oxalobacter formigenes colonization of the human gut has been correlated with the formation of calcium oxalate kidney stones and also with the number of recurrent kidney stone episodes. Here, we present the genome sequence of HC-1, a human strain isolated from an individual residing in Iowa, USA. Copyright © 2017 Hatch et al.


July 7, 2019

2015 epidemic of severe Streptococcus agalactiae sequence type 283 infections in Singapore associated with the consumption of raw freshwater fish: a detailed analysis of clinical, epidemiological, and bacterial sequencing data.

Streptococcus agalactiae (group B Streptococcus [GBS]) has not been described as a foodborne pathogen. However, in 2015, a large outbreak of severe invasive sequence type (ST) 283 GBS infections in adults epidemiologically linked to the consumption of raw freshwater fish occurred in Singapore. We attempted to determine the scale of the outbreak, define the clinical spectrum of disease, and link the outbreak to contaminated fish.Time-series analysis was performed on microbiology laboratory data. Food handlers and fishmongers were screened for enteric carriage of GBS. A retrospective cohort study was conducted to assess differences in demographic and clinical characteristics of patients with invasive ST283 and non-ST283 infections. Whole-genome sequencing was performed on human and fish ST283 isolates from Singapore, Thailand, and Hong Kong.The outbreak was estimated to have started in late January 2015. Within the study cohort of 408 patients, ST283 accounted for 35.8% of cases. Patients with ST283 infection were younger and had fewer comorbidities but were more likely to develop meningoencephalitis, septic arthritis, and spinal infection. Of 82 food handlers and fishmongers screened, none carried ST283. Culture of 43 fish samples yielded 13 ST283-positive samples. Phylogenomic analysis of 161 ST283 isolates from humans and fish revealed they formed a tight clade distinguished by 93 single-nucleotide polymorphisms.ST283 is a zoonotic GBS clone associated with farmed freshwater fish, capable of causing severe disease in humans. It caused a large foodborne outbreak in Singapore and poses both a regional and potentially more widespread threat.


July 7, 2019

Heterogeneous resistance to quizartinib in acute myeloid leukemia revealed by single-cell analysis.

Genomic studies have revealed significant branching heterogeneity in cancer. Studies of resistance to tyrosine kinase inhibitor therapy have not fully reflected this heterogeneity because resistance in individual patients has been ascribed to largely mutually exclusive on-target or off-target mechanisms in which tumors either retain dependency on the target oncogene or subvert it through a parallel pathway. Using targeted sequencing from single cells and colonies from patient samples, we demonstrate tremendous clonal diversity in the majority of acute myeloid leukemia (AML) patients with activating FLT3 internal tandem duplication mutations at the time of acquired resistance to the FLT3 inhibitor quizartinib. These findings establish that clinical resistance to quizartinib is highly complex and reflects the underlying clonal heterogeneity of AML.© 2017 by The American Society of Hematology.


July 7, 2019

Complete genome sequence of Microbulbifer sp. CCB-MM1, a halophile isolated from Matang Mangrove Forest, Malaysia.

Microbulbifer sp. CCB-MM1 is a halophile isolated from estuarine sediment of Matang Mangrove Forest, Malaysia. Based on 16S rRNA gene sequence analysis, strain CCB-MM1 is a potentially new species of genus Microbulbifer. Here we describe its features and present its complete genome sequence with annotation. The genome sequence is 3.86 Mb in size with GC content of 58.85%, harbouring 3313 protein coding genes and 92 RNA genes. A total of 71 genes associated with carbohydrate active enzymes were found using dbCAN. Ectoine biosynthetic genes, ectABC operon and ask_ect were detected using antiSMASH 3.0. Cell shape determination genes, mreBCD operon, rodA and rodZ were annotated, congruent with the rod-coccus cell cycle of the strain CCB-MM1. In addition, putative mreBCD operon regulatory gene, bolA was detected, which might be associated with the regulation of rod-coccus cell cycle observed from the strain.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.