Short read massive parallel sequencing has emerged as a standard diagnostic tool in the medical setting. However, short read technologies have inherent limitations such as GC bias, difficulties mapping to repetitive elements, trouble discriminating paralogous sequences, and difficulties in phasing alleles. Long read single molecule sequencers resolve these obstacles. Moreover, they offer higher consensus accuracies and can detect epigenetic modifications from native DNA. The first commercially available long read single molecule platform was the RS system based on PacBio’s single molecule real-time (SMRT) sequencing technology, which has since evolved into their RSII and Sequel systems. Here we capsulize how SMRT…
Pig-tailed macaques (Macaca nemestrina, Mane) are important models for human immunodeficiency virus (HIV) studies. Their infectability with minimally modified HIV makes them a uniquely valuable animal model to mimic human infection with HIV and progression to acquired immunodeficiency syndrome (AIDS). However, variation in the pig-tailed macaque major histocompatibility complex (MHC) and the impact of individual transcripts on the pathogenesis of HIV and other infectious diseases is understudied compared to that of rhesus and cynomolgus macaques. In this study, we used Pacific Biosciences single-molecule real-time circular consensus sequencing to describe full-length MHC class I (MHC-I) transcripts for 194 pig-tailed macaques from…
Mutations in the beta-subunit of bacterial RNA polymerase (RpoB) cause resistance to rifampin (Rifr), a critical antibiotic for treatment of multidrug-resistantStaphylococcus aureus.In vitrostudies have shown that RpoB mutations confer decreased susceptibility to other antibiotics, but the clinical relevance is unknown. Here, by analyzing 7,099S. aureusgenomes, we demonstrate that the most prevalent RpoB mutations promote clinically relevant phenotypic plasticity resulting in the emergence of stableS. aureuslineages, associated with increased risk of therapeutic failure through generation of small-colony variants (SCVs) and coresistance to last-line antimicrobial agents. We found eight RpoB mutations that accounted for 93% (469/505) of the total number of Rifrmutations. The most…
Clostridium perfringens causes a range of diseases in animals and humans including necrotic enteritis in chickens and food poisoning and gas gangrene in humans. Necrotic enteritis is of concern in commercial chicken production due to the cost of the implementation of infection control measures and to productivity losses. This study has focused on the genomic analysis of a range of chicken-derived C. perfringens isolates, from around the world and from different years. The genomes were sequenced and compared with 20 genomes available from public databases, which were from a diverse collection of isolates from chickens, other animals, and humans. We…
Vibrio parahaemolyticus can cause gastrointestinal illness through consumption of seafood. Despite frequent food-borne outbreaks of V. parahaemolyticus, only 19 strains have subjected to complete whole-genome analysis. In this study, a novel strain of V. parahaemolyticus, designated FORC_022 (Food-borne pathogen Omics Research Center_022), was isolated from soy sauce marinated crabs, and its genome and transcriptome were analyzed to elucidate the pathogenic mechanisms. FORC_022 did not include major virulence factors of thermostable direct hemolysin (tdh) and TDH-related hemolysin (trh). However, FORC_022 showed high cytotoxicity and had several V. parahaemolyticus islands (VPaIs) and other virulence factors, such as various secretion systems (types I,…
Resistance to last-line polymyxins mediated by the plasmid-borne mobile colistin resistance gene (mcr-1) represents a new threat to global human health. Here we present the complete genome sequence of an mcr-1-positive multidrug-resistant Escherichia coli strain (MS8345). We show that MS8345 belongs to serotype O2:K1:H4, has a large 241,164-bp IncHI2 plasmid that carries 15 other antibiotic resistance genes (including the extended-spectrum ß-lactamase blaCTX-M-1) and 3 putative multidrug efflux systems, and contains 14 chromosomally encoded antibiotic resistance genes. MS8345 also carries a large ColV-like virulence plasmid that has been associated with E. coli bacteremia. Whole-genome phylogeny revealed that MS8345 clusters within a…
Non-typeable Haemophilus influenzae contains an N(6)-adenine DNA-methyltransferase (ModA) that is subject to phase-variable expression (random ON/OFF switching). Five modA alleles, modA2, modA4, modA5, modA9 and modA10, account for over two-thirds of clinical otitis media isolates surveyed. Here, we use single molecule, real-time (SMRT) methylome analysis to identify the DNA-recognition motifs for all five of these modA alleles. Phase variation of these alleles regulates multiple proteins including vaccine candidates, and key virulence phenotypes such as antibiotic resistance (modA2, modA5, modA10), biofilm formation (modA2) and immunoevasion (modA4). Analyses of a modA2 strain in the chinchilla model of otitis media show a clear…
Carbapenem resistant Enterobacteriaceae (CRE) pose an urgent risk to global human health. CRE that are non-susceptible to all commercially available antibiotics threaten to return us to the pre-antibiotic era. Using Single Molecule Real Time (SMRT) sequencing we determined the complete genome of a pandrug-resistant Klebsiella pneumoniae isolate, representing the first complete genome sequence of CRE resistant to all commercially available antibiotics. The precise location of acquired antibiotic resistance elements, including mobile elements carrying genes for the OXA-181 carbapenemase, were defined. Intriguingly, we identified three chromosomal copies of an ISEcp1-blaOXA-181 mobile element, one of which has disrupted the mgrB regulatory gene,…
Amplification of DNA is required as a mandatory step during library preparation in most targeted sequencing protocols. This can be a critical limitation when targeting regions that are highly repetitive or with extreme guanine-cytosine (GC) content, including repeat expansions associated with human disease. Here, we used an amplification-free protocol for targeted enrichment utilizing the CRISPR/Cas9 system (No-Amp Targeted sequencing) in combination with single molecule, real-time (SMRT) sequencing for studying repeat elements in the huntingtin (HTT) gene, where an expanded CAG repeat is causative for Huntington disease. We also developed a robust data analysis pipeline for repeat element analysis that is…
Investigating how epigenetic information is transmitted through the mammalian germline is the key to understanding how this information impacts on health and disease susceptibility in offspring. EED is essential for regulating the repressive histone modification, histone 3 lysine 27 tri-methylation (H3K27me3) at many developmental genes.In this study, we used oocyte-specific Zp3-Cre recombinase (Zp3Cre) to delete Eed specifically in mouse growing oocytes, permitting the study of EED function in oocytes and the impact of depleting EED in oocytes on outcomes in offspring. As EED deletion occurred only in growing oocytes and females were mated to normal wild type males, this model…
Staphylococcus capitis is an opportunistic pathogen of the coagulase negative staphylococci (CoNS). Functional genomic studies of S. capitis have thus far been limited by a lack of available complete genome sequences. Here, we determined the closed S. capitis genome and methylome using Single Molecule Real Time (SMRT) sequencing. The strain, AYP1020, harbors a single circular chromosome of 2.44 Mb encoding 2304 predicted proteins, which is the smallest of all complete staphylococcal genomes sequenced to date. AYP1020 harbors two large mobile genetic elements; a plasmid designated pAYP1020 (59.6 Kb) and a prophage, FAYP1020 (48.5 Kb). Methylome analysis identified significant adenine methylation…
NDM-producing Klebsiella pneumoniae strains represent major clinical and infection control challenges, particularly in resource-limited settings with high rates of antimicrobial resistance. Determining whether transmission occurs at a gene, plasmid, or bacterial strain level and within hospital and/or the community has implications for monitoring and controlling spread. Whole-genome sequencing (WGS) is the highest-resolution typing method available for transmission epidemiology. We sequenced carbapenem-resistant K. pneumoniae isolates from 26 individuals involved in several infection case clusters in a Nepali neonatal unit and 68 other clinical Gram-negative isolates from a similar time frame, using Illumina and PacBio technologies. Within-outbreak chromosomal and closed-plasmid structures were…
The increasing prevalence of polymyxin-resistant bacteria has stimulated the search for improved polymyxin lipopeptides. Here we describe the sequence and product profile for polymyxin D nonribosomal peptide synthetase from Paenibacillus polymyxa ATCC 10401. The polymyxin D synthase gene cluster comprised five genes that encoded ABC transporters (pmxC and pmxD) and enzymes responsible for the biosynthesis of polymyxin D (pmxA, pmxB, and pmxE). Unlike polymyxins B and E, polymyxin D contains d-Ser at position 3 as opposed to l-a,?-diaminobutyric acid and has an l-Thr at position 7 rather than l-Leu. Module 3 of pmxE harbored an auxiliary epimerization domain that catalyzes…
The evolution of land flora transformed the terrestrial environment. Land plants evolved from an ancestral charophycean alga from which they inherited developmental, biochemical, and cell biological attributes. Additional biochemical and physiological adaptations to land, and a life cycle with an alternation between multicellular haploid and diploid generations that facilitated efficient dispersal of desiccation tolerant spores, evolved in the ancestral land plant. We analyzed the genome of the liverwort Marchantia polymorpha, a member of a basal land plant lineage. Relative to charophycean algae, land plant genomes are characterized by genes encoding novel biochemical pathways, new phytohormone signaling pathways (notably auxin), expanded…
The Helicobacter pylori phase variable gene modH, typified by gene HP1522 in strain 26695, encodes a N6-adenosine type III DNA methyltransferase. Our previous studies identified multiple strain-specific modH variants (modH1 – modH19) and showed that phase variation of modH5 in H. pylori P12 influenced expression of motility-associated genes and outer membrane protein gene hopG. However, the ModH5 DNA recognition motif and the mechanism by which ModH5 controls gene expression were unknown. Here, using comparative single molecule real-time sequencing, we identify the DNA site methylated by ModH5 as 5′-Gm6ACC-3′. This motif is vastly underrepresented in H. pylori genomes, but overrepresented in…