Menu
September 22, 2019

Novel full-length major histocompatibility complex class I allele discovery and haplotype definition in pig-tailed macaques.

Pig-tailed macaques (Macaca nemestrina, Mane) are important models for human immunodeficiency virus (HIV) studies. Their infectability with minimally modified HIV makes them a uniquely valuable animal model to mimic human infection with HIV and progression to acquired immunodeficiency syndrome (AIDS). However, variation in the pig-tailed macaque major histocompatibility complex (MHC) and the impact of individual transcripts on the pathogenesis of HIV and other infectious diseases is understudied compared to that of rhesus and cynomolgus macaques. In this study, we used Pacific Biosciences single-molecule real-time circular consensus sequencing to describe full-length MHC class I (MHC-I) transcripts for 194 pig-tailed macaques from three breeding centers. We then used the full-length sequences to infer Mane-A and Mane-B haplotypes containing groups of MHC-I transcripts that co-segregate due to physical linkage. In total, we characterized full-length open reading frames (ORFs) for 313 Mane-A, Mane-B, and Mane-I sequences that defined 86 Mane-A and 106 Mane-B MHC-I haplotypes. Pacific Biosciences technology allows us to resolve these Mane-A and Mane-B haplotypes to the level of synonymous allelic variants. The newly defined haplotypes and transcript sequences containing full-length ORFs provide an important resource for infectious disease researchers as certain MHC haplotypes have been shown to provide exceptional control of simian immunodeficiency virus (SIV) replication and prevention of AIDS-like disease in nonhuman primates. The increased allelic resolution provided by Pacific Biosciences sequencing also benefits transplant research by allowing researchers to more specifically match haplotypes between donors and recipients to the level of nonsynonymous allelic variation, thus reducing the risk of graft-versus-host disease.


September 22, 2019

High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.


September 22, 2019

High resolution annotation of zebrafish transcriptome using long-read sequencing.

With the emergence of zebrafish as an important model organism, a concerted effort has been made to study its transcriptome. This effort is limited, however, by gaps in zebrafish annotation, which are especially pronounced concerning transcripts dynamically expressed during zygotic genome activation (ZGA). To date, short-read sequencing has been the principal technology for zebrafish transcriptome annotation. In part because these sequence reads are too short for assembly methods to resolve the full complexity of the transcriptome, the current annotation is rudimentary. By providing direct observation of full-length transcripts, recently refined long-read sequencing platforms can dramatically improve annotation coverage and accuracy. Here, we leveraged the SMRT platform to study the transcriptome of zebrafish embryos before and after ZGA. Our analysis revealed additional novelty and complexity in thehttps://www.ncbi.nlm.nih.gov/pubmed/nfidence novel transcripts that originated from previously unannotated loci and 1835 high-confidence new isoforms in previously annotated genes. We validated these findings using a suite of computational approaches including structural prediction, sequence homology, and functional conservation analyses, as well as by confirmatory transcript quantification with short-read sequencing data. Our analyses provided insight into new homologs and paralogs of functionally important proteins and noncoding RNAs, isoform switching occurrences, and different classes of novel splicing events. Several novel isoforms representing distinct splicing events were validated through PCR experiments, including the discovery and validation of a novel 8-kb transcript spanning multiple mir-430 elements, an important driver of early development. Our study provides a significantly improved zebrafish transcriptome annotation resource.© 2018 Nudelman et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Diversified microbiota of meconium is affected by maternal diabetes status.

This study was aimed to assess the diversity of the meconium microbiome and determine if the bacterial community is affected by maternal diabetes status.The first intestinal discharge (meconium) was collected from 23 newborns stratified by maternal diabetes status: 4 mothers had pre-gestational type 2 diabetes mellitus (DM) including one mother with dizygotic twins, 5 developed gestational diabetes mellitus (GDM) and 13 had no diabetes. The meconium microbiome was profiled using multi-barcode 16S rRNA sequencing followed by taxonomic assignment and diversity analysis.All meconium samples were not sterile and contained diversified microbiota. Compared with adult feces, the meconium showed a lower species diversity, higher sample-to-sample variation, and enrichment of Proteobacteria and reduction of Bacteroidetes. Among the meconium samples, the taxonomy analyses suggested that the overall bacterial content significantly differed by maternal diabetes status, with the microbiome of the DM group showing higher alpha-diversity than that of no-diabetes or GDM groups. No global difference was found between babies delivered vaginally versus via Cesarean-section. Regression analysis showed that the most robust predictor for the meconium microbiota composition was the maternal diabetes status that preceded pregnancy. Specifically, Bacteroidetes (phyla) and Parabacteriodes (genus) were enriched in the meconium in the DM group compared to the no-diabetes group.Our study provides evidence that meconium contains diversified microbiota and is not affected by the mode of delivery. It also suggests that the meconium microbiome of infants born to mothers with DM is enriched for the same bacterial taxa as those reported in the fecal microbiome of adult DM patients.


September 22, 2019

Nasopharyngeal microbiome in premature infants and stability during rhinovirus infection.

The nasopharyngeal (NP) microbiota of newborns and infants plays a key role in modulating airway inflammation and respiratory symptoms during viral infections. Premature (PM) birth modifies the early NP environment and is a major risk factor for severe viral respiratory infections. However, it is currently unknown if the NP microbiota of PM infants is altered relative to full-term (FT) individuals.To characterize the NP microbiota differences in preterm and FT infants during rhinovirus (RV) infection.We determined the NP microbiota of infants 6 months to =2 years of age born FT (n=6) or severely PM<32 weeks gestation (n=7). We compared microbiota composition in healthy NP samples and performed a longitudinal analysis during naturally occurring RV infections to contrast the microbiota dynamics in PM versus FT infants.We observed significant differences in the NP bacterial community of PM versus FT. NP from PM infants had higher within-group dissimilarity (heterogeneity) relative to FT infants. Bacterial composition of NP samples from PM infants showed increased Proteobacteria and decreased in Firmicutes. There were also differences in the major taxonomic groups identified, including Streptococcus, Moraxella, and Haemophilus. Longitudinal data showed that these prematurity-related microbiota features persisted during RV infection.PM is associated with NP microbiota changes beyond the neonatal stage. PM infants have an NP microbiota with high heterogeneity relative to FT infants. These prematurity-related microbiota features persisted during RV infection, suggesting that the NP microbiota of PM may play an important role in modulating airway inflammatory and immune responses in this vulnerable group. Copyright © 2017 American Federation for Medical Research.


September 22, 2019

Identification of a biosynthetic gene cluster for the polyene macrolactam sceliphrolactam in a Streptomyces strain isolated from mangrove sediment.

Streptomyces are a genus of Actinobacteria capable of producing structurally diverse natural products. Here we report the isolation and characterization of a biosynthetically talented Streptomyces (Streptomyces sp. SD85) from tropical mangrove sediments. Whole-genome sequencing revealed that Streptomyces sp. SD85 harbors at least 52 biosynthetic gene clusters (BGCs), which constitute 21.2% of the 8.6-Mb genome. When cultivated under lab conditions, Streptomyces sp. SD85 produces sceliphrolactam, a 26-membered polyene macrolactam with unknown biosynthetic origin. Genome mining yielded a putative sceliphrolactam BGC (sce) that encodes a type I modular polyketide synthase (PKS) system, several ß-amino acid starter biosynthetic enzymes, transporters, and transcriptional regulators. Using the CRISPR/Cas9-based gene knockout method, we demonstrated that the sce BGC is essential for sceliphrolactam biosynthesis. Unexpectedly, the PKS system encoded by sce is short of one module required for assembling the 26-membered macrolactam skeleton according to the collinearity rule. With experimental data disfavoring the involvement of a trans-PKS module, the biosynthesis of sceliphrolactam seems to be best rationalized by invoking a mechanism whereby the PKS system employs an iterative module to catalyze two successive chain extensions with different outcomes. The potential violation of the collinearity rule makes the mechanism distinct from those of other polyene macrolactams.


September 22, 2019

Egg case silk gene sequences from Argiope spiders: Evidence for multiple loci and a loss of function between paralogs.

Spiders swath their eggs with silk to protect developing embryos and hatchlings. Egg case silks, like other fibrous spider silks, are primarily composed of proteins called spidroins (spidroin = spider-fibroin). Silks, and thus spidroins, are important throughout the lives of spiders, yet the evolution of spidroin genes has been relatively understudied. Spidroin genes are notoriously difficult to sequence because they are typically very long (= 10 kb of coding sequence) and highly repetitive. Here, we investigate the evolution of spider silk genes through long-read sequencing of Bacterial Artificial Chromosome (BAC) clones. We demonstrate that the silver garden spiderArgiope argentatahas multiple egg case spidroin loci with a loss of function at one locus. We also use degenerate PCR primers to search the genomic DNA of congeneric species and find evidence for multiple egg case spidroin loci in otherArgiopespiders. Comparative analyses show that these multiple loci are more similar at the nucleotide level within a species than between species. This pattern is consistent with concerted evolution homogenizing gene copies within a genome. More complicated explanations include convergent evolution or recent independent gene duplications within each species. Copyright © 2018 Chaw et al.


September 22, 2019

Translating genomics into practice for real-time surveillance and response to carbapenemase-producing Enterobacteriaceae: evidence from a complex multi-institutional KPC outbreak.

Until recently, Klebsiella pneumoniae carbapenemase (KPC)-producing Enterobacteriaceae were rarely identified in Australia. Following an increase in the number of incident cases across the state of Victoria, we undertook a real-time combined genomic and epidemiological investigation. The scope of this study included identifying risk factors and routes of transmission, and investigating the utility of genomics to enhance traditional field epidemiology for informing management of established widespread outbreaks.All KPC-producing Enterobacteriaceae isolates referred to the state reference laboratory from 2012 onwards were included. Whole-genome sequencing was performed in parallel with a detailed descriptive epidemiological investigation of each case, using Illumina sequencing on each isolate. This was complemented with PacBio long-read sequencing on selected isolates to establish high-quality reference sequences and interrogate characteristics of KPC-encoding plasmids.Initial investigations indicated that the outbreak was widespread, with 86 KPC-producing Enterobacteriaceae isolates (K. pneumoniae 92%) identified from 35 different locations across metropolitan and rural Victoria between 2012 and 2015. Initial combined analyses of the epidemiological and genomic data resolved the outbreak into distinct nosocomial transmission networks, and identified healthcare facilities at the epicentre of KPC transmission. New cases were assigned to transmission networks in real-time, allowing focussed infection control efforts. PacBio sequencing confirmed a secondary transmission network arising from inter-species plasmid transmission. Insights from Bayesian transmission inference and analyses of within-host diversity informed the development of state-wide public health and infection control guidelines, including interventions such as an intensive approach to screening contacts following new case detection to minimise unrecognised colonisation.A real-time combined epidemiological and genomic investigation proved critical to identifying and defining multiple transmission networks of KPC Enterobacteriaceae, while data from either investigation alone were inconclusive. The investigation was fundamental to informing infection control measures in real-time and the development of state-wide public health guidelines on carbapenemase-producing Enterobacteriaceae surveillance and management.


September 22, 2019

Genetic basis of emerging vancomycin, linezolid, and daptomycin heteroresistance in a case of persistent Enterococcus faecium bacteremia.

Whole-genome sequencing was used to examine a persistent Enterococcus faecium bacteremia that acquired heteroresistance to three antibiotics in response to prolonged multidrug therapy. A comparison of the complete genomes before and after each change revealed the emergence of known resistance determinants for vancomycin and linezolid and suggested that a novel mutation in fabF, encoding a fatty acid synthase, was responsible for daptomycin nonsusceptibility. Plasmid recombination contributed to the progressive loss of vancomycin resistance after withdrawal of the drug. Copyright © 2018 Chacko et al.


September 22, 2019

Screening and genomic characterization of filamentous hemagglutinin-deficient Bordetella pertussis.

Despite high vaccine coverage, pertussis cases in the United States have increased over the last decade. Growing evidence suggests that disease resurgence results, in part, from genetic divergence of circulating strain populations away from vaccine references. The United States employs acellular vaccines exclusively, and current Bordetella pertussis isolates are predominantly deficient in at least one immunogen, pertactin (Prn). First detected in the United States retrospectively in a 1994 isolate, the rapid spread of Prn deficiency is likely vaccine driven, raising concerns about whether other acellular vaccine immunogens experience similar pressures, as further antigenic changes could potentially threaten vaccine efficacy. We developed an electrochemiluminescent antibody capture assay to monitor the production of the acellular vaccine immunogen filamentous hemagglutinin (Fha). Screening 722 U.S. surveillance isolates collected from 2010 to 2016 identified two that were both Prn and Fha deficient. Three additional Fha-deficient laboratory strains were also identified from a historic collection of 65 isolates dating back to 1935. Whole-genome sequencing of deficient isolates revealed putative, underlying genetic changes. Only four isolates harbored mutations to known genes involved in Fha production, highlighting the complexity of its regulation. The chromosomes of two Fha-deficient isolates included unexpected structural variation that did not appear to influence Fha production. Furthermore, insertion sequence disruption of fhaB was also detected in a previously identified pertussis toxin-deficient isolate that still produced normal levels of Fha. These results demonstrate the genetic potential for additional vaccine immunogen deficiency and underscore the importance of continued surveillance of circulating B. pertussis evolution in response to vaccine pressure. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Characterization of plasmids harboring blaCTX-M and blaCMY genes in E. coli from French broilers.

Resistance to extended-spectrum cephalosporins (ESC) is a global health issue. The aim of this study was to analyze and compare plasmids coding for resistance to ESC isolated from 16 avian commensal and 17 avian pathogenic Escherichia coli (APEC) strains obtained respectively at slaughterhouse or from diseased broilers in 2010-2012. Plasmid DNA was used to transform E. coli DH5alpha, and the resistances of the transformants were determined. The sequences of the ESC-resistance plasmids prepared from transformants were obtained by Illumina (33 plasmids) or PacBio (1 plasmid). Results showed that 29 of these plasmids contained the blaCTX-M-1 gene and belonged to the IncI1/ST3 type, with 27 and 20 of them carrying the sul2 or tet(A) genes respectively. Despite their diverse origins, several plasmids showed very high percentages of identity. None of the blaCTX-M-1-containing plasmid contained APEC virulence genes, although some of them were detected in the parental strains. Three plasmids had the blaCMY-2 gene, but no other resistance gene. They belonged to IncB/O/K/Z-like or IncFIA/FIB replicon types. The blaCMY-2 IncFIA/FIB plasmid was obtained from a strain isolated from a diseased broiler and also containing a blaCTX-M-1 IncI1/ST3 plasmid. Importantly APEC virulence genes (sitA-D, iucA-D, iutA, hlyF, ompT, etsA-C, iss, iroB-E, iroN, cvaA-C and cvi) were detected on the blaCMY-2 plasmid. In conclusion, our results show the dominance and high similarity of blaCTX-M-1 IncI1/ST3 plasmids, and the worrying presence of APEC virulence genes on a blaCMY-2 plasmid.


September 22, 2019

Convergent evolution driven by rifampin exacerbates the global burden of drug-resistant Staphylococcus aureus.

Mutations in the beta-subunit of bacterial RNA polymerase (RpoB) cause resistance to rifampin (Rifr), a critical antibiotic for treatment of multidrug-resistantStaphylococcus aureus.In vitrostudies have shown that RpoB mutations confer decreased susceptibility to other antibiotics, but the clinical relevance is unknown. Here, by analyzing 7,099S. aureusgenomes, we demonstrate that the most prevalent RpoB mutations promote clinically relevant phenotypic plasticity resulting in the emergence of stableS. aureuslineages, associated with increased risk of therapeutic failure through generation of small-colony variants (SCVs) and coresistance to last-line antimicrobial agents. We found eight RpoB mutations that accounted for 93% (469/505) of the total number of Rifrmutations. The most frequently selected amino acid substitutions affecting residue 481 (H481N/Y) were associated with worldwide expansions of Rifrclones spanning decades. Recreating the H481N/Y mutations confirmed no impact onS. aureusgrowth, but the H481N mutation promoted the emergence of a subpopulation of stable RifrSCVs with reduced susceptibility to vancomycin and daptomycin. Recreating the other frequent RpoB mutations showed similar impacts on resistance to these last-line agents. We found that 86% of all Rifrisolates in our global sample carried the mutations promoting cross-resistance to vancomycin and 52% to both vancomycin and daptomycin. As four of the most frequent RpoB mutations confer only low-level Rifr, equal to or below some international breakpoints, we recommend decreasing these breakpoints and reconsidering the appropriate use of rifampin to reduce the fixation and spread of these clinically deleterious mutations. IMPORTANCE Increasing antibiotic resistance in the major human pathogenStaphylococcus aureusis threatening the ability to treat patients with these infections. Recent laboratory studies suggest that mutations in the gene commonly associated with rifampin resistance may also impact susceptibility to other last-line antibiotics inS. aureus; however, the overall frequency and clinical impact of these mutations are unknown. By mining a global collection of clinicalS. aureusgenomes and by mutagenesis experiments, this work reveals that common rifampin-inducedrpoBmutations promote phenotypic plasticity that has led to the global emergence of stable, multidrug-resistantS. aureuslineages that are associated with increased risk of therapeutic failure through coresistance to other last-line antimicrobials. We recommend decreasing susceptibility breakpoints for rifampin to allow phenotypic detection of criticalrpoBmutations conferring low resistance to rifampin and reconsidering the appropriate use of rifampin to reduce the fixation and spread of these deleterious mutations globally.


September 22, 2019

In situ analyses directly in diarrheal stool reveal large variations in bacterial load and active toxin expression of enterotoxigenic Escherichia coli and Vibrio cholerae.

The bacterial pathogens enterotoxigenicEscherichia coli(ETEC) andVibrio choleraeare major causes of diarrhea. ETEC causes diarrhea by production of the heat-labile toxin (LT) and heat-stable toxins (STh and STp), whileV. choleraeproduces cholera toxin (CT). In this study, we determined the occurrence and bacterial doses of the two pathogens and their respective toxin expression levels directly in liquid diarrheal stools of patients in Dhaka, Bangladesh. By quantitative culture and real-time quantitative PCR (qPCR) detection of the toxin genes, the two pathogens were found to coexist in several of the patients, at concentrations between 102and 108bacterial gene copies per ml. Even in culture-negative samples, gene copy numbers of 102to 104of either ETEC orV. choleraetoxin genes were detected by qPCR. RNA was extracted directly from stool, and gene expression levels, quantified by reverse transcriptase qPCR (RT-qPCR), of the genes encoding CT, LT, STh, and STp showed expression of toxin genes. Toxin enzyme-linked immunosorbent assay (ELISA) confirmed active toxin secretion directly in the liquid diarrhea. Analysis of ETEC isolates by multiplex PCR, dot blot analysis, and genome sequencing suggested that there are genetic ETEC profiles that are more commonly found as dominating single pathogens and others that are coinfectants with lower bacterial loads. The ETEC genomes, including assembled genomes of dominating ETEC isolates expressing LT/STh/CS5/CS6 and LT/CS7, are provided. In addition, this study highlights an emerging important ETEC strain expressing LT/STp and the novel colonization factor CS27b. These findings have implications for investigations of pathogenesis as well as for vaccine development. IMPORTANCEThe cause of diarrheal disease is usually determined by screening for several microorganisms by various methods, and sole detection is used to assign the agent as the cause of disease. However, it has become increasingly clear that many infections are caused by coinfections with several pathogens and that the dose of the infecting pathogen is important. We quantified the absolute numbers of enterotoxigenicE. coli(ETEC) andVibrio choleraedirectly in diarrheal fluid. We noted several events where both pathogens were found but also a large dose dependency. In three samples, we found ETEC as the only pathogen sought for. These isolates belonged to globally distributed ETEC clones and were the dominating species in stool with active toxin expression. This suggests that certain superior virulent ETEC lineages are able to outcompete the gut microbiota and be the sole cause of disease and hence need to be specifically monitored.


September 22, 2019

The genome of the Hi5 germ cell line from Trichoplusia ni, an agricultural pest and novel model for small RNA biology.

We report a draft assembly of the genome of Hi5 cells from the lepidopteran insect pest,Trichoplusia ni, assigning 90.6% of bases to one of 28 chromosomes and predicting 14,037 protein-coding genes. Chemoreception and detoxification gene families revealT. ni-specific gene expansions that may explain its widespread distribution and rapid adaptation to insecticides. Transcriptome and small RNA data from thorax, ovary, testis, and the germline-derived Hi5 cell line show distinct expression profiles for 295 microRNA- and >393 piRNA-producing loci, as well as 39 genes encoding small RNA pathway proteins. Nearly all of the W chromosome is devoted to piRNA production, andT. nisiRNAs are not 2´-O-methylated. To enable use of Hi5 cells as a model system, we have established genome editing and single-cell cloning protocols. TheT. nigenome provides insights into pest control and allows Hi5 cells to become a new tool for studying small RNAs ex vivo.© 2018, Fu et al.


September 22, 2019

Comparative genomics of completely sequenced Lactobacillus helveticus genomes provides insights into strain-specific genes and resolves metagenomics data down to the strain level.

Although complete genome sequences hold particular value for an accurate description of core genomes, the identification of strain-specific genes, and as the optimal basis for functional genomics studies, they are still largely underrepresented in public repositories. Based on an assessment of the genome assembly complexity for all lactobacilli, we used Pacific Biosciences’ long read technology to sequence and de novo assemble the genomes of three Lactobacillus helveticus starter strains, raising the number of completely sequenced strains to 12. The first comparative genomics study for L. helveticus-to our knowledge-identified a core genome of 988 genes and sets of unique, strain-specific genes ranging from about 30 to more than 200 genes. Importantly, the comparison of MiSeq- and PacBio-based assemblies uncovered that not only accessory but also core genes can be missed in incomplete genome assemblies based on short reads. Analysis of the three genomes revealed that a large number of pseudogenes were enriched for functional Gene Ontology categories such as amino acid transmembrane transport and carbohydrate metabolism, which is in line with a reductive genome evolution in the rich natural habitat of L. helveticus. Notably, the functional Clusters of Orthologous Groups of proteins categories “cell wall/membrane biogenesis” and “defense mechanisms” were found to be enriched among the strain-specific genes. A genome mining effort uncovered examples where an experimentally observed phenotype could be linked to the underlying genotype, such as for cell envelope proteinase PrtH3 of strain FAM8627. Another possible link identified for peptidoglycan hydrolases will require further experiments. Of note, strain FAM22155 did not harbor a CRISPR/Cas system; its loss was also observed in other L. helveticus strains and lactobacillus species, thus questioning the value of the CRISPR/Cas system for diagnostic purposes. Importantly, the complete genome sequences proved to be very useful for the analysis of natural whey starter cultures with metagenomics, as a larger percentage of the sequenced reads of these complex mixtures could be unambiguously assigned down to the strain level.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.