Menu
July 7, 2019

Filling in the gap of human chromosome 4: Single Molecule Real Time sequencing of macrosatellite repeats in the facioscapulohumeral muscular dystrophy locus.

A majority of facioscapulohumeral muscular dystrophy (FSHD) is caused by contraction of macrosatellite repeats called D4Z4 that are located in the subtelomeric region of human chromosome 4q35. Sequencing the FSHD locus has been technically challenging due to its long size and nearly identical nature of repeat elements. Here we report sequencing and partial assembly of a BAC clone carrying an entire FSHD locus by a single molecule real time (SMRT) sequencing technology which could produce long reads up to about 18 kb containing D4Z4 repeats. De novo assembly by Hierarchical Genome Assembly Process 1 (HGAP.1) yielded a contig of 41 kb containing all but a part of the most distal D4Z4 element. The validity of the sequence model was confirmed by an independent approach employing anchored multiple sequence alignment by Kalign using reads containing unique flanking sequences. Our data will provide a basis for further optimization of sequencing and assembly conditions of D4Z4.


July 7, 2019

High quality maize centromere 10 sequence reveals evidence of frequent recombination events.

The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR) has presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here, we report a nearly complete assembly of the 1.85 Mb maize centromere 10 from inbred B73 using PacBio technology and BACs from the reference genome project. The error rates estimated from overlapping BAC sequences are 7 × 10(-6) and 5 × 10(-5) for mismatches and indels, respectively. The number of gaps in the region covered by the reassembly was reduced from 140 in the reference genome to three. Three expressed genes are located between 92 and 477 kb from the inferred ancestral CentC cluster, which lies within the region of highest centromeric repeat density. The improved assembly increased the count of full-length CR from 5 to 55 and revealed a 22.7 kb segmental duplication that occurred approximately 121,000 years ago. Our analysis provides evidence of frequent recombination events in the form of partial retrotransposons, deletions within retrotransposons, chimeric retrotransposons, segmental duplications including higher order CentC repeats, a deleted CentC monomer, centromere-proximal inversions, and insertion of mitochondrial sequences. Double-strand DNA break (DSB) repair is the most plausible mechanism for these events and may be the major driver of centromere repeat evolution and diversity. In many cases examined here, DSB repair appears to be mediated by microhomology, suggesting that tandem repeats may have evolved to efficiently repair frequent DSBs in centromeres.


July 7, 2019

BAC-pool sequencing and assembly of 19 Mb of the complex sugarcane genome.

Sequencing plant genomes are often challenging because of their complex architecture and high content of repetitive sequences. Sugarcane has one of the most complex genomes. It is highly polyploid, preserves intact homeologous chromosomes from its parental species and contains >55% repetitive sequences. Although bacterial artificial chromosome (BAC) libraries have emerged as an alternative for accessing the sugarcane genome, sequencing individual clones is laborious and expensive. Here, we present a strategy for sequencing and assembly reads produced from the DNA of pooled BAC clones. A set of 178 BAC clones, randomly sampled from the SP80-3280 sugarcane BAC library, was pooled and sequenced using the Illumina HiSeq2000 and PacBio platforms. A hybrid assembly strategy was used to generate 2,451 scaffolds comprising 19.2 MB of assembled genome sequence. Scaffolds of =20 Kb corresponded to 80% of the assembled sequences, and the full sequences of forty BACs were recovered in one or two contigs. Alignment of the BAC scaffolds with the chromosome sequences of sorghum showed a high degree of collinearity and gene order. The alignment of the BAC scaffolds to the 10 sorghum chromosomes suggests that the genome of the SP80-3280 sugarcane variety is ~19% contracted in relation to the sorghum genome. In conclusion, our data show that sequencing pools composed of high numbers of BAC clones may help to construct a reference scaffold map of the sugarcane genome.


July 7, 2019

Single-locus enrichment without amplification for sequencing and direct detection of epigenetic modifications.

A gene-level targeted enrichment method for direct detection of epigenetic modifications is described. The approach is demonstrated on the CGG-repeat region of the FMR1 gene, for which large repeat expansions, hitherto refractory to sequencing, are known to cause fragile X syndrome. In addition to achieving a single-locus enrichment of nearly 700,000-fold, the elimination of all amplification steps removes PCR-induced bias in the repeat count and preserves the native epigenetic modifications of the DNA. In conjunction with the single-molecule real-time sequencing approach, this enrichment method enables direct readout of the methylation status and the CGG repeat number of the FMR1 allele(s) for a clonally derived cell line. The current method avoids potential biases introduced through chemical modification and/or amplification methods for indirect detection of CpG methylation events.


July 7, 2019

A carbapenem-resistant Pseudomonas aeruginosa isolate harboring two copies of blaIMP-34 encoding a metallo-ß-lactamase.

A carbapenem-resistant strain of Pseudomonas aeruginosa, NCGM1984, was isolated in 2012 from a hospitalized patient in Japan. Immunochromatographic assay showed that the isolate was positive for IMP-type metallo-ß-lactamase. Complete genome sequencing revealed that NCGM1984 harbored two copies of blaIMP-34, located at different sites on the chromosome. Each blaIMP-34 was present in the same structures of the class 1 integrons, tnpA(ISPa7)-intI1-qacG-blaIMP-34-aac(6′)-Ib-qacEdelta1-sul1-orf5-tniBdelta-tniA. The isolate belonged to multilocus sequence typing ST235, one of the international high-risk clones. IMP-34, with an amino acid substitution (Glu126Gly) compared with IMP-1, hydrolyzed all ß-lactamases tested except aztreonam, and its catalytic activities were similar to IMP-1. This is the first report of a clinical isolate of an IMP-34-producing P. aeruginosa harboring two copies of blaIMP-34 on its chromosome.


July 7, 2019

Characterization of VCC-1, a novel ambler class A carbapenemase from Vibrio cholerae isolated from imported retail shrimp sold in Canada.

One of the core goals of the Canadian Integrated Program for Antimicrobial Resistance Surveillance (CIPARS) is to monitor major meat commodities for antimicrobial resistance. Targeted studies with methodologies based on core surveillance protocols are used to examine other foods, e.g., seafood, for antimicrobial resistance to detect resistances of concern to public health. Here we report the discovery of a novel Ambler class A carbapenemase that was identified in a nontoxigenic strain of Vibrio cholerae (N14-02106) isolated from shrimp that was sold for human consumption in Canada. V. cholerae N14-02106 was resistant to penicillins, carbapenems, and monobactam antibiotics; however, PCR did not detect common ß-lactamases. Bioinformatic analysis of the whole-genome sequence of V. cholerae N14-02106 revealed on the large chromosome a novel carbapenemase (referred to here as VCC-1, for Vibrio cholerae carbapenemase 1) with sequence similarity to class A enzymes. Two copies of blaVCC-1 separated and flanked by ISVch9 (i.e., 3 copies of ISVch9) were found in an acquired 8.5-kb region inserted into a VrgG family protein gene. Cloned blaVCC-1 conferred a ß-lactam resistance profile similar to that in V. cholerae N14-02106 when it was transformed into a susceptible laboratory strain of Escherichia coli. Purified VCC-1 was found to hydrolyze penicillins, 1st-generation cephalosporins, aztreonam, and carbapenems, whereas 2nd- and 3rd-generation cephalosporins were poor substrates. Using nitrocefin as a reporter substrate, VCC-1 was moderately inhibited by clavulanic acid and tazobactam but not EDTA. In this report, we present the discovery of a novel class A carbapenemase from the food supply. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

No evidence for extensive horizontal gene transfer in the genome of the tardigrade Hypsibius dujardini.

Tardigrades are meiofaunal ecdysozoans that are key to understanding the origins of Arthropoda. Many species of Tardigrada can survive extreme conditions through cryptobiosis. In a recent paper [Boothby TC, et al. (2015) Proc Natl Acad Sci USA 112(52):15976-15981], the authors concluded that the tardigrade Hypsibius dujardini had an unprecedented proportion (17%) of genes originating through functional horizontal gene transfer (fHGT) and speculated that fHGT was likely formative in the evolution of cryptobiosis. We independently sequenced the genome of H. dujardini As expected from whole-organism DNA sampling, our raw data contained reads from nontarget genomes. Filtering using metagenomics approaches generated a draft H. dujardini genome assembly of 135 Mb with superior assembly metrics to the previously published assembly. Additional microbial contamination likely remains. We found no support for extensive fHGT. Among 23,021 gene predictions we identified 0.2% strong candidates for fHGT from bacteria and 0.2% strong candidates for fHGT from nonmetazoan eukaryotes. Cross-comparison of assemblies showed that the overwhelming majority of HGT candidates in the Boothby et al. genome derived from contaminants. We conclude that fHGT into H. dujardini accounts for at most 1-2% of genes and that the proposal that one-sixth of tardigrade genes originate from functional HGT events is an artifact of undetected contamination.


July 7, 2019

Horizontal gene acquisitions, mobile element proliferation, and genome decay in the host-restricted plant pathogen Erwinia tracheiphila.

Modern industrial agriculture depends on high-density cultivation of genetically similar crop plants, creating favorable conditions for the emergence of novel pathogens with increased fitness in managed compared with ecologically intact settings. Here, we present the genome sequence of six strains of the cucurbit bacterial wilt pathogen Erwinia tracheiphila (Enterobacteriaceae) isolated from infected squash plants in New York, Pennsylvania, Kentucky, and Michigan. These genomes exhibit a high proportion of recent horizontal gene acquisitions, invasion and remarkable amplification of mobile genetic elements, and pseudogenization of approximately 20% of the coding sequences. These genome attributes indicate that E. tracheiphila recently emerged as a host-restricted pathogen. Furthermore, chromosomal rearrangements associated with phage and transposable element proliferation contribute to substantial differences in gene content and genetic architecture between the six E. tracheiphila strains and other Erwinia species. Together, these data lead us to hypothesize that E. tracheiphila has undergone recent evolution through both genome decay (pseudogenization) and genome expansion (horizontal gene transfer and mobile element amplification). Despite evidence of dramatic genomic changes, the six strains are genetically monomorphic, suggesting a recent population bottleneck and emergence into E. tracheiphila’s current ecological niche. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Campylobacter fetus subspecies contain conserved type IV secretion systems on multiple genomic islands and plasmids.

The features contributing to differences in pathogenicity of the Campylobacter fetus subspecies are unknown. Putative factors involved in pathogenesis are located in genomic islands that encode a type IV secretion system (T4SS) and fic domain (filamentation induced by cyclic AMP) proteins, which may disrupt host cell processes. In the genomes of 27 C. fetus strains, three phylogenetically-different T4SS-encoding regions (T4SSs) were identified: one was located in both the chromosome and in extra-chromosomal plasmids; one was located exclusively in the chromosome; and one exclusively in extra-chromosomal plasmids. We observed that C. fetus strains can contain multiple T4SSs and that homologous T4SSs can be present both in chromosomal genomic islands (GI) and on plasmids in the C. fetus strains. The GIs of the chromosomally located T4SS differed mainly by the presence of fic genes, insertion sequence elements and phage-related or hypothetical proteins. Comparative analysis showed that T4SS sequences, inserted in the same locations, were conserved in the studied C. fetus genomes. Using phylogenetic analysis of the T4SSs, it was shown that C. fetus may have acquired the T4SS regions from other Campylobacter species by horizontal gene transfer. The identified T4SSs and fic genes were found in Cff and Cfv strains, although the presence of T4SSs and fic genes were significantly associated with Cfv strains. The T4SSs and fic genes could not be associated with S-layer serotypes or geographical origin of the strains.


July 7, 2019

Fully closed genome sequences of five type strains of the genus Cronobacter and one Cronobacter sakazakii strain.

Cronobacteris associated with infant infections and the consumption of reconstituted infant formula. Here we sequenced and closed six genomes ofC. condimenti(T),C. muytjensii(T),C. universalis(T),C. malonaticus(T),C. dublinensis(T), andC. sakazakiithat can be used as reference genomes in single nucleotide polymorphism (SNP)-based next-generation sequencing (NGS) analysis for source tracking investigations. Copyright © 2016 Moine et al.


July 7, 2019

A strange endocytobiont revealed as largest virus.

A lot of endocytobionts (or endosymbionts) have been discovered within free-living amoebae in recent years. In this article the results of a long lasting effort to derive valuable data about an extraordinary spore-like infectious microorganism (endocytobiont, endosymbiont) within host amoebae (Acanthamoeba sp.) recently isolated from the contact lens case of a patient with keratitis, are presented. It took some time until this endocytobiont could be attributed to the genus Pandoravirus following a publication of two other pandoraviruses isolated from aquatic environments. Consequently the molecular biological investigation led to the taxonomic affiliation of the endocytobiont with the genus Pandoravirus and to the description of a new Pandoravirus species, Pandoravirus inopinatum after whole-genome sequencing in 2015. The fact that it was isolated from a contact lens container of a keratitis patient gives another dimension to these findings showing paradigmatically, how readily these ‘new’ giant viruses get to humans. Copyright © 2016 Elsevier Ltd. All rights reserved.


July 7, 2019

A time- and cost-effective strategy to sequence mammalian Y Chromosomes: an application to the de novo assembly of gorilla Y.

The mammalian Y Chromosome sequence, critical for studying male fertility and dispersal, is enriched in repeats and palindromes, and thus, is the most difficult component of the genome to assemble. Previously, expensive and labor-intensive BAC-based techniques were used to sequence the Y for a handful of mammalian species. Here, we present a much faster and more affordable strategy for sequencing and assembling mammalian Y Chromosomes of sufficient quality for most comparative genomics analyses and for conservation genetics applications. The strategy combines flow sorting, short- and long-read genome and transcriptome sequencing, and droplet digital PCR with novel and existing computational methods. It can be used to reconstruct sex chromosomes in a heterogametic sex of any species. We applied our strategy to produce a draft of the gorilla Y sequence. The resulting assembly allowed us to refine gene content, evaluate copy number of ampliconic gene families, locate species-specific palindromes, examine the repetitive element content, and produce sequence alignments with human and chimpanzee Y Chromosomes. Our results inform the evolution of the hominine (human, chimpanzee, and gorilla) Y Chromosomes. Surprisingly, we found the gorilla Y Chromosome to be similar to the human Y Chromosome, but not to the chimpanzee Y Chromosome. Moreover, we have utilized the assembled gorilla Y Chromosome sequence to design genetic markers for studying the male-specific dispersal of this endangered species. © 2016 Tomaszkiewicz et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Metabolomics-guided analysis of isocoumarin production by Streptomyces species MBT76 and biotransformation of flavonoids and phenylpropanoids.

Actinomycetes produce the majority of the antibiotics currently in clinical use. The efficiency of antibiotic production is affected by multiple factors such as nutrients, pH, temperature and growth phase. Finding the optimal harvesting time is crucial for successful isolation of the desired bioactive metabolites from actinomycetes, but for this conventional chemical analysis has limitations due to the metabolic complexity. This study explores the utility of NMR-based metabolomics for (1) optimizing fermentation time for the production of known and/or unknown bioactive compounds produced by actinomycetes; (2) elucidating the biosynthetic pathway for microbial natural products; and (3) facilitating the biotransformation of nature-abundant chemicals.


July 7, 2019

Genome sequence and analysis of Escherichia coli MRE600, a colicinogenic, nonmotile strain that lacks RNase I and the type I methyltransferase, EcoKI.

Escherichia coli strain MRE600 was originally identified for its low RNase I activity and has therefore been widely adopted by the biomedical research community as a preferred source for the expression and purification of transfer RNAs and ribosomes. Despite its widespread use, surprisingly little information about its genome or genetic content exists. Here, we present the first de novo assembly and description of the MRE600 genome and epigenome. To provide context to these studies of MRE600, we include comparative analyses with E. coli K-12 MG1655 (K12). Pacific Biosciences Single Molecule, Real-Time sequencing reads were assembled into one large chromosome (4.83 Mb) and three smaller plasmids (89.1, 56.9, and 7.1 kb). Interestingly, the 7.1-kb plasmid possesses genes encoding a colicin E1 protein and its associated immunity protein. The MRE600 genome has a G + C content of 50.8% and contains a total of 5,181 genes, including 4,913 protein-encoding genes and 268 RNA genes. We identified 41,469 modified DNA bases (0.83% of total) and found that MRE600 lacks the gene for type I methyltransferase, EcoKI. Phylogenetic, taxonomic, and genetic analyses demonstrate that MRE600 is a divergent E. coli strain that displays features of the closely related genus, Shigella. Nevertheless, comparative analyses between MRE600 and E. coli K12 show that these two strains exhibit nearly identical ribosomal proteins, ribosomal RNAs, and highly homologous tRNA species. Substantiating prior suggestions that MRE600 lacks RNase I activity, the RNase I-encoding gene, rna, contains a single premature stop codon early in its open-reading frame. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Comparative genomic analyses of the Moraxella catarrhalis serosensitive and seroresistant lineages demonstrate their independent evolution.

The bacterial species Moraxella catarrhalishas been hypothesized as being composed of two distinct lineages (referred to as the seroresistant [SR] and serosensitive [SS]) with separate evolutionary histories based on several molecular typing methods, whereas 16S ribotyping has suggested an additional split within the SS lineage. Previously, we characterized whole-genome sequences of 12 SR-lineage isolates, which revealed a relatively small supragenome when compared with other opportunistic nasopharyngeal pathogens, suggestive of a relatively short evolutionary history. Here, we performed whole-genome sequencing on 18 strains from both ribotypes of the SS lineage, an additional SR strain, as well as four previously identified highly divergent strains based on multilocus sequence typing analyses. All 35 strains were subjected to a battery of comparative genomic analyses which clearly show that there are three lineages-the SR, SS, and the divergent. The SR and SS lineages are closely related, but distinct from each other based on three different methods of comparison: Allelic differences observed among core genes; possession of lineage-specific sets of core and distributed genes; and by an alignment of concatenated core sequences irrespective of gene annotation. All these methods show that the SS lineage has much longer interstrain branches than the SR lineage indicating that this lineage has likely been evolving either longer or faster than the SR lineage. There is evidence of extensive horizontal gene transfer (HGT) within both of these lineages, and to a lesser degree between them. In particular, we identified very high rates of HGT between these two lineages for ß-lactamase genes. The four divergent strains aresui generis, being much more distantly related to both the SR and SS groups than these other two groups are to each other. Based on average nucleotide identities, gene content, GC content, and genome size, this group could be considered as a separate taxonomic group. The SR and SS lineages, although distinct, clearly form a single species based on multiple criteria including a large common core genome, average nucleotide identity values, GC content, and genome size. Although neither of these lineages arose from within the other based on phylogenetic analyses, the question of how and when these lineages split and then subsequently reunited in the human nasopharynx is explored. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.