Menu
September 22, 2019

Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies.

Recent developments in third-gen long read sequencing and diploid-aware assemblers have resulted in the rapid release of numerous reference-quality assemblies for diploid genomes. However, assembly of highly heterozygous genomes is still problematic when regional heterogeneity is so high that haplotype homology is not recognised during assembly. This results in regional duplication rather than consolidation into allelic variants and can cause issues with downstream analysis, for example variant discovery, or haplotype reconstruction using the diploid assembly with unpaired allelic contigs.A new pipeline-Purge Haplotigs-was developed specifically for third-gen sequencing-based assemblies to automate the reassignment of allelic contigs, and to assist in the manual curation of genome assemblies. The pipeline uses a draft haplotype-fused assembly or a diploid assembly, read alignments, and repeat annotations to identify allelic variants in the primary assembly. The pipeline was tested on a simulated dataset and on four recent diploid (phased) de novo assemblies from third-generation long-read sequencing, and compared with a similar tool. After processing with Purge Haplotigs, haploid assemblies were less duplicated with minimal impact on genome completeness, and diploid assemblies had more pairings of allelic contigs.Purge Haplotigs improves the haploid and diploid representations of third-gen sequencing based genome assemblies by identifying and reassigning allelic contigs. The implementation is fast and scales well with large genomes, and it is less likely to over-purge repetitive or paralogous elements compared to alignment-only based methods. The software is available at https://bitbucket.org/mroachawri/purge_haplotigs under a permissive MIT licence.


September 22, 2019

Unexpected patterns of segregation distortion at a selfish supergene in the fire ant Solenopsis invicta.

The Sb supergene in the fire ant Solenopsis invicta determines the form of colony social organization, with colonies whose inhabitants bear the element containing multiple reproductive queens and colonies lacking it containing only a single queen. Several features of this supergene – including suppressed recombination, presence of deleterious mutations, association with a large centromere, and “green-beard” behavior – suggest that it may be a selfish genetic element that engages in transmission ratio distortion (TRD), defined as significant departures in progeny allele frequencies from Mendelian inheritance ratios. We tested this possibility by surveying segregation ratios in embryo progenies of 101 queens of the “polygyne” social form (3512 embryos) using three supergene-linked markers and twelve markers outside the supergene.Significant departures from Mendelian ratios were observed at the supergene loci in 3-5 times more progenies than expected in the absence of TRD and than found, on average, among non-supergene loci. Also, supergene loci displayed the greatest mean deviations from Mendelian ratios among all study loci, although these typically were modest. A surprising feature of the observed inter-progeny variation in TRD was that significant deviations involved not only excesses of supergene alleles but also similarly frequent excesses of the alternate alleles on the homologous chromosome. As expected given the common occurrence of such “drive reversal” in this system, alleles associated with the supergene gain no consistent transmission advantage over their alternate alleles at the population level. Finally, we observed low levels of recombination and incomplete gametic disequilibrium across the supergene, including between adjacent markers within a single inversion.Our data confirm the prediction that the Sb supergene is a selfish genetic element capable of biasing its own transmission during reproduction, yet counterselection for suppressor loci evidently has produced an evolutionary stalemate in TRD between the variant homologous haplotypes on the “social chromosome”. Evidence implicates prezygotic segregation distortion as responsible for the TRD we document, with “true” meiotic drive the most likely mechanism. Low levels of recombination and incomplete gametic disequilibrium across the supergene suggest that selection does not preserve a single uniform supergene haplotype responsible for inducing polygyny.


September 22, 2019

Antibiotic-resistant indicator bacteria in irrigation water: High prevalence of extended-spectrum beta-lactamase (ESBL)-producing Escherichia coli.

Irrigation water is a major source of fresh produce contamination with undesired microorganisms including antibiotic-resistant bacteria (ARB), and contaminated fresh produce can transfer ARB to the consumer especially when consumed raw. Nevertheless, no legal guidelines exist so far regulating quality of irrigation water with respect to ARB. We therefore examined irrigation water from major vegetable growing areas for occurrence of antibiotic-resistant indicator bacteria Escherichia coli and Enterococcus spp., including extended-spectrum ß-lactamase (ESBL)-producing E. coli and vancomycin-resistant Enterococcus spp. Occurrence of ARB strains was compared to total numbers of the respective species. We categorized water samples according to total numbers and found that categories with higher total E. coli or Enterococcus spp. numbers generally had an increased proportion of respective ARB-positive samples. We further detected high prevalence of ESBL-producing E. coli with eight positive samples of thirty-six (22%), while two presumptive vancomycin-resistant Enterococcus spp. were vancomycin-susceptible in confirmatory tests. In disk diffusion assays all ESBL-producing E. coli were multidrug-resistant (n = 21) and whole-genome sequencing of selected strains revealed a multitude of transmissible resistance genes (ARG), with blaCTX-M-1 (4 of 11) and blaCTX-M-15 (3 of 11) as the most frequent ESBL genes. Overall, the increased occurrence of indicator ARB with increased total indicator bacteria suggests that the latter might be a suitable estimate for presence of respective ARB strains. Finally, the high prevalence of ESBL-producing E. coli with transmissible ARG emphasizes the need to establish legal critical values and monitoring guidelines for ARB in irrigation water.


September 22, 2019

An improved genome assembly for Larimichthys crocea reveals hepcidin gene expansion with diversified regulation and function.

Larimichthys crocea (large yellow croaker) is a type of perciform fish well known for its peculiar physiological properties and economic value. Here, we constructed an improved version of the L. crocea genome assembly, which contained 26,100 protein-coding genes. Twenty-four pseudo-chromosomes of L. crocea were also reconstructed, comprising 90% of the genome assembly. This improved assembly revealed several expansions in gene families associated with olfactory detection, detoxification, and innate immunity. Specifically, six hepcidin genes (LcHamps) were identified in L. crocea, possibly resulting from lineage-specific gene duplication. All LcHamps possessed similar genomic structures and functional domains, but varied substantially with respect to expression pattern, transcriptional regulation, and biological function. LcHamp1 was associated specifically with iron metabolism, while LcHamp2s were functionally diverse, involving in antibacterial activity, antiviral activity, and regulation of intracellular iron metabolism. This functional diversity among gene copies may have allowed L. crocea to adapt to diverse environmental conditions.


September 22, 2019

Improved reference genome for the domestic horse increases assembly contiguity and composition.

Recent advances in genomic sequencing technology and computational assembly methods have allowed scientists to improve reference genome assemblies in terms of contiguity and composition. EquCab2, a reference genome for the domestic horse, was released in 2007. Although of equal or better quality compared to other first-generation Sanger assemblies, it had many of the shortcomings common to them. In 2014, the equine genomics research community began a project to improve the reference sequence for the horse, building upon the solid foundation of EquCab2 and incorporating new short-read data, long-read data, and proximity ligation data. Here, we present EquCab3. The count of non-N bases in the incorporated chromosomes is improved from 2.33?Gb in EquCab2 to 2.41?Gb in EquCab3. Contiguity has also been improved nearly 40-fold with a contig N50 of 4.5?Mb and scaffold contiguity enhanced to where all but one of the 32 chromosomes is comprised of a single scaffold.


September 22, 2019

Nonmutational mechanism of inheritance in the Archaeon Sulfolobus solfataricus.

Epigenetic phenomena have not yet been reported in archaea, which are presumed to use a classical genetic process of heritability. Here, analysis of independent lineages of Sulfolobus solfataricus evolved for enhanced fitness implicated a non-Mendelian basis for trait inheritance. The evolved strains, called super acid-resistant Crenarchaeota (SARC), acquired traits of extreme acid resistance and genome stability relative to their wild-type parental lines. Acid resistance was heritable because it was retained regardless of extensive passage without selection. Despite the hereditary pattern, in one strain, it was impossible for these SARC traits to result from mutation because its resequenced genome had no mutation. All strains also had conserved, heritable transcriptomes implicated in acid resistance. In addition, they had improved genome stability with absent or greatly decreased mutation and transposition relative to a passaged control. A mechanism that would confer these traits without DNA sequence alteration could involve posttranslationally modified archaeal chromatin proteins. To test this idea, homologous recombination with isogenic DNA was used to perturb native chromatin structure. Recombination at up-regulated loci from the heritable SARC transcriptome reduced acid resistance and gene expression in the majority of recombinants. In contrast, recombination at a control locus that was not part of the heritable transcriptome changed neither acid resistance nor gene expression. Variation in the amount of phenotypic and expression changes across individuals was consistent with Rad54-dependent chromatin remodeling that dictated crossover location and branch migration. These data support an epigenetic model implicating chromatin structure as a contributor to heritable traits.


September 22, 2019

Cryptocurrencies and Zero Mode Wave guides: An unclouded path to a more contiguous Cannabis sativa L. genome assembly

We describe the use ofa Decentralized Autonomous Organization (DAO) to crypto- fund the single molecule sequencing and publication ofa Type ll Cannabis plant. This resulted in the construction of the most contiguous Cannabis genome assembly to date. The combined use of the Dash cryptocurrency, DAOs, and Pacific Biosciences sequencing delivered a 1.03 Gb genome with a N50 of 665Kb in 77 days from funding to public upload. This represents a 230 fold improvement in the contiguity of the first cannabis assemblies in 2011 and a 4 fold improvement over all cannabis assemblies to date. 34Gb ofadditional sequencing pushed the assembly to a N50 of 3.8Mb. Hi-C data from Phase Genomics further scaffolded the assembly to 35 contigs at an N50 of 74Mb but requires additional curation. The genome is partially phased and larger than previously reported (2N : 1.33Gb). The CBCA, THCA and CBDA synthase gene clusters have been phased onto respective contigs demonstrating tandem repeat expansions.


September 22, 2019

Complete genome sequencing of Lactobacillus plantarum ZLP001, a potential probiotic that enhances intestinal epithelial barrier function and defense against pathogens in pigs.

The mammalian gastrointestinal tract is a heterogeneous ecosystem with the most abundant, and one of the most diverse, microbial communities. The gut microbiota, which may contain more than 100 times the number of genes in the human genome, endows the host with beneficial functional features, including colonization resistance, nutrient metabolism, and immune tolerance (Bäckhed, 2005). Dysbiosis of gut microbiota may result in serious adverse consequences for the host, such as neurological disorders, cancer, obesity, malnutrition, inflammatory dysregulation, and susceptibility to pathogens


September 22, 2019

Out in the cold: Identification of genomic regions associated with cold tolerance in the biocontrol fungus Clonostachys rosea through genome-wide association mapping.

There is an increasing importance for using biocontrol agents in combating plant diseases sustainably and in the long term. As large scale genomic sequencing becomes economically viable, the impact of single nucleotide polymorphisms (SNPs) on biocontrol-associated phenotypes can be easily studied across entire genomes of fungal populations. Here, we improved a previously reported genome assembly of the biocontrol fungus Clonostachys rosea strain IK726 using the PacBio sequencing platform, which resulted in a total genome size of 70.7 Mbp and 21,246 predicted genes. We further performed whole-genome re-sequencing of 52 additional C. rosea strains isolated globally using Illumina sequencing technology, in order to perform genome-wide association studies in conditions relevant for biocontrol activity. One such condition is the ability to grow at lower temperatures commonly encountered in cryic or frigid soils in temperate regions, as these will be prevalent for protecting growing crops in temperate climates. Growth rates at 10°C on potato dextrose agar of the 53 sequenced strains of C. rosea were measured and ranged between 0.066 and 0.413 mm/day. Performing a genome wide association study, a total of 1,478 SNP markers were significantly associated with the trait and located in 227 scaffolds, within or close to (< 1000 bp distance) 265 different genes. The predicted gene products included several chaperone proteins, membrane transporters, lipases, and proteins involved in chitin metabolism with possible roles in cold tolerance. The data reported in this study provides a foundation for future investigations into the genetic basis for cold tolerance in fungi, with important implications for biocontrol.


September 22, 2019

Genomic analysis of consecutive Acinetobacter baumannii strains from a single patient.

Acinetobacter baumannii is one of the most important nosocomial pathogens, and thus it is required to investigate how it disseminate in hospitals and infect patients. We performed whole genome sequencing for 24 A. baumannii strains isolated successively from the blood of a single patient to evaluate whether repeated infections were due to re-infection or relapse infection and to investigate within-host evolution. The whole genome of the first strain, BL1, was sequenced de novo using the PacBio RSII system. BL2-BL24, were sequenced with an Illumina Hiseq4000 and mapped to the genome sequences of BL1. We identified 42 single-nucleotide variations among the strains. The SNVs differentiated the strains into three groups, BL1, BL2-BL16, and BL17-BL24, indicating that the patient suffered from re-infections or co-infections by similar, but different strains. The results also showed that A. baumannii strains in each group were rather stable at the genomic level. Our study emphasizes the importance of intensive infection control.


September 22, 2019

Microevolution of Neisseria lactamica during nasopharyngeal colonisation induced by controlled human infection.

Neisseria lactamica is a harmless coloniser of the infant respiratory tract, and has a mutually-excluding relationship with the pathogen Neisseria meningitidis. Here we report controlled human infection with genomically-defined N. lactamica and subsequent bacterial microevolution during 26 weeks of colonisation. We find that most mutations that occur during nasopharyngeal carriage are transient indels within repetitive tracts of putative phase-variable loci associated with host-microbe interactions (pgl and lgt) and iron acquisition (fetA promotor and hpuA). Recurrent polymorphisms occurred in genes associated with energy metabolism (nuoN, rssA) and the CRISPR-associated cas1. A gene encoding a large hypothetical protein was often mutated in 27% of the subjects. In volunteers who were naturally co-colonised with meningococci, recombination altered allelic identity in N. lactamica to resemble meningococcal alleles, including loci associated with metabolism, outer membrane proteins and immune response activators. Our results suggest that phase variable genes are often mutated during carriage-associated microevolution.


September 22, 2019

The enterococcus cassette chromosome, a genomic variation enabler in enterococci.

Enterococcus faecium has a highly variable genome prone to recombination and horizontal gene transfer. Here, we have identified a novel genetic island with an insertion locus and mobilization genes similar to those of staphylococcus cassette chromosome elements SCCmec This novel element termed the enterococcus cassette chromosome (ECC) element was located in the 3′ region of rlmH and encoded large serine recombinases ccrAB similar to SCCmec Horizontal transfer of an ECC element termed ECC::cat containing a knock-in cat chloramphenicol resistance determinant occurred in the presence of a conjugative reppLG1 plasmid. We determined the ECC::cat insertion site in the 3′ region of rlmH in the E. faecium recipient by long-read sequencing. ECC::cat also mobilized by homologous recombination through sequence identity between flanking insertion sequence (IS) elements in ECC::cat and the conjugative plasmid. The ccrABEnt genes were found in 69 of 516 E. faecium genomes in GenBank. Full-length ECC elements were retrieved from 32 of these genomes. ECCs were flanked by attR and attL sites of approximately 50?bp. The attECC sequences were found by PCR and sequencing of circularized ECCs in three strains. The genes in ECCs contained an amalgam of common and rare E. faecium genes. Taken together, our data imply that ECC elements act as hot spots for genetic exchange and contribute to the large variation of accessory genes found in E. faeciumIMPORTANCEEnterococcus faecium is a bacterium found in a great variety of environments, ranging from the clinic as a nosocomial pathogen to natural habitats such as mammalian intestines, water, and soil. They are known to exchange genetic material through horizontal gene transfer and recombination, leading to great variability of accessory genes and aiding environmental adaptation. Identifying mobile genetic elements causing sequence variation is important to understand how genetic content variation occurs. Here, a novel genetic island, the enterococcus cassette chromosome, is shown to contain a wealth of genes, which may aid E. faecium in adapting to new environments. The transmission mechanism involves the only two conserved genes within ECC, ccrABEnt, large serine recombinases that insert ECC into the host genome similarly to SCC elements found in staphylococci. Copyright © 2018 Sivertsen et al.


September 22, 2019

3D molecular cytology of Hop (Humulus lupulus) meiotic chromosomes reveals non-disomic pairing and segregation, aneuploidy, and genomic structural variation.

Hop (Humulus lupulus L.) is an important crop worldwide, known as the main flavoring ingredient in beer. The diversifying brewing industry demands variation in flavors, superior process properties, and sustainable agronomics, which are the focus of advanced molecular breeding efforts in hops. Hop breeders have been limited in their ability to create strains with desirable traits, however, because of the unusual and unpredictable inheritance patterns and associated non-Mendelian genetic marker segregation. Cytogenetic analysis of meiotic chromosome behavior has also revealed conspicuous and prevalent occurrences of multiple, atypical, non-disomic chromosome complexes, including those involving autosomes in late prophase. To explore the role of meiosis in segregation distortion, we undertook 3D cytogenetic analysis of hop pollen mother cells stained with DAPI and FISH. We used telomere FISH to demonstrate that hop exhibits a normal telomere clustering bouquet. We also identified and characterized a new sub-terminal 180 bp satellite DNA tandem repeat family called HSR0, located proximal to telomeres. Highly variable 5S rDNA FISH patterns within and between plants, together with the detection of anaphase chromosome bridges, reflect extensive departures from normal disomic signal composition and distribution. Subsequent FACS analysis revealed variable DNA content in a cultivated pedigree. Together, these findings implicate multiple phenomena, including aneuploidy, segmental aneuploidy, or chromosome rearrangements, as contributing factors to segregation distortion in hop.


September 22, 2019

Genomic surveillance of Enterococcus faecium reveals limited sharing of strains and resistance genes between livestock and humans in the United Kingdom.

Vancomycin-resistant Enterococcus faecium (VREfm) is a major cause of nosocomial infection and is categorized as high priority by the World Health Organization global priority list of antibiotic-resistant bacteria. In the past, livestock have been proposed as a putative reservoir for drug-resistant E. faecium strains that infect humans, and isolates of the same lineage have been found in both reservoirs. We undertook cross-sectional surveys to isolate E. faecium (including VREfm) from livestock farms, retail meat, and wastewater treatment plants in the United Kingdom. More than 600 isolates from these sources were sequenced, and their relatedness and antibiotic resistance genes were compared with genomes of almost 800 E. faecium isolates from patients with bloodstream infection in the United Kingdom and Ireland. E. faecium was isolated from 28/29 farms; none of these isolates were VREfm, suggesting a decrease in VREfm prevalence since the last UK livestock survey in 2003. However, VREfm was isolated from 1% to 2% of retail meat products and was ubiquitous in wastewater treatment plants. Phylogenetic comparison demonstrated that the majority of human and livestock-related isolates were genetically distinct, although pig isolates from three farms were more genetically related to human isolates from 2001 to 2004 (minimum of 50?single-nucleotide polymorphisms [SNPs]). Analysis of accessory (variable) genes added further evidence for distinct niche adaptation. An analysis of acquired antibiotic resistance genes and their variants revealed limited sharing between humans and livestock. Our findings indicate that the majority of E. faecium strains infecting patients are largely distinct from those from livestock in this setting, with limited sharing of strains and resistance genes.IMPORTANCE The rise in rates of human infection caused by vancomycin-resistant Enterococcus faecium (VREfm) strains between 1988 to the 2000s in Europe was suggested to be associated with acquisition from livestock. As a result, the European Union banned the use of the glycopeptide drug avoparcin as a growth promoter in livestock feed. While some studies reported a decrease in VREfm in livestock, others reported no reduction. Here, we report the first livestock VREfm prevalence survey in the UK since 2003 and the first large-scale study using whole-genome sequencing to investigate the relationship between E. faecium strains in livestock and humans. We found a low prevalence of VREfm in retail meat and limited evidence for recent sharing of strains between livestock and humans with bloodstream infection. There was evidence for limited sharing of genes encoding antibiotic resistance between these reservoirs, a finding which requires further research. Copyright © 2018 Gouliouris et al.


September 22, 2019

Spread of the florfenicol resistance floR gene among clinical Klebsiella pneumoniae isolates in China.

Florfenicol is a derivative of chloramphenicol that is used only for the treatment of animal diseases. A key resistance gene for florfenicol, floR, can spread among bacteria of the same and different species or genera through horizontal gene transfer. To analyze the potential transmission of resistance genes between animal and human pathogens, we investigated floR in Klebsiella pneumoniae isolates from patient samples. floR in human pathogens may originate from animal pathogens and would reflect the risk to human health of using antimicrobial agents in animals.PCR was used to identify floR-positive strains. The floR genes were cloned, and the minimum inhibitory concentrations (MICs) were determined to assess the relative resistance levels of the genes and strains. Sequencing and comparative genomics methods were used to analyze floR gene-related sequence structure as well as the molecular mechanism of resistance dissemination.Of the strains evaluated, 20.42% (67/328) were resistant to florfenicol, and 86.96% (20/23) of the floR-positive strains demonstrated high resistance to florfenicol with MICs =512 µg/mL. Conjugation experiments showed that transferrable plasmids carried the floR gene in three isolates. Sequencing analysis of a plasmid approximately 125 kb in size (pKP18-125) indicated that the floR gene was flanked by multiple copies of mobile genetic elements. Comparative genomics analysis of a 9-kb transposon-like fragment of pKP18-125 showed that an approximately 2-kb sequence encoding lysR-floR-virD2 was conserved in the majority (79.01%, 83/105) of floR sequences collected from NCBI nucleotide database. Interestingly, the most similar sequence was a 7-kb fragment of plasmid pEC012 from an Escherichia coli strain isolated from a chicken.Identified on a transferable plasmid in the human pathogen K. pneumoniae, the floR gene may be disseminated through horizontal gene transfer from animal pathogens. Studies on the molecular mechanism of resistance gene dissemination in different bacterial species of animal origin could provide useful information for preventing or controlling the spread of resistance between animal and human pathogens.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.