Menu
September 22, 2019

Genomic analysis of Picochlorum species reveals how microalgae may adapt to variable environments.

Understanding how microalgae adapt to rapidly changing environments is not only important to science but can help clarify the potential impact of climate change on the biology of primary producers. We sequenced and analyzed the nuclear genome of multiple Picochlorum isolates (Chlorophyta) to elucidate strategies of environmental adaptation. It was previously found that coordinated gene regulation is involved in adaptation to salinity stress, and here we show that gene gain and loss also play key roles in adaptation. We determined the extent of horizontal gene transfer (HGT) from prokaryotes and their role in the origin of novel functions in the Picochlorum clade. HGT is an ongoing and dynamic process in this algal clade with adaptation being driven by transfer, divergence, and loss. One HGT candidate that is differentially expressed under salinity stress is indolepyruvate decarboxylase that is involved in the production of a plant auxin that mediates bacteria-diatom symbiotic interactions. Large differences in levels of heterozygosity were found in diploid haplotypes among Picochlorum isolates. Biallelic divergence was pronounced in P. oklahomensis (salt plains environment) when compared with its closely related sister taxon Picochlorum SENEW3 (brackish water environment), suggesting a role of diverged alleles in response to environmental stress. Our results elucidate how microbial eukaryotes with limited gene inventories expand habitat range from mesophilic to halophilic through allelic diversity, and with minor but important contributions made by HGT. We also explore how the nature and quality of genome data may impact inference of nuclear ploidy.


September 22, 2019

Reassessment of the evolution of wheat chromosomes 4A, 5A, and 7B.

Comparison of genome sequences of wild emmer wheat and Aegilops tauschii suggests a novel scenario of the evolution of rearranged wheat chromosomes 4A, 5A, and 7B. Past research suggested that wheat chromosome 4A was subjected to a reciprocal translocation T(4AL;5AL)1 that occurred in the diploid progenitor of the wheat A subgenome and to three major rearrangements that occurred in polyploid wheat: pericentric inversion Inv(4AS;4AL)1, paracentric inversion Inv(4AL;4AL)1, and reciprocal translocation T(4AL;7BS)1. Gene collinearity along the pseudomolecules of tetraploid wild emmer wheat (Triticum turgidum ssp. dicoccoides, subgenomes AABB) and diploid Aegilops tauschii (genomes DD) was employed to confirm these rearrangements and to analyze the breakpoints. The exchange of distal regions of chromosome arms 4AS and 4AL due to pericentric inversion Inv(4AS;4AL)1 was detected, and breakpoints were validated with an optical Bionano genome map. Both breakpoints contained satellite DNA. The breakpoints of reciprocal translocation T(4AL;7BS)1 were also found. However, the breakpoints that generated paracentric inversion Inv(4AL;4AL)1 appeared to be collocated with the 4AL breakpoints that had produced Inv(4AS;4AL)1 and T(4AL;7BS)1. Inv(4AS;4AL)1, Inv(4AL;4AL)1, and T(4AL;7BS)1 either originated sequentially, and Inv(4AL;4AL)1 was produced by recurrent chromosome breaks at the same breakpoints that generated Inv(4AS;4AL)1 and T(4AL;7BS)1, or Inv(4AS;4AL)1, Inv(4AL;4AL)1, and T(4AL;7BS)1 originated simultaneously. We prefer the latter hypothesis since it makes fewer assumptions about the sequence of events that produced these chromosome rearrangements.


September 22, 2019

Genomic evidence for asymmetric introgression by sexual selection in the common wall lizard.

Strongly selected characters can be transferred from one lineage to another with limited genetic exchange, resulting in asymmetric introgression and a mosaic genome in the receiving population. However, systems are rarely sufficiently well studied to link the pattern of introgression to its underlying process. Male common wall lizards in western Italy exhibit exaggeration of a suite of sexually selected characters that make them outcompete males from a distantly related lineage that lack these characters. This results in asymmetric hybridization and adaptive introgression of the suite of characters following secondary contact. We developed genomewide markers to infer the demographic history of gene flow between different genetic lineages, identify the spread of the sexually selected syndrome, and test the prediction that introgression should be asymmetric and heterogeneous across the genome. Our results show that secondary contact was accompanied by gene flow in both directions across most of the genome, but with approximately 3% of the genome showing highly asymmetric introgression in the predicted direction. Demographic simulations reveal that this asymmetric gene flow is more recent than the initial secondary contact, and the data suggest that the exaggerated male sexual characters originated within the Italian lineage and subsequently spread throughout this lineage before eventually reaching the contact zone. These results demonstrate that sexual selection can cause a suite of characters to spread throughout both closely and distantly related lineages with limited gene flow across the genome at large.© 2018 John Wiley & Sons Ltd.


September 22, 2019

Evaluation of bacterial contamination in goat milk powder using PacBio Single Molecule Real-Time Sequencing and Droplet Digital PCR.

Goat milk powder is a nutritious and easy-to-store product that is highly favored by consumers. However, the presence of contaminating bacteria and their metabolites may significantly affect the flavor, solubility, shelf life, and safety of the product. To comprehensively and accurately understand the sanitary conditions in the goat milk powder production process and potential threats from bacterial contamination, a combination of Pacific Biosciences single molecule real-time sequencing and droplet digital PCR was used to evaluate bacterial contamination in seven goat milk powder samples from three dairies. Ten phyla, 119 genera, and 249 bacterial species were identified. Bacillus, Paenibacillus, Lactococcus, and Cronobacter were the primary genera. Bacillus cereus, Lactococcus lactis, Alkaliphilus oremlandii, and Cronobacter sakazakii were the dominant species. With droplet digital PCR, 6.3 × 104 copies per g of Bacillus cereus and 1.0 × 104 copies per g of Cronobacter spp. were quantified, which may increase the risk of food spoilage and the probability of foodborne illness and should be monitored and controlled. This study offers a new approach for evaluating bacterial contamination in goat milk powder and supplies a reference for the assessment of food safety and control of potential risk, which will be of interest to the dairy industry.


September 22, 2019

Genomic discovery of the hypsin gene and biosynthetic pathways for terpenoids in Hypsizygus marmoreus.

Hypsizygus marmoreus (Beech mushroom) is a popular ingredient in Asian cuisine. The medicinal effects of its bioactive compounds such as hypsin and hypsiziprenol have been reported, but the genetic basis or biosynthesis of these components is unknown.In this study, we sequenced a reference strain of H. marmoreus (Haemi 51,987-8). We evaluated various assembly strategies, and as a result the Allpaths and PBJelly produced the best assembly. The resulting genome was 42.7 Mbp in length and annotated with 16,627 gene models. A putative gene (Hypma_04324) encoding the antifungal and antiproliferative hypsin protein with 75% sequence identity with the previously known N-terminal sequence was identified. Carbohydrate active enzyme analysis displayed the typical feature of white-rot fungi where auxiliary activity and carbohydrate-binding modules were enriched. The genome annotation revealed four terpene synthase genes responsible for terpenoid biosynthesis. From the gene tree analysis, we identified that terpene synthase genes can be classified into six clades. Four terpene synthase genes of H. marmoreus belonged to four different groups that implies they may be involved in the synthesis of different structures of terpenes. A terpene synthase gene cluster was well-conserved in Agaricomycetes genomes, which contained known biosynthesis and regulatory genes.Genome sequence analysis of this mushroom led to the discovery of the hypsin gene. Comparative genome analysis revealed the conserved gene cluster for terpenoid biosynthesis in the genome. These discoveries will further our understanding of the biosynthesis of medicinal bioactive molecules in this edible mushroom.


September 22, 2019

Comparative genomic and methylome analysis of non-virulent D74 and virulent Nagasaki Haemophilus parasuis isolates.

Haemophilus parasuis is a respiratory pathogen of swine and the etiological agent of Glässer’s disease. H. parasuis isolates can exhibit different virulence capabilities ranging from lethal systemic disease to subclinical carriage. To identify genomic differences between phenotypically distinct strains, we obtained the closed whole-genome sequence annotation and genome-wide methylation patterns for the highly virulent Nagasaki strain and for the non-virulent D74 strain. Evaluation of the virulence-associated genes contained within the genomes of D74 and Nagasaki led to the discovery of a large number of toxin-antitoxin (TA) systems within both genomes. Five predicted hemolysins were identified as unique to Nagasaki and seven putative contact-dependent growth inhibition toxin proteins were identified only in strain D74. Assessment of all potential vtaA genes revealed thirteen present in the Nagasaki genome and three in the D74 genome. Subsequent evaluation of the predicted protein structure revealed that none of the D74 VtaA proteins contain a collagen triple helix repeat domain. Additionally, the predicted protein sequence for two D74 VtaA proteins is substantially longer than any predicted Nagasaki VtaA proteins. Fifteen methylation sequence motifs were identified in D74 and fourteen methylation sequence motifs were identified in Nagasaki using SMRT sequencing analysis. Only one of the methylation sequence motifs was observed in both strains indicative of the diversity between D74 and Nagasaki. Subsequent analysis also revealed diversity in the restriction-modification systems harbored by D74 and Nagasaki. The collective information reported in this study will aid in the development of vaccines and intervention strategies to decrease the prevalence and disease burden caused by H. parasuis.


September 22, 2019

Characterization of Streptococcus pluranimalium from a cattle with mastitis by whole genome sequencing and functional validation.

Streptococcus pluranimalium is a new member of the Streptococcus genus isolated from multiple different animal hosts. It has been identified as a pathogen associated with subclinical mastitis, valvular endocarditis and septicaemia in animals. Moreover, this bacterium has emerged as a new pathogen for human infective endocarditis and brain abscess. However, the patho-biological properties of S. pluranimalium remain virtually unknown. The aim of this study was to determine the complete genome sequence of S. pluranimalium strain TH11417 isolated from a cattle with mastitis, and to characterize its antimicrobial resistance, virulence, and carbon catabolism.The genome of S. pluranimalium TH11417, determined by single-molecule real-time (SMRT) sequencing, consists of 2,065,522 base pair (bp) with a G?+?C content of 38.65%, 2,007 predicted coding sequence (CDS), 58 transfer RNA (tRNA) genes and five ribosome RNA (rRNA) operons. It contains a novel ISSpl1 element (a memeber of the IS3 family) and a ?11417.1 prophage that carries the mef(A), msr(D) and lnu(C) genes. Consistently, our antimicrobial susceptibility test confirmed that S. pluranimalium TH11417 was resistant to erythromycin and lincomycin. However, this strain did not show virulence in murine pneumonia (intranasal inoculation, 107 colony forming unit – CFU) and sepsis (intraperitoneal inoculation, 107 CFU) models. Additionally, this strain is able to grow with glucose, lactose or galactose as the sole carbon source, and possesses a lactose-specific phosphoenolpyruvate-dependent phosphotransferase system (PTS).We reported the first whole genome sequence of S. pluranimalium isolated from a cattle with mastitis. It harbors a prophage carrying the mef(A), msr(D) and lnu(C) genes, and is avirulent in the murine infection model.


September 22, 2019

Comprehensive profiling of four base overhang ligation fidelity by T4 DNA Ligase and application to DNA assembly.

Synthetic biology relies on the manufacture of large and complex DNA constructs from libraries of genetic parts. Golden Gate and other Type IIS restriction enzyme-dependent DNA assembly methods enable rapid construction of genes and operons through one-pot, multifragment assembly, with the ordering of parts determined by the ligation of Watson-Crick base-paired overhangs. However, ligation of mismatched overhangs leads to erroneous assembly, and low-efficiency Watson Crick pairings can lead to truncated assemblies. Using sets of empirically vetted, high-accuracy junction pairs avoids this issue but limits the number of parts that can be joined in a single reaction. Here, we report the use of comprehensive end-joining ligation fidelity and bias data to predict high accuracy junction sets for Golden Gate assembly. The ligation profile accurately predicted junction fidelity in ten-fragment Golden Gate assembly reactions and enabled accurate and efficient assembly of a lac cassette from up to 24-fragments in a single reaction.


September 22, 2019

Streptococcus suis contains multiple phase-variable methyltransferases that show a discrete lineage distribution.

Streptococcus suis is a major pathogen of swine, responsible for a number of chronic and acute infections, and is also emerging as a major zoonotic pathogen, particularly in South-East Asia. Our study of a diverse population of S. suis shows that this organism contains both Type I and Type III phase-variable methyltransferases. In all previous examples, phase-variation of methyltransferases results in genome wide methylation differences, and results in differential regulation of multiple genes, a system known as the phasevarion (phase-variable regulon). We hypothesized that each variant in the Type I and Type III systems encoded a methyltransferase with a unique specificity, and could therefore control a distinct phasevarion, either by recombination-driven shuffling between different specificities (Type I) or by biphasic on-off switching via simple sequence repeats (Type III). Here, we present the identification of the target specificities for each Type III allelic variant from S. suis using single-molecule, real-time methylome analysis. We demonstrate phase-variation is occurring in both Type I and Type III methyltransferases, and show a distinct association between methyltransferase type and presence, and population clades. In addition, we show that the phase-variable Type I methyltransferase was likely acquired at the origin of a highly virulent zoonotic sub-population.


September 22, 2019

Unraveling microbial communities associated with methylmercury production in paddy soils.

Rice consumption is now recognized as an important pathway of human exposure to the neurotoxin methylmercury (MeHg), particularly in countries where rice is a staple food. Although the discovery of a two-gene cluster hgcAB has linked Hg methylation to several phylogenetically diverse groups of anaerobic microorganisms converting inorganic mercury (Hg) to MeHg, the prevalence and diversity of Hg methylators in microbial communities of rice paddy soils remain unclear. We characterized the abundance and distribution of hgcAB genes using third-generation PacBio long-read sequencing and Illumina short-read metagenomic sequencing, in combination with quantitative PCR analyses in several mine-impacted paddy soils from southwest China. Both Illumina and PacBio sequencing analyses revealed that Hg methylating communities were dominated by iron-reducing bacteria (i.e., Geobacter) and methanogens, with a relatively low abundance of hgcA + sulfate-reducing bacteria in the soil. A positive correlation was observed between the MeHg content in soil and the relative abundance of Geobacter carrying the hgcA gene. Phylogenetic analysis also uncovered some hgcAB sequences closely related to three novel Hg methylators, Geobacter anodireducens, Desulfuromonas sp. DDH964, and Desulfovibrio sp. J2, among which G. anodireducens was validated for its ability to methylate Hg. These findings shed new light on microbial community composition and major clades likely driving Hg methylation in rice paddy soils.


September 22, 2019

Whole-genome sequencing of Chinese yellow catfish provides a valuable genetic resource for high-throughput identification of toxin genes.

Naturally derived toxins from animals are good raw materials for drug development. As a representative venomous teleost, Chinese yellow catfish (Pelteobagrus fulvidraco) can provide valuable resources for studies on toxin genes. Its venom glands are located in the pectoral and dorsal fins. Although with such interesting biologic traits and great value in economy, Chinese yellow catfish is still lacking a sequenced genome. Here, we report a high-quality genome assembly of Chinese yellow catfish using a combination of next-generation Illumina and third-generation PacBio sequencing platforms. The final assembly reached 714 Mb, with a contig N50 of 970 kb and a scaffold N50 of 3.65 Mb, respectively. We also annotated 21,562 protein-coding genes, in which 97.59% were assigned at least one functional annotation. Based on the genome sequence, we analyzed toxin genes in Chinese yellow catfish. Finally, we identified 207 toxin genes and classified them into three major groups. Interestingly, we also expanded a previously reported sex-related region (to ˜6 Mb) in the achieved genome assembly, and localized two important toxin genes within this region. In summary, we assembled a high-quality genome of Chinese yellow catfish and performed high-throughput identification of toxin genes from a genomic view. Therefore, the limited number of toxin sequences in public databases will be remarkably improved once we integrate multi-omics data from more and more sequenced species.


September 22, 2019

The chromosome-level quality genome provides insights into the evolution of the biosynthesis genes for aroma compounds of Osmanthus fragrans.

Sweet osmanthus (Osmanthus fragrans) is a very popular ornamental tree species throughout Southeast Asia and USA particularly for its extremely fragrant aroma. We constructed a chromosome-level reference genome of O. fragrans to assist in studies of the evolution, genetic diversity, and molecular mechanism of aroma development. A total of over 118?Gb of polished reads was produced from HiSeq (45.1?Gb) and PacBio Sequel (73.35?Gb), giving 100× depth coverage for long reads. The combination of Illumina-short reads, PacBio-long reads, and Hi-C data produced the final chromosome quality genome of O. fragrans with a genome size of 727?Mb and a heterozygosity of 1.45 %. The genome was annotated using de novo and homology comparison and further refined with transcriptome data. The genome of O. fragrans was predicted to have?45,542 genes, of which 95.68 % were functionally annotated. Genome annotation found 49.35 % as the repetitive sequences, with long terminal repeats (LTR) being the richest (28.94 %). Genome evolution analysis indicated the evidence of whole-genome duplication 15 million years ago, which contributed to the current content of 45,242 genes. Metabolic analysis revealed that linalool, a monoterpene is the main aroma compound. Based on the genome and transcriptome, we further demonstrated the direct connection between terpene synthases (TPSs) and the rich aromatic molecules in O. fragrans. We identified three new flower-specific TPS genes, of which the expression coincided with the production of linalool. Our results suggest that the high number of TPS genes and the flower tissue- and stage-specific TPS genes expressions might drive the strong unique aroma production of O. fragrans.


September 22, 2019

A strain of an emerging Indian Xanthomonas oryzae pv. oryzae pathotype defeats the rice bacterial blight resistance gene xa13 without inducing a clade III SWEET gene and is nearly identical to a recent Thai isolate.

The rice bacterial blight pathogen Xanthomonas oryzae pv. oryzae (Xoo) injects transcription activator-like effectors (TALEs) that bind and activate host “susceptibility” (S) genes important for disease. Clade III SWEET genes are major S genes for bacterial blight. The resistance genes xa5, which reduces TALE activity generally, and xa13, a SWEET11 allele not recognized by the cognate TALE, have been effectively deployed. However, strains that defeat both resistance genes individually were recently reported in India and Thailand. To gain insight into the mechanism(s), we completely sequenced the genome of one such strain from each country and examined the encoded TALEs. Strikingly, the two strains are clones, sharing nearly identical TALE repertoires, including a TALE known to activate SWEET11 strongly enough to be effective even when diminished by xa5. We next investigated SWEET gene induction by the Indian strain. The Indian strain induced no clade III SWEET in plants harboring xa13, indicating a pathogen adaptation that relieves dependence on these genes for susceptibility. The findings open a door to mechanistic understanding of the role SWEET genes play in susceptibility and illustrate the importance of complete genome sequence-based monitoring of Xoo populations in developing varieties with effective disease resistance.


September 22, 2019

Phenotypic and genomic comparison of Photorhabdus luminescens subsp. laumondii TT01 and a widely used rifampicin-resistant Photorhabdus luminescens laboratory strain.

Photorhabdus luminescens is an enteric bacterium, which lives in mutualistic association with soil nematodes and is highly pathogenic for a broad spectrum of insects. A complete genome sequence for the type strain P. luminescens subsp. laumondii TT01, which was originally isolated in Trinidad and Tobago, has been described earlier. Subsequently, a rifampicin resistant P. luminescens strain has been generated with superior possibilities for experimental characterization. This strain, which is widely used in research, was described as a spontaneous rifampicin resistant mutant of TT01 and is known as TT01-RifR.Unexpectedly, upon phenotypic comparison between the rifampicin resistant strain and its presumed parent TT01, major differences were found with respect to bioluminescence, pigmentation, biofilm formation, haemolysis as well as growth. Therefore, we renamed the strain TT01-RifR to DJC. To unravel the genomic basis of the observed differences, we generated a complete genome sequence for strain DJC using the PacBio long read technology. As strain DJC was supposed to be a spontaneous mutant, only few sequence differences were expected. In order to distinguish these from potential sequencing errors in the published TT01 genome, we re-sequenced a derivative of strain TT01 in parallel, also using the PacBio technology. The two TT01 genomes differed at only 30 positions. In contrast, the genome of strain DJC varied extensively from TT01, showing 13,000 point mutations, 330 frameshifts, and 220 strain-specific regions with a total length of more than 300 kb in each of the compared genomes.According to the major phenotypic and genotypic differences, the rifampicin resistant P. luminescens strain, now named strain DJC, has to be considered as an independent isolate rather than a derivative of strain TT01. Strains TT01 and DJC both belong to P. luminescens subsp. laumondii.


September 22, 2019

Correcting palindromes in long reads after whole-genome amplification.

Next-generation sequencing requires sufficient DNA to be available. If limited, whole-genome amplification is applied to generate additional amounts of DNA. Such amplification often results in many chimeric DNA fragments, in particular artificial palindromic sequences, which limit the usefulness of long sequencing reads.Here, we present Pacasus, a tool for correcting such errors. Two datasets show that it markedly improves read mapping and de novo assembly, yielding results similar to these that would be obtained with non-amplified DNA.With Pacasus long-read technologies become available for sequencing targets with very small amounts of DNA, such as single cells or even single chromosomes.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.