In the past two decades, Chinese scientists have achieved significant progress on three aspects of wheat genetic transformation. First, the wheat transformation platform has been established and optimized to improve the transformation efficiency, shorten the time required from starting of transformation procedure to the fertile transgenic wheat plants obtained as well as to overcome the problem of genotype-dependent for wheat genetic transformation in wide range of wheat elite varieties. Second, with the help of many emerging techniques such as CRISPR/cas9 function of over 100 wheat genes has been investigated. Finally, modern technology has been combined with the traditional breeding technique such as crossing to accelerate the application of wheat transformation. Overall, the wheat end-use quality and the characteristics of wheat stress tolerance have been improved by wheat genetic engineering technique. So far, wheat transgenic lines integrated with quality-improved genes and stress tolerant genes have been on the way of Production Test stage in the field. The debates and the future studies on wheat transformation have been discussed, and the brief summary of Chinese wheat breeding research history has also been provided in this review.
Endogenous sequence patterns predispose the repair modes of CRISPR/Cas9-induced DNA double-stranded breaks in Arabidopsis thaliana.
The possibility to predict the outcome of targeted DNA double-stranded break (DSB) repair would be desirable for genome editing. Furthermore the consequences of mis-repair of potentially cell-lethal DSBs and the underlying pathways are not yet fully understood. Here we study the clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9-induced mutation spectra at three selected endogenous loci in Arabidopsis thaliana by deep sequencing of long amplicon libraries. Notably, we found sequence-dependent genomic features that affected the DNA repair outcome. Deletions of 1-bp to <1000-bp size and/or very short insertions, deletions >1 kbp (all due to NHEJ) and deletions combined with insertions between 5-bp to >100 bp [caused by a synthesis-dependent strand annealing (SDSA)-like mechanism] occurred most frequently at all three loci. The appearance of single-stranded annealing events depends on the presence and distance between repeats flanking the DSB. The frequency and size of insertions is increased if a sequence with high similarity to the target site was available in cis. Most deletions were linked to pre-existing microhomology. Deletion and/or insertion mutations were blunt-end ligated or via de novo generated microhomology. While most mutation types and, to some degree, their predictability are comparable with animal systems, the broad range of deletion mutations seems to be a peculiar feature of the plant A. thaliana.© 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
A Gram-stain-positive, catalase-positive and pleomorphic rod organism was isolated from malted barley in Finland, classified initially by partial 16S rRNA gene sequencing and originally deposited in the VTT Culture Collection as a strain of Propionibacterium acidipropionici (currently Acidipropionibacterium acidipropionici). The subsequent comparison of the whole 16S rRNA gene with other representatives of the genus Acidipropionibacterium revealed that the strain belongs to a novel species, most closely related to Acidipropionibacterium microaerophilum and Acidipropionibacterium acidipropionici, with similarity values of 98.46 and 98.31?%, respectively. The whole genome sequencing using PacBio RS II platform allowed further comparison of the genome with all of the other DNA sequences available for the type strains of the Acidipropionibacterium species. Those comparisons revealed the highest similarity of strain JS278T to A. acidipropionici, which was confirmed by the average nucleotide identity analysis. The genome of strain JS278T is intermediate in size compared to the A. acidipropionici and Acidipropionibacterium jensenii at 3?432?872?bp, the G+C?content is 68.4?mol%. The strain fermented a wide range of carbon sources, and produced propionic acid as the major fermentation product. Besides its poor ability to grow at 37?°C and positive catalase reaction, the observed phenotype was almost indistinguishable from those of A. acidipropionici and A. jensenii. Based on our findings, we conclude that the organism represents a novel member of the genus Acidipropionibacterium, for which we propose the name Acidipropionibacteriumvirtanenii sp. nov. The type strain is JS278T (=VTT E-113202T=DSM 106790T).
Cereal grasses of the Triticeae tribe have been the major food source in temperate regions since the dawn of agriculture. Their large genomes are characterized by a high content of repetitive elements and large pericentromeric regions that are virtually devoid of meiotic recombination. Here we present a high-quality reference genome assembly for barley (Hordeum vulgare L.). We use chromosome conformation capture mapping to derive the linear order of sequences across the pericentromeric space and to investigate the spatial organization of chromatin in the nucleus at megabase resolution. The composition of genes and repetitive elements differs between distal and proximal regions. Gene family analyses reveal lineage-specific duplications of genes involved in the transport of nutrients to developing seeds and the mobilization of carbohydrates in grains. We demonstrate the importance of the barley reference sequence for breeding by inspecting the genomic partitioning of sequence variation in modern elite germplasm, highlighting regions vulnerable to genetic erosion.
Cow, yak, and camel milk diets differentially modulated the systemic immunity and fecal microbiota of rats
Cow milk is most widely consumed; however, non-cattle milk has gained increasing interest because of added nutritive values. We compared the health effects of yak, cow, and camel milk in rats. By measuring several plasma immune factors, significantly more interferon-? was detected in the camel than the yak (P=0.0020) or cow (P=0.0062) milk group. Significantly more IgM was detected in the yak milk than the control group (P=0.0071). The control group had significantly less interleukin 6 than the yak (P=0.0499) and cow (P=0.0248) milk groups. The fecal microbiota of the 144 samples comprised mainly of the Firmicutes (76.70±11.03%), Bacteroidetes (15.27±7.79%), Proteobacteria (3.61±4.34%), and Tenericutes (2.61±2.53%) phyla. Multivariate analyses revealed a mild shift in the fecal microbiota along the milk treatment. We further identified the differential microbes across the four groups. At day 14, 22 and 28 differential genera and species were identified (P=0.0000–0.0462), while 8 and 11 differential genera and species (P=0.0000–0.0013) were found at day 28. Some short-chain fatty acid and succinate producers increased, while certain health-concerned bacteria (Prevotella copri, Phascolarctobacterium faecium, and Bacteroides uniformis) decreased after 14days of yak or camel milk treatment. We demonstrated that different animal milk could confer distinctive nutritive value to the host.
The genome of an underwater architect, the caddisfly Stenopsyche tienmushanensis Hwang (Insecta: Trichoptera).
Caddisflies (Insecta: Trichoptera) are a highly adapted freshwater group of insects split from a common ancestor with Lepidoptera. They are the most diverse (>16,000 species) of the strictly aquatic insect orders and are widely employed as bio-indicators in water quality assessment and monitoring. Among the numerous adaptations to aquatic habitats, caddisfly larvae use silk and materials from the environment (e.g., stones, sticks, leaf matter) to build composite structures such as fixed retreats and portable cases. Understanding how caddisflies have adapted to aquatic habitats will help explain the evolution and subsequent diversification of the group.We sequenced a retreat-builder caddisfly Stenopsyche tienmushanensis Hwang and assembled a high-quality genome from both Illumina and Pacific Biosciences (PacBio) sequencing. In total, 601.2 M Illumina reads (90.2 Gb) and 16.9 M PacBio subreads (89.0 Gb) were generated. The 451.5 Mb assembled genome has a contig N50 of 1.29 M, has a longest contig of 4.76 Mb, and covers 97.65% of the 1,658 insect single-copy genes as assessed by Benchmarking Universal Single-Copy Orthologs. The genome comprises 36.76% repetitive elements. A total of 14,672 predicted protein-coding genes were identified. The genome revealed gene expansions in specific groups of the cytochrome P450 family and olfactory binding proteins, suggesting potential genomic features associated with pollutant tolerance and mate finding. In addition, the complete gene complex of the highly repetitive H-fibroin, the major protein component of caddisfly larval silk, was assembled.We report the draft genome of Stenopsyche tienmushanensis, the highest-quality caddisfly genome so far. The genome information will be an important resource for the study of caddisflies and may shed light on the evolution of aquatic insects.
Gut microbiota is a determining factor in human physiological functions and health. It is commonly accepted that diet has a major influence on the gut microbial community, however, the effects of diet is not fully understood. The typical Mongolian diet is characterized by high and frequent consumption of fermented dairy products and red meat, and low level of carbohydrates. In this study, the gut microbiota profile of 26 Mongolians whom consumed wheat, rice and oat as the sole carbohydrate staple food for a week each consecutively was determined. It was observed that changes in staple carbohydrate rapidly (within a week) altered gut microbial community structure and metabolic pathway of the subjects. Wheat and oat favored bifidobacteria (Bifidobacterium catenulatum, Bifodobacteriumbifidum, Bifidobacterium adolescentis); whereas rice suppressed bifidobacteria (Bifidobacterium longum, Bifidobacterium adolescentis) and wheat suppresses Lactobaciilus, Ruminococcus and Bacteroides. The study exhibited two gut microbial clustering patterns with the preference of fucosyllactose utilization linking to fucosidase genes (glycoside hydrolase family classifications: GH95 and GH29) encoded by Bifidobacterium, and xylan and arabinoxylan utilization linking to xylanase and arabinoxylanase genes encoded by Bacteroides. There was also a correlation between Lactobacillus ruminis and sialidase, as well as Butyrivibrio crossotus and xylanase/xylosidase. Meanwhile, a strong concordance was found between the gastrointestinal bacterial microbiome and the intestinal virome. Present research will contribute to understanding the impacts of the dietary carbohydrate on human gut microbiome, which will ultimately help understand relationships between dietary factor, microbial populations, and the health of global humans.
A report on the International Plant and Animal Genomes (PAG) conference held in San Diego, USA, 13-17 January 2018.
An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations.
Advances in genome sequencing and assembly technologies are generating many high-quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimized data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents >78% of the genome with a scaffold N50 of 88.8 kb that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNA-seq and Pacific Biosciences (PacBio) full-length cDNAs to identify 104,091 high-confidence protein-coding genes and 10,156 noncoding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop. © 2017 Clavijo et al.; Published by Cold Spring Harbor Laboratory Press.
Forty years ago the advent of Sanger sequencing was revolutionary as it allowed complete genome sequences to be deciphered for the first time. A second revolution came when next-generation sequencing (NGS) technologies appeared, which made genome sequencing much cheaper and faster. However, NGS methods have several drawbacks and pitfalls, most notably their short reads. Recently, third-generation/long-read methods appeared, which can produce genome assemblies of unprecedented quality. Moreover, these technologies can directly detect epigenetic modifications on native DNA and allow whole-transcript sequencing without the need for assembly. This marks the third revolution in sequencing technology. Here we review and compare the various long-read methods. We discuss their applications and their respective strengths and weaknesses and provide future perspectives. Copyright © 2018 Elsevier Ltd. All rights reserved.
Community profiling of Fusarium in combination with other plant associated fungi in different crop species using SMRT Sequencing.
Fusarium head blight, caused by fungi from the genus Fusarium, is one of the most harmful cereal diseases, resulting not only in severe yield losses but also in mycotoxin contaminated and health-threatening grains. Fusarium head blight is caused by a diverse set of species that have different host ranges, mycotoxin profiles and responses to agricultural practices. Thus, understanding the composition of Fusarium communities in the field is crucial for estimating their impact and also for the development of effective control measures. Up to now, most molecular tools that monitor Fusarium communities on plants are limited to certain species and do not distinguish other plant associated fungi. To close these gaps, we developed a sequencing-based community profiling methodology for crop-associated fungi with a focus on the genus Fusarium. By analyzing a 1600 bp long amplicon spanning the highly variable segments ITS and D1-D3 of the ribosomal operon by PacBio SMRT sequencing, we were able to robustly quantify Fusarium down to species level through clustering against reference sequences. The newly developed methodology was successfully validated in mock communities and provided similar results as the culture-based assessment of Fusarium communities by seed health tests in grain samples from different crop species. Finally, we exemplified the newly developed methodology in a field experiment with a wheat-maize crop sequence under different cover crop and tillage regimes. We analyzed wheat straw residues, cover crop shoots and maize grains and we could reveal that the cover crop hairy vetch (Vicia villosa) acts as a potent alternative host for Fusarium (OTU F.ave/tri) showing an eightfold higher relative abundance compared with other cover crop treatments. Moreover, as the newly developed methodology also allows to trace other crop-associated fungi, we found that vetch and green fallow hosted further fungal plant pathogens including Zymoseptoria tritici. Thus, besides their beneficial traits, cover crops can also entail phytopathological risks by acting as alternative hosts for Fusarium and other noxious plant pathogens. The newly developed sequencing based methodology is a powerful diagnostic tool to trace Fusarium in combination with other fungi associated to different crop species.
Genome-wide identification and analysis of the ALTERNATIVE OXIDASE gene family in diploid and hexaploid wheat.
A comprehensive understanding of wheat responses to environmental stress will contribute to the long-term goal of feeding the planet. ALERNATIVE OXIDASE (AOX) genes encode proteins involved in a bypass of the electron transport chain and are also known to be involved in stress tolerance in multiple species. Here, we report the identification and characterization of the AOX gene family in diploid and hexaploid wheat. Four genes each were found in the diploid ancestors Triticum urartu, and Aegilops tauschii, and three in Aegilops speltoides. In hexaploid wheat (Triticum aestivum), 20 genes were identified, some with multiple splice variants, corresponding to a total of 24 proteins for those with observed transcription and translation. These proteins were classified as AOX1a, AOX1c, AOX1e or AOX1d via phylogenetic analysis. Proteins lacking most or all signature AOX motifs were assigned to putative regulatory roles. Analysis of protein-targeting sequences suggests mixed localization to the mitochondria and other organelles. In comparison to the most studied AOX from Trypanosoma brucei, there were amino acid substitutions at critical functional domains indicating possible role divergence in wheat or grasses in general. In hexaploid wheat, AOX genes were expressed at specific developmental stages as well as in response to both biotic and abiotic stresses such as fungal pathogens, heat and drought. These AOX expression patterns suggest a highly regulated and diverse transcription and expression system. The insights gained provide a framework for the continued and expanded study of AOX genes in wheat for stress tolerance through breeding new varieties, as well as resistance to AOX-targeted herbicides, all of which can ultimately be used synergistically to improve crop yield.
Biogas production from hydrothermal liquefaction wastewater (HTLWW): Focusing on the microbial communities as revealed by high-throughput sequencing of full-length 16S rRNA genes.
Hydrothermal liquefaction (HTL) is an emerging and promising technology for the conversion of wet biomass into bio-crude, however, little attention has been paid to the utilization of hydrothermal liquefaction wastewater (HTLWW) with high concentration of organics. The present study investigated biogas production from wastewater obtained from HTL of straw for bio-crude production, with focuses on the analysis of the microbial communities and characterization of the organics. Batch experiments showed the methane yield of HTLWW (R-HTLWW) was 184 mL/g COD, while HTLWW after petroleum ether extraction (PE-HTLWW), to extract additional bio-crude, had higher methane yield (235 mL/g COD) due to the extraction of recalcitrant organic compounds. Sequential batch experiments further demonstrated the higher methane yield of PE-HTLWW. LC-TOF-MS, HPLC and gel filtration chromatography showed organics with molecular weight (MW) < 1000 were well degraded. Results from the high-throughput sequencing of full-length 16S rRNA genes analysis showed similar microbial community compositions were obtained for the reactors fed with either R-HTLWW or PE-HTLWW. The degradation of fatty acids were related with Mesotoga infera, Syntrophomonas wolfei et al. by species level identification. However, the species related to the degradation of other compounds (e.g. phenols) were not found, which could be due to the presence of uncharacterized microorganisms. It was also found previously proposed criteria (97% and 98.65% similarity) for species identification of 16S rRNA genes were not suitable for a fraction of 16S rRNA genes. Copyright © 2016 Elsevier Ltd. All rights reserved.
Full-length transcriptome survey and expression analysis of Cassia obtusifolia to discover putative genes related to aurantio-obtusin biosynthesis, seed formation and development, and stress response.
The seed is the pharmaceutical and breeding organ of Cassia obtusifolia, a well-known medical herb containing aurantio-obtusin (a kind of anthraquinone), food, and landscape. In order to understand the molecular mechanism of the biosynthesis of aurantio-obtusin, seed formation and development, and stress response of C. obtusifolia, it is necessary to understand the genomics information. Although previous seed transcriptome of C. obtusifolia has been carried out by short-read next-generation sequencing (NGS) technology, the vast majority of the resulting unigenes did not represent full-length cDNA sequences and supply enough gene expression profile information of the various organs or tissues. In this study, fifteen cDNA libraries, which were constructed from the seed, root, stem, leaf, and flower (three repetitions with each organ) of C. obtusifolia, were sequenced using hybrid approach combining single-molecule real-time (SMRT) and NGS platform. More than 4,315,774 long reads with 9.66 Gb sequencing data and 361,427,021 short reads with 108.13 Gb sequencing data were generated by SMRT and NGS platform, respectively. 67,222 consensus isoforms were clustered from the reads and 81.73% (61,016) of which were longer than 1000 bp. Furthermore, the 67,222 consensus isoforms represented 58,106 nonredundant transcripts, 98.25% (57,092) of which were annotated and 25,573 of which were assigned to specific metabolic pathways by KEGG. CoDXS and CoDXR genes were directly used for functional characterization to validate the accuracy of sequences obtained from transcriptome. A total of 658 seed-specific transcripts indicated their special roles in physiological processes in seed. Analysis of transcripts which were involved in the early stage of anthraquinone biosynthesis suggested that the aurantio-obtusin in C. obtusifolia was mainly generated from isochorismate and Mevalonate/methylerythritol phosphate (MVA/MEP) pathway, and three reactions catalyzed by Menaquinone-specific isochorismate synthase (ICS), 1-deoxy-d-xylulose-5-phosphate synthase (DXS) and isopentenyl diphosphate (IPPS) might be the limited steps. Several seed-specific CYPs, SAM-dependent methyltransferase, and UDP-glycosyltransferase (UDPG) supplied promising candidate genes in the late stage of anthraquinone biosynthesis. In addition, four seed-specific transcriptional factors including three MYB Transcription Factor (MYB) and one MADS-box Transcription Factor (MADS) transcriptional factors) and alternative splicing might be involved with seed formation and development. Meanwhile, most members of Hsp20 genes showed high expression level in seed and flower; seven of which might have chaperon activities under various abiotic stresses. Finally, the expressional patterns of genes with particular interests showed similar trends in both transcriptome assay and qRT-PCR. In conclusion, this is the first full-length transcriptome sequencing reported in Caesalpiniaceae family, and thus providing a more complete insight into aurantio-obtusin biosynthesis, seed formation and development, and stress response as well in C. obtusifolia.
Lentinula edodes is a popular, cultivated edible and medicinal mushroom. Lentinula edodes is susceptible to postharvest problems, such as gill browning, fruiting body softening, and lentinan degradation. We constructed a de novo assembly draft genome sequence and performed gene prediction for Lentinula edodesDe novo assembly was carried out using short reads from paired-end and mate-paired libraries and by using long reads by PacBio, resulting in a contig number of 1,951 and an N50 of 1 Mb. Furthermore, we predicted genes by Augustus using transcriptome sequencing (RNA-seq) data from the whole life cycle of Lentinula edodes, resulting in 12,959 predicted genes. This analysis revealed that Lentinula edodes lacks lignin peroxidase. To reveal genes involved in the loss of quality of Lentinula edodes postharvest fruiting bodies, transcriptome analysis was carried out using serial analysis of gene expression (SuperSAGE). This analysis revealed that many cell wall-related enzymes are upregulated after harvest, such as ß-1,3-1,6-glucan-degrading enzymes in glycoside hydrolase (GH) families GH5, GH16, GH30, GH55, and GH128, and thaumatin-like proteins. In addition, we found that several chitin-related genes are upregulated, such as putative chitinases in GH family 18, exochitinases in GH20, and a putative chitosanase in GH family 75. The results suggest that cell wall-degrading enzymes synergistically cooperate for rapid fruiting body autolysis. Many putative transcription factor genes were upregulated postharvest, such as genes containing high-mobility-group (HMG) domains and zinc finger domains. Several cell death-related proteins were also upregulated postharvest.IMPORTANCE Our data collectively suggest that there is a rapid fruiting body autolysis system in Lentinula edodes The genes for the loss of postharvest quality newly found in this research will be targets for the future breeding of strains that keep fresh longer than present strains. De novoLentinula edodes genome assembly data will be used for the construction of a complete Lentinula edodes chromosome map for future breeding. Copyright © 2017 American Society for Microbiology.