Menu
April 21, 2020

A chromosome-scale genome assembly of cucumber (Cucumis sativus L.).

Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assemble draft cucumber genomes, but the incompleteness and low quality of these genomes limit their use in comparative genomics and genetic research. A high-quality and complete cucumber genome assembly is therefore essential.We assembled single-molecule real-time (SMRT) long reads to generate an improved cucumber reference genome. This version contains 174 contigs with a total length of 226.2 Mb and an N50 of 8.9 Mb, and provides 29.0 Mb more sequence data than previous versions. Using 10X Genomics and high-throughput chromosome conformation capture (Hi-C) data, 89 contigs (~211.0 Mb) were directly linked into 7 pseudo-chromosome sequences. The newly assembled regions show much higher guanine-cytosine or adenine-thymine content than found previously, which is likely to have been inaccessible to Illumina sequencing. The new assembly contains 1,374 full-length long terminal retrotransposons and 1,078 novel genes including 239 tandemly duplicated genes. For example, we found 4 tandemly duplicated tyrosylprotein sulfotransferases, in contrast to the single copy of the gene found previously and in most other plants.This high-quality genome presents novel features of the cucumber genome and will serve as a valuable resource for genetic research in cucumber and plant comparative genomics. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

The genome assembly and annotation of yellowhorn (Xanthoceras sorbifolium Bunge).

Yellowhorn (Xanthoceras sorbifolium Bunge), a deciduous shrub or small tree native to north China, is of great economic value. Seeds of yellowhorn are rich in oil containing unsaturated long-chain fatty acids that have been used for producing edible oil and nervonic acid capsules. However, the lack of a high-quality genome sequence hampers the understanding of its evolution and gene functions.In this study, a whole genome of yellowhorn was sequenced and assembled by integration of Illumina sequencing, Pacific Biosciences single-molecule real-time sequencing, 10X Genomics linked reads, Bionano optical maps, and Hi-C. The yellowhorn genome assembly was 439.97 Mb, which comprised 15 pseudo-chromosomes covering 95.42% (419.84 Mb) of the assembled genome. The repetitive fractions accounted for 56.39% of the yellowhorn genome. The genome contained 21,059 protein-coding genes. Of them, 18,503 (87.86%) genes were found to be functionally annotated with =1 “annotation” term by searching against other databases. Transcriptomic analysis showed that 341, 135, 125, 113, and 100 genes were specifically expressed in hermaphrodite flower, staminate flower, young fruit, leaf, and shoot, respectively. Phylogenetic analysis suggested that yellowhorn and Dimocarpus longan diverged from their most recent common ancestor ~46 million years ago.The availability and subsequent annotation of the yellowhorn genome, as well as the identification of tissue-specific functional genes, provides a valuable reference for plant comparative genomics, evolutionary studies, and molecular design breeding. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Pseudomolecule-level assembly of the Chinese oil tree yellowhorn (Xanthoceras sorbifolium) genome.

Yellowhorn (Xanthoceras sorbifolium) is a species of the Sapindaceae family native to China and is an oil tree that can withstand cold and drought conditions. A pseudomolecule-level genome assembly for this species will not only contribute to understanding the evolution of its genes and chromosomes but also bring yellowhorn breeding into the genomic era.Here, we generated 15 pseudomolecules of yellowhorn chromosomes, on which 97.04% of scaffolds were anchored, using the combined Illumina HiSeq, Pacific Biosciences Sequel, and Hi-C technologies. The length of the final yellowhorn genome assembly was 504.2 Mb with a contig N50 size of 1.04 Mb and a scaffold N50 size of 32.17 Mb. Genome annotation revealed that 68.67% of the yellowhorn genome was composed of repetitive elements. Gene modelling predicted 24,672 protein-coding genes. By comparing orthologous genes, the divergence time of yellowhorn and its close sister species longan (Dimocarpus longan) was estimated at ~33.07 million years ago. Gene cluster and chromosome synteny analysis demonstrated that the yellowhorn genome shared a conserved genome structure with its ancestor in some chromosomes.This genome assembly represents a high-quality reference genome for yellowhorn. Integrated genome annotations provide a valuable dataset for genetic and molecular research in this species. We did not detect whole-genome duplication in the genome. The yellowhorn genome carries syntenic blocks from ancient chromosomes. These data sources will enable this genome to serve as an initial platform for breeding better yellowhorn cultivars. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Genome Analysis of Hypomyces perniciosus, the Causal Agent of Wet Bubble Disease of Button Mushroom (Agaricus bisporus).

The mycoparasitic fungus Hypomyces perniciosus causes wet bubble disease of mushrooms, particularly Agaricus bisporus. The genome of a highly virulent strain of H. perniciosus HP10 was sequenced and compared to three other fungi from the order Hypocreales that cause disease on A. bisporus. H. perniciosus genome is ~44 Mb, encodes 10,077 genes and enriched with transposable elements up to 25.3%. Phylogenetic analysis revealed that H. perniciosus is closely related to Cladobotryum protrusum and diverged from their common ancestor ~156.7 million years ago. H. perniciosus has few secreted proteins compared to C. protrusum and Trichoderma virens, but significantly expanded protein families of transporters, protein kinases, CAZymes (GH 18), peptidases, cytochrome P450, and SMs that are essential for mycoparasitism and adaptation to harsh environments. This study provides insights into H. perniciosus evolution and pathogenesis and will contribute to the development of effective disease management strategies to control wet bubble disease.


April 21, 2020

Transcriptome Analysis Reveals the Accumulation Mechanism of Anthocyanins in Buckwheat (Fagopyrum esculentum Moench) Cotyledons and Flowers.

Buckwheat (Fagopyrum esculentum) is a valuable crop which can produce multiple human beneficial secondary metabolites, for example, the anthocyanins in sprouts and flowers. However, as the predominant group of visible polyphenols in pigmentation, little is known about the molecular mechanisms underlying the anthocyanin biosynthesis within buckwheat. In this study, a comparative transcriptome analysis of green and red common buckwheat cultivars was carried out through RNA sequencing. Overall, 3727 and 5323 differently expressed genes (DEGs) were identified in flowers and cotyledons, respectively. Through GO and KEGG analysis, we revealed that DEGs in flowers and cotyledons are predominately involved in biosynthesis of anthocyanin. A total of 42 unigenes encoding 11 structural enzymes of the anthocyanin biosynthesis were identified as DEGs. We also identified some transcription factor families involved in the regulation of anthocyanin biosynthesis. Real-time qPCR validation of candidate genes was performed in flowers and cotyledons, and the results suggested that the high expression level of structural genes involved in anthocyanin biosynthetic pathway promotes anthocyanin accumulation. Our results provide the insight understanding for coloration of red common buckwheat.


April 21, 2020

A chromosome-scale assembly of the major African malaria vector Anopheles funestus.

Anopheles funestus is one of the 3 most consequential and widespread vectors of human malaria in tropical Africa. However, the lack of a high-quality reference genome has hindered the association of phenotypic traits with their genetic basis in this important mosquito.Here we present a new high-quality A. funestus reference genome (AfunF3) assembled using 240× coverage of long-read single-molecule sequencing for contigging, combined with 100× coverage of short-read Hi-C data for chromosome scaffolding. The assembled contigs total 446 Mbp of sequence and contain substantial duplication due to alternative alleles present in the sequenced pool of mosquitos from the FUMOZ colony. Using alignment and depth-of-coverage information, these contigs were deduplicated to a 211 Mbp primary assembly, which is closer to the expected haploid genome size of 250 Mbp. This primary assembly consists of 1,053 contigs organized into 3 chromosome-scale scaffolds with an N50 contig size of 632 kbp and an N50 scaffold size of 93.811 Mbp, representing a 100-fold improvement in continuity versus the current reference assembly, AfunF1.This highly contiguous and complete A. funestus reference genome assembly will serve as an improved basis for future studies of genomic variation and organization in this important disease vector. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Full-Length Multi-Barcoding: DNA Barcoding from Single Ingredient to Complex Mixtures.

DNA barcoding has been used for decades, although it has mostly been applied to somesingle-species. Traditional Chinese medicine (TCM), which is mainly used in the form ofcombination-one type of the multi-species, identification is crucial for clinical usage.Next-generation Sequencing (NGS) has been used to address this authentication issue for the pastfew years, but conventional NGS technology is hampered in application due to its short sequencingreads and systematic errors. Here, a novel method, Full-length multi-barcoding (FLMB) vialong-read sequencing, is employed for the identification of biological compositions in herbalcompound formulas in adequate and well controlled studies. By directly sequencing the full-lengthamplicons of ITS2 and psbA-trnH through single-molecule real-time (SMRT) technology, thebiological composition of a classical prescription Sheng-Mai-San (SMS) was analyzed. At the sametime, clone-dependent Sanger sequencing was carried out as a parallel control. Further, anotherformula-Sanwei-Jili-San (SJS)-was analyzed with genes of ITS2 and CO1. All the ingredients inthe samples of SMS and SJS were successfully authenticated at the species level, and 11 exogenousspecies were also checked, some of which were considered as common contaminations in theseproducts. Methodology analysis demonstrated that this method was sensitive, accurate andreliable. FLMB, a superior but feasible approach for the identification of biological complexmixture, was established and elucidated, which shows perfect interpretation for DNA barcodingthat could lead its application in multi-species mixtures.


April 21, 2020

The genomes of pecan and Chinese hickory provide insights into Carya evolution and nut nutrition.

Pecan (Carya illinoinensis) and Chinese hickory (C. cathayensis) are important commercially cultivated nut trees in the genus Carya (Juglandaceae), with high nutritional value and substantial health benefits.We obtained >187.22 and 178.87 gigabases of sequence, and ~288× and 248× genome coverage, to a pecan cultivar (“Pawnee”) and a domesticated Chinese hickory landrace (ZAFU-1), respectively. The total assembly size is 651.31 megabases (Mb) for pecan and 706.43 Mb for Chinese hickory. Two genome duplication events before the divergence from walnut were found in these species. Gene family analysis highlighted key genes in biotic and abiotic tolerance, oil, polyphenols, essential amino acids, and B vitamins. Further analyses of reduced-coverage genome sequences of 16 Carya and 2 Juglans species provide additional phylogenetic perspective on crop wild relatives.Cooperative characterization of these valuable resources provides a window to their evolutionary development and a valuable foundation for future crop improvement. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Impact of Chromosomal Rearrangements on the Interpretation of Lupin Karyotype Evolution.

Plant genome evolution can be very complex and challenging to describe, even within a genus. Mechanisms that underlie genome variation are complex and can include whole-genome duplications, gene duplication and/or loss, and, importantly, multiple chromosomal rearrangements. Lupins (Lupinus) diverged from other legumes approximately 60 mya. In contrast to New World lupins, Old World lupins show high variability not only for chromosome numbers (2n = 32?52), but also for the basic chromosome number (x = 5?9, 13) and genome size. The evolutionary basis that underlies the karyotype evolution in lupins remains unknown, as it has so far been impossible to identify individual chromosomes. To shed light on chromosome changes and evolution, we used comparative chromosome mapping among 11 Old World lupins, with Lupinusangustifolius as the reference species. We applied set of L.angustifolius-derived bacterial artificial chromosome clones for fluorescence in situ hybridization. We demonstrate that chromosome variations in the species analyzed might have arisen from multiple changes in chromosome structure and number. We hypothesize about lupin karyotype evolution through polyploidy and subsequent aneuploidy. Additionally, we have established a cytogenomic map of L.angustifolius along with chromosome markers that can be used for related species to further improve comparative studies of crops and wild lupins.


April 21, 2020

De novo genome assembly of the white-spotted flower chafer (Protaetia brevitarsis).

Protaetia brevitarsis, commonly known as the white-spotted flower chafer, is an important Scarabaeidae insect that is distributed in most Asian countries. Recently, research on the insect’s harmfulness to crops, usefulness in agricultural waste utilization, edibility, medicinal value, and usability in insect immunology has provided sufficient impetus to demonstrate the need for a detailed study of its biology. Herein, we sequenced the whole genome of this species to improve our understanding and study of P. brevitarsis.We developed a highly reliable genome resource for P. brevitarsis (Lewis, 1879; Coleoptera: Cetoniinae) using Illumina and PacBio sequencing platforms. A total of 135.75 gigabases (Gb) was generated, providing 150-fold coverage based on the 810-megabases (Mb) estimated genome size. The assembled P. brevitarsis genome was 751 Mb (including the scaffolds longer than 2 kilobases (kb)) with 327 scaffolds, and the N50 length of the assembly was 2.94 Mb. A total of 34,110 (22,229 in scaffolds and 11,881 located in alleles) genes were identified using Evidence Modeler, which was based on the gene prediction results obtained from 3 different methods (ab initio, RNA sequencing based, and known gene based).We assembled a high-quality P. brevitarsis genome, which will not only provide insight into the biology of the species but also provide a wealth of information that will inform researchers on the evolution, control, and utilization of P. brevitarsis. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Chromosome-scale genome assembly of kiwifruit Actinidia eriantha with single-molecule sequencing and chromatin interaction mapping.

Kiwifruit (Actinidia spp.) is a dioecious plant with fruits containing abundant vitamin C and minerals. A handful of kiwifruit species have been domesticated, among which Actinidiaeriantha is increasingly favored in breeding owing to its superior commercial traits. Recently, elite cultivars from A. eriantha have been successfully selected and further studies on their biology and breeding potential require genomic information, which is currently unavailable.We assembled a chromosome-scale genome sequence of A. eriantha cultivar White using single-molecular sequencing and chromatin interaction map-based scaffolding. The assembly has a total size of 690.6 megabases and an N50 of 21.7 megabases. Approximately 99% of the assembly were in 29 pseudomolecules corresponding to the 29 kiwifruit chromosomes. Forty-three percent of the A. eriantha genome are repetitive sequences, and the non-repetitive part encodes 42,988 protein-coding genes, of which 39,075 have homologues from other plant species or protein domains. The divergence time between A. eriantha and its close relative Actinidia chinensis is estimated to be 3.3 million years, and after diversification, 1,727 and 1,506 gene families are expanded and contracted in A. eriantha, respectively.We provide a high-quality reference genome for kiwifruit A. eriantha. This chromosome-scale genome assembly is substantially better than 2 published kiwifruit assemblies from A. chinensis in terms of genome contiguity and completeness. The availability of the A. eriantha genome provides a valuable resource for facilitating kiwifruit breeding and studies of kiwifruit biology. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

A draft nuclear-genome assembly of the acoel flatworm Praesagittifera naikaiensis.

Acoels are primitive bilaterians with very simple soft bodies, in which many organs, including the gut, are not developed. They provide platforms for studying molecular and developmental mechanisms involved in the formation of the basic bilaterian body plan, whole-body regeneration, and symbiosis with photosynthetic microalgae. Because genomic information is essential for future research on acoel biology, we sequenced and assembled the nuclear genome of an acoel, Praesagittifera naikaiensis.To avoid sequence contamination derived from symbiotic microalgae, DNA was extracted from embryos that were free of algae. More than 290x sequencing coverage was achieved using a combination of Illumina (paired-end and mate-pair libraries) and PacBio sequencing. RNA sequencing and Iso-Seq data from embryos, larvae, and adults were also obtained. First, a preliminary ~17-kilobase pair (kb) mitochondrial genome was assembled, which was deleted from the nuclear sequence assembly. As a result, a draft nuclear genome assembly was ~656 Mb in length, with a scaffold N50 of 117 kb and a contig N50 of 57 kb. Although ~70% of the assembled sequences were likely composed of repetitive sequences that include DNA transposons and retrotransposons, the draft genome was estimated to contain 22,143 protein-coding genes, ~99% of which were substantiated by corresponding transcripts. We could not find horizontally transferred microalgal genes in the acoel genome. Benchmarking Universal Single-Copy Orthologs analyses indicated that 77% of the conserved single-copy genes were complete. Pfam domain analyses provided a basic set of gene families for transcription factors and signaling molecules.Our present sequencing and assembly of the P. naikaiensis nuclear genome are comparable to those of other metazoan genomes, providing basic information for future studies of genic and genomic attributes of this animal group. Such studies may shed light on the origins and evolution of simple bilaterians. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Pacbio Sequencing Reveals Identical Organelle Genomes between American Cranberry (Vaccinium macrocarpon Ait.) and a Wild Relative.

Breeding efforts in the American cranberry (Vaccinium macrocarpon Ait.), a North American perennial fruit crop of great importance, have been hampered by the limited genetic and phenotypic variability observed among cultivars and experimental materials. Most of the cultivars commercially used by cranberry growers today were derived from a few wild accessions bred in the 1950s. In different crops, wild germplasm has been used as an important genetic resource to incorporate novel traits and increase the phenotypic diversity of breeding materials. Vaccinium microcarpum (Turcz. ex Rupr.) Schmalh. and V. oxycoccos L., two closely related species, may be cross-compatible with the American cranberry, and could be useful to improve fruit quality such as phytochemical content. Furthermore, given their northern distribution, they could also help develop cold hardy cultivars. Although these species have previously been analyzed in diversity studies, genomic characterization and comparative studies are still lacking. In this study, we sequenced and assembled the organelle genomes of the cultivated American cranberry and its wild relative, V. microcarpum. PacBio sequencing technology allowed us to assemble both mitochondrial and plastid genomes at very high coverage and in a single circular scaffold. A comparative analysis revealed that the mitochondrial genome sequences were identical between both species and that the plastids presented only two synonymous single nucleotide polymorphisms (SNPs). Moreover, the Illumina resequencing of additional accessions of V. microcarpum and V. oxycoccos revealed high genetic variation in both species. Based on these results, we provided a hypothesis involving the extension and dynamics of the last glaciation period in North America, and how this could have shaped the distribution and dispersal of V. microcarpum. Finally, we provided important data regarding the polyploid origin of V. oxycoccos.


April 21, 2020

Cellular Dynamics and Genomic Identity of Centromeres in Cereal Blast Fungus.

Precise kinetochore-microtubule interactions ensure faithful chromosome segregation in eukaryotes. Centromeres, identified as scaffolding sites for kinetochore assembly, are among the most rapidly evolving chromosomal loci in terms of the DNA sequence and length and organization of intrinsic elements. Neither the centromere structure nor the kinetochore dynamics is well studied in plant-pathogenic fungi. Here, we sought to understand the process of chromosome segregation in the rice blast fungus Magnaporthe oryzae High-resolution imaging of green fluorescent protein (GFP)-tagged inner kinetochore proteins CenpA and CenpC revealed unusual albeit transient declustering of centromeres just before anaphase separation of chromosomes in M. oryzae Strikingly, the declustered centromeres positioned randomly at the spindle midzone without an apparent metaphase plate per se Using CenpA chromatin immunoprecipitation followed by deep sequencing, all seven centromeres in M. oryzae were found to be regional, spanning 57-kb to 109-kb transcriptionally poor regions. Highly AT-rich and heavily methylated DNA sequences were the only common defining features of all the centromeres in rice blast. Lack of centromere-specific DNA sequence motifs or repetitive elements suggests an epigenetic specification of centromere function in M. oryzae PacBio genome assemblies and synteny analyses facilitated comparison of the centromeric/pericentromeric regions in distinct isolates of rice blast and wheat blast and in Magnaporthiopsis poae Overall, this study revealed unusual centromere dynamics and precisely identified the centromere loci in the top model fungal pathogens that belong to Magnaporthales and cause severe losses in the global production of food crops and turf grasses.IMPORTANCEMagnaporthe oryzae is an important fungal pathogen that causes a loss of 10% to 30% of the annual rice crop due to the devastating blast disease. In most organisms, kinetochores are clustered together or arranged at the metaphase plate to facilitate synchronized anaphase separation of sister chromatids in mitosis. In this study, we showed that the initially clustered kinetochores separate and position randomly prior to anaphase in M. oryzae Centromeres in M. oryzae occupy large genomic regions and form on AT-rich DNA without any common sequence motifs. Overall, this study identified atypical kinetochore dynamics and mapped functional centromeres in M. oryzae to define the roles of centromeric and pericentric boundaries in kinetochore assembly on epigenetically specified centromere loci. This study should pave the way for further understanding of the contribution of heterochromatin in genome stability and virulence of the blast fungus and its related species of high economic importance.Copyright © 2019 Yadav et al.


April 21, 2020

Genome sequence of the corn leaf aphid (Rhopalosiphum maidis Fitch).

The corn leaf aphid (Rhopalosiphum maidis Fitch) is the most economically damaging aphid pest on maize (Zea mays), one of the world’s most important grain crops. In addition to causing direct damage by removing photoassimilates, R. maidis transmits several destructive maize viruses, including maize yellow dwarf virus, barley yellow dwarf virus, sugarcane mosaic virus, and cucumber mosaic virus.The genome of a parthenogenetically reproducing R. maidis clone was assembled with a combination of Pacific Biosciences (207-fold coverage) and Illumina (83-fold coverage) sequencing. The 689 assembled contigs, which have an N50 size of 9.0 megabases (Mb) and a low level of heterozygosity, were clustered using Phase Genomics Hi-C interaction maps. Consistent with the commonly observed 2n = 8 karyotype of R. maidis, most of the contigs (473 spanning 321 Mb) were successfully oriented into 4 scaffolds. The genome assembly captured the full length of 95.8% of the core eukaryotic genes, indicating that it is highly complete. Repetitive sequences accounted for 21.2% of the assembly, and a total of 17,629 protein-coding genes were predicted with integrated evidence from ab initio and homology-based gene predictions and transcriptome sequences generated with both Pacific Biosciences and Illumina. An analysis of likely horizontally transferred genes identified 2 from bacteria, 7 from fungi, 2 from protozoa, and 9 from algae. Repeat elements, transposons, and genes encoding likely detoxification enzymes (cytochrome P450s, glutathione S-transferases, carboxylesterases, uridine diphosphate-glucosyltransferases, and ABC transporters) were identified in the genome sequence. Other than Buchnera aphidicola (642,929 base pairs, 602 genes), no endosymbiont bacteria were found in R. maidis.A high-quality R. maidis genome was assembled at the chromosome level. This genome sequence will enable further research related to ecological interactions, virus transmission, pesticide resistance, and other aspects of R. maidis biology. It also serves as a valuable resource for comparative investigation of other aphid species. © The Author(s) 2019. Published by Oxford University Press.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.