Menu
July 7, 2019  |  

Insights into land plant evolution garnered from the Marchantia polymorpha genome.

The evolution of land flora transformed the terrestrial environment. Land plants evolved from an ancestral charophycean alga from which they inherited developmental, biochemical, and cell biological attributes. Additional biochemical and physiological adaptations to land, and a life cycle with an alternation between multicellular haploid and diploid generations that facilitated efficient dispersal of desiccation tolerant spores, evolved in the ancestral land plant. We analyzed the genome of the liverwort Marchantia polymorpha, a member of a basal land plant lineage. Relative to charophycean algae, land plant genomes are characterized by genes encoding novel biochemical pathways, new phytohormone signaling pathways (notably auxin), expanded repertoires of signaling pathways, and increased diversity in some transcription factor families. Compared with other sequenced land plants, M. polymorpha exhibits low genetic redundancy in most regulatory pathways, with this portion of its genome resembling that predicted for the ancestral land plant. PAPERCLIP. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.


July 7, 2019  |  

Contributions of Zea mays subspecies mexicana haplotypes to modern maize.

Maize was domesticated from lowland teosinte (Zea mays ssp. parviglumis), but the contribution of highland teosinte (Zea mays ssp. mexicana, hereafter mexicana) to modern maize is not clear. Here, two genomes for Mo17 (a modern maize inbred) and mexicana are assembled using a meta-assembly strategy after sequencing of 10 lines derived from a maize-teosinte cross. Comparative analyses reveal a high level of diversity between Mo17, B73, and mexicana, including three Mb-size structural rearrangements. The maize spontaneous mutation rate is estimated to be 2.17?×?10-8 ~3.87?×?10-8 per site per generation with a nonrandom distribution across the genome. A higher deleterious mutation rate is observed in the pericentromeric regions, and might be caused by differences in recombination frequency. Over 10% of the maize genome shows evidence of introgression from the mexicana genome, suggesting that mexicana contributed to maize adaptation and improvement. Our data offer a rich resource for constructing the pan-genome of Zea mays and genetic improvement of modern maize varieties.


July 7, 2019  |  

Assembly of an early-matured japonica (Geng) rice genome, Suijing18, based on PacBio and Illumina sequencing.

The early-matured japonica (Geng) rice variety, Suijing18 (SJ18), carries multiple elite traits including durable blast resistance, good grain quality, and high yield. Using PacBio SMRT technology, we produced over 25?Gb of long-read sequencing raw data from SJ18 with a coverage of 62×. Using Illumina paired-end whole-genome shotgun sequencing technology, we generated 59?Gb of short-read sequencing data from SJ18 (23.6?Gb from a 200?bp library with a coverage of 59× and 35.4?Gb from an 800?bp library with a coverage of 88×). With these data, we assembled a single SJ18 genome and then generated a set of annotation data. These data sets can be used to test new programs for variation deep mining, and will provide new insights into the genome structure, function, and evolution of SJ18, and will provide essential support for biological research in general.


July 7, 2019  |  

HISEA: HIerarchical SEed Aligner for PacBio data.

The next generation sequencing (NGS) techniques have been around for over a decade. Many of their fundamental applications rely on the ability to compute good genome assemblies. As the technology evolves, the assembly algorithms and tools have to continuously adjust and improve. The currently dominant technology of Illumina produces reads that are too short to bridge many repeats, setting limits on what can be successfully assembled. The emerging SMRT (Single Molecule, Real-Time) sequencing technique from Pacific Biosciences produces uniform coverage and long reads of length up to sixty thousand base pairs, enabling significantly better genome assemblies. However, SMRT reads are much more expensive and have a much higher error rate than Illumina’s – around 10-15% – mostly due to indels. New algorithms are very much needed to take advantage of the long reads while mitigating the effect of high error rate and lowering the required coverage.An essential step in assembling SMRT data is the detection of alignments, or overlaps, between reads. High error rate and very long reads make this a much more challenging problem than for Illumina data. We present a new pairwise read aligner, or overlapper, HISEA (Hierarchical SEed Aligner) for SMRT sequencing data. HISEA uses a novel two-step k-mer search, employing consistent clustering, k-mer filtering, and read alignment extension.We compare HISEA against several state-of-the-art programs – BLASR, DALIGNER, GraphMap, MHAP, and Minimap – on real datasets from five organisms. We compare their sensitivity, precision, specificity, F1-score, as well as time and memory usage. We also introduce a new, more precise, evaluation method. Finally, we compare the two leading programs, MHAP and HISEA, for their genome assembly performance in the Canu pipeline.Our algorithm has the best alignment detection sensitivity among all programs for SMRT data, significantly higher than the current best. The currently best assembler for SMRT data is the Canu program which uses the MHAP aligner in its pipeline. We have incorporated our new HISEA aligner in the Canu pipeline and benchmarked it against the best pipeline for multiple datasets at two relevant coverage levels: 30x and 50x. Our assemblies are better than those using MHAP for both coverage levels. Moreover, Canu+HISEA assemblies for 30x coverage are comparable with Canu+MHAP assemblies for 50x coverage, while being faster and cheaper.The HISEA algorithm produces alignments with highest sensitivity compared with the current state-of-the-art algorithms. Integrated in the Canu pipeline, currently the best for assembling PacBio data, it produces better assemblies than Canu+MHAP.


July 7, 2019  |  

Draft sequencing of the heterozygous diploid genome of Satsuma (Citrus unshiu Marc.) using a hybrid assembly approach.

Satsuma (Citrus unshiu Marc.) is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma (“Miyagawa Wase”) was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome.


July 7, 2019  |  

Observations on bipolar disjunctions of moonwort ferns (Botrychium, Ophioglossaceae).

Peter Raven, in 1963, included two fern taxa of the genus Botrychium in his list of plant species exhibiting American amphitropical bipolar disjunctions. He attributed the southern hemisphere occurrences to post-Pleistocene long-distance dispersal from counterparts in the northern hemisphere, probably assisted by annual bird migrations between the disjunct areas. Using genetic evidence gathered through worldwide analyses of phylogenetic relationship in Botrychium, we now review and reconsider Raven’s conclusions. Genetic similarities indicate that South American Botrychium dusenii is an allotetraploid taxon closely related to B. spathulatum, a North American endemic, and that B. lunaria in New Zealand possesses a genotype identical to that of a taxon in North America derived through introgressive hybridization between B. lunaria and an endemic North American species, B. neolunaria. Both North American counterparts exhibit Raven’s characteristics of bipolar disjuncts in their occurrence in mountain and coastal meadows, copious production of small propagules (spores in Botrychium), occurrence in habitats frequented by transpolar bird migrants, and ability to found new colonies through inbreeding. We discuss these characteristics in Botrychium and relative to other ferns and suggest further studies on Botrychium and related taxa to address questions of time, number, and mode of bipolar dispersals.© 2017 Botanical Society of America.


July 7, 2019  |  

Post genomics era for orchid research.

Among 300,000 species in angiosperms, Orchidaceae containing 30,000 species is one of the largest families. Almost every habitats on earth have orchid plants successfully colonized, and it indicates that orchids are among the plants with significant ecological and evolutionary importance. So far, four orchid genomes have been sequenced, including Phalaenopsis equestris, Dendrobium catenatum, Dendrobium officinale, and Apostaceae shengen. Here, we review the current progress and the direction of orchid research in the post genomics era. These include the orchid genome evolution, genome mapping (genome-wide association analysis, genetic map, physical map), comparative genomics (especially receptor-like kinase and terpene synthase), secondary metabolomics, and genome editing.


July 7, 2019  |  

Diversity in grain amaranths and relatives distinguished by genotyping by sequencing (GBS).

The genotyping by sequencing (GBS) method has become a molecular marker technology of choice for many crop plants because of its simultaneous discovery and evaluation of a large number of single nucleotide polymorphisms (SNPs) and utility for germplasm characterization. Genome representation and complexity reduction are the basis for GBS fingerprinting and can vary by species based on genome size and other sequence characteristics. Grain amaranths are a set of three species that were domesticated in the New World to be high protein, pseudo-cereal grain crops. The goal of this research was to employ the GBS technique for diversity evaluation in grain amaranth accessions and close relatives from sixAmaranthusspecies and determine genetic differences and similarities between groupings. A total of 10,668 SNPs were discovered in 94 amaranth accessions withApeKI complexity reduction and 10X genome coverage Illumina sequencing. The majority of the SNPs were species specific with 4,568 and 3,082 for the two grain amaranths originating in Central AmericaAmaranthus cruentus and A. hypochondriacusand 3,284 found amongst bothA. caudatus, originally domesticated in South America, and its close relative,A. quitensis. The distance matrix based on shared alleles provided information on the close relationships of the two cultivated Central American species with each other and of the wild and cultivated South American species with each other, as distinguished from the outgroup with two wild species,A. powelliiandA. retroflexus. The GBS data also distinguished admixture between each pair of species and the geographical origins and seed colors of the accessions. The SNPs we discovered here can be used for marker development for future amaranth study.


July 7, 2019  |  

An update on bioinformatics resources for plant genomics research

Next-generation sequencing and traditional Sanger sequencing methods are of great significance in unraveling the complexity of plant genomes. These are constantly generating heaps of sequence data to be analyzed, annotated and stored. This has created a revolutionary demand for bioinformatics tools and software that can perform these functions. A large number of potentially useful bioinformatics tools and plant genome databases are created that have greatly simplified the analysis and storage of vast amounts of sequence data. The information garnered using the available bioinformatics methods have greatly helped in understanding the plant genome structure. Despite the availability of a good number of such tools, the information pouring from single gene-sequencing, and various whole-genome sequencing projects is overwhelming; thus, further innovations and improved methods are needed to sift through this sequence data, and assemble genomes. The current review focuses on diverse bioinformatics approaches and methods developed to systematically analyze and store plant sequence data. Finally, it outlines the bottlenecks in plant genome analysis, and some possible solutions that could be utilized to overcome the problems associated with plant genome analysis.


July 7, 2019  |  

The plastid genome in Cladophorales green algae is encoded by hairpin chromosomes.

Virtually all plastid (chloroplast) genomes are circular double-stranded DNA molecules, typically between 100 and 200 kb in size and encoding circa 80-250 genes. Exceptions to this universal plastid genome architecture are very few and include the dinoflagellates, where genes are located on DNA minicircles. Here we report on the highly deviant chloroplast genome of Cladophorales green algae, which is entirely fragmented into hairpin chromosomes. Short- and long-read high-throughput sequencing of DNA and RNA demonstrated that the chloroplast genes of Boodlea composita are encoded on 1- to 7-kb DNA contigs with an exceptionally high GC content, each containing a long inverted repeat with one or two protein-coding genes and conserved non-coding regions putatively involved in replication and/or expression. We propose that these contigs correspond to linear single-stranded DNA molecules that fold onto themselves to form hairpin chromosomes. The Boodlea chloroplast genes are highly divergent from their corresponding orthologs, and display an alternative genetic code. The origin of this highly deviant chloroplast genome most likely occurred before the emergence of the Cladophorales, and coincided with an elevated transfer of chloroplast genes to the nucleus. A chloroplast genome that is composed only of linear DNA molecules is unprecedented among eukaryotes, and highlights unexpected variation in plastid genome architecture. Copyright © 2017 Elsevier Ltd. All rights reserved.


July 7, 2019  |  

The complete mitochondrial genome of Wonwhang (Pyrus pyrifolia)

This is a de novo assembly and annotation of a complete mitochondrial genome from Pyrus pyrifolia in the family Rosaceae. The complete mitochondrial genome of P. pyrifolia was assembled from PacBio RSII P6-C4 sequencing reads. The circular genome was 458,873?bp in length, containing 39 protein-coding genes, 23 tRNA genes and three rRNA genes. The nucleotide composition was A (27.5%), T (27.3%), G (22.6%) and C (22.6%) with GC content of 45.2%. Most of protein-coding genes use the canonical start codon ATG, whereas nad1, cox1, matR and rps4 use ACG, mttB uses ATT, rpl16 and rps19 uses GTG. The stop codon is also common in all mitochondrial genes. The phylogenetic analysis showed that P. pyrifolia was clustered with the Malus of Rosaceae family. Maximum-likelihood analysis suggests a clear relationship of Rosids and Asterids, which support the traditional classification.


July 7, 2019  |  

Genomic clues to the parental origin of the wild flowering cherry Prunus yedoensis var. nudiflora (Rosaceae)

Prunus yedoensis Matsumura is one of the popular ornamental flowering cherry trees native to northeastern Asia, and its wild populations have only been found on Jeju Island, Korea. Previous studies suggested that wild P. yedoensis (P. yedoensis var. nudiflora) is a hybrid species; however, there is no solid evidence on its exact parental origin and genomic organization. In this study, we developed a total of 38 nuclear gene-based DNA markers that can be universally amplifiable in the Prunus species using 586 Prunus Conserved Orthologous Gene Set (Prunus COS). Using the Prunus COS markers, we investigated the genetic structure of wild P. yedoensis populations and evaluated the putative parental species of wild P. yedoensis. Population structure and phylogenetic analysis of 73 wild P. yedoensis accessions and 54 accessions of other Prunus species revealed that the wild P. yedoensis on Jeju Island is a natural homoploid hybrid. Sequence-level comparison of Prunus COS markers between species suggested that wild P. yedoensis might originate from a cross between maternal P. pendula f. ascendens and paternal P. jamasakura. Moreover, approximately 81% of the wild P. yedoensis accessions examined were likely F1 hybrids, whereas the remaining 19% were backcross hybrids resulting from additional asymmetric introgression of parental genotypes. These findings suggest that complex hybridization of the Prunus species on Jeju Island can produce a range of variable hybrid offspring. Overall, this study makes a significant contribution to address issues of the origin, nomenclature, and genetic relationship of ornamental P. yedoensis.


July 7, 2019  |  

Glaucophyta

The Glaucophyta is by far the least species-rich phylum of the Archaeplastida comprising only four described genera, Glaucocystis, Cyanophora, Gloeochaete, and Cyanoptyche, and 15 species. However, recent molecular and morphological analyses reveal that glaucophytes are not as species poor as hitherto assumed with many novel lineages existing in natural environments. Glaucophytes are freshwater phototrophs of moderate to low abundance and retain many ancestral plastid traits derived from the cyanobacterial donor of this organelle, including the remnant peptidoglycan wall in their envelope. These plastids were originally named “cyanelles,” which was later changed to “muroplasts” when their shared ancestry with other Archaeplastida was recognized. The model glaucophyte, Cyanophora paradoxa, is well studied with respect to biochemistry, proteomics, and the gene content of the nuclear and organelle genomes. Investigation of the biosynthesis of cytosolic starch led to a model for the transition from glycogen to starch storage during plastid endosymbiosis. The photosynthetic apparatus, including phycobilisome antennae, resembles that of cyanobacteria. However, the carbon-concentrating mechanism is algal in nature and based on pyrenoids. Studies on protein import into muroplasts revealed a primordial Toc/Tic translocon. The peptidoglycan wall was elucidated with respect to composition, biosynthesis, and involvement of nuclear genes. The muroplast genome is distinct, not due to the number of encoded genes but, rather, because of the presence of unique genes not present on other plastid genomes. The mosaic nature of the gene-rich (27,000) nuclear genome came as a surprise, considering the relatively small genomes of unicellular red algae.


July 7, 2019  |  

Complete chloroplast genome sequence of Fritillaria unibracteata var. wabuensis based on SMRT Sequencing Technology.

Fritillaria unibracteata var. wabuensis is an important medicinal plant used for the treatment of cough symptoms related to the respiratory system. The chloroplast genome of F. unibracteata var. wabuensis (GenBank accession no. KF769142) was assembled using the PacBio RS platform (Pacific Biosciences, Beverly, MA) as a circle sequence with 151 009?bp. The assembled genome contains 133 genes, including 88 protein-coding, 37 tRNA, and eight rRNA genes. This genome sequence will provide important resource for further studies on the evolution of Fritillaria genus and molecular identification of Fritillaria herbs and their adulterants. This work suggests that PacBio RS is a powerful tool to sequence and assemble chloroplast genomes.


July 7, 2019  |  

Effects of genome structure variation, homeologous genes and repetitive DNA on polyploid crop research in the age of genomics.

Compared to diploid species, allopolyploid crop species possess more complex genomes, higher productivity, and greater adaptability to changing environments. Next generation sequencing techniques have produced high-density genetic maps, whole genome sequences, transcriptomes and epigenomes for important polyploid crops. However, several problems interfere with the full application of next generation sequencing techniques to these crops. Firstly, different types of genomic variation affect sequence assembly and QTL mapping. Secondly, duplicated or homoeologous genes can diverge in function and then lead to emergence of many minor QTL, which increases difficulties in fine mapping, cloning and marker assisted selection. Thirdly, repetitive DNA sequences arising in polyploid crop genomes also impact sequence assembly, and are increasingly being shown to produce small RNAs to regulate gene expression and hence phenotypic traits. We propose that these three key features should be considered together when analyzing polyploid crop genomes. It is apparent that dissection of genomic structural variation, elucidation of the function and mechanism of interaction of homoeologous genes, and investigation of the de novo roles of repeat sequences in agronomic traits are necessary for genomics-based crop breeding in polyploids. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.