Menu
July 7, 2019

LOGAN: A framework for LOssless Graph-based ANalysis of high throughput sequence data

Recent massive growth in the production of sequencing data necessitates matching improvements in bioinformatics tools to effectively utilize it. Existing tools suffer from limitations in both scalability and applicability which are inherent to their underlying algorithms and data structures. We identify the key requirements for the ideal data structure for sequence analyses: it should be informationally lossless, locally updatable, and memory efficient; requirements which are not met by data structures underlying the major assembly strategies Overlap Layout Consensus and De Bruijn Graphs. We therefore propose a new data structure, the LOGAN graph, which is based on a memory efficient Sparse De Bruijn Graph with routing information. Innovations in storing routing information and careful implementation allow sequence datasets for Escherichia coli (4.6Mbp, 117x coverage), Arabidopsis thaliana (135Mbp, 17.5x coverage) and Solanum pennellii (1.2Gbp, 47x coverage) to be loaded into memory on a desktop computer in seconds, minutes, and hours respectively. Memory consumption is competitive with state of the art alternatives, while losslessly representing the reads in an indexed and updatable form. Both Second and Third Generation Sequencing reads are supported. Thus, the LOGAN graph is positioned to be the backbone for major breakthroughs in sequence analysis such as integrated hybrid assembly, assembly of exceptionally large and repetitive genomes, as well as assembly and representation of pan-genomes.


July 7, 2019

Multiple hybrid de novo genome assembly of finger millet, an orphan allotetraploid crop.

Finger millet (Eleusine coracana (L.) Gaertn) is an important crop for food security because of its tolerance to drought, which is expected to be exacerbated by global climate changes. Nevertheless, it is often classified as an orphan/underutilized crop because of the paucity of scientific attention. Among several small millets, finger millet is considered as an excellent source of essential nutrient elements, such as iron and zinc; hence, it has potential as an alternate coarse cereal. However, high-quality genome sequence data of finger millet are currently not available. One of the major problems encountered in the genome assembly of this species was its polyploidy, which hampers genome assembly compared with a diploid genome. To overcome this problem, we sequenced its genome using diverse technologies with sufficient coverage and assembled it via a novel multiple hybrid assembly workflow that combines next-generation with single-molecule sequencing, followed by whole-genome optical mapping using the Bionano Irys® system. The total number of scaffolds was 1,897 with an N50 length?>2.6?Mb and detection of 96% of the universal single-copy orthologs. The majority of the homeologs were assembled separately. This indicates that the proposed workflow is applicable to the assembly of other allotetraploid genomes.© The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


July 7, 2019

Is sex irreplaceable? Towards the molecular regulation of apomixis

Apomixis, defined as the asexual plant reproduction through seeds that results in the production of genetically uniform progeny and a natural way of cloning. Currently there are more than 400 plant species known to use apomixis as a strategy for their propagation. The primary fundamental aspects of apomixis are the bypassing of meiosis and parthenogenetic development of the embryo without fertilization. Apomixis attracts special attention because of its potential value for agriculture, as it could be harnessed for plant breeding programs enabling the permanent fixation of heterosis in crop plants. A better understanding of the molecular and genetic regulation of apomixis is important for developmental and evolutionary perspectives but also for implementation of engineering of apomixis traits into agricultural crop plants. Despite apomixis is considered as one of the key technologies for the improving agriculture, but currently how genetic and molecular regulation of this important trait occurs is not fully known. Recent information on the biology of apomixis and genes and genetic loci associated with the regulation of different components of apomixis is provided in the present review.


July 7, 2019

A high-quality genome assembly of quinoa provides insights into the molecular basis of salt bladder-based salinity tolerance and the exceptional nutritional value.

Chenopodium quinoa is a halophytic pseudocereal crop that is being cultivated in an ever-growing number of countries. Because quinoa is highly resistant to multiple abiotic stresses and its seed has a better nutritional value than any other major cereals, it is regarded as a future crop to ensure global food security. We generated a high-quality genome draft using an inbred line of the quinoa cultivar Real. The quinoa genome experienced one recent genome duplication about 4.3 million years ago, likely reflecting the genome fusion of two Chenopodium parents, in addition to the ? paleohexaploidization reported for most eudicots. The genome is highly repetitive (64.5% repeat content) and contains 54 438 protein-coding genes and 192 microRNA genes, with more than 99.3% having orthologous genes from glycophylic species. Stress tolerance in quinoa is associated with the expansion of genes involved in ion and nutrient transport, ABA homeostasis and signaling, and enhanced basal-level ABA responses. Epidermal salt bladder cells exhibit similar characteristics as trichomes, with a significantly higher expression of genes related to energy import and ABA biosynthesis compared with the leaf lamina. The quinoa genome sequence provides insights into its exceptional nutritional value and the evolution of halophytes, enabling the identification of genes involved in salinity tolerance, and providing the basis for molecular breeding in quinoa.


July 7, 2019

Gene losses and partial deletion of small single-copy regions of the chloroplast genomes of two hemiparasitic Taxillus species.

Numerous variations are known to occur in the chloroplast genomes of parasitic plants. We determined the complete chloroplast genome sequences of two hemiparasitic species, Taxillus chinensis and T. sutchuenensis, using Illumina and PacBio sequencing technologies. These species are the first members of the family Loranthaceae to be sequenced. The complete chloroplast genomes of T. chinensis and T. sutchuenensis comprise circular 121,363 and 122,562 bp-long molecules with quadripartite structures, respectively. Compared with the chloroplast genomes of Nicotiana tabacum and Osyris alba, all ndh genes as well as three ribosomal protein genes, seven tRNA genes, four ycf genes, and the infA gene of these two species have been lost. The results of the maximum likelihood and neighbor-joining phylogenetic trees strongly support the theory that Loranthaceae and Viscaceae are monophyletic clades. This research reveals the effect of a parasitic lifestyle on the chloroplast structure and genome content of T. chinensis and T. sutchuenensis, and enhances our understanding of the discrepancies in terms of assembly results between Illumina and PacBio.


July 7, 2019

Echinochloa crus-galli genome analysis provides insight into its adaptation and invasiveness as a weed.

Barnyardgrass (Echinochloa crus-galli) is a pernicious weed in agricultural fields worldwide. The molecular mechanisms underlying its success in the absence of human intervention are presently unknown. Here we report a draft genome sequence of the hexaploid species E. crus-galli, i.e., a 1.27?Gb assembly representing 90.7% of the predicted genome size. An extremely large repertoire of genes encoding cytochrome P450 monooxygenases and glutathione S-transferases associated with detoxification are found. Two gene clusters involved in the biosynthesis of an allelochemical 2,4-dihydroxy-7-methoxy-1,4-benzoxazin-3-one (DIMBOA) and a phytoalexin momilactone A are found in the E. crus-galli genome, respectively. The allelochemical DIMBOA gene cluster is activated in response to co-cultivation with rice, while the phytoalexin momilactone A gene cluster specifically to infection by pathogenic Pyricularia oryzae. Our results provide a new understanding of the molecular mechanisms underlying the extreme adaptation of the weed.


July 7, 2019

The Tartary buckwheat genome provides insights into rutin biosynthesis and abiotic stress tolerance.

Tartary buckwheat (Fagopyrum tataricum) is an important pseudocereal crop that is strongly adapted to growth in adverse environments. Its gluten-free grain contains complete proteins with a well-balanced composition of essential amino acids and is a rich source of beneficial phytochemicals that provide significant health benefits. Here, we report a high-quality, chromosome-scale Tartary buckwheat genome sequence of 489.3 Mb that is assembled by combining whole-genome shotgun sequencing of both Illumina short reads and single-molecule real-time long reads, sequence tags of a large DNA insert fosmid library, Hi-C sequencing data, and BioNano genome maps. We annotated 33 366 high-confidence protein-coding genes based on expression evidence. Comparisons of the intra-genome with the sugar beet genome revealed an independent whole-genome duplication that occurred in the buckwheat lineage after they diverged from the common ancestor, which was not shared with rosids or asterids. The reference genome facilitated the identification of many new genes predicted to be involved in rutin biosynthesis and regulation, aluminum stress resistance, and in drought and cold stress responses. Our data suggest that Tartary buckwheat’s ability to tolerate high levels of abiotic stress is attributed to the expansion of several gene families involved in signal transduction, gene regulation, and membrane transport. The availability of these genomic resources will facilitate the discovery of agronomically and nutritionally important genes and genetic improvement of Tartary buckwheat. Copyright © 2017 The Author. Published by Elsevier Inc. All rights reserved.


July 7, 2019

Complete genome sequence of endophyte Bacillus flexus KLBMP 4941 reveals its plant growth promotion mechanism and genetic basis for salt tolerance.

Bacillus flexus KLBMP 4941 is a halotolerant endophyte isolated from the halophyte Limonium sinense. This strain can improve host seedling growth under salt stress conditions. We here report the complete genome information of endophyte KLBMP 4941. It has a circular chromosome and two plasmids for a total genome 4,104,242 bp in size with a G+C content of 38.09%. Genes related to plant growth promotion (PGP), such as those associated with nitrogen fixation, siderophore, spermidine, and acetoin synthesis were found in the KLBMP 4941 genome. Some genes responsible for high salinity tolerance, like genes associated with the Na(+)/H(+) antiporter, glycine betaine transporter, and betaine-aldehyde dehydrogenase were also found in the KLBMP 4941 genome. The genome analysis will provide better understanding of the mechanisms underlying the promotion of plant growth in strain KLBMP 4941 under salt stress conditions and its ability to adapt to coastal salt marsh habitats, and provide a basis for its further biotechnological applications in agriculture. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

Genome architecture and evolution of a unichromosomal asexual nematode.

Asexual reproduction in animals, though rare, is the main or exclusive mode of reproduction in some long-lived lineages. The longevity of asexual clades may be correlated with the maintenance of heterozygosity by mechanisms that rearrange genomes and reduce recombination. Asexual species thus provide an opportunity to gain insight into the relationship between molecular changes, genome architecture, and cellular processes. Here we report the genome sequence of the parthenogenetic nematode Diploscapter pachys with only one chromosome pair. We show that this unichromosomal architecture is shared by a long-lived clade of asexual nematodes closely related to the genetic model organism Caenorhabditis elegans. Analysis of the genome assembly reveals that the unitary chromosome arose through fusion of six ancestral chromosomes, with extensive rearrangement among neighboring regions. Typical nematode telomeres and telomeric protection-encoding genes are lacking. Most regions show significant heterozygosity; homozygosity is largely concentrated to one region and attributed to gene conversion. Cell-biological and molecular evidence is consistent with the absence of key features of meiosis I, including synapsis and recombination. We propose that D. pachys preserves heterozygosity and produces diploid embryos without fertilization through a truncated meiosis. As a prelude to functional studies, we demonstrate that D. pachys is amenable to experimental manipulation by RNA interference. Copyright © 2017 Elsevier Ltd. All rights reserved.


July 7, 2019

Sunflower leaf senescence: A complex genetic process with economic impact on crop production

Leaf senescence is a complex process controlled by multiple genetic and environmental variables. In different crops, a delay in leaf senescence has an important impact on grain yield trough the maintenance of the photosynthetic leaf area during the reproductive stage. In sunflower (Helianthus annuus L.), the fourth largest oil crop worldwide, senescence reduces the capacity of plants to maintain their green leaf area for longer periods, especially during the grain filling phase, leading to important economic losses. In crop species, taking into account the temporal gap between the onset and the phenotypic detection of senescence, identification of both, candidate genes and functional stay-green are indispensable to enable the early detection of senescence, the elucidation of molecular mechanisms and the development of tools for breeding applications. In this chapter a comprehensive literature revision of leaf senescence process not only in model plant species but also in agronomical relevant crops is presented. Results derived from system biology approaches integrating transcriptomic, metabolomic and physiological data as well as those leading to the selection and characterization of stay green sunflower genotypes are included, making an important contribution to the knowledge of leaf senescence process and providing a valuable tool to assist in crop breeding.


July 7, 2019

New insights into structural organization and gene duplication in a 1.75-Mb genomic region harboring the a-gliadin gene family in Aegilops tauschii, the source of wheat D genome.

Among the wheat prolamins important for its end-use traits, a-gliadins are the most abundant, and are also a major cause of food-related allergies and intolerances. Previous studies of various wheat species estimated that between 25 and 150 a-gliadin genes reside in the Gli-2 locus regions. To better understand the evolution of this complex gene family, the DNA sequence of a 1.75-Mb genomic region spanning the Gli-2 locus was analyzed in the diploid grass, Aegilops tauschii, the ancestral source of D genome in hexaploid bread wheat. Comparison with orthologous regions from rice, sorghum, and Brachypodium revealed rapid and dynamic changes only occurring to the Ae. tauschii Gli-2 region, including insertions of high numbers of non-syntenic genes and a high rate of tandem gene duplications, the latter of which have given rise to 12 copies of a-gliadin genes clustered within a 550-kb region. Among them, five copies have undergone pseudogenization by various mutation events. Insights into the evolutionary relationship of the duplicated a-gliadin genes were obtained from their genomic organization, transcription patterns, transposable element insertions and phylogenetic analyses. An ancestral glutamate-like receptor (GLR) gene encoding putative amino acid sensor in all four grass species has duplicated only in Ae. tauschii and generated three more copies that are interspersed with the a-gliadin genes. Phylogenetic inference and different gene expression patterns support functional divergence of the Ae. tauschii GLR copies after duplication. Our results suggest that the duplicates of a-gliadin and GLR genes have likely taken different evolutionary paths; conservation for the former and neofunctionalization for the latter.© 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.


July 7, 2019

The unusual S locus of Leavenworthia is composed of two sets of paralogous loci.

The Leavenworthia self-incompatibility locus (S locus) consists of paralogs (Lal2, SCRL) of the canonical Brassicaceae S locus genes (SRK, SCR), and is situated in a genomic position that differs from the ancestral one in the Brassicaceae. Unexpectedly, in a small number of Leavenworthia alabamica plants examined, sequences closely resembling exon 1 of SRK have been found, but the function of these has remained unclear. BAC cloning and expression analyses were employed to characterize these SRK-like sequences. An SRK-positive Bacterial Artificial Chromosome clone was found to contain complete SRK and SCR sequences located close by one another in the derived genomic position of the Leavenworthia S locus, and in place of the more typical Lal2 and SCRL sequences. These sequences are expressed in stigmas and anthers, respectively, and crossing data show that the SRK/SCR haplotype is functional in self-incompatibility. Population surveys indicate that < 5% of Leavenworthia S loci possess such alleles. An ancestral translocation or recombination event involving SRK/SCR and Lal2/SCRL likely occurred, together with neofunctionalization of Lal2/SCRL, and both haplotype groups now function as Leavenworthia S locus alleles. These findings suggest that S locus alleles can have distinctly different evolutionary origins.© 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.


July 7, 2019

Complete genome sequence of Paenibacillus yonginensis DCY84(T), a novel plant symbiont that promotes growth via induced systemic resistance.

This article reports the full genome sequence of Paenibacillus yonginensis DCY84(T) (KCTC33428, JCM19885), which is a Gram-positive rod-shaped bacterium isolated from humus soil of Yongin Forest in Gyeonggi Province, South Korea. The genome sequence of strain DCY84(T) provides greater understanding of the Paenibacillus species for practical use. This bacterium displays plant growth promotion via induced systemic resistance of abiotic stresses.


July 7, 2019

The asparagus genome sheds light on the origin and evolution of a young Y chromosome.

Sex chromosomes evolved from autosomes many times across the eukaryote phylogeny. Several models have been proposed to explain this transition, some involving male and female sterility mutations linked in a region of suppressed recombination between X and Y (or Z/W, U/V) chromosomes. Comparative and experimental analysis of a reference genome assembly for a double haploid YY male garden asparagus (Asparagus officinalis L.) individual implicates separate but linked genes as responsible for sex determination. Dioecy has evolved recently within Asparagus and sex chromosomes are cytogenetically identical with the Y, harboring a megabase segment that is missing from the X. We show that deletion of this entire region results in a male-to-female conversion, whereas loss of a single suppressor of female development drives male-to-hermaphrodite conversion. A single copy anther-specific gene with a male sterile Arabidopsis knockout phenotype is also in the Y-specific region, supporting a two-gene model for sex chromosome evolution.


July 7, 2019

Complete genome sequence of Pseudomonas corrugata strain RM1-1-4, a stress protecting agent from the rhizosphere of an oilseed rape bait plant

Pseudomonas corrugata strain RM1-1-4 is a rhizosphere colonizer of oilseed rape. A previous study has shown that this motile, Gram-negative, non-sporulating bacterium is an effective stress protecting and biocontrol agent, which protects their hosts against abiotic and biotic stresses. Here, we announce and describe the complete genome sequence of P. corrugata RM1-1-4 consisting of a single 6.1 Mb circular chromosome that encodes 5189 protein coding genes and 85 RNA-only encoding genes. Genome analysis revealed genes predicting functions such as detoxifying mechanisms, stress inhibitors, exoproteases, lipoproteins or volatile components as well as rhizobactin siderophores and spermidine. Further analysis of its genome will help to identify traits promising for stress protection, biocontrol and plant growth promotion properties.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.