Menu
April 21, 2020  |  

Updated assembly resource of Phytophthora ramorum Pr102 isolate incorporating long reads from PacBio sequencing.

The NA1 clonal lineage of Phytophthora ramorum is responsible for Sudden Oak Death, an epidemic that has devastated California’s coastal forest ecosystems. An NA1 isolate Pr102 derived from coast live oak in California was previously sequenced and reported with 65 Mb assembly containing 12 Mb gaps in 2576 scaffolds. Here we report an improved 70 Mb genome in 1512 scaffolds with 6752 bp gaps after incorporating PacBio P5-C3 longreads. This assembly contains 19494 gene models (average gene length 2515 bp) compared to 16134 genes (average gene length of 1673 bp) in the previous version. We predicted 29 new RXLRs and 76 new paralogs of a total 392 RXLRs from this assembly. We predicted 35 CRNs compared to 19 in earlier version with six paralogs. Our lncRNAs prediction identified 255 candidates. This new resource will be invaluable for future evolution studies on the invasive plant pathogen.


April 21, 2020  |  

Large-scale ruminant genome sequencing provides insights into their evolution and distinct traits.

The ruminants are one of the most successful mammalian lineages, exhibiting morphological and habitat diversity and containing several key livestock species. To better understand their evolution, we generated and analyzed de novo assembled genomes of 44 ruminant species, representing all six Ruminantia families. We used these genomes to create a time-calibrated phylogeny to resolve topological controversies, overcoming the challenges of incomplete lineage sorting. Population dynamic analyses show that population declines commenced between 100,000 and 50,000 years ago, which is concomitant with expansion in human populations. We also reveal genes and regulatory elements that possibly contribute to the evolution of the digestive system, cranial appendages, immune system, metabolism, body size, cursorial locomotion, and dentition of the ruminants. Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.


April 21, 2020  |  

Sensitivity to the two peptide bacteriocin plantaricin EF is dependent on CorC, a membrane-bound, magnesium/cobalt efflux protein.

Lactic acid bacteria produce a variety of antimicrobial peptides known as bacteriocins. Most bacteriocins are understood to kill sensitive bacteria through receptor-mediated disruptions. Here, we report on the identification of the Lactobacillus plantarum plantaricin EF (PlnEF) receptor. Spontaneous PlnEF-resistant mutants of the PlnEF-indicator strain L. plantarum NCIMB 700965 (LP965) were isolated and confirmed to maintain cellular ATP levels in the presence of PlnEF. Genome comparisons resulted in the identification of a single mutated gene annotated as the membrane-bound, magnesium/cobalt efflux protein CorC. All isolates contained a valine (V) at position 334 instead of a glycine (G) in a cysteine-ß-synthase domain at the C-terminal region of CorC. In silico template-based modeling of this domain indicated that the mutation resides in a loop between two ß-strands. The relationship between PlnEF, CorC, and metal homeostasis was supported by the finding that PlnEF-resistance was lost when PlnEF was applied together with high concentrations of Mg2+ , Co2+ , Zn2+ , or Cu2+ . Lastly, PlnEF sensitivity was increased upon heterologous expression of LP965 corC but not the G334V CorC mutant in the PlnEF-resistant strain Lactobacillus casei BL23. These results show that PlnEF kills sensitive bacteria by targeting CorC. © 2019 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.


April 21, 2020  |  

Iso-Seq Allows Genome-Independent Transcriptome Profiling of Grape Berry Development.

Transcriptomics has been widely applied to study grape berry development. With few exceptions, transcriptomic studies in grape are performed using the available genome sequence, PN40024, as reference. However, differences in gene content among grape accessions, which contribute to phenotypic differences among cultivars, suggest that a single reference genome does not represent the species’ entire gene space. Though whole genome assembly and annotation can reveal the relatively unique or “private” gene space of any particular cultivar, transcriptome reconstruction is a more rapid, less costly, and less computationally intensive strategy to accomplish the same goal. In this study, we used single molecule-real time sequencing (SMRT) to sequence full-length cDNA (Iso-Seq) and reconstruct the transcriptome of Cabernet Sauvignon berries during berry ripening. In addition, short reads from ripening berries were used to error-correct low-expression isoforms and to profile isoform expression. By comparing the annotated gene space of Cabernet Sauvignon to other grape cultivars, we demonstrate that the transcriptome reference built with Iso-Seq data represents most of the expressed genes in the grape berries and includes 1,501 cultivar-specific genes. Iso-Seq produced transcriptome profiles similar to those obtained after mapping on a complete genome reference. Together, these results justify the application of Iso-Seq to identify cultivar-specific genes and build a comprehensive reference for transcriptional profiling that circumvents the necessity of a genome reference with its associated costs and computational weight.Copyright © 2019 Minio et al.


April 21, 2020  |  

Origin and evolution of the octoploid strawberry genome.

Cultivated strawberry emerged from the hybridization of two wild octoploid species, both descendants from the merger of four diploid progenitor species into a single nucleus more than 1 million years ago. Here we report a near-complete chromosome-scale assembly for cultivated octoploid strawberry (Fragaria?×?ananassa) and uncovered the origin and evolutionary processes that shaped this complex allopolyploid. We identified the extant relatives of each diploid progenitor species and provide support for the North American origin of octoploid strawberry. We examined the dynamics among the four subgenomes in octoploid strawberry and uncovered the presence of a single dominant subgenome with significantly greater gene content, gene expression abundance, and biased exchanges between homoeologous chromosomes, as compared with the other subgenomes. Pathway analysis showed that certain metabolomic and disease-resistance traits are largely controlled by the dominant subgenome. These findings and the reference genome should serve as a powerful platform for future evolutionary studies and enable molecular breeding in strawberry.


April 21, 2020  |  

An Annotated Genome for Haliotis rufescens (Red Abalone) and Resequenced Green, Pink, Pinto, Black, and White Abalone Species.

Abalone are one of the few marine taxa where aquaculture production dominates the global market as a result of increasing demand and declining natural stocks from overexploitation and disease. To better understand abalone biology, aid in conservation efforts for endangered abalone species, and gain insight into sustainable aquaculture, we created a draft genome of the red abalone (Haliotis rufescens). The approach to this genome draft included initial assembly using raw Illumina and PacBio sequencing data with MaSuRCA, before scaffolding using sequencing data generated from Chicago library preparations with HiRise2. This assembly approach resulted in 8,371 scaffolds and total length of 1.498?Gb; the N50 was 1.895?Mb, and the longest scaffold was 13.2?Mb. Gene models were predicted, using MAKER2, from RNA-Seq data and all related expressed sequence tags and proteins from NCBI; this resulted in 57,785 genes with an average length of 8,255?bp. In addition, single nucleotide polymorphisms were called on Illumina short-sequencing reads from five other eastern Pacific abalone species: the green (H. fulgens), pink (H. corrugata), pinto (H. kamtschatkana), black (H. cracherodii), and white (H. sorenseni) abalone. Phylogenetic relationships largely follow patterns detected by previous studies based on 1,784,991 high-quality single nucleotide polymorphisms. Among the six abalone species examined, the endangered white abalone appears to harbor the lowest levels of heterozygosity. This draft genome assembly and the sequencing data provide a foundation for genome-enabled aquaculture improvement for red abalone, and for genome-guided conservation efforts for the other five species and, in particular, for the endangered white and black abalone.


April 21, 2020  |  

Multi-platform discovery of haplotype-resolved structural variation in human genomes.

The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (<50?bp) and 27,622 SVs (=50?bp) per genome. We also discover 156 inversions per genome and 58 of the inversions intersect with the critical regions of recurrent microdeletion and microduplication syndromes. Taken together, our SV callsets represent a three to sevenfold increase in SV detection compared to most standard high-throughput sequencing studies, including those from the 1000 Genomes Project. The methods and the dataset presented serve as a gold standard for the scientific community allowing us to make recommendations for maximizing structural variation sensitivity for future genome sequencing studies.


April 21, 2020  |  

The transcriptome of Darwin’s bark spider silk glands predicts proteins contributing to dragline silk toughness.

Darwin’s bark spider (Caerostris darwini) produces giant orb webs from dragline silk that can be twice as tough as other silks, making it the toughest biological material. This extreme toughness comes from increased extensibility relative to other draglines. We show C. darwini dragline-producing major ampullate (MA) glands highly express a novel silk gene transcript (MaSp4) encoding a protein that diverges markedly from closely related proteins and contains abundant proline, known to confer silk extensibility, in a unique GPGPQ amino acid motif. This suggests C. darwini evolved distinct proteins that may have increased its dragline’s toughness, enabling giant webs. Caerostris darwini’s MA spinning ducts also appear unusually long, potentially facilitating alignment of silk proteins into extremely tough fibers. Thus, a suite of novel traits from the level of genes to spinning physiology to silk biomechanics are associated with the unique ecology of Darwin’s bark spider, presenting innovative designs for engineering biomaterials.


April 21, 2020  |  

Genome and proteome of the chlorophyll f-producing cyanobacterium Halomicronema hongdechloris: adaptative proteomic shifts under different light conditions.

Halomicronema hongdechloris was the first cyanobacterium to be identified that produces chlorophyll (Chl) f. It contains Chl a and uses phycobiliproteins as its major light-harvesting components under white light conditions. However, under far-red light conditions H. hongdechloris produces Chl f and red-shifted phycobiliprotein complexes to absorb and use far-red light. In this study, we report the genomic sequence of H. hongdechloris and use quantitative proteomic approaches to confirm the deduced metabolic pathways as well as metabolic and photosynthetic changes in response to different photo-autotrophic conditions.The whole genome of H. hongdechloris was sequenced using three different technologies and assembled into a single circular scaffold with a genome size of 5,577,845?bp. The assembled genome has 54.6% GC content and encodes 5273 proteins covering 83.5% of the DNA sequence. Using Tandem Mass Tag labelling, the total proteome of H. hongdechloris grown under different light conditions was analyzed. A total of 1816 proteins were identified, with photosynthetic proteins accounting for 24% of the total mass spectral readings, of which 35% are phycobiliproteins. The proteomic data showed that essential cellular metabolic reactions remain unchanged under shifted light conditions. The largest differences in protein content between white and far-red light conditions reflect the changes to photosynthetic complexes, shifting from a standard phycobilisome and Chl a-based light harvesting system under white light, to modified, red-shifted phycobilisomes and Chl f-containing photosystems under far-red light conditions.We demonstrate that essential cellular metabolic reactions under different light conditions remain constant, including most of the enzymes in chlorophyll biosynthesis and photosynthetic carbon fixation. The changed light conditions cause significant changes in the make-up of photosynthetic protein complexes to improve photosynthetic light capture and reaction efficiencies. The integration of the global proteome with the genome sequence highlights that cyanobacterial adaptation strategies are focused on optimizing light capture and utilization, with minimal changes in other metabolic pathways. Our quantitative proteomic approach has enabled a deeper understanding of both the stability and the flexibility of cellular metabolic networks of H. hongdechloris in response to changes in its environment.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.