Menu
April 21, 2020

Complete genome sequence of marine Bacillus sp. Y-01, isolated from the plastics contamination in the Yellow Sea

Plastics contamination in the environment has been an increasing ecological problem. Here we present the complete genome sequence of Bacillus sp. Y-01, isolated from plastic contamination samples in the Yellow Sea, which can utilize the polypropylene as the sole carbon and energy source. The strain has one circular chromosome of 5,130,901?bp in 8 contigs with a 38.24% GC content, consisting of 4996 protein-coding genes, 118 tRNA genes, as well as 40 rRNA operons as 5S-16S-23S rRNA. The complete genome sequence of Bacillus sp. Y-01 will provide useful genetic information to further detect the molecular mechanisms behind marine microplastics degradation.


April 21, 2020

Genome of Crucihimalaya himalaica, a close relative of Arabidopsis, shows ecological adaptation to high altitude.

Crucihimalaya himalaica, a close relative of Arabidopsis and Capsella, grows on the Qinghai-Tibet Plateau (QTP) about 4,000 m above sea level and represents an attractive model system for studying speciation and ecological adaptation in extreme environments. We assembled a draft genome sequence of 234.72 Mb encoding 27,019 genes and investigated its origin and adaptive evolutionary mechanisms. Phylogenomic analyses based on 4,586 single-copy genes revealed that C. himalaica is most closely related to Capsella (estimated divergence 8.8 to 12.2 Mya), whereas both species form a sister clade to Arabidopsis thaliana and Arabidopsis lyrata, from which they diverged between 12.7 and 17.2 Mya. LTR retrotransposons in C. himalaica proliferated shortly after the dramatic uplift and climatic change of the Himalayas from the Late Pliocene to Pleistocene. Compared with closely related species, C. himalaica showed significant contraction and pseudogenization in gene families associated with disease resistance and also significant expansion in gene families associated with ubiquitin-mediated proteolysis and DNA repair. We identified hundreds of genes involved in DNA repair, ubiquitin-mediated proteolysis, and reproductive processes with signs of positive selection. Gene families showing dramatic changes in size and genes showing signs of positive selection are likely candidates for C. himalaica’s adaptation to intense radiation, low temperature, and pathogen-depauperate environments in the QTP. Loss of function at the S-locus, the reason for the transition to self-fertilization of C. himalaica, might have enabled its QTP occupation. Overall, the genome sequence of C. himalaica provides insights into the mechanisms of plant adaptation to extreme environments.Copyright © 2019 the Author(s). Published by PNAS.


April 21, 2020

Complete Genome Sequence of the Wolbachia wAlbB Endosymbiont of Aedes albopictus.

Wolbachia, an alpha-proteobacterium closely related to Rickettsia, is a maternally transmitted, intracellular symbiont of arthropods and nematodes. Aedes albopictus mosquitoes are naturally infected with Wolbachia strains wAlbA and wAlbB. Cell line Aa23 established from Ae. albopictus embryos retains only wAlbB and is a key model to study host-endosymbiont interactions. We have assembled the complete circular genome of wAlbB from the Aa23 cell line using long-read PacBio sequencing at 500× median coverage. The assembled circular chromosome is 1.48 megabases in size, an increase of more than 300 kb over the published draft wAlbB genome. The annotation of the genome identified 1,205 protein coding genes, 34 tRNA, 3 rRNA, 1 tmRNA, and 3 other ncRNA loci. The long reads enabled sequencing over complex repeat regions which are difficult to resolve with short-read sequencing. Thirteen percent of the genome comprised insertion sequence elements distributed throughout the genome, some of which cause pseudogenization. Prophage WO genes encoding some essential components of phage particle assembly are missing, while the remainder are found in five prophage regions/WO-like islands or scattered around the genome. Orthology analysis identified a core proteome of 535 orthogroups across all completed Wolbachia genomes. The majority of proteins could be annotated using Pfam and eggNOG analyses, including ankyrins and components of the Type IV secretion system. KEGG analysis revealed the absence of five genes in wAlbB which are present in other Wolbachia. The availability of a complete circular chromosome from wAlbB will enable further biochemical, molecular, and genetic analyses on this strain and related Wolbachia. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

Tools and Strategies for Long-Read Sequencing and De Novo Assembly of Plant Genomes.

The commercial release of third-generation sequencing technologies (TGSTs), giving long and ultra-long sequencing reads, has stimulated the development of new tools for assembling highly contiguous genome sequences with unprecedented accuracy across complex repeat regions. We survey here a wide range of emerging sequencing platforms and analytical tools for de novo assembly, provide background information for each of their steps, and discuss the spectrum of available options. Our decision tree recommends workflows for the generation of a high-quality genome assembly when used in combination with the specific needs and resources of a project.Copyright © 2019 Elsevier Ltd. All rights reserved.


April 21, 2020

Characterizing the major structural variant alleles of the human genome.

In order to provide a comprehensive resource for human structural variants (SVs), we generated long-read sequence data and analyzed SVs for fifteen human genomes. We sequence resolved 99,604 insertions, deletions, and inversions including 2,238 (1.6 Mbp) that are shared among all discovery genomes with an additional 13,053 (6.9 Mbp) present in the majority, indicating minor alleles or errors in the reference. Genotyping in 440 additional genomes confirms the most common SVs in unique euchromatin are now sequence resolved. We report a ninefold SV bias toward the last 5 Mbp of human chromosomes with nearly 55% of all VNTRs (variable number of tandem repeats) mapping to this portion of the genome. We identify SVs affecting coding and noncoding regulatory loci improving annotation and interpretation of functional variation. These data provide the framework to construct a canonical human reference and a resource for developing advanced representations capable of capturing allelic diversity. Copyright © 2018 Elsevier Inc. All rights reserved.


April 21, 2020

The smut fungus Ustilago esculenta has a bipolar mating system with three idiomorphs larger than 500?kb.

Zizania latifolia Turcz., which is mainly distributed in Asia, has had a long cultivation history as a cereal and vegetable crop. On infection with the smut fungus Ustilago esculenta, Z. latifolia becomes an edible vegetable, water bamboo. Two main cultivars, with a green shell and red shell, are cultivated for commercial production in Taiwan. Previous studies indicated that cultivars of Z. latifolia may be related to the infected U. esculenta isolates. However, related research is limited. The infection process of the corn smut fungus Ustilago maydis is coupled with sexual development and under control of the mating type locus. Thus, we aimed to use the knowledge of U. maydis to reveal the mating system of U. esculenta. We collected water bamboo samples and isolated 145 U. esculenta strains from Taiwan’s major production areas. By using PCR and idiomorph screening among meiotic offspring and field isolates, we identified three idiomorphs of the mating type locus and found no sequence recombination between them. Whole-genome sequencing (Illumina and PacBio) suggested that the mating system of U. esculenta was bipolar. Mating type locus 1 (MAT-1) was 552,895?bp and contained 44% repeated sequences. Sequence comparison revealed that U. esculenta MAT-1 shared high gene synteny with Sporisorium reilianum and many repeats with Ustilago hordei MAT-1. These results can be utilized to further explore the genomic diversity of U. esculenta isolates and their application for water bamboo breeding. Copyright © 2019 Elsevier Inc. All rights reserved.


April 21, 2020

De novo assembly of white poplar genome and genetic diversity of white poplar population in Irtysh River basin in China.

The white poplar (Populus alba) is widely distributed in Central Asia and Europe. There are natural populations of white poplar in Irtysh River basin in China. It also can be cultivated and grown well in northern China. In this study, we sequenced the genome of P. alba by single-molecule real-time technology. De novo assembly of P. alba had a genome size of 415.99 Mb with a contig N50 of 1.18 Mb. A total of 32,963 protein-coding genes were identified. 45.16% of the genome was annotated as repetitive elements. Genome evolution analysis revealed that divergence between P. alba and Populus trichocarpa (black cottonwood) occurred ~5.0 Mya (3.0, 7.1). Fourfold synonymous third-codon transversion (4DTV) and synonymous substitution rate (ks) distributions supported the occurrence of the salicoid WGD event (~ 65 Mya). Twelve natural populations of P. alba in the Irtysh River basin in China were sequenced to explore the genetic diversity. Average pooled heterozygosity value of P. alba populations was 0.170±0.014, which was lower than that in Italy (0.271±0.051) and Hungary (0.264±0.054). Tajima’s D values showed a negative distribution, which might signify an excess of low frequency polymorphisms and a bottleneck with later expansion of P. alba populations examined.


April 21, 2020

A New Species of the ?-Proteobacterium Francisella, F. adeliensis Sp. Nov., Endocytobiont in an Antarctic Marine Ciliate and Potential Evolutionary Forerunner of Pathogenic Species.

The study of the draft genome of an Antarctic marine ciliate, Euplotes petzi, revealed foreign sequences of bacterial origin belonging to the ?-proteobacterium Francisella that includes pathogenic and environmental species. TEM and FISH analyses confirmed the presence of a Francisella endocytobiont in E. petzi. This endocytobiont was isolated and found to be a new species, named F. adeliensis sp. nov.. F. adeliensis grows well at wide ranges of temperature, salinity, and carbon dioxide concentrations implying that it may colonize new organisms living in deeply diversified habitats. The F. adeliensis genome includes the igl and pdp gene sets (pdpC and pdpE excepted) of the Francisella pathogenicity island needed for intracellular growth. Consistently with an F. adeliensis ancient symbiotic lifestyle, it also contains a single insertion-sequence element. Instead, it lacks genes for the biosynthesis of essential amino acids such as cysteine, lysine, methionine, and tyrosine. In a genome-based phylogenetic tree, F. adeliensis forms a new early branching clade, basal to the evolution of pathogenic species. The correlations of this clade with the other clades raise doubts about a genuine free-living nature of the environmental Francisella species isolated from natural and man-made environments, and suggest to look at F. adeliensis as a pioneer in the Francisella colonization of eukaryotic organisms.


April 21, 2020

Finding Nemo’s Genes: A chromosome-scale reference assembly of the genome of the orange clownfish Amphiprion percula.

The iconic orange clownfish, Amphiprion percula, is a model organism for studying the ecology and evolution of reef fishes, including patterns of population connectivity, sex change, social organization, habitat selection and adaptation to climate change. Notably, the orange clownfish is the only reef fish for which a complete larval dispersal kernel has been established and was the first fish species for which it was demonstrated that antipredator responses of reef fishes could be impaired by ocean acidification. Despite its importance, molecular resources for this species remain scarce and until now it lacked a reference genome assembly. Here, we present a de novo chromosome-scale assembly of the genome of the orange clownfish Amphiprion percula. We utilized single-molecule real-time sequencing technology from Pacific Biosciences to produce an initial polished assembly comprised of 1,414 contigs, with a contig N50 length of 1.86 Mb. Using Hi-C-based chromatin contact maps, 98% of the genome assembly were placed into 24 chromosomes, resulting in a final assembly of 908.8 Mb in length with contig and scaffold N50s of 3.12 and 38.4 Mb, respectively. This makes it one of the most contiguous and complete fish genome assemblies currently available. The genome was annotated with 26,597 protein-coding genes and contains 96% of the core set of conserved actinopterygian orthologs. The availability of this reference genome assembly as a community resource will further strengthen the role of the orange clownfish as a model species for research on the ecology and evolution of reef fishes. © 2018 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.


April 21, 2020

Transmission of ESBL-producing Escherichia coli between broilers and humans on broiler farms.

ESBL and AmpC ß-lactamases are an increasing concern for public health. Studies suggest that ESBL/pAmpC-producing Escherichia coli and their plasmids carrying antibiotic resistance genes can spread from broilers to humans working or living on broiler farms. These studies used traditional typing methods, which may not have provided sufficient resolution to reliably assess the relatedness of these isolates.Eleven suspected transmission events among broilers and humans living/working on eight broiler farms were investigated using whole-genome short-read (Illumina) and long-read sequencing (PacBio). Core genome MLST (cgMLST) was performed to investigate the occurrence of strain transmission. Horizontal plasmid and gene transfer were analysed using BLAST.Of eight suspected strain transmission events, six were confirmed. The isolate pairs had identical ESBL/AmpC genes and fewer than eight allelic differences according to the cgMLST, and five had an almost identical plasmid composition. On one of the farms, cgMLST revealed that the isolate pairs belonging to ST10 from a broiler and a household member of the farmer had 475 different alleles, but that the plasmids were identical, indicating horizontal transfer of mobile elements rather than strain transfer. Of three suspected horizontal plasmid transmission events, one was confirmed. In addition, gene transfer between plasmids was found.The present study confirms transmission of strains as well as horizontal plasmid and gene transfer between broilers and farmers and household members on the same farm. WGS is an important tool to confirm suspected zoonotic strain and resistance gene transmission. © The Author(s) 2019. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For permissions, please email: journals.permissions@oup.com.


April 21, 2020

Complete genome sequence of an IMP-8, CTX-M-14, CTX-M-3 and QnrS1 co-producing Enterobacter asburiae isolate from a patient with wound infection.

The aim of this study was to investigate the characteristics and complete genome sequence of an IMP-8, CTX-M-14, CTX-M-3 and QnrS1 co-producing multidrug-resistant Enterobacter asburiae isolate (EN3600) from a patient with wound infection.Species identification was confirmed by matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry (MALDI-TOF/MS). Carbapenemase genes were identified by PCR and Sanger sequencing. The complete genome sequence of E. asburiae EN3600 was obtained using a PacBio RS II platform. Genome annotation was done by Rapid Annotation using Subsystem Technology (RAST) server. Acquired antimicrobial resistance genes (ARGs) and plasmid replicons were detected using ResFinder 2.1 and PlasmidFinder 1.3, respectively.The genome of E. asburiae EN3600 consists of a 4.8-Mbp chromosome and five plasmids. The annotated genome contains various ARGs conferring resistance to aminoglycosides, ß-lactams, fluoroquinolones, fosfomycin, macrolides, phenicols, rifampicin and sulfonamides. In addition, plasmids of incompatibility (Inc) groups IncHI2A, IncFIB(pECLA), IncFIB(pQil) and IncP1 were identified. The genes blaIMP-8, blaCTX-M-14 and blaCTX-M-3 were located on different plasmids. The blaIMP-8 gene was carried by an 86-kb IncFIB(pQil) plasmid. The blaCTX-M-3 and qnrS1 genes were co-harboured by an IncP1 plasmid. In addition, blaCTX-M-14 was associated with blaTEM-1B, blaOXA-1, catB3 and sul1 genes in a 116-kb non-typeable plasmid.To our knowledge, this is the first complete genome sequence of an E. asburiae isolate co-producing IMP-8, CTX-M-14, CTX-M-3 and QnrS1. This genome may facilitate the understanding of the resistome, pathogenesis and genomic features of Enterobacter cloacae complex (ECC) and will provide valuable information for accurate identification of ECC.Copyright © 2019 International Society for Antimicrobial Chemotherapy. Published by Elsevier Ltd. All rights reserved.


April 21, 2020

Genome sequence of Jatropha curcas L., a non-edible biodiesel plant, provides a resource to improve seed-related traits.

Jatropha curcas (physic nut), a non-edible oilseed crop, represents one of the most promising alternative energy sources due to its high seed oil content, rapid growth and adaptability to various environments. We report ~339 Mbp draft whole genome sequence of J. curcas var. Chai Nat using both the PacBio and Illumina sequencing platforms. We identified and categorized differentially expressed genes related to biosynthesis of lipid and toxic compound among four stages of seed development. Triacylglycerol (TAG), the major component of seed storage oil, is mainly synthesized by phospholipid:diacylglycerol acyltransferase in Jatropha, and continuous high expression of homologs of oleosin over seed development contributes to accumulation of high level of oil in kernels by preventing the breakdown of TAG. A physical cluster of genes for diterpenoid biosynthetic enzymes, including casbene synthases highly responsible for a toxic compound, phorbol ester, in seed cake, was syntenically highly conserved between Jatropha and castor bean. Transcriptomic analysis of female and male flowers revealed the up-regulation of a dozen family of TFs in female flower. Additionally, we constructed a robust species tree enabling estimation of divergence times among nine Jatropha species and five commercial crops in Malpighiales order. Our results will help researchers and breeders increase energy efficiency of this important oil seed crop by improving yield and oil content, and eliminating toxic compound in seed cake for animal feed. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020

The Genome of Cucurbita argyrosperma (Silver-Seed Gourd) Reveals Faster Rates of Protein-Coding Gene and Long Noncoding RNA Turnover and Neofunctionalization within Cucurbita.

Whole-genome duplications are an important source of evolutionary novelties that change the mode and tempo at which genetic elements evolve within a genome. The Cucurbita genus experienced a whole-genome duplication around 30 million years ago, although the evolutionary dynamics of the coding and noncoding genes in this genus have not yet been scrutinized. Here, we analyzed the genomes of four Cucurbita species, including a newly assembled genome of Cucurbita argyrosperma, and compared the gene contents of these species with those of five other members of the Cucurbitaceae family to assess the evolutionary dynamics of protein-coding and long intergenic noncoding RNA (lincRNA) genes after the genome duplication. We report that Cucurbita genomes have a higher protein-coding gene birth-death rate compared with the genomes of the other members of the Cucurbitaceae family. C. argyrosperma gene families associated with pollination and transmembrane transport had significantly faster evolutionary rates. lincRNA families showed high levels of gene turnover throughout the phylogeny, and 67.7% of the lincRNA families in Cucurbita showed evidence of birth from the neofunctionalization of previously existing protein-coding genes. Collectively, our results suggest that the whole-genome duplication in Cucurbita resulted in faster rates of gene family evolution through the neofunctionalization of duplicated genes. Copyright © 2019 The Author. Published by Elsevier Inc. All rights reserved.


April 21, 2020

Whole genome sequence of Auricularia heimuer (Basidiomycota, Fungi), the third most important cultivated mushroom worldwide.

Heimuer, Auricularia heimuer, is one of the most famous traditional Chinese foods and medicines, and it is the third most important cultivated mushroom worldwide. The aim of this study is to develop genomic resources for A. heimuer to furnish tools that can be used to study its secondary metabolite production capability, wood degradation ability and biosynthesis of polysaccharides. The genome was obtained from single spore mycelia of the strain Dai 13782 by using combined high-throughput Illumina HiSeq 4000 system with the PacBio RSII long-read sequencing platform. Functional annotation was accomplished by blasting protein sequences with different public available databases to obtain their corresponding annotations. It is 49.76Mb in size with a N50 scaffold size of 1,350,668bp and encodes 16,244 putative predicted genes. This is the first genome-scale assembly and annotation for A. heimuer, which is the third sequenced species in Auricularia. Copyright © 2018 Elsevier Inc. All rights reserved.


April 21, 2020

Complete Genome Sequence of Lactic Acid Bacterium Pediococcus acidilactici Strain ATCC 8042, an Autolytic Anti-bacterial Peptidoglycan Hydrolase Producer

Pediococcus acidilactici is a probiotic bacterium that is industrially utilized in the food industry and antibiotics development. Here, we determine the complete nucleotide sequence of the genome of Pediococcus acidilactici ATCC 8042. The genome was sequenced by the PacBio RSII to generate a single contig consisting of circular chromosome sequence. Illumina MiniSeq sequencing platform and Sanger sequencing method were additionally utilized to correct errors resulting from the long-read sequencing platform. The sequence consists of 2,009,598 bp with a G + C content of 42.1% and contains 1,865 protein-coding sequences. Based on the sequence information, we could confirm and predict the presence of four peptidoglycan hydrolases by HyPe software. This work, therefore, provides the complete genomic information of P. acidilactici ATCC 8042 with a profitable potential of genome-scale comprehension of anti-pathogenic activity, which can be applied in nutraceutical and pharmaceutical biotechnology field.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.