Menu
July 7, 2019

HISEA: HIerarchical SEed Aligner for PacBio data.

The next generation sequencing (NGS) techniques have been around for over a decade. Many of their fundamental applications rely on the ability to compute good genome assemblies. As the technology evolves, the assembly algorithms and tools have to continuously adjust and improve. The currently dominant technology of Illumina produces reads that are too short to bridge many repeats, setting limits on what can be successfully assembled. The emerging SMRT (Single Molecule, Real-Time) sequencing technique from Pacific Biosciences produces uniform coverage and long reads of length up to sixty thousand base pairs, enabling significantly better genome assemblies. However, SMRT reads are much more expensive and have a much higher error rate than Illumina’s – around 10-15% – mostly due to indels. New algorithms are very much needed to take advantage of the long reads while mitigating the effect of high error rate and lowering the required coverage.An essential step in assembling SMRT data is the detection of alignments, or overlaps, between reads. High error rate and very long reads make this a much more challenging problem than for Illumina data. We present a new pairwise read aligner, or overlapper, HISEA (Hierarchical SEed Aligner) for SMRT sequencing data. HISEA uses a novel two-step k-mer search, employing consistent clustering, k-mer filtering, and read alignment extension.We compare HISEA against several state-of-the-art programs – BLASR, DALIGNER, GraphMap, MHAP, and Minimap – on real datasets from five organisms. We compare their sensitivity, precision, specificity, F1-score, as well as time and memory usage. We also introduce a new, more precise, evaluation method. Finally, we compare the two leading programs, MHAP and HISEA, for their genome assembly performance in the Canu pipeline.Our algorithm has the best alignment detection sensitivity among all programs for SMRT data, significantly higher than the current best. The currently best assembler for SMRT data is the Canu program which uses the MHAP aligner in its pipeline. We have incorporated our new HISEA aligner in the Canu pipeline and benchmarked it against the best pipeline for multiple datasets at two relevant coverage levels: 30x and 50x. Our assemblies are better than those using MHAP for both coverage levels. Moreover, Canu+HISEA assemblies for 30x coverage are comparable with Canu+MHAP assemblies for 50x coverage, while being faster and cheaper.The HISEA algorithm produces alignments with highest sensitivity compared with the current state-of-the-art algorithms. Integrated in the Canu pipeline, currently the best for assembling PacBio data, it produces better assemblies than Canu+MHAP.


July 7, 2019

Efficient transgenesis and annotated genome sequence of the regenerative flatworm model Macrostomum lignano.

Regeneration-capable flatworms are informative research models to study the mechanisms of stem cell regulation, regeneration, and tissue patterning. However, the lack of transgenesis methods considerably hampers their wider use. Here we report development of a transgenesis method for Macrostomum lignano, a basal flatworm with excellent regeneration capacity. We demonstrate that microinjection of DNA constructs into fertilized one-cell stage eggs, followed by a low dose of irradiation, frequently results in random integration of the transgene in the genome and its stable transmission through the germline. To facilitate selection of promoter regions for transgenic reporters, we assembled and annotated the M. lignano genome, including genome-wide mapping of transcription start regions, and show its utility by generating multiple stable transgenic lines expressing fluorescent proteins under several tissue-specific promoters. The reported transgenesis method and annotated genome sequence will permit sophisticated genetic studies on stem cells and regeneration using M. lignano as a model organism.


July 7, 2019

Complete genome sequences of two plant-associated Pseudomonas putida isolates with increased heavy-metal tolerance.

We report here the complete genome sequences of two Pseudomonas putida isolates recovered from surfac e-sterilized roots of Sida hermaphrodita The two isolates were characterized by an increased tolerance to zinc, cadmium, and lead. Furthermore, the strains showed typical plant growth-promoting properties, such as the production of indole acetic acid, cellulolytic enzymes, and siderophores. Copyright © 2017 Nesme et al.


July 7, 2019

Chromosome evolution in the free-living flatworms: first evidence of intrachromosomal rearrangements in karyotype evolution of Macrostomum lignano (Platyhelminthes, Macrostomida).

The free-living flatworm Macrostomum lignano is a hidden tetraploid. Its genome was formed by a recent whole genome duplication followed by chromosome fusions. Its karyotype (2n = 8) consists of a pair of large chromosomes (MLI1), which contain regions of all other chromosomes, and three pairs of small metacentric chromosomes. Comparison of MLI1 with metacentrics was performed by painting with microdissected DNA probes and fluorescent in situ hybridization of unique DNA fragments. Regions of MLI1 homologous to small metacentrics appeared to be contiguous. Besides the loss of DNA repeat clusters (pericentromeric and telomeric repeats and the 5S rDNA cluster) from MLI1, the difference between small metacentrics MLI2 and MLI4 and regions homologous to them in MLI1 were revealed. Abnormal karyotypes found in the inbred DV1/10 subline were analyzed, and structurally rearranged chromosomes were described with the painting technique, suggesting the mechanism of their origin. The revealed chromosomal rearrangements generate additional diversity, opening the way toward massive loss of duplicated genes from a duplicated genome. Our findings suggest that the karyotype of M. lignano is in the early stage of genome diploidization after whole genome duplication, and further studies on M. lignano and closely related species can address many questions about karyotype evolution in animals.


July 7, 2019

Draft sequencing of the heterozygous diploid genome of Satsuma (Citrus unshiu Marc.) using a hybrid assembly approach.

Satsuma (Citrus unshiu Marc.) is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma (“Miyagawa Wase”) was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome.


July 7, 2019

Scaffolding of long read assemblies using long range contact information.

Long read technologies have revolutionized de novo genome assembly by generating contigs orders of magnitude longer than that of short read assemblies. Although assembly contiguity has increased, it usually does not reconstruct a full chromosome or an arm of the chromosome, resulting in an unfinished chromosome level assembly. To increase the contiguity of the assembly to the chromosome level, different strategies are used which exploit long range contact information between chromosomes in the genome.We develop a scalable and computationally efficient scaffolding method that can boost the assembly contiguity to a large extent using genome-wide chromatin interaction data such as Hi-C.we demonstrate an algorithm that uses Hi-C data for longer-range scaffolding of de novo long read genome assemblies. We tested our methods on the human and goat genome assemblies. We compare our scaffolds with the scaffolds generated by LACHESIS based on various metrics.Our new algorithm SALSA produces more accurate scaffolds compared to the existing state of the art method LACHESIS.


July 7, 2019

Observations on bipolar disjunctions of moonwort ferns (Botrychium, Ophioglossaceae).

Peter Raven, in 1963, included two fern taxa of the genus Botrychium in his list of plant species exhibiting American amphitropical bipolar disjunctions. He attributed the southern hemisphere occurrences to post-Pleistocene long-distance dispersal from counterparts in the northern hemisphere, probably assisted by annual bird migrations between the disjunct areas. Using genetic evidence gathered through worldwide analyses of phylogenetic relationship in Botrychium, we now review and reconsider Raven’s conclusions. Genetic similarities indicate that South American Botrychium dusenii is an allotetraploid taxon closely related to B. spathulatum, a North American endemic, and that B. lunaria in New Zealand possesses a genotype identical to that of a taxon in North America derived through introgressive hybridization between B. lunaria and an endemic North American species, B. neolunaria. Both North American counterparts exhibit Raven’s characteristics of bipolar disjuncts in their occurrence in mountain and coastal meadows, copious production of small propagules (spores in Botrychium), occurrence in habitats frequented by transpolar bird migrants, and ability to found new colonies through inbreeding. We discuss these characteristics in Botrychium and relative to other ferns and suggest further studies on Botrychium and related taxa to address questions of time, number, and mode of bipolar dispersals.© 2017 Botanical Society of America.


July 7, 2019

Complete genome sequence of Bacillus velezensis YJ11-1-4, a strain with broad-spectrum antimicrobial activity, isolated from traditional Korean fermented soybean paste.

Bacillus velezensis YJ11-1-4 is a strain that exhibits broad-spectrum antimicrobial activity against various pathogens. It was isolated from doenjang, a traditional Korean fermented soybean paste. The genome comprises a single circular chromosome of 4,006,637 bp with 46.42% G+C content without plasmids. Copyright © 2017 Lee et al.


July 7, 2019

Complete genome sequence of Bacillus vallismortis NBIF-001, a novel strain from Shangri-La, China, that has high activity against Fusarium oxysporum.

Bacillus vallismortis NBIF-001, a Gram-positive bacterium, was isolated from soil in Shangri-La, China. Here, we provide the complete genome sequence of this bacterium, which has a 3,929,787-bp-long genome, including 4,030 protein-coding genes and 195 RNA genes. This strain possesses a number of genes encoding virulence factors of pathogens. Copyright © 2017 Liu et al.


July 7, 2019

Complete genome sequence of Bacillus velezensis L-1, which has antagonistic activity against pear diseases.

Bacillus velezensis L-1 is an effective biocontrol agent against pear diseases. Here, we report the complete genome sequence of B. velezensis L-1 in which clusters related to the biosynthesis of secondary metabolites were predicted. This genome provides insights into the possible biocontrol mechanisms and furthers application of this specific bacterium. Copyright © 2017 Sun et al.


July 7, 2019

Phenotypic and genotypic features of a Salmonella Heidelberg strain isolated in broilers in Brazil and their possible association to antibiotics and short-chain organic acids resistance and susceptibility.

Salmonella enterica serovar Heidelberg is a human pathogen also found in broilers. A strain (UFPR1) has been associated with field reports of resistance to short-chain organic acids (SCOA) in broilers in the South of Brazil, but was susceptible to aBacillus subtilis-based probiotic added in feed in a related study. This work aimed to (i) report clinical symptoms caused by SH UFPR1 in broilers, (ii) study its susceptibility to some antibioticsin vitro, and (iii) SCOAin vivo; and (iv) relate these phenotypic observations with its genome characteristics. Twoin vivotrials used 1-day-old chicks housed for 21?days in 8 sterilized isolated negative pressure rooms with 4 battery cages of 12 birds each. Birds were challenged or not with 107?CFU/bird of SH UFPR1 orally and exposed or not to SCOA in a 2?×?2 factorial design. Zootechnical parameters were unaffected (P?>?0.05), no clinical signs were observed, and few cecal and hepatic histologic and immune-related alterations were seen, in birds challenged with SH. Formic and propionic acids added together in drinking water, fumaric and benzoic acid in feed (Trial 1), and coated calcium butyrate in feed (Trial 2) did not reduce the SH isolation frequencies seen in cecum and liver in broilers after SH challenge (P?>?0.05). SH UFPR1 was susceptible to amikacin, amoxicillin?+?clavulanate, ceftiofur, cephalexin, doxycycline and oxytetracycline; and mildly susceptible to ampicillin?+?sulbactam, cephalothin, ciprofloxacin, enrofloxacin, and gentamycin in anin vitrominimum inhibitory concentration model using Mueller-Hinton agar. The whole genome of SH UFPR1 was sequenced and consisted of a circular chromosome, spanning 4,760,321?bp with 52.18% of GC-content encoding 84 tRNA, 22 rRNA, and 4,427 protein-coding genes. The comparison between SH UFPR1 genome and a multidrug-resistant SL476 strain revealed 11 missing genomic fragments and 5 insertions related tobgt, bgr, andrpoSgenes. The deleted genes codify proteins associated with cell cycle regulation, virulence, drug resistance, cellular adhesion, and salt efflux which collectively reveal key aspects of the evolution and adaptation of SH strains such as organic acids resistance and antibiotic sensitivity and provide information relevant to the control of SH in poultry.


July 7, 2019

The genome of an intranuclear parasite, Paramicrosporidium saccamoebae, reveals alternative adaptations to obligate intracellular parasitism.

Intracellular parasitism often results in gene loss, genome reduction, and dependence upon the host for cellular functioning. Rozellomycota is a clade comprising many such parasites and is related to the diverse, highly reduced, animal parasites, Microsporidia. We sequenced the nuclear and mitochondrial genomes ofParamicrosporidium saccamoebae[Rozellomycota], an intranuclear parasite of amoebae. A canonical fungal mitochondrial genome was recovered fromP. saccamoebaethat encodes genes necessary for the complete oxidative phosphorylation pathway including Complex I, differentiating it from most endoparasites including its sequenced relatives in Rozellomycota and Microsporidia. Comparative analysis revealed thatP. saccamoebaeshares more gene content with distantly related Fungi than with its closest relatives, suggesting that genome evolution in Rozellomycota and Microsporidia has been affected by repeated and independent gene losses, possibly as a result of variation in parasitic strategies (e.g. host and subcellular localization) or due to multiple transitions to parasitism.


July 7, 2019

Post genomics era for orchid research.

Among 300,000 species in angiosperms, Orchidaceae containing 30,000 species is one of the largest families. Almost every habitats on earth have orchid plants successfully colonized, and it indicates that orchids are among the plants with significant ecological and evolutionary importance. So far, four orchid genomes have been sequenced, including Phalaenopsis equestris, Dendrobium catenatum, Dendrobium officinale, and Apostaceae shengen. Here, we review the current progress and the direction of orchid research in the post genomics era. These include the orchid genome evolution, genome mapping (genome-wide association analysis, genetic map, physical map), comparative genomics (especially receptor-like kinase and terpene synthase), secondary metabolomics, and genome editing.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.