Animals in the phylum Hemichordata have provided key understanding of the origins and development of body patterning and nervous system organization. However, efforts to sequence and assemble the genomes of highly heterozygous non-model organisms have proven to be difficult with traditional short read approaches. Long repetitive DNA structures, extensive structural variation between haplotypes in polyploid species, and large genome sizes are limiting factors to achieving highly contiguous genome assemblies. Here we present the highly contiguous de novo assembly and preliminary annotation of an indirect developing hemichordate genome, Schizocardium californicum, using SMRT Sequening long reads.
Recent improvements in sequencing chemistry and instrument performance combine to create a new PacBio data type, Single Molecule High-Fidelity reads (HiFi reads). Increased read length and improvement in library construction enables average read lengths of 10-20 kb with average sequence identity greater than 99% from raw single molecule reads. The resulting reads have the accuracy comparable to short read NGS but with 50-100 times longer read length. Here we benchmark the performance of this data type by sequencing and genotyping the Genome in a Bottle (GIAB) HG0002 human reference sample from the National Institute of Standards and Technology (NIST). We…
Background: Long-read sequencing presents several potential advantages for providing more complete gene profiling of metagenomic samples. Long reads can capture multiple genes in a single read, and longer reads typically result in assemblies with better contiguity, especially for higher abundance organisms. However, a major challenge with using long reads has been the higher cost per base, which may lead to insufficient coverage of low-abundance species. Additionally, lower single-pass accuracy can make gene discovery for low-abundance organisms difficult. Methods: To evaluate the pros and cons of long reads for metagenomics, we directly compared PacBio and Illumina sequencing on a soil-derived sample,…
Highly repetitive satellite DNA (satDNA) repeats are found in most eukaryotic genomes. SatDNAs are rapidly evolving and have roles in genome stability and chromosome segregation. Their repetitive nature poses a challenge for genome assembly and makes progress on the detailed study of satDNA structure difficult. Here, we use single-molecule sequencing long reads from Pacific Biosciences (PacBio) to determine the detailed structure of all major autosomal complex satDNA loci in Drosophila melanogaster, with a particular focus on the 260-bp and Responder satellites. We determine the optimal de novo assembly methods and parameter combinations required to produce a high-quality assembly of these…
Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology.Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ~7.9 million base pairs (Mb), representing a ~300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to…
Streptomyces sp. H-KF8 is a fjord-derived marine actinobacterium capable of producing antimicrobial activity. Streptomyces sp. H-KF8 was isolated from sediments of the Comau fjord, located in the northern Chilean Patagonia. Here, we report the 7.7-Mb genome assembly, which represents the first genome of a Chilean marine actinobacterium. Copyright © 2017 Undabarrena et al.
Edwardsiella hoshinae is a Gram-negative facultative anaerobe that has primarily been isolated from avians and reptiles. We report here the complete and annotated genome sequence of an isolate from a monitor lizard (Varanus sp.), which contains a chromosome of 3,811,650 bp and no plasmids. Copyright © 2017 Reichley et al.
Staphylococcus aureus causes a variety of human diseases ranging in severity. The pathogenicity of S. aureus can be partially attributed to the acquisition of mobile genetic elements. In this report, we provide two complete genome sequences from human clinical S. aureus isolates. Copyright © 2017 Hau et al.
Legionella is a highly diverse genus of intracellular bacterial pathogens that cause Legionnaire’s disease (LD), an often severe form of pneumonia. Two L. micdadei sp. clinical isolates, obtained from patients hospitalized with LD from geographically distinct areas, were sequenced using PacBio SMRT cell technology, identifying incomplete phage regions, which may impact virulence. Copyright © 2017 Osborne et al.
We report here the complete genome sequence of the livestock-associated methicillin-resistant Staphylococcus aureus strain 08S00974 from sequence type 398 (ST398 LA-MRSA) isolated from a fatting pig at a farm in Germany. Copyright © 2017 Makarova et al.
Zinc resistance in livestock-associated methicillin resistant Staphylococcus aureus (LA-MRSA) sequence type (ST) 398 is primarily mediated by the czrC gene co-located with the mecA gene, encoding methicillin resistance, within the type V SCCmec element. Because czrC and mecA are located within the same mobile genetic element, it has been suggested that the use of in feed zinc as an antidiarrheal agent has the potential to contribute to the emergence and spread of MRSA in swine through increased selection pressure to maintain the SCCmec element in isolates obtained from pigs. In this study we report the prevalence of the czrC gene…
We report here the complete genome sequence of the methicillin-sensitive Staphylococcus aureus subsp. aureus strain ATCC 6538 (FDA 209, DSM 799, WDCM 00032, and NCTC 10788). Copyright © 2017 Makarova et al.
Polyporus brumalis is able to synthesize several sesquiterpenes during fungal growth. Using a single-molecule real-time sequencing platform, we present the 53-Mb draft genome of P. brumalis, which contains 6,231 protein-coding genes. Gene annotation and isolation support genetic information, which can increase the understanding of sesquiterpene metabolism in P. brumalis. Copyright © 2017 Lee et al.
The draft whole-genome sequence of the Spodoptera frugiperda Sf9 insect cell line was obtained using long-read PacBio sequence technology and Canu assembly. The final assembled genome consisted of 451 Mbp in 4,577 contigs, with 12,716× mean coverage and a G+C content of 36.53%.