June 1, 2021  |  

A low DNA input protocol for high-quality PacBio de novo genome assemblies from single invertebrate individuals

A high-quality reference genome is an essential tool for studies of plant and animal genomics. PacBio Single Molecule, Real-Time (SMRT) Sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. PacBio is the core technology for many large genome initiatives, however, relatively high DNA input requirements (5 µg for standard library protocol) have placed PacBio out of reach for many projects on small, non-inbred organisms that may have lower DNA content. Here we present high-quality de novo genome assemblies from single invertebrate individuals for two different species: the Anopheles coluzzii mosquito and the Schistosoma mansoni parasitic flatworm. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 50-100 ng of starting genomic DNA. The libraries were run on the Sequel System with chemistry v3.0 and software v6.0, generating a range of 21-32 Gb of sequence per SMRT Cell with 20 hour movies, and followed by diploid de novo genome assembly with FALCON-Unzip. The resulting assemblies had high contiguity (contig N50s over 3 Mb for both species) and completeness (as determined by conserved BUSCO gene analysis). We were also able to resolve maternal and paternal haplotypes for 1/3 of the genome in both cases. By sequencing and assembling material from a single diploid individual, only two haplotypes are present, simplifying the assembly process compared to samples from multiple pooled individuals. This new low-input approach puts PacBio-based assemblies in reach for small, highly heterozygous organisms that comprise much of the diversity of life. The method presented here can be applied to samples with starting DNA amounts around 100 ng per 250 Mb – 1 Gb genome size.


June 1, 2021  |  

A low DNA input protocol for high-quality PacBio de novo genome assemblies

A high-quality reference genome is an essential tool for studying the genetics of traits and disease, organismal, comparative and conservation biology, and population genomics. PacBio Single Molecule, Real-Time (SMRT) Sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives. However, relatively high DNA input requirements (3 µg for standard library protocol) have placed PacBio out of reach for many projects on small organisms that may have lower DNA content or on projects with limited input DNA for other reasons. Here we present a modified SMRTbell library construction protocol without DNA shearing or size selection that can be used to generate a SMRTbell library from just 150 ng of starting genomic DNA. Remarkably, the protocol enables high quality de novo assemblies from single invertebrate individuals and is applied to taxonomically diverse samples. By sequencing and assembling material from a single diploid individual, only two haplotypes are present, simplifying the assembly process compared to samples from multiple pooled individuals. The libraries were run on the Sequel System with chemistry v3.0 and software v6.0, generating ~11 Gb of sequence per SMRT Cell with 10 hour movies, and followed by de novo genome assembly with FALCON. The resulting assemblies had high contiguity (contig N50s over 1 Mb) and completeness (as determined by conserved BUSCO gene analysis) when at least 30-fold unique molecular coverage is obtained. This new low-input approach now puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life. The method presented here is scalable and can be applied to samples with starting DNA amounts of 150 ng per 300 Mb genome size.


June 1, 2021  |  

Every species can be a model: Reference-quality PacBio genomes from single insects

A high-quality reference genome is an essential resource for primary and applied research across the tree of life. Genome projects for small-bodied, non-model organisms such as insects face several unique challenges including limited DNA input quantities, high heterozygosity, and difficulty of culturing or inbreeding in the lab. Recent progress in PacBio library preparation protocols, sequencing throughput, and read accuracy address these challenges. We present several case studies including the Red Admiral (Vanessa atalanta), Monarch Butterfly (Danaus plexippus), and Anopheles malaria mosquitoes that highlight the benefits of sequencing single individuals for de novo genome assembly projects, and the ease at which these projects can be conducted by individual research labs. Sampled individuals may originate from lab colonies of interest to the research community or be sourced from the wild to better capture natural variation in a focal population. Where genomic DNA quantities are limited, the PacBio Low DNA Input Protocol requires ~100 ng of input DNA. Low DNA input samples with 500 Mb genome size or less can be multiplexed on a single SMRT Cell 8M on the Sequel II System. For samples with more abundant DNA quantity, size-selected libraries may be constructed to maximize sequencing yield. Both low DNA input and size-selected libraries can be used to generate HiFi reads, whose quality is Q20 or above (1% error or less) and lengths range from 10 – 25 kb. With HiFi reads, de novo assembly computation is greatly simplified relative to long read methods due to smaller sequence file sizes and more rapid analysis, resulting in highly accurate, contiguous, complete, and haplotype-resolved assemblies.


June 1, 2021  |  

A high-quality PacBio insect genome from 5 ng of input DNA

High-quality insect genomes are essential resources to understand insect biology and to combat them as disease vectors and agricultural pests. It is desirable to sequence a single individual for a reference genome to avoid complications from multiple alleles during de novo assembly. However, the small body size of many insects poses a challenge for the use of long-read sequencing technologies which often have high DNA-input requirements. The previously described PacBio Low DNA Input Protocol starts with ~100 ng of DNA and allows for high-quality assemblies of single mosquitoes among others and represents a significant step in reducing such requirements. Here, we describe a new library protocol with a further 20-fold reduction in the DNA input quantity. Starting with just 5 ng of high molecular weight DNA, we describe the successful sequencing and de novo genome assembly of a single male sandfly (Phlebotomus papatasi, the main vector of the Old World cutaneous leishmaniasis), using HiFi data generated on the PacBio Sequel II System and assembled with FALCON. The assembly shows a high degree of completeness (>97% of BUSCO genes are complete), contiguity (contig N50 of 1 Mb), and sequence accuracy (>98% of BUSCO genes without frameshift errors). This workflow has general utility for small-bodied insects and other plant and animal species for both focused research studies or in conjunction with large-scale genome projects.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.