BGI is a recognized leader in De Novo Genome Sequencing and has extensive experience from the de novo sequencing and assembly of more than 400 species’ genomes. Contact us today with any questions or for a no obligation quote.
De Novo Sequencing
De novo sequencing refers to the sequencing of a novel genome without a reference sequence for alignment. The process of de novo genome sequencing involves the sequencing of DNA fragments, assembling the reads into longer sequences (contigs) and finally ordering the contigs to obtain the entire genome sequence.
BGI has been providing de novo whole genome sequencing for over 20 years and has published countless de novo genomes across a huge variety of species in the world’s leading journals.
We offer a complete suite of technologies to support your de novo sequencing projects, along with expert assistance with the planning of optimal sequencing and bioinformatics options, to assure your project is a success.
- Library preparations (DNBSEQ™/Illumina, Nanopore PromethION,
PacBio Sequel II etc) - Various sequencing mode
- Guaranteed ≥90% of DNBSEQ™ clean bases with quality score of Q20
- Guaranteed ≥100Gb Nanopore pass data with Read length >20kb
- Guaranteed ≥100Gb PacBio Sequel II CLR data with Read length N50>20kb
- Guaranteed ≥20Gb PacBio Sequel II CCS (HiFi library) data with accuracy greater than 99%
- DNBSEQ,350 bp library,≥0.2 μg (recommend≥0.4 μg),8 ng/μL,The band shown on gel electrophoresis has little degradation, or of fragment size greater than 20 kb.
- Nanopore,20-50 Kb Normal long library,≥2 μg,50 ng/μL,OD260/280: 1.8-2.0 OD260/230:≥1.5,The band shown on gel electrophoresis has no or little degradation.
- Nanopore,Ultra long library,≥10 μg,50 ng/μLOD260/280: 1.8-2.0 OD260/230:≥1.5,The band shown on gel electrophoresis has no or little degradation.
- PacBio Revio,15-20 Kb HIFI library,≥14 μg,80 ng/μL,OD260/280: 1.6-2.2 OD260/230: 1.6-2.5,The band shown on gel electrophoresis has no or little degradation. (The main peak is around 40 Kb with smear no smaller than 20 Kb).
- Genome Survey: 1-Kmer estimation (Jellyfish + GenomeScope), 2-External pollution Analysis (BWA);
- Genome Assembly (Pacbio HiFi data): 1- Assembly,2- Assessment by short reads alignment,3- BUSCO assessment
- Genome Assembly (Pacbio CLR /Nanopore data+NGS): 1- Reads correction,2- Assembly,3- Assembly result correction using long reads,4- Assembly result correction using short reads,5- BUSCO assessment
- stLFR Assembly: Genome assembly by stLFR data
- Gene Annotation: 1- Repeat annotation,2- Gene structure annotation,3- Gene function Annotation,4- Transcrip factors (plant)
- Evolution: Deliver published genome and allied species (less than 10 species) 1. Gene family identification (Animal TreeFam; Plant OrthoMCL; ≤10 species);2. Phylogenetic tree construction; 3. Estimation of divergence time; 4. Genome synteny analysis; 5. Whole genome duplication analysis; 6. long fragments duplication (Animal LASTZ), 7. Gene family expansion and contraction analysis
- Auxiliary Assembly: Hi-C data auxiliary assembly
De Novo Sequencing Technology
BGI provides various combinations of sequencing platforms, sequencing read-lengths and paired-end library options for De Novo sequencing applications for plant and animal. Services are performed on DNBSEQ™, Illumina, Nanopore PromethION, PacBio Sequel II or Sequel platforms.
Besides clean data output, BGI offers a range of standard and customized bioinformatics pipelines for your plant and animal de novo sequencing project covering: GENOME SURVEY, GENE ANNOTATION, EVOLUTION ANALYSIS and AUXILIARY ASSEMBLY.
Reports and output data files are delivered in industry standard file formats: BAM, .xls, .png. Raw FASTQ and FASTA data is available.
