Table of Contents
Introduction
Novoalign and NovoalignCS are highly accurate programs for mapping next-generation sequencing reads to a reference database. The purpose of this guide is to provide quick, usable examples for everyday scenarios of working with short read data and aligning these to a reference genome sequence database.
For more in-depth understanding of the various options available please consult the Novoalign & NovoalignCS reference manual available from our free Downloads area.
The specially formatted code text contains UNIX commands that may be copied and pasted in your environment to accomplish the tasks e.g.
<br /><span style="color:rgb(0,0,255);">This is an example of unix command formatting</span><br />
Dependencies
Ensure that you have these executables in your $PATH environment variable
- Novocraft alignment suite (novoalign, novoalignCS, novoalignMPI, novobarcode, novoindex,etc).
- BedTools command line programs.
- Samtools version 0.1.8 or highter command line programs.
- Picard java command line programs
- Genome Analysis Toolkit java programs.
References
- Li H.*, Handsaker B.*, Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R. and 1000 Genome Project Data Processing Subgroup (2009) The Sequence alignment/map (SAM) format and SAMtools. Bioinformatics, 25, 2078-9. [PMID: 1950594]
- Quinlan, AR and Hall, IM, 2010. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 26, 6, pp. 841–842.
- The Picard library. http://picard.sf.net
- The Genome Analysis toolkit. http://www.broadinstitute.org/gsa/wiki/index.php/The_Genome_Analysis_Toolkit.