Manual Reference Pages - bw a (1) NAME bwa - Burrows-Wheeler Alignment Tool CONTENTS Synopsis Description Commands And Options Sam Alignment Format Notes On Short-read Alignment Alignment Accuracy Estimating Insert Size Distribution Memory Requirement Speed Notes On Long-read Alignment See Also Author License And Citation History SYNOPSIS bwa index -a bwtsw database.fasta bwa aln database.fasta short_read.fastq > aln_sa.sai bwa samse database.fasta aln_sa.sai short_read.fastq > aln.sam bwa sampe database.fasta aln_sa1.sai aln_sa2.sai read1.fq read2.fq > aln.sam bwa bwasw database.fasta long_read.fastq > aln.sam DESCRIPTION BWA is a fast light-weighted tool that aligns relatively short sequences (queries) to a sequence database (targe), such as the human reference genome. It implements two different algorithms, both based on Burrows-Wheeler Transform (BWT). The first algorithm is designed for short queries up to ~200bp with low error rate (<3%). It does gapped global alignment w.r.t. queries, supports paired-end reads, and is one of the fastest short read alignment algorithms to date while also visiting suboptimal hits. The second algorithm, BWA-SW, is designed for long reads with more errors. It performs heuristic Smith-Waterman-like alignment to find high-scoring local hits (and thus chimera). On low-error short queries, BWA-SW is slower and less accurate than the first algorithm, but on long queries, it is better. For both algorithms, the database file in the FASTA format must be first indexed with the ‘index’ command, which typically takes a few hours. The first algorithm is implemented via the ‘aln’ command, which finds the suffix array (SA) coordinates of good hits of each individual read, and the ‘samse/...