bio-rainbow(1)
clustering and assembling short reads for bioinformatics
Description
BIO-RAINBOW
NAME
bio-rainbow - clustering and assembling short reads for bioinformatics
SYNOPSIS
rainbow <cmd> [options]
DESCRIPTION
rainbow 2.0.4 -- <ruanjue@gmail.com, chongzechen@gmail.com>
cluster
Input File Format: paired fasta/fastq file(s) Output File Format: <seqid:int>\t<cluster_id:int>\t<read1:string>\t<read2:string>
|
-1 <string> Input fasta/fastq file, supports multiple ’-1’ |
||
|
-2 <string> Input fasta/fastq file, supports multiple ’-2’ [null] |
-l <int>
Read length, default: 0 variable
-m <int>
Maximum mismatches [4]
-e <int>
Exactly matching threshold [2000]
|
-L |
Low level of polymorphism |
div
Input File Format: <seqid:int>\t<cluster_id:int>\t<read1:string>\t<read2:string> Output File Format: <seqid:int>\t<cluster_id:int>\t<read1:string>\t<read2:string>[\t<pre_cluster_id:int>]
|
-i <string> Input file [stdin] |
||
|
-o <string> Output file [stdout] |
-k <int>
K_allele, min variants to create a new group [2]
-K <int>
K_allele, divide regardless of frequency when num of variants exceed this value [50]
-f <float>
Frequency, min variant frequency to create a new group [0.2]
merge
Input File Format: <seqid:int>\t<cluster_id:int>\t<read1:string>\t<read2:string>[\t<pre_cluster_id:int>]
|
-i <string> Input rbasm output file [stdin] |
||
|
-a |
output assembly
|
-o <string> Output file for merged contigs, one line per cluster [stdout] |
-N <int>
Maximum number of divided clusters to merge [300]
-l <int>
Minimum overlap when assemble two reads (valid only when ’-a’ is opened) [5]
-f <float>
Minimum fraction of similarity when assembly (valid only when ’-a’ is opened) [0.90]
-r <int>
Minimum number of reads to assemble (valid only when ’-a’ is opened) [5]
-R <int>
Maximum number of reads to assemble (valid only when ’-a’ is opened) [300]
AUTHOR
This manpage was written by Andreas Tille for the Debian distribution and can be used for any other usage of the program.