Run STACKS
cstacks
module inside R! The function runs a summary of the log file automatically
at the end (summary_cstacks
).
run_cstacks( P = "06_ustacks_2_gstacks", o = "06_ustacks_2_gstacks", M = "02_project_info/population.map.catalog.tsv", catalog.path = NULL, n = 1, parallel.core = parallel::detectCores() - 1, max.gaps = 2, min.aln.len = 0.8, disable.gapped = FALSE, k.len = NULL, report.mmatches = FALSE, split.catalog = 20 )
P | path to the directory containing STACKS files.
Default:
|
---|---|
o | Output path to write catalog.
Default: |
M | path to a population map file (Required when P is used).
Default: |
catalog.path | This is for the "Catalog editing" part in cstacks where
you can provide the path to an existing catalog.
cstacks will add data to this existing catalog.
With default: |
n | number of mismatches allowed between sample loci when build the catalog.
Default: |
parallel.core | Enable parallel execution with num_threads threads.
Default: |
max.gaps | The number of gaps allowed between stacks before merging.
Default: |
min.aln.len | The minimum length of aligned sequence in a gapped
alignment.
Default: |
disable.gapped | Disable gapped alignments between stacks.
Default: |
k.len | Specify k-mer size for matching between between catalog loci
(automatically calculated by default).
Advice: don't modify.
Default: |
report.mmatches | Report query loci that match more than one catalog locus.
Advice: don't modify.
Default: |
split.catalog | (integer) In how many samples you want to split the
catalog population map. This allows to have backup catalog every
|
sstacks
returns a .matches.tsv.gz file for each sample
Computer or server problem during the cstacks ? Look in the log file to see which individuals remains to be included. Create a new list of individuals to include and use the catalog.path argument to point to the catalog created before the problem.
Catchen JM, Amores A, Hohenlohe PA et al. (2011) Stacks: Building and Genotyping Loci De Novo From Short-Read Sequences. G3, 1, 171-182.
Catchen JM, Hohenlohe PA, Bassham S, Amores A, Cresko WA (2013) Stacks: an analysis tool set for population genomics. Molecular Ecology, 22, 3124-3140.