Used internally in radiator and might be of interest for users. The function generate a tidy dataset of DArT markers and associated metadata. Usefull to filter before importing the actual dataset.
tidy_dart_metadata(
data,
filename = NULL,
verbose = FALSE,
parallel.core = parallel::detectCores() - 1
)
DArT output file. Note that most popular formats used by DArT are
recognised (1- and 2- row format, also called binary, and count data.).
If you encounter a problem, sent me your data so that I can update
the function. The function can import .csv
or .tsv
files.
(optional) The function uses write.fst
,
to write the tidy data frame in
the working directory. The file extension appended to
the filename
provided is .rad
.
With default: filename = NULL
, the tidy data frame is
in the global environment only (i.e. not written in the working directory...).
(optional, logical) When verbose = TRUE
the function is a little more chatty during execution.
Default: verbose = TRUE
.
(optional) The number of core used for parallel
execution during import.
Default: parallel.core = parallel::detectCores() - 1
.
A tidy dataframe with these columns:
MARKERS: generated by radiator and correspond to CHROM + LOCUS + POS separated by 2 underscores.
CHROM: the chromosome, for de novo: CHROM_1.
LOCUS: the locus.
POS: the SNP id on the LOCUS.
REF: the reference allele.
ALT: the alternate allele.
CALL_RATE: call rate output specific of DArT.
AVG_COUNT_REF: the coverage for the reference allele, output specific of DArT.
AVG_COUNT_SNP: the coverage for the alternate allele, output specific of DArT.
REP_AVG: the reproducibility average, output specific of DArT.
if (FALSE) { # \dontrun{
clownfish.dart.tidy <- radiator::tidy_dart_metadata(
data = "clownfish.dart.tsv",
verbose = TRUE)
} # }