Used internally in radiator and might be of interest for users. The function generate a tidy dataset of DArT markers and associated metadata. Usefull to filter before importing the actual dataset.
tidy_dart_metadata(
data,
filename = NULL,
verbose = FALSE,
parallel.core = parallel::detectCores() - 1
)DArT output file. Note that most popular formats used by DArT are
recognised (1- and 2- row format, also called binary, and count data.).
If you encounter a problem, sent me your data so that I can update
the function. The function can import .csv or .tsv files.
(optional) The function uses write.fst,
to write the tidy data frame in
the working directory. The file extension appended to
the filename provided is .rad.
With default: filename = NULL, the tidy data frame is
in the global environment only (i.e. not written in the working directory...).
(optional, logical) When verbose = TRUE
the function is a little more chatty during execution.
Default: verbose = TRUE.
(optional) The number of core used for parallel
execution during import.
Default: parallel.core = parallel::detectCores() - 1.
A tidy dataframe with these columns:
MARKERS: generated by radiator and correspond to CHROM + LOCUS + POS separated by 2 underscores.
CHROM: the chromosome, for de novo: CHROM_1.
LOCUS: the locus.
POS: the SNP id on the LOCUS.
REF: the reference allele.
ALT: the alternate allele.
CALL_RATE: call rate output specific of DArT.
AVG_COUNT_REF: the coverage for the reference allele, output specific of DArT.
AVG_COUNT_SNP: the coverage for the alternate allele, output specific of DArT.
REP_AVG: the reproducibility average, output specific of DArT.
if (FALSE) { # \dontrun{
clownfish.dart.tidy <- radiator::tidy_dart_metadata(
data = "clownfish.dart.tsv",
verbose = TRUE)
} # }