Fast read of .gds, .rad, .gds.rad, .arrow.parquet
files.
The function uses read_parquet
or
CoreArray Genomic Data Structure (GDS)
file system.
Used internally in radiator and assigner and might be of interest for users.
read_rad(
data,
columns = NULL,
allow.dup = FALSE,
check = TRUE,
verbose = FALSE
)
A file in the working directory ending with .arrow.parquet or .gds, and produced by radiator, assigner or grur.
(optional) For arrow.parquet file. Column names to read. The default is to read all all columns.
Default: columns = NULL
.
(optional, logical) To allow the opening of a GDS file with
read-only mode when it has been opened in the same R session.
Default: allow.dup = FALSE
.
(optional, logical) Verify that GDS number of samples and markers
match.
Default: check = TRUE
.
(optional, logical) verbose = TRUE
to be chatty
during execution.
Default: verbose = FALSE
.
A radiator tidy data frame or GDS object (with read/write permissions) in the global environment.
For GDS file system, read_rad will open the GDS connection file set the filters (variants and samples) based on the info found in the file.