Fast read of .gds, .rad, .gds.rad, .arrow.parquet files. The function uses read_parquet or CoreArray Genomic Data Structure (GDS) file system.

Used internally in radiator and assigner and might be of interest for users.

read_rad(
  data,
  columns = NULL,
  allow.dup = FALSE,
  check = TRUE,
  verbose = FALSE
)

Arguments

data

A file in the working directory ending with .arrow.parquet or .gds, and produced by radiator, assigner or grur.

columns

(optional) For arrow.parquet file. Column names to read. The default is to read all all columns. Default: columns = NULL.

allow.dup

(optional, logical) To allow the opening of a GDS file with read-only mode when it has been opened in the same R session. Default: allow.dup = FALSE.

check

(optional, logical) Verify that GDS number of samples and markers match. Default: check = TRUE.

verbose

(optional, logical) verbose = TRUE to be chatty during execution. Default: verbose = FALSE.

Value

A radiator tidy data frame or GDS object (with read/write permissions) in the global environment.

Details

For GDS file system, read_rad will open the GDS connection file set the filters (variants and samples) based on the info found in the file.

See also

arrow GDS read_rad

Author

Thierry Gosselin thierrygosselin@icloud.com

Examples

if (FALSE) { # \dontrun{
require(SeqArray)
shark <- radiator::read_rad(data = "data.shark.gds")
turtle <- radiator::read_rad(data = "data.turtle.arrow.parquet")
} # }