Process a single dataset with `dataset_id` using the associated `data.csv` and `metadata.yml` files. Adds a unique observation id for each row of observation, trait names are formatted using AusTraits accepted names and trait substitutions are added. `parse data` is used in the core workflow pipeline (i.e. in `load study`).

process_parse_data(data, dataset_id, metadata, contexts, schema)

Arguments

data

Tibble or dataframe containing the study data

dataset_id

Identifier for a particular study in the AusTraits database

metadata

Yaml file with metadata

contexts

Dataframe of contexts for this study

schema

Schema for traits.build

Value

Tibble in long format with AusTraits formatted trait names, trait substitutions and unique observation id added