Process a single dataset — process_parse

Process a single dataset with dataset_id using the associated data.csv and metadata.yml files. Adds a unique observation id for each row of observation, trait names are formatted using AusTraits accepted names and trait substitutions are added. parse data is used in the core workflow pipeline (i.e. in load study).

process_parse_data(data, dataset_id, metadata, contexts, schema, identifiers)

Arguments

data: Tibble or dataframe containing the study data
dataset_id: Identifier for a particular study in the AusTraits database
metadata: Yaml file with metadata
contexts: Dataframe of contexts for this study
schema: Schema for traits.build

Value

Tibble in long format with AusTraits formatted trait names, trait substitutions and unique observation id added