Process a single dataset with dataset_id using the associated data.csv and metadata.yml files. Adds a unique observation id for each row of observation, trait names are formatted using AusTraits accepted names and trait substitutions are added. parse data is used in the core workflow pipeline (i.e. in load study).

process_parse_data(data, dataset_id, metadata, contexts, schema)

Arguments

data

Tibble or dataframe containing the study data

dataset_id

Identifier for a particular study in the AusTraits database

metadata

Yaml file with metadata

contexts

Dataframe of contexts for this study

schema

Schema for traits.build

Value

Tibble in long format with AusTraits formatted trait names, trait substitutions and unique observation id added