Validate submissions for the All of Us EHR data
- Python >=3.6 (download from here and install)
- pip (download get-pip.py and run
python get-pip.py)
-
Install requirements by running
pip install -r requirements.txt
- Update
_settings.pyand rename it tosettings.py - Create a folder and place all the EHR submission files to be validated into it
- Set the "csv_dir" parameter in
settings.pyto the full path of the folder created above - Ensure the resources folder is also downloaded and is located in the same place as
omop_file_validator.py - Execute the following at the command line:
python omop_file_validator.py [-h] [-r RESTRICT]
Evaluate OMOP files for formatting issues before AoU submission.
optional arguments:
-h, --help Show this help message and exit
-r RESTRICT, --restrict RESTRICT Where RESTRICT is replaced with the number of rows to restrict f
or validation per file. e.g. --restrict 1000 for only validating
the first 1000 lines- File names must follow naming convention
{table}.csvtablean OMOP CDM table listed in resources/omop
- Files must be in CSV format (comma-delimited) as specified by rfc4180
- Column names and types must follow the conventions in resources/omop