-
Notifications
You must be signed in to change notification settings - Fork 7
Open
Description
Consistently found across datasets:
[WARNING] TOO_FEW_AUTHORS The 'Authors' field of 'dataset_description.json' should contain an array of values -
with one author per value.
This was triggered based on the presence of only one author field.
Please ignore if all contributors are already properly listed.
/dataset_description.json
Please visit https://neurostars.org/search?q=TOO_FEW_AUTHORS for existing conversations about this issue.
[WARNING] JSON_KEY_RECOMMENDED A JSON file is missing a key listed as recommended.
GeneratedBy
/dataset_description.json - Field description: Used to specify provenance of the dataset.
SourceDatasets
/dataset_description.json - Field description: Used to specify the locations and relevant attributes of all source datasets.
Valid keys in each object include `"URL"`, `"DOI"` (see
[URI](SPEC_ROOT/common-principles.md#uniform-resource-indicator)), and
`"Version"` with
[string](https://www.w3schools.com/js/js_json_datatypes.asp)
values.
Please visit https://neurostars.org/search?q=JSON_KEY_RECOMMENDED for existing conversations about this issue.
[WARNING] NO_AUTHORS The Authors field of dataset_description.json should contain an array of fields -
with one author per field. This was triggered because there are no authors, which
will make DOI registration from dataset metadata impossible.
Authors
/dataset_description.json - Field description: List of individuals who contributed to the creation/curation of the dataset.
Suggested fixes:
- For authors: Copy over information from Physionet or store that information in a place where it can be used in both in a quick manner
- Source datasets: there are no source datasets for the raw dataset but for the derivatives there are, so should format as a study dataset with appropriate files and structures
- Generated by: unsure how to fix for raw dataset
Participants Issues
participant.tsv (should be participants.tsv) is not at the top level but rather in phenotype/enrollment
[ERROR] PARTICIPANT_ID_MISMATCH Subject directories found in this dataset did not match the values in
the participant_id column found in the participants.tsv file.
/participants.tsv
Please visit https://neurostars.org/search?q=PARTICIPANT_ID_MISMATCH for existing conversations about this issue.
[ERROR] TSV_VALUE_INCORRECT_TYPE A value in a column did not match the acceptable type for that column headers specified format.
participant_id
/participants.tsv - '005009'
Please visit https://neurostars.org/search?q=TSV_VALUE_INCORRECT_TYPE for existing conversations about this issue.
Suggested fixes:
- Rename file to participants.tsv and move to appropriate place in the dataset
- Add
sub-to every ID
Non compliant format
Files from non-compliant format are getting flagged. Should update the schema to include the suggested audio changes.
[ERROR] NOT_INCLUDED Files with such naming scheme are not part of BIDS specification. This error is most commonly caused by typos in file names that make them not BIDS compatible. Please consult the specification and make sure your files are named correctly. If this is not a file naming issue (for example when including files not yet covered by the BIDS specification) you should include a ".bidsignore" file in your dataset (see https://github.com/bids-standard/bids-validator#bidsignore for details). Please note that derived (processed) data should be placed in /derivatives folder and source data (such as DICOMS or behavioural logs in proprietary formats) should be placed in the /sourcedata folder.
/sub-628119/ses-2912dba1/audio/
/sub-628119/ses-2912dba1/audio/sub-628119_ses-2912dba1_task-breath-sounds.wav
91364 more files with the same issue
Suggested Fixes
- Update the schema that the code is being run with to include the suggested changes for audio data that has been proposed
Metadata
Metadata
Assignees
Labels
No labels