Skip to content

Add raw data fields. #2048

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 13 commits into
base: main
Choose a base branch
from

Conversation

aaronweeden
Copy link
Contributor

@aaronweeden aaronweeden commented May 30, 2025

WORK IN PROGRESS

Description

This PR adds the following fields to the raw data export:

Field Jobs realm Cloud realm Gateways realm
Principal Investigator ID Added Added Added
Principal Investigator Already exists Already exists Added
PI Institution ID Added Added Added
PI Institution Already exists Already exists Added
User ID Added Added Added as "Gateway ID"
User Institution ID Added Added Added as "Organization ID"

This PR also refactors the raw statistics configuration files to move common objects into a reference file.

Motivation and Context

This PR enhances the Data Analytics Framework by helping with deduplicating people and institutions that have the same name.

Tests performed

Checklist:

  • The pull request description is suitable for a Changelog entry
  • The milestone is set correctly on the pull request
  • The appropriate labels have been added to the pull request

@aaronweeden aaronweeden changed the title Refactor raw data configuration. Add raw data fields. May 30, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant