Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Report for contributor without name but having role in descriptive metadata.adminMetadata #4541

Open
ndushay opened this issue Aug 4, 2023 · 4 comments
Assignees
Labels
cocina reports request for report on metadata content

Comments

@ndushay
Copy link
Contributor

ndushay commented Aug 4, 2023

These cause indexing errors akin to https://app.honeybadger.io/projects/49898/faults/98834813

We will avoid importing such data in Argo via csv in future: sul-dlss/argo#4075

@ndushay ndushay self-assigned this Aug 4, 2023
@ndushay ndushay added the cocina reports request for report on metadata content label Aug 4, 2023
@ndushay
Copy link
Contributor Author

ndushay commented Aug 4, 2023

want for both collections and dros.

@ndushay
Copy link
Contributor Author

ndushay commented Aug 7, 2023

I ended up just searching for contributors without a name.

NOTE: I think for the adminMetadata.contributor errors found, we got the "name" value in "role" and vice version.

Collection objects in prod without contributor name value

  • 1344 contributors without a name value; ALL of these were found under adminMetadata. (None were top level contributors or under event or under relatedResource)

contrib_no_names_in_adminMetadata_prod_collections.csv

Collection objects in stage without contributor name value

  • 22 contributors without a name value; ALL of these were found under adminMetadata. (None were top level contributors or under event or under relatedResource)

contrib_no_names_in_adminMetadata_stage_collections.csv

Collection objects in qa without contributor name value

  • 2 contributors without a name value; ALL of these were found under adminMetadata. (None were top level contributors or under event or under relatedResource)
collection_druid,catalog_record_id,collection_name,name_value,role_code,role_value,role_uri
druid:qn165qb7188,222,"Aria",,,,
druid:wr455zs4178,73456,"Research projects and education and training facilities in the field of human relations",,,,

contrib_no_names_in_adminMetadata_qa_collections.csv

@ndushay
Copy link
Contributor Author

ndushay commented Aug 7, 2023

NOTE: I think for the adminMetadata.contributor errors found, we got the "name" value in "role" and vice version.

DRO objects in prod without contributor name value

  • 118 contributor at top level without a name value
    contrib_no_names_top_level_prod_dros.csv

  • 4,159,045 contributors without a name value found under adminMetadata. (278M file)

  • None were found under event or under relatedResource

DRO objects in stage without contributor name value

  • 26061 contributors without a name value; ALL of these were found under adminMetadata. (None were top level contributors or under event or under relatedResource)

contrib_no_names_in_adminMetadata_stage_dros.csv

DRO objects in qa without contributor name value

  • 139 contributors without a name value; ALL of these were found under adminMetadata. (None were top level contributors or under event or under relatedResource)

contrib_no_names_in_adminMetadata_qa_dros.csv

@ndushay
Copy link
Contributor Author

ndushay commented Aug 30, 2023

@arcadiafalcone I think I might have mentioned this ticket to you 3 weeks ago???

@ndushay ndushay changed the title Report for contributor without name but having role in descriptive metadata Report for contributor without name but having role in descriptive metadata.adminMetadata Aug 30, 2023
@mjgiarlo mjgiarlo moved this to In Review (by Reporter, PO, SDR Manager, ...) in Infrastructure Portfolio Production Priorities Oct 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cocina reports request for report on metadata content
Projects
Status: Under Review (Unordered)
1 participant