Skip to content

Adding data curation tool to Harvard Dataverse#296

Merged
stevenwinship merged 4 commits intomasterfrom
32-add-data-curation-to-harvard-dataverse
Jul 21, 2025
Merged

Adding data curation tool to Harvard Dataverse#296
stevenwinship merged 4 commits intomasterfrom
32-add-data-curation-to-harvard-dataverse

Conversation

@stevenwinship
Copy link
Contributor

Adding Data Curation Tool to https://dataverse.harvard.edu/ and https://demo.dataverse.org/

Closes #32

@stevenwinship stevenwinship self-assigned this Jul 23, 2024
@stevenwinship stevenwinship added Feature: External Tool Size: 10 A percentage of a sprint. NIH CAFE Issues associated with the NIH CAFE project FY25 Sprint 2 FY25 Sprint 2 labels Jul 23, 2024
@stevenwinship stevenwinship removed their assignment Jul 23, 2024
@pdurbin
Copy link
Member

pdurbin commented Aug 5, 2024

As I mentioned in Slack, there is Borealis-specific language in the tool (I check on our demo server):

Screenshot 2024-07-25 at 2 33 49 PM
Screenshot 2024-07-25 at 2 36 53 PM

@stevenwinship
Copy link
Contributor Author

Removed the Data Curation tool from Demo. Putting this issue on hold until a new release is available

@stevenwinship stevenwinship added the Status: Needs Input Applied to issues in need of input from someone currently unavailable label Aug 5, 2024
@stevenwinship
Copy link
Contributor Author

Message From Nana Boateng:
We have a beta release planned for Tuesday August 27, 2024. We are setting up a test instance at demo.borealisdata.ca for testers.

@pdurbin
Copy link
Member

pdurbin commented Sep 25, 2024

@nana-boateng @bikramj any news on the release? I just checked https://github.com/scholarsportal/Dataverse-Data-Explorer/releases . We're just trying to figure out when it makes sense to pull this item into sprint. No rush. Thanks.

@nana-boateng
Copy link

@stevenwinship sincere apologies again for the late response, I just saw your email. Please reach me at my work email [email protected] ([email protected] is my personal email)

@pdurbin We are planning on releasing this at the end of November (the week of the 25th).

I have also updated the app to replace hardcoded references to Borealis, with 'Dataverse' this will be further customizable with language files on release.

@cmbz
Copy link
Collaborator

cmbz commented Oct 9, 2024

Placed on hold to check back in about the potential new release in November. Assigning to @stevenwinship and @cmbz for monitoring.

@cmbz
Copy link
Collaborator

cmbz commented Dec 2, 2024

2024/12/02: No release as of this date. Will check again on 2025/01/15.

@nana-boateng
Copy link

Hi all,

Sincere apologies for the delay. The new Data Explorer is now in pre-release and will be officially released later today.

@cmbz
Copy link
Collaborator

cmbz commented Jan 27, 2025

That's great new @nana-boateng, thanks for letting us know! :)

@cmbz
Copy link
Collaborator

cmbz commented Jan 28, 2025

2025/01/27: Moving back into Sprint Ready now that release 2.0 is available.

@cmbz cmbz removed the Status: Needs Input Applied to issues in need of input from someone currently unavailable label Mar 27, 2025
@stevenwinship stevenwinship self-assigned this Jul 17, 2025
@stevenwinship stevenwinship moved this from SPRINT READY to In Progress 💻 in IQSS Dataverse Project Jul 17, 2025
@stevenwinship
Copy link
Contributor Author

stevenwinship commented Jul 17, 2025

@nana-boateng Could you let me know if this error is one that needs to be looked at or if it's ok to ignore. I see the tool making the call to dataverse and the data in the response looks good. I don't see anything other than a stripe of color on the screen. This file is definitely missing. I'm running in docker.

2025/07/17 18:10:45 [error] 36#36: *9 open() "/usr/share/nginx/html/matomo.js" failed (2: No such file or directory),

image

EDIT: got it working. Nothing to do with that error.

@cmbz cmbz added the FY26 Sprint 2 FY26 Sprint 2 (2025-07-16 - 2025-07-30) label Jul 17, 2025
@stevenwinship
Copy link
Contributor Author

@nana-boateng I have the tool running in demo.dataverse.org. I'm currently getting this error and I would like help fixing this.

Error fetching dataset: Bad signed URL
image

@stevenwinship
Copy link
Contributor Author

This error is coming from dataverse. It is unable to get the user from the signed url. This works locally in docker. On demo it fails with user logged in with username/password or shib (Harvard Account).

@nana-boateng
Copy link

@stevenwinship looks like you figured it out. But let me know if you find anything else I can fix!

@qqmyers
Copy link
Member

qqmyers commented Jul 18, 2025

FWIW: Doing bin/asadmin set-log-levels edu.harvard.iq.dataverse.util.UrlSignerUtil=FINE (which is still on on demo), I see that Dataverse is trying to match the hash with String to hash: http://demo.dataverse.org/api/v1/files/2664681/metadata/2360434/toolparams/69?until=2025-07-18T16:29:47.724&user=qqmyers&method=GET&token=<key>]] - starting with http. Since demo is set up on https, this fails.

My guess, based on a problem with a TurboCurator demo around DCM2025, is that the Anubis configuration in /etc/httpd/conf.d/ssl.conf is still not sending the correct URL for Dataverse to pick up. I'm not sure yet how to fix that.

@stevenwinship
Copy link
Contributor Author

@landreev Here is the issue

@stevenwinship
Copy link
Contributor Author

Everything is working now. This tool was configured on qa.dataverse.org, demo.dataverse.org, and both nodes of https://dataverse.harvard.edu/

@stevenwinship stevenwinship merged commit 1f3f853 into master Jul 21, 2025
@github-project-automation github-project-automation bot moved this from In Progress 💻 to Merged 🚀 in IQSS Dataverse Project Jul 21, 2025
@stevenwinship stevenwinship deleted the 32-add-data-curation-to-harvard-dataverse branch July 21, 2025 20:51
@stevenwinship stevenwinship moved this from Merged 🚀 to Done 🧹 in IQSS Dataverse Project Jul 21, 2025
@stevenwinship stevenwinship removed their assignment Jul 21, 2025
@pdurbin pdurbin moved this from Done 🧹 to QA ✅ in IQSS Dataverse Project Jul 22, 2025
@pdurbin pdurbin moved this from QA ✅ to In Review 🔎 in IQSS Dataverse Project Jul 22, 2025
@pdurbin pdurbin moved this from In Review 🔎 to Done 🧹 in IQSS Dataverse Project Jul 22, 2025
@cmbz cmbz added the FY26 Sprint 4 FY26 Sprint 4 (2025-08-13 - 2025-08-27) label Aug 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Feature: External Tool FY25 Sprint 2 FY25 Sprint 2 FY26 Sprint 2 FY26 Sprint 2 (2025-07-16 - 2025-07-30) FY26 Sprint 4 FY26 Sprint 4 (2025-08-13 - 2025-08-27) NIH CAFE Issues associated with the NIH CAFE project Size: 10 A percentage of a sprint.

Projects

Status: Done 🧹

Development

Successfully merging this pull request may close these issues.

Add Data Curation Tool to Harvard Dataverse Repository

5 participants