-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[INVESTIGATION]: Generate Example summaries using ChatGPT #2
Comments
Updated doc with expected formatting |
@hartwickma, @lisa-mml, @rshabman, @sudvenk We have generated examples of ChatGPT generated summaries. A selection from NIAID priority resources can be found here: https://docs.google.com/document/d/1KfJg5R-28mhUmcUwAHybuGv4sha6Pg3-8mqTPnCqFYw/edit The Ask: we seek approval to initiate work on descriptive augmentation, starting with identifying the optimal summary length. |
Per discussions on 2024.09.23, we will forgo conducting user studies to determine the optimal length of generated summaries, and use the ~170 word count used by Scientific Data (as suggested by Lilliana). The summary should include 1-2 sentences detailing the method and experimental conditions. We will proceed with generating mock-ups that will visually prioritize summary information (but also indicates clearly that the summary information was generated using genAI). This information will be stored in a separate metadata field (while the original description field will be kept un-alterated). @ZubairQazi can you edit the prompt according to the requirements: 170 words max; of which 1-2 sentences should detail method and experimental conditions (only if available, do not make it up if not available). Then, run the prompt on ClinEpiDB records as ClinEpiDB is likely to have longer description fields (with method/experimental condition info in description). |
Generated summaries for ClinEpiDB and posted here: https://docs.google.com/spreadsheets/d/1vtxtJrG4qSbrSlaqp_4RhtQw2jRUQXc38G3z7ZZFS3A/edit?usp=sharing |
The requested email draft for ClinEpiDB can be found in it's own issue here: https://github.com/NIAID-Data-Ecosystem/niaid-feedback/issues/161 |
Issue Name
Generate Example summaries using ChatGPT
Issue Description
This is a preliminary/quick test for NIAID-Data-Ecosystem/nde-crawlers#159
To demonstrate the value of description length normalization, perform the following:
Please save the results/examples to the following google document:
Examples from each repository: https://docs.google.com/document/d/1pX0CTaDyQmH-XqvHX-l13ZKT0zB0MRKeHl-B-RTndSg/edit
Key examples: https://docs.google.com/document/d/1KfJg5R-28mhUmcUwAHybuGv4sha6Pg3-8mqTPnCqFYw/edit
Use the format:
Name: record name
Description: record description
ID: record ID in the NDE
SEO abstract: 140-160 length generated result
Tweet abstract: 240-280 length generated result
3 sentence abstract: 3 sentence result
5 sentence abstract: 5 sentence result
Issue Discussion
This issue was discussed at the bi-weekly meeting dated 2024.09.04
Request Type
Examples (generate examples for evaluation, decision-making, etc.)
Material URL
https://docs.google.com/document/d/1pX0CTaDyQmH-XqvHX-l13ZKT0zB0MRKeHl-B-RTndSg/edit
Related WBS task
https://github.com/NIAID-Data-Ecosystem/nde-roadmap/issues/13
For internal use only. Assignee, please select the status of this issue
Status Description
No response
Request status check list
The text was updated successfully, but these errors were encountered: