Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data Portal - backend support GO facet search via mappings - Pfam #1388

Open
Tracked by #1462
aclum opened this issue Sep 12, 2024 · 8 comments
Open
Tracked by #1462

Data Portal - backend support GO facet search via mappings - Pfam #1388

aclum opened this issue Sep 12, 2024 · 8 comments
Assignees

Comments

@aclum
Copy link
Contributor

aclum commented Sep 12, 2024

Search for a GO term in the data portal should use a mapping file to check mapping files since the GO terms aren't stored directly in the data files.

The Pfam -> GO mapping file is https://current.geneontology.org/ontology/external2go/pfam2go

The code should be able to handle multiple mapping files, there will be a related ticket for the KEGG to GO mapping file.

@ssarrafan
Copy link

@naglepuff This doesn't appear to be active. Can I move this to the next sprint or backlog @aclum ?

@naglepuff
Copy link
Collaborator

@aclum

To clarify, here's a row from the GO/Pfam mapping file:

Pfam:PF00001 7tm_1 > GO:G protein-coupled receptor activity ; GO:0004930

and here is a row from the GO/KEGG mapping file:

K00001	[GO:0004022 0004023 0004024 0004025]

It looks to me like there's less information in the GO/KEGG mapping file, specifically the descriptions are missing.

Is there a file that maps GO terms (and by that I mean GO:9999999) to their descriptions? I guess I can get many of them from the Pfam/GO mapping file, my concern there is that I might miss some if there are GO terms that map to KEGG terms but not to Pfam terms.

@aclum
Copy link
Contributor Author

aclum commented Oct 8, 2024

looping in @sierra-moxon

@sierra-moxon
Copy link
Member

https://purl.obolibrary.org/obo/go/go-basic.json <-- this should have the terms, definitions, etc needed for our use cases.
https://geneontology.org/docs/download-ontology/ defines the 'go-basic' file contents. The PURL above is for the JSON representation of the go-basic content.

(As an aside, I'm working on a generic ontology loader that will take ontology files and move them into MongoDB so that term names, descriptions, cross references, and synonym metadata will be available for ontologies like GO, ENVO, and ChEBI directly. here it is - https://github.com/sierra-moxon/ontology_loader. It's just in my own org for ease of dev, but happy to move/demo/etc when it's far enough along).

@ssarrafan
Copy link

@aclum @naglepuff moving to next sprint but please let me know if you can't work it on in the next 2 weeks.

@ssarrafan
Copy link

Mike mentioned this would be next on his list after COG/PFAM so I'll move to the next sprint. @naglepuff @aclum

@ssarrafan
Copy link

Assuming this is still Mike's next priority? @naglepuff @aclum let me know if this should be in the backlog instead

@aclum
Copy link
Contributor Author

aclum commented Nov 15, 2024

Yes, this is the next priority. It is related to milestone 2.12.2 which is due this quarter (end of Dec) so we need to work on this so it can be part of the December release.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Development

No branches or pull requests

4 participants