Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Source]: BioStudies ArrayExpress #140

Open
5 of 17 tasks
gtsueng opened this issue May 3, 2024 · 1 comment
Open
5 of 17 tasks

[Source]: BioStudies ArrayExpress #140

gtsueng opened this issue May 3, 2024 · 1 comment
Assignees
Labels
api-harvester source New source suggestion

Comments

@gtsueng
Copy link
Contributor

gtsueng commented May 3, 2024

Source Name

BioStudies ArrayExpress

Source URL

https://www.ebi.ac.uk/biostudies/arrayexpress

Source Description

The functional genomics data collection (ArrayExpress), stores data from high-throughput functional genomics experiments, and provides data for reuse to the research community. In line with community guidelines, a study typically contains metadata such as detailed sample annotations, protocols, processed data and raw data. Raw sequence reads from high-throughput sequencing studies are brokered to the European Nucleotide Archive (ENA), and links are provided to download the sequence reads from ENA. Data can be submitted to the ArrayExpress collection through its dedicated submission tool, Annotare.

Short description

ArrayExpress is an NIH supported repository that includes high-throughput genomics data in the biomedical domain.

Source Access

No access issue, account not needed

Source Funding

EMBL

Source Relevance

NIAID medium priority resource

Related WBS task

For internal use only. Assignee, please select the status of this issue

  • Not yet started
  • In process
  • Blocked
  • Will not include

Status Description

No response

Source to-do list

  • License check- Can this source be included?
  • Class check- Does this source have the right class of research output for inclusions?
  • NIAID Review- Does NIAID approve of the inclusion of this source?
  • Data access check- Have data access issues been resolved?
  • Structured data check- Does this source have structured data?
  • Mapping check- Has the properties from this source been mapped to the schema?
  • Parser check- Has the parser been written?
  • Parsed data check- Has a sanity check on the data obtained from this source been performed?
  • Merge data- Has the crawler/plugin been successfully integrated with the system?
  • Issues could not be resolved - Add as a single record via the DDE
@gtsueng
Copy link
Contributor Author

gtsueng commented Jun 12, 2024

This repository has been evaluated as a medium-priority repository for integration. With Dylan's work on MassIVE and MalariaGEN, Jason's work on VEuPathDB collections, and my manual curation to create ResourceCatalogs, we've pretty much covered all of the high-priority resources, and can now start on the medium priority ones.

Note that records from BioStudies-ArrayExpress may potentially be aggregated by OMICs-DI, we should keep an eye out for duplication of records.

@gtsueng gtsueng added the source New source suggestion label Jul 2, 2024
@gtsueng gtsueng assigned jal347 and unassigned DylanWelzel Jul 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
api-harvester source New source suggestion
Projects
None yet
Development

No branches or pull requests

3 participants