Welcome to the OSAI Ecosystem Components List! This repository hosts a community-curated list of AI-relevant components useful for AI research. Our aim is to connect researchers developing AI models and datasets with resources that are FAIR, Open, and environmentally sustainable.
Our goal is to provide a centrally curated list for discovering relevant AI resources such as tools, registries, metadata standards, AI best practice frameworks, and more. We have used this collection to map components to our OSAI Recommendations, thereby helping to connect researchers with practical resources for AI model and dataset sharing. We understand that the OSAI Recommendations designed for life sciences may not be adopted by all organizations, but we believe this list of ecosystem components itself will be highly valuable. Therefore, this list is intended for researchers, developers, policymakers, educators, students, and anyone interested in the field of Open and Sustainable AI to reuse and adapt for their own needs, such as by remapping the components to different frameworks.
This list originates from an upcoming publication by members of the ELIXIR Machine Learning Focus Group. It represents a collective effort to identify and map key components supporting the OSAI Recommendations within the life sciences.
Digital Object Identifier (DOI): Preprint available on aRxiv
The primary goal of this list is to serve as a dynamic, community-maintained inventory. It is explicitly designed for reuse and adaptation. We envision this list feeding into various resources, frameworks, and platforms, providing a long-term, accessible collection of OSAI ecosystem components.
While the initial curation may align with specific OSAI principles or OSAI Recommendations, the structure allows the components listed here to be mapped or repurposed according to different sets of guidelines or perspectives. We acknowledge that there is no single, perfect set of AI best practice recommendations, and diverse global viewpoints exist. This list aims to support these varied efforts by providing a central, collaboratively curated foundation of AI sharing relevant tools, resgistries, metadata, and practices.
The list data is maintained in human-readable Tab-Separated Values (.tsv
) files within the /data
directory. These files are automatically converted to JSON format for easy consumption by websites and applications.
Note
This list is under active development and curation. Contributions and suggestions are highly encouraged!
We welcome contributions to help grow and maintain this resource. The OSAI Ecosystem Components List relies on community input and updates to remain valuable.
The contribution process is designed to be straightforward:
- Edit the .yml file: Locate the
ecosystem_components.yml
file in the/data
directory. Add a new component using the template and correct fields or modify existing lines. - Submit a Pull Request (PR): Propose your changes via a GitHub Pull Request. Please use the provided PR template to describe your contribution.
- Review: The project maintainers will review your submission for relevance, accuracy, and formatting.
- Merge & Convert: Once approved, your changes will be merged into the main
ecosystem_components.tsv
file. The updated JSON file will be automatically generated shortly thereafter.
After contributing:
- Hosted Online: The JSON will be used to populate the OSAI Ecosystem Components List on https://dome-ml.org/ai-ecosystem.
Please see our Contribution Guidelines for detailed instructions on formatting entries and the PR process.
The OSAI project is maintained by the project maintainers, who review contributions and manage the repository.
The content of this list (the data in TSV and generated JSON files) is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.
Information about the license can be found in LICENSE.md. By contributing, you agree that your contributions will be licensed under CC BY 4.0.
If you use this list in your work, please cite it. We recommend using the CITATION.cff
file in the repository root for standardized citation information.
Recommended Citation:
See CITATION.cff
All participants are expected to adhere to the project's Code of Conduct. Please ensure you are familiar with its contents.
For questions, suggestions, or to report issues, please open an Issue in this repository, or contact the primary maintainer at [email protected].
This repository's structure and documentation approach draw inspiration from, and adapt elements of, the excellent work done by the Research Software Quality Kit (RSQKit) and ELIXIR's Research Data Management Kit (RDMKit).