-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Research and design pipeline integration methods #27
Comments
Might need to update this to fit with the updated roadmap. |
@adamkells I updated the description for this issue, are you up for taking this |
Yeah I can take this. I think the updated version of the issue is much cleaner than previously. |
Summary of FindingsI looked at the four packages suggested (sklearn, spaCy, langchain, hugging-face). These package all introduce a concept of pipelines. However each is slightly different.
If I had to rank these in terms of how neatly they fit into our framework:
ProposalTo add methods to instantiate pipelines from spaCy, langchain and hugging-face and to omit sklearn for the moment. The api can be as simple as |
This is great 🌟 Let's focus on text pipelines and park the others for future. Are you happy to also create an issue and work on the implementation @adamkells ? Also what did you mean by:
|
Yeah happy to work on it. With sklearn, I just meant that the components of an sklearn pipeline which a user may want to use make more sense to be just added manually as a part of the sandbox than to be an initialiser for the healthchain pipeline. |
Add pipeline integrations. Examples usage:
Integrations to start:
spacy
huggingface
langchain
sklearn
These libraries all have some form of pipelines. Some time may be needed to think about how to import / export whole pipelines from these libraries.
[Future]
haystack
llama-index
The text was updated successfully, but these errors were encountered: