Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

universe-pipeline-solipCysme-french (#13627) [ci skip] #13627

Merged
merged 1 commit into from
Oct 11, 2024

Conversation

thjbdvlt
Copy link
Contributor

Add a pipeline (solipcysme) for french language, designed for the analysis of fictions and first person point of view texts, primarly focused on personal pronouns. It aims to fill a gap in french models: the currently available pipelines for french are trained on 'news' data, a special kind of data that lack of personal pronouns (as an example: 'tu', which means 'you', is not in the data), and of some verb modes (e.g. imperative) very usual in fictions.

PS: The pipeline integrates two small projects that are already in the spaCy universe (presque, a normalizer for french, and quelquhui, a tokenizer for french). Maybe i should remove these projects, so i only have one there?

@ines ines added docs Documentation and website universe Changes to the Universe directory of third-party spaCy code. labels Oct 11, 2024
@ines
Copy link
Member

ines commented Oct 11, 2024

Thank you for all the universe contributions! 💙 And if the two other projects can also be used independently, it's fine to keep them separate – but if you want people to just use the new project instead, feel free to submit another PR that removes the others!

@ines ines changed the title universe-pipeline-solipCysme-french universe-pipeline-solipCysme-french (#13627) [ci skip] Oct 11, 2024
@ines ines merged commit 0d7e57f into explosion:master Oct 11, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
docs Documentation and website universe Changes to the Universe directory of third-party spaCy code.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants