GitHub · Where software is built

Milestones

Clean up models and vectors
Also see #3052. We didn't want to cram this all into the initial v2.1 release, since it'd introduce another layer of complexity. But now that the new models are out, we can focus on cleaning up the vocab and vectors, and resolving various issues around the model data.
No due date
•2/2 issues closed
100% complete0 open 2 closed
v1.7.0
No due date
•3/3 issues closed
100% complete0 open 3 closed
Debug parser transition system
There are a few issues that may indicate bugs in the `Parser` transition system, including how spaces and punctuation are attached and how sentences are broken.
No due date
•7/7 issues closed
100% complete0 open 7 closed
Update lemmatizer and morphology
The current `Lemmatizer` currently only works for English and assumes WordNet-formatted input files. The `Morphology` class does not currently add attributes correctly. We also need a simple lookup-based `Lemmatizer` class.
No due date
•8/8 issues closed
100% complete0 open 8 closed
Improve training API
Fix and extend NER, tagging and dependency parsing training examples. Improve model saving and loading process, especially for vocabulary.
No due date
•3/3 issues closed
100% complete0 open 3 closed
Reorganise language data
As more languages are added, we need a better way to share information between them. For instance, most languages will need the same set of emoticons, some abbreviations etc. We also need to refactor the `Defaults` class, which provides access to the data to the `Language` class.
No due date
•5/5 issues closed
100% complete0 open 5 closed
Workflows for new docs
Write usage workflows for new and improved docs.
No due date
•14/14 issues closed
100% complete0 open 14 closed
Version 1.0 Release
Bug fixes, enhancements, documentation for spaCy 1.0.
Due by October 19, 2016
•28/28 issues closed
100% complete0 open 28 closed