Skip to content

Commit 9b50b62

Browse files
committed
release 2.1.0
1 parent d1c5134 commit 9b50b62

File tree

9 files changed

+26
-54
lines changed

9 files changed

+26
-54
lines changed

.readthedocs.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ sphinx:
1212
build:
1313
os: ubuntu-22.04
1414
tools:
15-
python: "3.9"
15+
python: "3.12"
1616

1717
# Build documentation with MkDocs
1818
#mkdocs:

README.md

Lines changed: 3 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ representation of the texts and documents are needed before they are put into
1818
any classification algorithm. In this package, it facilitates various types
1919
of these representations, including topic modeling and word-embedding algorithms.
2020

21-
The package `shorttext` runs on Python 3.8, 3.9, 3.10, and 3.11.
21+
The package `shorttext` runs on Python 3.9, 3.10, 3.11, and 3.12.
2222
Characteristics:
2323

2424
- example data provided (including subject keywords and NIH RePORT);
@@ -31,8 +31,7 @@ Characteristics:
3131
- maximum entropy classification;
3232
- metrics of phrases differences, including soft Jaccard score (using Damerau-Levenshtein distance), and Word Mover's distance (WMD);
3333
- character-level sequence-to-sequence (seq2seq) learning;
34-
- spell correction;
35-
- API for word-embedding algorithm for one-time loading; and
34+
- spell correction; and
3635
- Sentence encodings and similarities based on BERT.
3736

3837
## Documentation
@@ -84,6 +83,7 @@ If you would like to contribute, feel free to submit the pull requests. You can
8483

8584
## News
8685

86+
* 12/14/2024: `shorttext` 2.1.0 released.
8787
* 07/12/2024: `shorttext` 2.0.0 released.
8888
* 12/21/2023: `shorttext` 1.6.1 released.
8989
* 08/26/2023: `shorttext` 1.6.0 released.
@@ -159,8 +159,3 @@ If you would like to contribute, feel free to submit the pull requests. You can
159159
* 12/21/2016: `shorttext` 0.2.0 released.
160160
* 11/25/2016: `shorttext` 0.1.2 released.
161161
* 11/21/2016: `shorttext` 0.1.1 released.
162-
163-
## Possible Future Updates
164-
165-
- [ ] Dividing components to other packages;
166-
- [ ] More available corpus.

docs/codes.rst

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -65,7 +65,13 @@ Module `shorttext.metrics.dynprog`
6565
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
6666

6767
.. automodule:: shorttext.metrics.dynprog.jaccard
68-
:members: soft_intersection_list
68+
:members:
69+
70+
.. automodule:: shorttext.metrics.dynprog.dldist
71+
:members:
72+
73+
.. automodule:: shorttext.metrics.dynprog.lcp
74+
:members:
6975

7076
Module `shorttext.metrics.wassersterin`
7177
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

docs/install.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -41,7 +41,7 @@ you may try one (or more) of the following:
4141

4242
::
4343

44-
pip install -U python3-dev
44+
pip install python3-dev
4545

4646

4747

docs/intro.rst

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,6 @@ Characteristics:
2323
- metrics of phrases differences, including soft Jaccard score (using Damerau-Levenshtein distance), and Word Mover's distance (WMD); (see :doc:`tutorial_metrics`)
2424
- character-level sequence-to-sequence (seq2seq) learning; (see :doc:`tutorial_charbaseseq2seq`)
2525
- spell correction; (see :doc:`tutorial_spell`)
26-
- API for word-embedding algorithm for one-time loading; (see :doc:`tutorial_wordembedAPI`) and
2726
- Sentence encodings and similarities based on BERT (see :doc:`tutorial_wordembed` and :doc:`tutorial_metrics`).
2827

2928
Author: Kwan Yuet Stephen Ho (LinkedIn_, ResearchGate_, Twitter_)

docs/news.rst

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
11
News
22
====
33

4+
* 12/14/2024: `shorttext` 2.1.0 released.
45
* 07/12/2024: `shorttext` 2.0.0 released.
56
* 12/21/2023: `shorttext` 1.6.1 released.
67
* 08/26/2023: `shorttext` 1.6.0 released.
@@ -81,6 +82,13 @@ News
8182
What's New
8283
----------
8384

85+
Released 2.1.0 (December 14, 2024)
86+
------------------------------
87+
88+
* Use of `pyproject.toml` for package distribution.
89+
* Removed Cython components.
90+
* Huge relative import refactoring.
91+
8492
Released 2.0.0 (July 13, 2024)
8593
------------------------------
8694

docs/scripts.rst

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,14 +12,15 @@ ShortTextCategorizerConsole
1212

1313
usage: ShortTextCategorizerConsole [-h] [--wv WV] [--vecsize VECSIZE]
1414
[--topn TOPN] [--inputtext INPUTTEXT]
15+
[--type TYPE]
1516
model_filepath
1617

1718
Perform prediction on short text with a given trained model.
1819

1920
positional arguments:
2021
model_filepath Path of the trained (compact) model.
2122

22-
optional arguments:
23+
options:
2324
-h, --help show this help message and exit
2425
--wv WV Path of the pre-trained Word2Vec model. (None if not
2526
needed)
@@ -28,6 +29,9 @@ ShortTextCategorizerConsole
2829
--inputtext INPUTTEXT
2930
single input text for classification. Run console if
3031
set to None. (Default: None)
32+
--type TYPE Type of word-embedding model (default: "word2vec";
33+
other options: "fasttext", "poincare",
34+
"word2vec_nonbinary", "poincare_binary")
3135

3236

3337
ShortTextWordEmbedSimilarity

docs/tutorial_wordembedAPI.rst

Lines changed: 0 additions & 40 deletions
This file was deleted.

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
44

55
[project]
66
name = "shorttext"
7-
version = "2.1.0a1"
7+
version = "2.1.0"
88
authors = [
99
{name = "Kwan Yuet Stephen Ho", email = "[email protected]"}
1010
]

0 commit comments

Comments
 (0)