Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: stephenhky/PyShortTextCategorization
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: 1.0.7
Choose a base ref
...
head repository: stephenhky/PyShortTextCategorization
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: master
Choose a head ref
Loading
Showing with 5,350 additions and 35,103 deletions.
  1. +61 −0 .circleci/config.yml
  2. +828 −65 .gitignore
  3. +31 −0 .readthedocs.yml
  4. +0 −33 .travis.yml
  5. +1 −1 LICENSE
  6. +3 −6 MANIFEST.in
  7. +52 −21 README.md
  8. +0 −192 apidocs/Makefile
  9. +0 −173 apidocs/source/codes.rst
  10. +0 −287 apidocs/source/conf.py
  11. +0 −37 apidocs/source/index.rst
  12. +0 −46 apidocs/source/install.rst
  13. +0 −34 bin/switch_kerasbackend
  14. BIN data/USInaugural.zip
  15. BIN data/nih_full.csv.zip
  16. +13 −90 docs/codes.rst
  17. +5 −5 docs/conf.py
  18. +10 −65 docs/faq.rst
  19. +1 −0 docs/index.rst
  20. +17 −40 docs/install.rst
  21. +5 −10 docs/intro.rst
  22. +269 −1 docs/news.rst
  23. +12 −0 docs/requirements.txt
  24. +6 −2 docs/scripts.rst
  25. +0 −3 docs/tutorial.rst
  26. +7 −2 docs/tutorial_charbaseonehot.rst
  27. +14 −1 docs/tutorial_charbaseseq2seq.rst
  28. +13 −24 docs/tutorial_dataprep.rst
  29. +6 −0 docs/tutorial_dtm.rst
  30. +5 −0 docs/tutorial_maxent.rst
  31. +36 −1 docs/tutorial_metrics.rst
  32. +13 −1 docs/tutorial_nnlib.rst
  33. +10 −6 docs/tutorial_spell.rst
  34. +5 −0 docs/tutorial_stacking.rst
  35. +4 −0 docs/tutorial_sumvec.rst
  36. +8 −2 docs/tutorial_textpreprocessing.rst
  37. +24 −0 docs/tutorial_topic.rst
  38. +56 −7 docs/tutorial_wordembed.rst
  39. +85 −0 pyproject.toml
  40. +0 −7 setup.cfg
  41. +0 −90 setup.py
  42. +4 −12 shorttext/__init__.py
  43. +7 −25 shorttext/classifiers/bow/maxent/MaxEntClassification.py
  44. +6 −6 shorttext/classifiers/bow/topic/SkLearnClassification.py
  45. +4 −4 shorttext/classifiers/bow/topic/TopicVectorDistanceClassification.py
  46. +23 −13 shorttext/classifiers/embed/nnlib/VarNNEmbedVecClassification.py
  47. +9 −16 shorttext/classifiers/embed/nnlib/frameworks.py
  48. +6 −5 shorttext/classifiers/embed/sumvec/SumEmbedVecClassification.py
  49. +10 −14 shorttext/classifiers/embed/sumvec/VarNNSumEmbedVecClassification.py
  50. +3 −3 shorttext/classifiers/embed/sumvec/frameworks.py
  51. 0 shorttext/cli/__init__.py
  52. +34 −30 bin/ShortTextCategorizerConsole → shorttext/cli/categorization.py
  53. +22 −23 bin/ShortTextWordEmbedSimilarity → shorttext/cli/wordembedsim.py
  54. +7 −34 shorttext/data/data_retrieval.py
  55. +11 −10 shorttext/generators/bow/AutoEncodingTopicModeling.py
  56. +5 −5 shorttext/generators/bow/GensimTopicModeling.py
  57. +10 −4 shorttext/generators/bow/LatentTopicModeling.py
  58. +2 −1 shorttext/generators/bow/__init__.py
  59. +1 −1 shorttext/generators/charbase/char2vec.py
  60. +1 −2 shorttext/generators/seq2seq/charbaseS2S.py
  61. +14 −14 shorttext/generators/seq2seq/s2skeras.py
  62. +2 −1 shorttext/metrics/__init__.py
  63. +3 −6 shorttext/metrics/dynprog/__init__.py
  64. +0 −23,881 shorttext/metrics/dynprog/dldist.c
  65. +41 −0 shorttext/metrics/dynprog/dldist.py
  66. +0 −32 shorttext/metrics/dynprog/dldist.pyx
  67. +0 −6,562 shorttext/metrics/dynprog/lcp.c
  68. +22 −0 shorttext/metrics/dynprog/lcp.py
  69. +0 −19 shorttext/metrics/dynprog/lcp.pyx
  70. +3 −3 shorttext/metrics/embedfuzzy/jaccard.py
  71. +2 −0 shorttext/metrics/transformers/__init__.py
  72. +112 −0 shorttext/metrics/transformers/bertscore.py
  73. +1 −1 shorttext/metrics/wasserstein/__init__.py
  74. +42 −27 shorttext/metrics/wasserstein/wordmoverdist.py
  75. +9 −4 shorttext/spell/basespellcorrector.py
  76. +2 −2 shorttext/spell/binarize.py
  77. +14 −2 shorttext/spell/editor.py
  78. +0 −2,909 shorttext/spell/edits1_comb.c
  79. +0 −17 shorttext/spell/edits1_comb.pyx
  80. +4 −4 shorttext/spell/sakaguchi.py
  81. +10 −6 shorttext/stack/stacking.py
  82. +5 −5 shorttext/utils/__init__.py
  83. +1 −1 shorttext/utils/compactmodel_io.py
  84. +0 −14 shorttext/utils/deprecation.py
  85. +1 −1 shorttext/utils/dtm.py
  86. +1 −1 shorttext/utils/gensim_corpora.py
  87. +2 −2 shorttext/utils/kerasmodel_io.py
  88. +17 −0 shorttext/utils/misc.py
  89. +2,778 −0 shorttext/utils/nonneg_stopwords.txt
  90. +62 −44 shorttext/utils/textpreprocessing.py
  91. +153 −0 shorttext/utils/transformers.py
  92. +153 −35 shorttext/utils/wordembed.py
  93. +45 −0 test/test_bertrepresentations.py
  94. +3 −7 test/test_charonehot.py
  95. +8 −8 test/test_dtm.py
  96. +13 −13 test/test_fuzzylogic.py
  97. +3 −8 test/test_norvigspell.py
  98. +7 −0 test/test_textpreprocessing.py
  99. +12 −14 test/test_var_nn_embedded_vec_classifier.py
  100. +40 −0 test/test_wmd.py
61 changes: 61 additions & 0 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,61 @@
version: 2


shared: &shared
working_directory: ~/shorttext

steps:
- checkout

- run:
name: Apt Install
command: |
sudo apt-get update
sudo apt-get install libc6
sudo apt-get install python3-dev
sudo apt-get install -y g++
- run:
name: Installing Miniconda and Packages
command: |
pip install --upgrade --user pip
pip install --upgrade --user google-compute-engine
pip install --user .
- run:
name: Run Unit Tests
command: |
pip install --user .[test]
pytest
jobs:
py39:
<<: *shared
docker:
- image: cimg/python:3.9

py310:
<<: *shared
docker:
- image: cimg/python:3.10

py311:
<<: *shared
docker:
- image: cimg/python:3.11

py312:
<<: *shared
docker:
- image: cimg/python:3.12


workflows:
version: 2
build:
jobs:
- py39
- py310
- py311
- py312
Loading