Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1632 commits
Select commit Hold shift + click to select a range
63fda5c
Merge pull request #547 from stephenhky/develop
stephenhky Mar 2, 2026
8396313
initiate development of 3.1.1
stephenhky Mar 2, 2026
49956d1
Update transformers from 5.2.0 to 5.3.0
pyup-bot Mar 4, 2026
9481806
Merge pull request #548 from stephenhky/pyup-update-transformers-5.2.…
stephenhky Mar 5, 2026
c8e2bb5
Merge pull request #549 from stephenhky/pyup-update-transformers-5.2.…
stephenhky Mar 5, 2026
e7b2db1
Update tensorflow from 2.20.0 to 2.21.0
pyup-bot Mar 6, 2026
0109846
Merge pull request #550 from stephenhky/pyup-update-tensorflow-2.20.0…
stephenhky Mar 6, 2026
a3dcf4c
Merge pull request #551 from stephenhky/pyup-update-tensorflow-2.20.0…
stephenhky Mar 6, 2026
449d0f5
Update numpy from 2.4.2 to 2.4.3
pyup-bot Mar 9, 2026
9291051
Merge pull request #553 from stephenhky/pyup-update-numpy-2.4.2-to-2.4.3
stephenhky Mar 9, 2026
30389c5
Merge pull request #552 from stephenhky/pyup-update-numpy-2.4.2-to-2.4.3
stephenhky Mar 9, 2026
586f028
Update npdict from 0.0.8 to 0.0.9
pyup-bot Mar 21, 2026
33e9f0d
updated Github workflow
stephenhky Mar 21, 2026
b473586
Merge pull request #555 from stephenhky/pyup-update-npdict-0.0.8-to-0…
stephenhky Mar 21, 2026
1ae80c5
Merge remote-tracking branch 'origin/develop' into develop
stephenhky Mar 21, 2026
8a8f594
update attributes in workflow
stephenhky Mar 21, 2026
a157e4d
release 3.1.1
stephenhky Mar 21, 2026
33b74cc
updated documentation
stephenhky Mar 23, 2026
45a0584
Merge pull request #556 from stephenhky/develop
stephenhky Mar 23, 2026
3c453f0
initiate development of 3.1.2
stephenhky Mar 23, 2026
dec68bc
Update torch from 2.10.0 to 2.11.0
pyup-bot Mar 23, 2026
05bfdc1
Merge pull request #558 from stephenhky/pyup-update-torch-2.10.0-to-2…
stephenhky Mar 24, 2026
a0aaeda
Merge pull request #557 from stephenhky/pyup-update-torch-2.10.0-to-2…
stephenhky Mar 24, 2026
f0b4127
Update transformers from 5.3.0 to 5.4.0
pyup-bot Mar 27, 2026
b3e491a
Merge pull request #560 from stephenhky/pyup-update-transformers-5.3.…
stephenhky Mar 27, 2026
dbec04b
Merge pull request #559 from stephenhky/pyup-update-transformers-5.3.…
stephenhky Mar 27, 2026
cb09fd8
Update numpy from 2.4.3 to 2.4.4
pyup-bot Mar 29, 2026
1ea8417
Merge pull request #562 from stephenhky/pyup-update-numpy-2.4.3-to-2.4.4
stephenhky Mar 29, 2026
8b5a2aa
Update pandas from 3.0.1 to 3.0.2
pyup-bot Mar 31, 2026
e30ee70
Merge pull request #564 from stephenhky/pyup-update-pandas-3.0.1-to-3…
stephenhky Mar 31, 2026
bd93a97
removed support for Python 3.9
stephenhky Mar 31, 2026
5bf355a
reimplemented smartload.py
stephenhky Mar 31, 2026
cd18d0f
removed deprecated functions
stephenhky Mar 31, 2026
e05c3ad
branching off for type hinting and advanced modern updates
stephenhky Mar 31, 2026
e6ac110
make CompactIOMachine an abstract class
stephenhky Mar 31, 2026
7eb388a
updating codes and unit tests for DTM
stephenhky Mar 31, 2026
aba6db0
updating codes and unit tests for DTM
stephenhky Mar 31, 2026
43d27bf
removed deleted imports
stephenhky Mar 31, 2026
26a3060
regular expression updated
stephenhky Mar 31, 2026
4dc9cd5
updated test_dtm.py to use pytest
stephenhky Mar 31, 2026
4baea7e
debug for pytest
stephenhky Mar 31, 2026
d5032d7
updated gensim_corpora.py
stephenhky Mar 31, 2026
df16fe0
Update numba from 0.64.0 to 0.65.0
pyup-bot Apr 1, 2026
8a57184
Merge pull request #567 from stephenhky/pyup-update-numba-0.64.0-to-0…
stephenhky Apr 1, 2026
9c28aaa
updated requirements for npdict
stephenhky Apr 1, 2026
379fa5d
Merge pull request #566 from stephenhky/pyup-update-numba-0.64.0-to-0…
stephenhky Apr 1, 2026
5564df0
added IO for NPDTM
stephenhky Apr 1, 2026
3024d02
removed unused import
stephenhky Apr 1, 2026
9f416b6
removed typing-extensions dependence and support for Python 3.10
stephenhky Apr 1, 2026
eb8ac42
debugging
stephenhky Apr 1, 2026
1d059f9
updated tokenizing functions
stephenhky Apr 1, 2026
3e097e1
refactoring
stephenhky Apr 1, 2026
c33e6f2
updated requirements for npdict
stephenhky Apr 1, 2026
78ceb13
Merge pull request #565 from stephenhky/pyup-update-numba-0.64.0-to-0…
stephenhky Apr 1, 2026
bfa4d07
updated gensim_corpora.py
stephenhky Apr 1, 2026
5cbb520
kerasmodel_io.py updated
stephenhky Apr 1, 2026
25c510b
beautification
stephenhky Apr 1, 2026
d15ddc5
more type hinting
stephenhky Apr 1, 2026
ca2c6c9
do not split corpus by spaces
stephenhky Apr 1, 2026
402f99f
updated regular expression
stephenhky Apr 2, 2026
f0ebd31
split the text again
stephenhky Apr 2, 2026
92b43e3
use Counter
stephenhky Apr 2, 2026
036e8b3
numba optimization
stephenhky Apr 2, 2026
87a0f45
not using numba
stephenhky Apr 2, 2026
edec034
changed typing from TextWrapperIO to TextIO
stephenhky Apr 2, 2026
ecf9b74
fixing bugs concerning methods and class methods
stephenhky Apr 2, 2026
e208511
Update transformers from 5.4.0 to 5.5.0
pyup-bot Apr 2, 2026
35f9c59
removed unused codes
stephenhky Apr 2, 2026
45e21e8
Merge pull request #573 from stephenhky/pyup-update-transformers-5.4.…
stephenhky Apr 2, 2026
5d178ea
Merge pull request #572 from stephenhky/pyup-update-transformers-5.4.…
stephenhky Apr 2, 2026
e0d28b7
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 2, 2026
e440f9a
Update keras from 3.13.2 to 3.14.0
pyup-bot Apr 3, 2026
3a7e79a
Merge pull request #576 from stephenhky/pyup-update-keras-3.13.2-to-3…
stephenhky Apr 3, 2026
ba8a36e
Merge pull request #575 from stephenhky/pyup-update-keras-3.13.2-to-3…
stephenhky Apr 3, 2026
e856032
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 3, 2026
45ddf70
updated unit tests for DTM
stephenhky Apr 3, 2026
f683f88
updated DTM classes
stephenhky Apr 3, 2026
4c98be2
updated attributes in unit tests
stephenhky Apr 3, 2026
bc59adc
updated DTM unit tests
stephenhky Apr 3, 2026
7cda6bc
updated load function for DTM and the documentations
stephenhky Apr 3, 2026
3f23e80
updated unit tests
stephenhky Apr 3, 2026
3dfd055
use super()
stephenhky Apr 4, 2026
e75dfd5
Merge branch 'develop' into typehint
stephenhky Apr 5, 2026
7a710ea
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 5, 2026
a16cd77
Merge pull request #577 from stephenhky/typehint
stephenhky Apr 5, 2026
5bcef7e
updated to a3
stephenhky Apr 5, 2026
52365ca
updated to a3
stephenhky Apr 5, 2026
7e1320e
some type hinting for MaxEntClassification.py
stephenhky Apr 5, 2026
4533839
updated maxent classifier
stephenhky Apr 6, 2026
e5eaaa7
type hinting debugged
stephenhky Apr 6, 2026
519c0b9
use float
stephenhky Apr 6, 2026
8267603
debug
stephenhky Apr 6, 2026
c94a522
preprocess and tokenize function separated
stephenhky Apr 6, 2026
0bd93ae
removed unused imports
stephenhky Apr 6, 2026
b4812d9
typo fixed
stephenhky Apr 6, 2026
8e68c37
handled non-existent key problem
stephenhky Apr 6, 2026
359a497
Merge branch 'develop' into typehint
stephenhky Apr 6, 2026
c8a7238
Merge pull request #578 from stephenhky/typehint
stephenhky Apr 6, 2026
46b5a52
initiate change for topic modelers
stephenhky Apr 6, 2026
6284f04
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 6, 2026
036b4f5
updated DTM type hinting
stephenhky Apr 7, 2026
c610e59
updated exceptions messages
stephenhky Apr 7, 2026
1b23de2
use native NotImplemented
stephenhky Apr 7, 2026
81fc41e
use native NotImplemented
stephenhky Apr 7, 2026
1b9ab2a
type hinting to exception classes
stephenhky Apr 7, 2026
a0d3329
handled topic modelers
stephenhky Apr 7, 2026
ecc6fd7
updated Python requirements
stephenhky Apr 7, 2026
d4b4a48
updated tutorial on maxent
stephenhky Apr 7, 2026
1361a0e
all topic modelers have type hints
stephenhky Apr 7, 2026
5a4ea1b
handled TopicVectorDistanceClassification.py
stephenhky Apr 7, 2026
2b4ab13
forgot a returned type
stephenhky Apr 7, 2026
25f8114
cleaning SkLearnClassifier
stephenhky Apr 7, 2026
b5b8781
update topic modeling classes
stephenhky Apr 8, 2026
4329a5c
fixing inheritance mess
stephenhky Apr 8, 2026
6a8ff70
updated unit tests for fuzzy logic
stephenhky Apr 8, 2026
b150e84
pydantic base models for CompactIO
stephenhky Apr 8, 2026
a7e62fa
removed deprecation status for two functions related to gensim
stephenhky Apr 8, 2026
216d9cc
convert test_textpreprocessing.py using pytest purely
stephenhky Apr 8, 2026
e9146aa
updated onehot unit test
stephenhky Apr 8, 2026
0b5fdc2
testing autoencoder model
stephenhky Apr 8, 2026
cad0ff3
Update transformers from 5.4.0 to 5.5.1
pyup-bot Apr 9, 2026
bb15f7c
Merge branch 'typehint' into pyup-update-transformers-5.4.0-to-5.5.1
stephenhky Apr 9, 2026
be6cf90
Merge pull request #561 from stephenhky/pyup-update-numpy-2.4.3-to-2.4.4
stephenhky Apr 9, 2026
a4d0c09
Merge pull request #563 from stephenhky/pyup-update-pandas-3.0.1-to-3…
stephenhky Apr 9, 2026
53d40d3
Merge pull request #574 from stephenhky/pyup-update-keras-3.13.2-to-3…
stephenhky Apr 9, 2026
1d664a5
updated transformers requirement
stephenhky Apr 9, 2026
e689bc8
Update transformers version in requirements.txt
stephenhky Apr 9, 2026
7f9ef99
Merge pull request #581 from stephenhky/pyup-update-transformers-5.4.…
stephenhky Apr 9, 2026
5e7daf0
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 9, 2026
cc65337
updated the keyword arguments for the keras model in the autoencoder
stephenhky Apr 9, 2026
6fec926
Update transformers from 5.5.1 to 5.5.2
pyup-bot Apr 9, 2026
b551652
Merge branch 'typehint' into pyup-update-transformers-5.5.1-to-5.5.2
stephenhky Apr 9, 2026
437e593
Merge pull request #585 from stephenhky/pyup-update-transformers-5.5.…
stephenhky Apr 9, 2026
321d48e
Merge pull request #584 from stephenhky/pyup-update-transformers-5.5.…
stephenhky Apr 9, 2026
5ecfe9b
Update transformers version to 5.5.2
stephenhky Apr 9, 2026
17a69fe
Update npdict version to 0.0.10
stephenhky Apr 9, 2026
a9b7a7e
embedvecs optimization
stephenhky Apr 9, 2026
1d4bef1
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 9, 2026
d2e5529
dtype fixing for bow
stephenhky Apr 9, 2026
54fc60d
Update transformers from 5.5.2 to 5.5.3
pyup-bot Apr 9, 2026
1377071
fixing tokenization
stephenhky Apr 9, 2026
aadf2ae
using np.array for embedvecs
stephenhky Apr 9, 2026
7a9a66b
preprocessing
stephenhky Apr 9, 2026
382cfb6
handle empty entry
stephenhky Apr 9, 2026
ec89e4d
Merge pull request #586 from stephenhky/pyup-update-transformers-5.5.…
stephenhky Apr 9, 2026
cc08352
output data download in error stream
stephenhky Apr 9, 2026
e4f2498
test to topic modeling
stephenhky Apr 9, 2026
702718f
equally distributing the topics
stephenhky Apr 9, 2026
2f5a63b
float type casting
stephenhky Apr 10, 2026
7ee65d2
test cosine similarity scorer
stephenhky Apr 10, 2026
98f974b
simplify workflow
stephenhky Apr 10, 2026
63edde5
changing how topic model get_info
stephenhky Apr 10, 2026
a08e971
handling nan error
stephenhky Apr 10, 2026
8e0beb6
updated test
stephenhky Apr 10, 2026
30635bb
typo
stephenhky Apr 10, 2026
648823b
big change to test_stacking
stephenhky Apr 10, 2026
fd52fb0
added loguru package
stephenhky Apr 10, 2026
b2f1cf5
uopdated I/O test
stephenhky Apr 10, 2026
52390a6
code optimization
stephenhky Apr 10, 2026
8675946
approximation in test
stephenhky Apr 10, 2026
9b5223f
Merge pull request #587 from stephenhky/typehint
stephenhky Apr 10, 2026
fadc0db
removed imports
stephenhky Apr 10, 2026
c843ebd
updated version
stephenhky Apr 10, 2026
68868fb
type hinting for DNN frameworks
stephenhky Apr 10, 2026
ecb489f
type hinting for VarNNEmbedVecClassification.py
stephenhky Apr 10, 2026
a5c185c
removed pandas dependence
stephenhky Apr 10, 2026
ef6a576
sum embed classifier
stephenhky Apr 10, 2026
fc94c2c
tune down float type
stephenhky Apr 10, 2026
e60fb68
retreated back to float64
stephenhky Apr 10, 2026
5063300
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 11, 2026
998f185
C-contiguous for np.dot
stephenhky Apr 13, 2026
85c9a5f
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 13, 2026
99e1f61
debugging
stephenhky Apr 13, 2026
b2fe355
dtype
stephenhky Apr 13, 2026
d42250e
convert all wvmodel output to np.float64
stephenhky Apr 13, 2026
4435c14
Update transformers from 5.5.3 to 5.5.4
pyup-bot Apr 13, 2026
c7c4d8d
Merge pull request #588 from stephenhky/pyup-update-transformers-5.5.…
stephenhky Apr 13, 2026
38f5869
Merge branch 'develop' into pyup-update-transformers-5.5.3-to-5.5.4
stephenhky Apr 13, 2026
8b4a09e
Merge pull request #589 from stephenhky/pyup-update-transformers-5.5.…
stephenhky Apr 13, 2026
39cbdd6
Merge pull request #590 from stephenhky/pyup-update-transformers-5.5.…
stephenhky Apr 13, 2026
cb3e88d
updated requirements
stephenhky Apr 13, 2026
aa5f8a8
code optimization
stephenhky Apr 13, 2026
0018a10
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 13, 2026
880fd38
test cosine similarity
stephenhky Apr 13, 2026
c9af399
shorten code
stephenhky Apr 13, 2026
63dc751
shorten code
stephenhky Apr 13, 2026
23bb4c7
handling IO
stephenhky Apr 13, 2026
50fb386
handling IO 2
stephenhky Apr 13, 2026
8b42ef4
removed a unit test due to the stochasticity of LDA model
stephenhky Apr 13, 2026
851fc3a
import orjson
stephenhky Apr 13, 2026
2906e93
debugged
stephenhky Apr 14, 2026
ca31717
type bugs fixing
stephenhky Apr 14, 2026
00a166e
Merge pull request #591 from stephenhky/typehint
stephenhky Apr 14, 2026
89dc775
release 4.0.0a6 update
stephenhky Apr 14, 2026
ab6c751
command line functions updated
stephenhky Apr 14, 2026
41e6230
data retrieval
stephenhky Apr 14, 2026
ccd5c46
char2vec
stephenhky Apr 14, 2026
7dc8d9a
change typing from TextIO to TextIOWrapper
stephenhky Apr 14, 2026
571fd25
type hinting for seq2seq models
stephenhky Apr 14, 2026
a72f6e4
update type hinting for metrics
stephenhky Apr 14, 2026
36749f8
updated spell corrector
stephenhky Apr 14, 2026
9806439
wrong imports
stephenhky Apr 14, 2026
5393cf4
fixed namespace error
stephenhky Apr 14, 2026
eb180b4
added Scorable abstract class
stephenhky Apr 14, 2026
12ecc85
added imports
stephenhky Apr 14, 2026
27e3927
added imports
stephenhky Apr 14, 2026
da40655
Merge pull request #592 from stephenhky/typehint
stephenhky Apr 14, 2026
fa23267
new alpha
stephenhky Apr 14, 2026
b122c80
updated neural network unit tests
stephenhky Apr 14, 2026
062c633
cleaner exception
stephenhky Apr 14, 2026
c58d8b8
defined AbsoluteScorer
stephenhky Apr 14, 2026
f7351b0
stacking and AbsoluteScorer
stephenhky Apr 15, 2026
d040612
typing updated
stephenhky Apr 15, 2026
736da80
debugged
stephenhky Apr 15, 2026
42c8463
debugging with logger
stephenhky Apr 15, 2026
711c67b
use list for classlabels in stacking
stephenhky Apr 15, 2026
84ab0fb
removed some assertion test due to the stochasticity of the gensim mo…
stephenhky Apr 15, 2026
897fbf6
Merge pull request #593 from stephenhky/typehint
stephenhky Apr 15, 2026
90b8e96
update version
stephenhky Apr 15, 2026
b98d91e
cleaning up tests
stephenhky Apr 15, 2026
b6fc812
use logistic regression
stephenhky Apr 15, 2026
7c3fe82
Merge pull request #594 from stephenhky/typehint
stephenhky Apr 15, 2026
796c5cb
increase version alpha
stephenhky Apr 15, 2026
97f9fac
docstring updated
stephenhky Apr 16, 2026
b0555af
do not use != for NoneType
stephenhky Apr 16, 2026
32634d1
deprecate mergedict
stephenhky Apr 18, 2026
68e2faf
revert changes, and refactor the `mergedict` function
stephenhky Apr 18, 2026
94a15ca
updated instance instantiation functions
stephenhky Apr 18, 2026
825b765
Merge pull request #595 from stephenhky/typehint
stephenhky Apr 18, 2026
996f7ca
initiate new version alpha; updated documentation style
stephenhky Apr 18, 2026
cab5429
Merge pull request #596 from stephenhky/typehint
stephenhky Apr 18, 2026
ec6c8b6
Release 4.0.0
stephenhky Apr 19, 2026
550e2cb
Merge pull request #597 from stephenhky/develop
stephenhky Apr 19, 2026
85974b5
initiate development of 4.0.1
stephenhky Apr 19, 2026
a3a4e3e
deprecation
stephenhky Apr 20, 2026
cb39ec2
deprecation
stephenhky Apr 20, 2026
4d59bbf
Update deprecation from 2.0.0 to 2.1.0
pyup-bot Apr 20, 2026
fd22a46
deprecation
stephenhky Apr 20, 2026
6b15d21
deprecation
stephenhky Apr 20, 2026
a2981c3
deprecation
stephenhky Apr 20, 2026
93d1aa4
Merge pull request #599 from stephenhky/pyup-update-deprecation-2.0.0…
stephenhky Apr 20, 2026
ddf4b56
Merge pull request #598 from stephenhky/pyup-update-deprecation-2.0.0…
stephenhky Apr 20, 2026
a48c608
updated requirements_minimal.txt
stephenhky Apr 20, 2026
fc0e95b
Merge remote-tracking branch 'origin/develop' into develop
stephenhky Apr 20, 2026
f110127
updated requirements_minimal.txt
stephenhky Apr 20, 2026
528f948
Merge branch 'develop' into readthedocs
stephenhky Apr 20, 2026
583b63f
updated HTML theme
stephenhky Apr 20, 2026
569a1ab
Merge branch 'master' into develop
stephenhky Apr 20, 2026
b6d802c
Merge pull request #600 from stephenhky/readthedocs
stephenhky Apr 20, 2026
785a065
Merge pull request #601 from stephenhky/develop
stephenhky Apr 20, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
55 changes: 55 additions & 0 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,55 @@
version: 2


shared: &shared
working_directory: ~/shorttext

steps:
- checkout

- run:
name: Apt Install
command: |
sudo apt-get update
sudo apt-get install libc6
sudo apt-get install python3-dev
sudo apt-get install -y g++

- run:
name: Installing Miniconda and Packages
command: |
pip install --upgrade --user pip
pip install --upgrade --user google-compute-engine
pip install --user .

- run:
name: Run Unit Tests
command: |
pip install --user .[test]
pytest


jobs:
py311:
<<: *shared
docker:
- image: cimg/python:3.11

py312:
<<: *shared
docker:
- image: cimg/python:3.12

py313:
<<: *shared
docker:
- image: cimg/python:3.13


workflows:
version: 2
build:
jobs:
- py311
- py312
- py313
35 changes: 35 additions & 0 deletions .github/workflows/publish-to-pypi.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,35 @@
name: Publish to PyPI

on:
release:
types: [published]

jobs:
publish-to-pypi:
name: Publish to PyPI
runs-on: ubuntu-latest
permissions:
id-token: write # IMPORTANT: this permission is mandatory for trusted publishing
steps:
- name: Checkout code
uses: actions/checkout@v5

- name: Set up Python
uses: actions/setup-python@v6
with:
python-version: "3.10"

- name: Install build dependencies
run: |
python -m pip install --upgrade pip
pip install build

- name: Build package
run: python -m build

- name: Publish package to PyPI
uses: pypa/gh-action-pypi-publish@release/v1
with:
user: __token__
password: ${{ secrets.PYPI_API_TOKEN }}
skip-existing: true
Loading