Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
194 commits
Select commit Hold shift + click to select a range
3c453f0
initiate development of 3.1.2
stephenhky Mar 23, 2026
05bfdc1
Merge pull request #558 from stephenhky/pyup-update-torch-2.10.0-to-2…
stephenhky Mar 24, 2026
b3e491a
Merge pull request #560 from stephenhky/pyup-update-transformers-5.3.…
stephenhky Mar 27, 2026
1ea8417
Merge pull request #562 from stephenhky/pyup-update-numpy-2.4.3-to-2.4.4
stephenhky Mar 29, 2026
e30ee70
Merge pull request #564 from stephenhky/pyup-update-pandas-3.0.1-to-3…
stephenhky Mar 31, 2026
bd93a97
removed support for Python 3.9
stephenhky Mar 31, 2026
5bf355a
reimplemented smartload.py
stephenhky Mar 31, 2026
cd18d0f
removed deprecated functions
stephenhky Mar 31, 2026
e05c3ad
branching off for type hinting and advanced modern updates
stephenhky Mar 31, 2026
e6ac110
make CompactIOMachine an abstract class
stephenhky Mar 31, 2026
7eb388a
updating codes and unit tests for DTM
stephenhky Mar 31, 2026
aba6db0
updating codes and unit tests for DTM
stephenhky Mar 31, 2026
43d27bf
removed deleted imports
stephenhky Mar 31, 2026
26a3060
regular expression updated
stephenhky Mar 31, 2026
4dc9cd5
updated test_dtm.py to use pytest
stephenhky Mar 31, 2026
4baea7e
debug for pytest
stephenhky Mar 31, 2026
d5032d7
updated gensim_corpora.py
stephenhky Mar 31, 2026
8a57184
Merge pull request #567 from stephenhky/pyup-update-numba-0.64.0-to-0…
stephenhky Apr 1, 2026
9c28aaa
updated requirements for npdict
stephenhky Apr 1, 2026
379fa5d
Merge pull request #566 from stephenhky/pyup-update-numba-0.64.0-to-0…
stephenhky Apr 1, 2026
5564df0
added IO for NPDTM
stephenhky Apr 1, 2026
3024d02
removed unused import
stephenhky Apr 1, 2026
9f416b6
removed typing-extensions dependence and support for Python 3.10
stephenhky Apr 1, 2026
eb8ac42
debugging
stephenhky Apr 1, 2026
1d059f9
updated tokenizing functions
stephenhky Apr 1, 2026
3e097e1
refactoring
stephenhky Apr 1, 2026
c33e6f2
updated requirements for npdict
stephenhky Apr 1, 2026
bfa4d07
updated gensim_corpora.py
stephenhky Apr 1, 2026
5cbb520
kerasmodel_io.py updated
stephenhky Apr 1, 2026
25c510b
beautification
stephenhky Apr 1, 2026
d15ddc5
more type hinting
stephenhky Apr 1, 2026
ca2c6c9
do not split corpus by spaces
stephenhky Apr 1, 2026
402f99f
updated regular expression
stephenhky Apr 2, 2026
f0ebd31
split the text again
stephenhky Apr 2, 2026
92b43e3
use Counter
stephenhky Apr 2, 2026
036e8b3
numba optimization
stephenhky Apr 2, 2026
87a0f45
not using numba
stephenhky Apr 2, 2026
edec034
changed typing from TextWrapperIO to TextIO
stephenhky Apr 2, 2026
ecf9b74
fixing bugs concerning methods and class methods
stephenhky Apr 2, 2026
e208511
Update transformers from 5.4.0 to 5.5.0
pyup-bot Apr 2, 2026
35f9c59
removed unused codes
stephenhky Apr 2, 2026
45e21e8
Merge pull request #573 from stephenhky/pyup-update-transformers-5.4.…
stephenhky Apr 2, 2026
5d178ea
Merge pull request #572 from stephenhky/pyup-update-transformers-5.4.…
stephenhky Apr 2, 2026
e0d28b7
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 2, 2026
3a7e79a
Merge pull request #576 from stephenhky/pyup-update-keras-3.13.2-to-3…
stephenhky Apr 3, 2026
ba8a36e
Merge pull request #575 from stephenhky/pyup-update-keras-3.13.2-to-3…
stephenhky Apr 3, 2026
e856032
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 3, 2026
45ddf70
updated unit tests for DTM
stephenhky Apr 3, 2026
f683f88
updated DTM classes
stephenhky Apr 3, 2026
4c98be2
updated attributes in unit tests
stephenhky Apr 3, 2026
bc59adc
updated DTM unit tests
stephenhky Apr 3, 2026
7cda6bc
updated load function for DTM and the documentations
stephenhky Apr 3, 2026
3f23e80
updated unit tests
stephenhky Apr 3, 2026
3dfd055
use super()
stephenhky Apr 4, 2026
e75dfd5
Merge branch 'develop' into typehint
stephenhky Apr 5, 2026
7a710ea
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 5, 2026
a16cd77
Merge pull request #577 from stephenhky/typehint
stephenhky Apr 5, 2026
5bcef7e
updated to a3
stephenhky Apr 5, 2026
52365ca
updated to a3
stephenhky Apr 5, 2026
7e1320e
some type hinting for MaxEntClassification.py
stephenhky Apr 5, 2026
4533839
updated maxent classifier
stephenhky Apr 6, 2026
e5eaaa7
type hinting debugged
stephenhky Apr 6, 2026
519c0b9
use float
stephenhky Apr 6, 2026
8267603
debug
stephenhky Apr 6, 2026
c94a522
preprocess and tokenize function separated
stephenhky Apr 6, 2026
0bd93ae
removed unused imports
stephenhky Apr 6, 2026
b4812d9
typo fixed
stephenhky Apr 6, 2026
8e68c37
handled non-existent key problem
stephenhky Apr 6, 2026
359a497
Merge branch 'develop' into typehint
stephenhky Apr 6, 2026
c8a7238
Merge pull request #578 from stephenhky/typehint
stephenhky Apr 6, 2026
46b5a52
initiate change for topic modelers
stephenhky Apr 6, 2026
6284f04
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 6, 2026
036b4f5
updated DTM type hinting
stephenhky Apr 7, 2026
c610e59
updated exceptions messages
stephenhky Apr 7, 2026
1b23de2
use native NotImplemented
stephenhky Apr 7, 2026
81fc41e
use native NotImplemented
stephenhky Apr 7, 2026
1b9ab2a
type hinting to exception classes
stephenhky Apr 7, 2026
a0d3329
handled topic modelers
stephenhky Apr 7, 2026
ecc6fd7
updated Python requirements
stephenhky Apr 7, 2026
d4b4a48
updated tutorial on maxent
stephenhky Apr 7, 2026
1361a0e
all topic modelers have type hints
stephenhky Apr 7, 2026
5a4ea1b
handled TopicVectorDistanceClassification.py
stephenhky Apr 7, 2026
2b4ab13
forgot a returned type
stephenhky Apr 7, 2026
25f8114
cleaning SkLearnClassifier
stephenhky Apr 7, 2026
b5b8781
update topic modeling classes
stephenhky Apr 8, 2026
4329a5c
fixing inheritance mess
stephenhky Apr 8, 2026
6a8ff70
updated unit tests for fuzzy logic
stephenhky Apr 8, 2026
b150e84
pydantic base models for CompactIO
stephenhky Apr 8, 2026
a7e62fa
removed deprecation status for two functions related to gensim
stephenhky Apr 8, 2026
216d9cc
convert test_textpreprocessing.py using pytest purely
stephenhky Apr 8, 2026
e9146aa
updated onehot unit test
stephenhky Apr 8, 2026
0b5fdc2
testing autoencoder model
stephenhky Apr 8, 2026
cad0ff3
Update transformers from 5.4.0 to 5.5.1
pyup-bot Apr 9, 2026
bb15f7c
Merge branch 'typehint' into pyup-update-transformers-5.4.0-to-5.5.1
stephenhky Apr 9, 2026
1d664a5
updated transformers requirement
stephenhky Apr 9, 2026
7f9ef99
Merge pull request #581 from stephenhky/pyup-update-transformers-5.4.…
stephenhky Apr 9, 2026
5e7daf0
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 9, 2026
cc65337
updated the keyword arguments for the keras model in the autoencoder
stephenhky Apr 9, 2026
6fec926
Update transformers from 5.5.1 to 5.5.2
pyup-bot Apr 9, 2026
b551652
Merge branch 'typehint' into pyup-update-transformers-5.5.1-to-5.5.2
stephenhky Apr 9, 2026
437e593
Merge pull request #585 from stephenhky/pyup-update-transformers-5.5.…
stephenhky Apr 9, 2026
321d48e
Merge pull request #584 from stephenhky/pyup-update-transformers-5.5.…
stephenhky Apr 9, 2026
a9b7a7e
embedvecs optimization
stephenhky Apr 9, 2026
1d4bef1
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 9, 2026
d2e5529
dtype fixing for bow
stephenhky Apr 9, 2026
1377071
fixing tokenization
stephenhky Apr 9, 2026
aadf2ae
using np.array for embedvecs
stephenhky Apr 9, 2026
7a9a66b
preprocessing
stephenhky Apr 9, 2026
382cfb6
handle empty entry
stephenhky Apr 9, 2026
cc08352
output data download in error stream
stephenhky Apr 9, 2026
e4f2498
test to topic modeling
stephenhky Apr 9, 2026
702718f
equally distributing the topics
stephenhky Apr 9, 2026
2f5a63b
float type casting
stephenhky Apr 10, 2026
7ee65d2
test cosine similarity scorer
stephenhky Apr 10, 2026
98f974b
simplify workflow
stephenhky Apr 10, 2026
63edde5
changing how topic model get_info
stephenhky Apr 10, 2026
a08e971
handling nan error
stephenhky Apr 10, 2026
8e0beb6
updated test
stephenhky Apr 10, 2026
30635bb
typo
stephenhky Apr 10, 2026
648823b
big change to test_stacking
stephenhky Apr 10, 2026
fd52fb0
added loguru package
stephenhky Apr 10, 2026
b2f1cf5
uopdated I/O test
stephenhky Apr 10, 2026
52390a6
code optimization
stephenhky Apr 10, 2026
8675946
approximation in test
stephenhky Apr 10, 2026
9b5223f
Merge pull request #587 from stephenhky/typehint
stephenhky Apr 10, 2026
fadc0db
removed imports
stephenhky Apr 10, 2026
c843ebd
updated version
stephenhky Apr 10, 2026
68868fb
type hinting for DNN frameworks
stephenhky Apr 10, 2026
ecb489f
type hinting for VarNNEmbedVecClassification.py
stephenhky Apr 10, 2026
a5c185c
removed pandas dependence
stephenhky Apr 10, 2026
ef6a576
sum embed classifier
stephenhky Apr 10, 2026
fc94c2c
tune down float type
stephenhky Apr 10, 2026
e60fb68
retreated back to float64
stephenhky Apr 10, 2026
5063300
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 11, 2026
998f185
C-contiguous for np.dot
stephenhky Apr 13, 2026
85c9a5f
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 13, 2026
99e1f61
debugging
stephenhky Apr 13, 2026
b2fe355
dtype
stephenhky Apr 13, 2026
d42250e
convert all wvmodel output to np.float64
stephenhky Apr 13, 2026
38f5869
Merge branch 'develop' into pyup-update-transformers-5.5.3-to-5.5.4
stephenhky Apr 13, 2026
8b4a09e
Merge pull request #589 from stephenhky/pyup-update-transformers-5.5.…
stephenhky Apr 13, 2026
39cbdd6
Merge pull request #590 from stephenhky/pyup-update-transformers-5.5.…
stephenhky Apr 13, 2026
cb3e88d
updated requirements
stephenhky Apr 13, 2026
aa5f8a8
code optimization
stephenhky Apr 13, 2026
0018a10
Merge remote-tracking branch 'origin/typehint' into typehint
stephenhky Apr 13, 2026
880fd38
test cosine similarity
stephenhky Apr 13, 2026
c9af399
shorten code
stephenhky Apr 13, 2026
63dc751
shorten code
stephenhky Apr 13, 2026
23bb4c7
handling IO
stephenhky Apr 13, 2026
50fb386
handling IO 2
stephenhky Apr 13, 2026
8b42ef4
removed a unit test due to the stochasticity of LDA model
stephenhky Apr 13, 2026
851fc3a
import orjson
stephenhky Apr 13, 2026
2906e93
debugged
stephenhky Apr 14, 2026
ca31717
type bugs fixing
stephenhky Apr 14, 2026
00a166e
Merge pull request #591 from stephenhky/typehint
stephenhky Apr 14, 2026
89dc775
release 4.0.0a6 update
stephenhky Apr 14, 2026
ab6c751
command line functions updated
stephenhky Apr 14, 2026
41e6230
data retrieval
stephenhky Apr 14, 2026
ccd5c46
char2vec
stephenhky Apr 14, 2026
7dc8d9a
change typing from TextIO to TextIOWrapper
stephenhky Apr 14, 2026
571fd25
type hinting for seq2seq models
stephenhky Apr 14, 2026
a72f6e4
update type hinting for metrics
stephenhky Apr 14, 2026
36749f8
updated spell corrector
stephenhky Apr 14, 2026
9806439
wrong imports
stephenhky Apr 14, 2026
5393cf4
fixed namespace error
stephenhky Apr 14, 2026
eb180b4
added Scorable abstract class
stephenhky Apr 14, 2026
12ecc85
added imports
stephenhky Apr 14, 2026
27e3927
added imports
stephenhky Apr 14, 2026
da40655
Merge pull request #592 from stephenhky/typehint
stephenhky Apr 14, 2026
fa23267
new alpha
stephenhky Apr 14, 2026
b122c80
updated neural network unit tests
stephenhky Apr 14, 2026
062c633
cleaner exception
stephenhky Apr 14, 2026
c58d8b8
defined AbsoluteScorer
stephenhky Apr 14, 2026
f7351b0
stacking and AbsoluteScorer
stephenhky Apr 15, 2026
d040612
typing updated
stephenhky Apr 15, 2026
736da80
debugged
stephenhky Apr 15, 2026
42c8463
debugging with logger
stephenhky Apr 15, 2026
711c67b
use list for classlabels in stacking
stephenhky Apr 15, 2026
84ab0fb
removed some assertion test due to the stochasticity of the gensim mo…
stephenhky Apr 15, 2026
897fbf6
Merge pull request #593 from stephenhky/typehint
stephenhky Apr 15, 2026
90b8e96
update version
stephenhky Apr 15, 2026
b98d91e
cleaning up tests
stephenhky Apr 15, 2026
b6fc812
use logistic regression
stephenhky Apr 15, 2026
7c3fe82
Merge pull request #594 from stephenhky/typehint
stephenhky Apr 15, 2026
796c5cb
increase version alpha
stephenhky Apr 15, 2026
97f9fac
docstring updated
stephenhky Apr 16, 2026
b0555af
do not use != for NoneType
stephenhky Apr 16, 2026
32634d1
deprecate mergedict
stephenhky Apr 18, 2026
68e2faf
revert changes, and refactor the `mergedict` function
stephenhky Apr 18, 2026
94a15ca
updated instance instantiation functions
stephenhky Apr 18, 2026
825b765
Merge pull request #595 from stephenhky/typehint
stephenhky Apr 18, 2026
996f7ca
initiate new version alpha; updated documentation style
stephenhky Apr 18, 2026
cab5429
Merge pull request #596 from stephenhky/typehint
stephenhky Apr 18, 2026
ec6c8b6
Release 4.0.0
stephenhky Apr 19, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
14 changes: 1 addition & 13 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -30,16 +30,6 @@ shared: &shared


jobs:
py39:
<<: *shared
docker:
- image: cimg/python:3.9

py310:
<<: *shared
docker:
- image: cimg/python:3.10

py311:
<<: *shared
docker:
Expand All @@ -53,15 +43,13 @@ jobs:
py313:
<<: *shared
docker:
- image: cimg/python:3.12
- image: cimg/python:3.13


workflows:
version: 2
build:
jobs:
- py39
- py310
- py311
- py312
- py313
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -80,6 +80,7 @@ You can talk to me in advance through e-mails or the [Issues](https://github.com

## News

* 04/19/2026: `shorttext` 4.0.0 released.
* 03/22/2026: `shorttext` 3.1.1 released.
* 03/02/2026: `shorttext` 3.1.0 reelased.
* 10/27/2025: `shorttext` 3.0.1 released.
Expand Down
Loading