Skip to content

Latest commit

 

History

History
3 lines (2 loc) · 314 Bytes

README.md

File metadata and controls

3 lines (2 loc) · 314 Bytes

Topic Modeling Pipelines

Two LDA pipelines for topic modeling in Python based on pymystem (Python wrapper for mystem stemmer), gensim, and tomotopy packages. Pipelines include the crucial steps in topic modeling: lemmatization, stop words filtering, model training, phi and theta matrices saving.