Topic Modeling with Non Negative Matrix Factorization

A concise guide to uncovering hidden themes in text data.

Libraries Used 📚

NLTK: For text preprocessing
TfidfVectorizer: To convert text to numerical features
Non Negative Matrix Factorization: For topic modeling

Data Preprocessing 🧹

Clean the Text with TfidfVectorizer

Remove stop words
Tokenize text
Lemmatize/Stem words
Convert to lowercase

Feature Extraction with TfidfVectorizer

Create document-term matrix

Model Training 🧠

Initialize NMF

Set number of topics
Tune hyperparameters

Fit the Model

Train on preprocessed data

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
dataset		dataset
notebook		notebook
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Topic Modeling with Non Negative Matrix Factorization

Libraries Used 📚

Data Preprocessing 🧹

Clean the Text with TfidfVectorizer

Feature Extraction with TfidfVectorizer

Model Training 🧠

Initialize NMF

Fit the Model

About

Releases

Packages

Languages

petroritse1/NMF_model

Folders and files

Latest commit

History

Repository files navigation

Topic Modeling with Non Negative Matrix Factorization

Libraries Used 📚

Data Preprocessing 🧹

Clean the Text with TfidfVectorizer

Feature Extraction with TfidfVectorizer

Model Training 🧠

Initialize NMF

Fit the Model

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages