Skip to content

This project aims to identify key topics in Quora questions related to popular applications. We'll use Non-Negative Matrix Factorization (NMF) to extract meaningful themes and patterns from this data.

Notifications You must be signed in to change notification settings

petroritse1/NMF_model

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Topic Modeling with Non Negative Matrix Factorization

A concise guide to uncovering hidden themes in text data.

Libraries Used 📚

  • NLTK: For text preprocessing
  • TfidfVectorizer: To convert text to numerical features
  • Non Negative Matrix Factorization: For topic modeling

Data Preprocessing 🧹

Clean the Text with TfidfVectorizer

  • Remove stop words
  • Tokenize text
  • Lemmatize/Stem words
  • Convert to lowercase

Feature Extraction with TfidfVectorizer

  • Create document-term matrix

Model Training 🧠

Initialize NMF

  • Set number of topics
  • Tune hyperparameters

Fit the Model

  • Train on preprocessed data

About

This project aims to identify key topics in Quora questions related to popular applications. We'll use Non-Negative Matrix Factorization (NMF) to extract meaningful themes and patterns from this data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published