This repository hosts Django Application Deploying ML and DL Models for Arabic Dialects in texts Classification.
Arabic language many dialects across MENA, and among sources for people to communicate, twitter is a good source where we can get a huge amount of data to represent such diversity. There's a paper discussed this already, and I encourage you to skim through it, you'll find it easy to follow with.
I've used SGDClassifier, which performed quite well relatively to published results in the paper. Also, I've used a pretrained AraBERT v0.2 from HugginFace, which is helpful as they did most of the heavy lifting.
- Nice & fancy UI
- Support TPU training for AraBERT Model
- Apply Out-of-core Classifitcation with proper setup