Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
shweta-yadav15 authored Jun 3, 2020
1 parent e48cb43 commit 52843bc
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@ The dataset contains 30,000 messages drawn from events including an earthquake i

The data has been encoded with 36 different categories related to disaster response and has been stripped of messages with sensitive information in their entirety.

Disaster response messages dataset consists of imbalanced category labels data. Some labels like aid-related, weather-related have much more examples as compared to other categories. This imbalance might affect the model training as the classes are not represented equally. It can be handled by resampling the dataset or by generating synthetic samples. Although I have not applied these methods for now but I am planning to do it in future.
---

## Installation
Expand Down

0 comments on commit 52843bc

Please sign in to comment.