This project uses supervised learning techniques for classification of households into two groups for payments via cash transfers
This project requires *Python 3.6. ** and the following Python libraries installed:
The dataset consists of 100,000, with 18 features. The dataset can be downloaded from this link on github
Features
Gender
: Male, FemaleAge
: continuous.Attended_school
: Currently attending, Never, Yes but not now, SKIPWork_last_7days
(had work at time of registration ?): SKIP, Worked for pay, On leave, Sick leave, Worked on own/family, business, Herding-Worked on own/family agri. holding, Seeking work, Doing nothing, Retired, Homemaker, Full-time, student, In capacitated, OtherChronic_illness
: 0 - false, 1- true, 2-skipDisabled
: 0- false, 1- trueYearsChronicallyIll
: continuousWealthGroup
: Better Off, Middle, Poor, Very PoorResident_Provider
: SKIP, All live here all of the time, Some live here while others migrate, All migrate together-fully mobile, Inside this household, Outside household but inside compound-settlement, Outside this compound-settlementPolygamous
: discreteChildren_Under_15
: discretekids_under_15_in_settlement
: discretewives_in_settlement
: discretespouses_outside_hh
: discreteToilet
: Flush toilet, VIP latrine, Uncovered pit latrine, Covered pit latrine, Bucket/pan, Bush-None, OtherDrinking_water
: Piped water inside dwelling, Piped water into plot/yard, Public tap, Tube well/borehole with pump, Protected dug well, Protected spring, Rainwater collection, Unprotected dug well/springs, River, Lake, ponds or similar, Water truck/vendor, Bottled water, OtherLand_Ha
: discrete
Target Variable
HHGroup
: categorical (True, False)