This repo contains Form34A tallies from the Kenya August 2022 eleciton. This data has been collected via crowdsourcing and is not the official results. For official results please see the Independent Electoral and Boundaries Commission website
This data is provided for education purposes only.
In August 2022, elections were held in Kenya to decide the next leaders, the main one being the presidential elections.
The way elections work is, that there are 46,229 polling centers where elections take place. After the voting period ends (typically around 5pm on election day) the polling station officials commence counting of votes.
They then record the tally manually on a physical form. This form is referred to as Form 34A. The form is then uploaded on public portal for transparency. This repo contains the results of an OCR project that focused on extracting those numbers and creating a tally. For more info on the project see this blog post: Using AI to verify Kenya Election Results.
Train an OCR model that correctly identifies the hand written digits. Use the data in this repo ~ forms-export.csv A sample pipeline looks like this.
Download Images -> Crop Images -> Train an OCR model -> Analyse Accuracy -> Submit results here
Get the images from the IEBC website here
When cropping images, you may have to crop individual numbers as most OCR models using deep learning can only read one line at a time.
To submit feel free to create an issue, and share you results and methodology. Please note that the data is truncated and only 30,000 out of the 46,000 forms are provided. This is to enusre no cheating, once submitted you can share you model weights, also a jupyter or colab notebook if possible and I will run on a test the remaining data.
In this economy, we do it the for the glory.