Skip to content

Code repository for Mattias Beming and Ludvig Joborn's master thesis project.

License

Notifications You must be signed in to change notification settings

Ludvig-Joborn/Thesis-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

98 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Thesis Project

An AI and Machine Learning project developed by Mattias Beming and Ludvig Joborn (former students at Linköping University).

The thesis work was conducted during the spring of 2022 in collaboration with The Swedish Police Authority - National Forensic Centre (sv. Polismyndigheten - Nationellt Forensiskt Centrum).

The thesis is mostly concerned with Deep Learning in the context of Voice Activity Detection; that is, using neural networks to find voice activity in audio recordings. It also briefly touches the area of Sound Event Detection. If these areas are of interest to you, feel free to check out our published thesis!

Our thesis has been published at DiVA. You can find the publication here.

Overview

Architecture

A simplified image of the project architecture is shown below. Understanding the architecture is not necessary for simple usage, but will hopefully be helpful to anyone that desires to make deeper use of the project material.

Arrows indicate code-wise dependencies.

Note: evaluate.py has been split up into validate.py, eval.py and predict.py, but they all fit where evaluate.py is currently placed.

architecture

Setup

Instructions for setting up the project is found in setup.md.

Acquire and Build Datasets

The instructions on how to download the datasets used is found in datasets.md.

Run Code

Instructions to run code can be found in run.md.

Citation

If you find the code useful in your projects or research, consider citing our thesis:

@mastersthesis{Joborn-Beming,
   author = {Joborn, Ludvig and Beming, Mattias},
   institution = {Linköping University, Department of Computer and Information Science},
   pages = {68},
   school = {Linköping University, Department of Computer and Information Science},
   title = {Towards a Nuanced Evaluation of Voice Activity Detection Systems : An Examination of Metrics, Sampling Rates and Noise with Deep Learning},
   year = {2022}
}

About

Code repository for Mattias Beming and Ludvig Joborn's master thesis project.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages