Reinforcement Learning is a trial and error application of ML, in which everytime the
algorithm performs an action it gets rewarded. The overall obejective of the algorithm
is to maximize the reward. It remembers the previously taken actions through two very
important matrices: the R-Matrix and the Q-Matrix.
The R-Matrix maps the current state of the algorithm with the reward it will recibe for
taking a determined action. On the other hand, the Q-Matrix is the "memory" of the algorithm
and maps the rewards of the R-Matrix showing how to achieve the highest reward based on the
previous states.
In this implementation, we will use the graph shown below to find the shortest path between
two nodes. The user needs to enter both the initial and the final node.
-
Notifications
You must be signed in to change notification settings - Fork 0
felipeary91/Short-Path-Using-Q-Learning
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Implementation of a short path finder using Q learning in C language
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published