TF Reader is an offline alternative to TensorBoard that I created while working on a Reinforcement Learning project. I found TensorBoard to be limited for the use I had in mind, specifically with regard to the thorough division of Scalars and the ability to display custom plots. This project also features Azure ttk theme.
Although the scope of this app is narrow and its usage is constrained by my personal use cases, I plan to maintain this project in my free time and share it, as it may be helpful to others.
TF Reader is built using the tkinter
GUI and embedded with matplotlib
to plot training results from TensorBoard binary files using tbparse
SummaryReader. Since loading all detected files at the start of the program would take too long, the program saves every training session and relevant file path in an internal dataclass and only loads the data when it's requested for plotting.
Currently, the app only supports plotting Scalars, but more features will be added in the future.
The features implemented at the moment are the following:
- Automatic training session retrieval:
- Predefined folder structure to allow automatic search
- Training sessions are identified by model tags, reward tags and size
- Further selection based on the Scalar Tag
- Multiple plots:
- The window can now be rescaled, allowing for bigger plots
- Curves on the plot can be smoothed via slider, similar to TensorBoard
- Full capabilities of the
matplotlib
plot toolbar - Now multiple plots (up to 6) corresponding to different scalar tags can be displayed
- Each scalar plotted is being kept track:
- Plotted scalars are displayed along with its model and reward tag, and size
- The labels are ordered based on how well the model performed on average during testing
- Hovering the name will highlight each plotted line
- Info window can be opened to view the training session hyperparameters
- An ordered list of the plotted scalars can be saved to a .txt file
By being born from a Reainforcement Learning need, tf_reader
is based on a particular folder structure.
.
└── trainings_dir/
├── Env [MT1] [MT2] [...] | [RT1] [RT2] [...]/
| ├── YYYYMMDD-HHMMSS/
| | └── dir/
| | ├──-ModelName-TPID/
| | | ├──tf_events_file
| | | └──best-model.pt
| | └──ModelName-TPID.params
| ├── YYYYMMDD-HHMMSS/
| | └── ...
| └── YYYYMMDD-HHMMSS/
| └── ...
├── Env [MT1] [MT2] [...] | [RT1] [RT2] [...]/
| ├── YYYYMMDD-HHMMSS/
| | └── ...
| ├── YYYYMMDD-HHMMSS/
| | └── ...
| └── YYYYMMDD-HHMMSS/
| └── ...
└── ...
Despite sounding complex, the directory tree used in this project is simply a fancy way of organizing different trainings. The first encountered folder contains the name of the environment (in this case, Gymnasium environments), model tags (such as MT1, indicating changes to the model like modifications to the number of layers), and reward tags (such as [RT1], indicating changes to the reward given to the agent).
For each model and reward variation, there are multiple sessions, each containing model checkpoints, tf_events files (which can be uploaded to TensorBoard), and a defining .params file.
The .params file is generated by saving a custom dataclass containing hyperparameters and other values of interest for the scope of the training.
The params.py
file includes the dataclass I used for the training. However, given the project's structure, any dataclass can be used without editing the code, as long as it uses the same render_to_string
function.
Since the network size is obtained from the .params file, the option entries in the program are currently hardcoded. Therefore, editing the code is necessary to use the program with different network sizes. However, I plan to fix this issue soon, as it's easy to implement and will greatly improve the versatility of the program.
The first thing needed is tkinter
backend; to install use sudo apt install python3-tk
.
Then run pip install -r requirements.txt
to install all the requirements needed.
To run TF Reader, simply run the command python3 main.py ./path/to/trainings