The official implementation of Tensor ProducT ATTenTion Transformer (T6)
-
Updated
Jan 24, 2025 - Python
The official implementation of Tensor ProducT ATTenTion Transformer (T6)
Implementing and training/testing popular model architectures on the CIFAR10 dataset.
A beginner's investigation into the world of neural networks, using the MNIST image dataset
Add a description, image, and links to the model-architectures topic page so that developers can more easily learn about it.
To associate your repository with the model-architectures topic, visit your repo's landing page and select "manage topics."