- This repo contains a GPT network for LLM's generated from scratch on an OpenWebCorpus dataset of reddit comments and reviews....
- The model and environment used is created with ipykernel and cuda with creating a custom kernel, feel free to just use your default python3 kernel but with cuda for faster processing and training of data..
- The code is inspired and modified from Andrej Karpathy's explaination of how LLM's are made. Feel free to use the pretrained model or the code to train on your own dataset!
- As this code was trained on the OpenCorpus dataset the way to unzip all the files and make the vocab.txt is by running the training the model first followed by extract.py script then you just save the changes and generate and use your model...
- In order for the last part to run use final_bot.py script to run and start typing prompts for the generation magic to happen....
-
Notifications
You must be signed in to change notification settings - Fork 0
This repo contains a GPT network for LLM's generated from scratch on an OpenWebCorpus dataset of reddit comments and reviews.... The code is inspired and modified from Andrej Karpathy's explaination of how LLM's are made. Feel free to use the pretrained model or the code to train on your own dataset!
Nash1818/SmartGPT
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
This repo contains a GPT network for LLM's generated from scratch on an OpenWebCorpus dataset of reddit comments and reviews.... The code is inspired and modified from Andrej Karpathy's explaination of how LLM's are made. Feel free to use the pretrained model or the code to train on your own dataset!
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published