From 23265f48d1e86e580dea3474b476ef270f259681 Mon Sep 17 00:00:00 2001 From: Titiksha Agrawal <117917014+AgrawalTitiksha@users.noreply.github.com> Date: Tue, 16 Jul 2024 13:48:14 +0530 Subject: [PATCH] Create readme.md --- MLM Analysis of ChatGPT using NLP/Images/readme.md | 10 ++++++++++ 1 file changed, 10 insertions(+) create mode 100644 MLM Analysis of ChatGPT using NLP/Images/readme.md diff --git a/MLM Analysis of ChatGPT using NLP/Images/readme.md b/MLM Analysis of ChatGPT using NLP/Images/readme.md new file mode 100644 index 000000000..150ee59c5 --- /dev/null +++ b/MLM Analysis of ChatGPT using NLP/Images/readme.md @@ -0,0 +1,10 @@ +# EDA Images File +### Distribution of Prompts by Query Type: +This counts the occurrences of different query types in your prompts. We create a new column query_type based on keywords in the prompts and then visualize the counts. +![image](https://github.com/user-attachments/assets/a45a6fc2-c4cc-4edc-8c6f-e3af8b8365af) +### Split Distribution: +This shows the number of samples in each split (e.g., train, validation, test). +![image](https://github.com/user-attachments/assets/ee2cb201-1f03-48c7-ae2a-fbd9bb154314) +### Word Cloud of Prompts: +This generates a word cloud from the text in the prompts to visualize the most common words. +![image](https://github.com/user-attachments/assets/738f3aaa-1e54-4628-bda8-e615a409189d)