update website

souzatharsis · Jan 24, 2025 · aa0bbbb · aa0bbbb
1 parent 4f4e91a
commit aa0bbbb
Show file tree

Hide file tree

Showing 10 changed files with 43 additions and 82 deletions.
diff --git a/README.md b/README.md
@@ -12,19 +12,19 @@ Please [open an issue](https://github.com/souzatharsis/tamingLLMs/issues) with y
 
 Abstract: *The current discourse around Large Language Models (LLMs) tends to focus heavily on their capabilities while glossing over fundamental challenges. Conversely, this book takes a critical look at the key limitations and implementation pitfalls that engineers and technical leaders encounter when building LLM-powered applications. Through practical Python examples and proven open source solutions, it provides an introductory yet comprehensive guide for navigating these challenges. The focus is on concrete problems with reproducible code examples and battle-tested open source tools. By understanding these pitfalls upfront, readers will be better equipped to build products that harness the power of LLMs while sidestepping their inherent limitations.*
 
-| Chapter                                   | PDF          | Podcast      | Website      | Notebook      | Status               |
-|-------------------------------------------|--------------|--------------|--------------|---------------|----------------------|
-| Preface                   | [pdf](https://docsend.com/view/vdnwucxnpbtzu9pk) | | [html](https://www.tamingllms.com/markdown/preface.html) | N/A           | *Ready for Review*                   |
-| About the Book                  | [pdf](https://docsend.com/view/h9zrk4su559d2mmk) | | [html](https://www.tamingllms.com/markdown/intro.html) | N/A           | *Ready for Review*                   |
-| Chapter 1: The Evals Gap                | [pdf](https://docsend.com/view/a3s9qk8ix3btaa3u)  | [podcast](https://tamingllm.substack.com/p/chapter-1-podcast-the-evals-gap) | [html](https://www.tamingllms.com/notebooks/evals.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/evals.ipynb) | *Ready for Review*     |
-| Chapter 2: Structured Output| [pdf](https://docsend.com/view/5i88kgvkvtxucj3b) | [podcast](https://tamingllm.substack.com/p/chapter-2-podcast-structured-output) | [html](https://www.tamingllms.com/notebooks/structured_output.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/structured_output.ipynb) |  *Ready for Review*  |
-| Chapter 3: Managing Input Data | [pdf](https://docsend.com/view/dp8b6j8jgpexkvkx) | | [html](https://www.tamingllms.com/notebooks/input.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/input.ipynb) |   *Ready for Review*   | 
-| Chapter 4: Safety              | [pdf](https://docsend.com/view/xzb3f6fqvy5cjzps) | | [html](https://www.tamingllms.com/notebooks/safety.html)  |  [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/safety.ipynb) |  *Ready for Review*   |
-| Chapter 5: Preference-Based Alignment     | [pdf](https://docsend.com/view/fi4z8c4jmzncviwa) | | [html](https://www.tamingllms.com/notebooks/alignment.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/alignment.ipynb) |  *Ready for Review*   |
-| Chapter 6: Local LLMs in Practice | [pdf](https://docsend.com/view/ybbh5nifcazw3bdh) | | [html](https://www.tamingllms.com/notebooks/local.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/local.ipynb) |  *Ready for Review*  |
-| Chapter 7: The Falling Cost Paradox               | | |              |               |        WIP              |
-| Chapter 8: Frontiers                | | |              |               |                 |
-| Appendix A: Tools and Resources           | | |              |               |                      |
+| Chapter                                   | Website      | Notebook      |
+|-------------------------------------------|--------------|---------------|
+| Preface                   | [html](https://www.tamingllms.com/markdown/preface.html) | N/A           |
+| About the Book                  | [html](https://www.tamingllms.com/markdown/intro.html) | N/A           |
+| Chapter 1: The Evals Gap                | [html](https://www.tamingllms.com/notebooks/evals.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/evals.ipynb) |
+| Chapter 2: Structured Output| [html](https://www.tamingllms.com/notebooks/structured_output.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/structured_output.ipynb) |
+| Chapter 3: Managing Input Data | [html](https://www.tamingllms.com/notebooks/input.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/input.ipynb) |
+| Chapter 4: Safety              | [html](https://www.tamingllms.com/notebooks/safety.html)  |  [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/safety.ipynb) |
+| Chapter 5: Preference-Based Alignment     | [html](https://www.tamingllms.com/notebooks/alignment.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/alignment.ipynb) |
+| Chapter 6: Local LLMs in Practice | [html](https://www.tamingllms.com/notebooks/local.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/local.ipynb) |
+| Chapter 7: The Falling Cost Paradox               |              |               |
+| Chapter 8: Frontiers                |              |               |
+| Appendix A: Tools and Resources           |              |               |
 
 ## Citation
 [![CC BY-NC-SA 4.0][cc-by-nc-sa-image]][cc-by-nc-sa]

diff --git a/tamingllms/_build/.doctrees/environment.pickle b/tamingllms/_build/.doctrees/environment.pickle
diff --git a/tamingllms/_build/.doctrees/markdown/toc.doctree b/tamingllms/_build/.doctrees/markdown/toc.doctree
diff --git a/tamingllms/_build/html/_sources/markdown/toc.md b/tamingllms/_build/html/_sources/markdown/toc.md
@@ -16,20 +16,19 @@ Abstract: *The current discourse around Large Language Models (LLMs) tends to fo
 
 ---
 
-(*) *The pdf version is preferred as it contains corrections and side notes.*
-| Chapter                                   | PDF          | Podcast      | Website      | Notebook      | Status               |
-|-------------------------------------------|--------------|--------------|--------------|---------------|----------------------|
-| Preface                   | [pdf](https://docsend.com/view/vdnwucxnpbtzu9pk) | | [html](https://www.tamingllms.com/markdown/preface.html) | N/A           | *Ready for Review*                   |
-| About the Book                  | [pdf](https://docsend.com/view/h9zrk4su559d2mmk) | | [html](https://www.tamingllms.com/markdown/intro.html) | N/A           | *Ready for Review*                   |
-| Chapter 1: The Evals Gap                | [pdf](https://docsend.com/view/a3s9qk8ix3btaa3u)  | [podcast](https://tamingllm.substack.com/p/chapter-1-podcast-the-evals-gap) | [html](https://www.tamingllms.com/notebooks/evals.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/evals.ipynb) | *Ready for Review*     |
-| Chapter 2: Structured Output| [pdf](https://docsend.com/view/5i88kgvkvtxucj3b) | [podcast](https://tamingllm.substack.com/p/chapter-2-podcast-structured-output) | [html](https://www.tamingllms.com/notebooks/structured_output.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/structured_output.ipynb) |  *Ready for Review*  |
-| Chapter 3: Managing Input Data | [pdf](https://docsend.com/view/dp8b6j8jgpexkvkx) | | [html](https://www.tamingllms.com/notebooks/input.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/input.ipynb) |   *Ready for Review*   | 
-| Chapter 4: Safety              | [pdf](https://docsend.com/view/xzb3f6fqvy5cjzps) | | [html](https://www.tamingllms.com/notebooks/safety.html)  |  [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/safety.ipynb) |  *Ready for Review*   |
-| Chapter 5: Preference-Based Alignment     | [pdf](https://docsend.com/view/fi4z8c4jmzncviwa) | | [html](https://www.tamingllms.com/notebooks/alignment.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/alignment.ipynb) |  *Ready for Review*   |
-| Chapter 6: Local LLMs in Practice | [pdf](https://docsend.com/view/ybbh5nifcazw3bdh) | | [html](https://www.tamingllms.com/notebooks/local.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/local.ipynb) |  *Ready for Review*  |
-| Chapter 7: The Falling Cost Paradox               | | |              |               |        WIP              |
-| Chapter 8: Frontiers                | | |              |               |                 |
-| Appendix A: Tools and Resources           | | |              |               |                      |
+| Chapter                                   | Website      | Notebook      |
+|-------------------------------------------|--------------|---------------|
+| Preface                   | [html](https://www.tamingllms.com/markdown/preface.html) | N/A           |
+| About the Book                  | [html](https://www.tamingllms.com/markdown/intro.html) | N/A           |
+| Chapter 1: The Evals Gap                | [html](https://www.tamingllms.com/notebooks/evals.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/evals.ipynb) |
+| Chapter 2: Structured Output| [html](https://www.tamingllms.com/notebooks/structured_output.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/structured_output.ipynb) |
+| Chapter 3: Managing Input Data | [html](https://www.tamingllms.com/notebooks/input.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/input.ipynb) |
+| Chapter 4: Safety              | [html](https://www.tamingllms.com/notebooks/safety.html)  |  [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/safety.ipynb) |
+| Chapter 5: Preference-Based Alignment     | [html](https://www.tamingllms.com/notebooks/alignment.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/alignment.ipynb) |
+| Chapter 6: Local LLMs in Practice | [html](https://www.tamingllms.com/notebooks/local.html) | [ipynb](https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/local.ipynb) |
+| Chapter 7: The Falling Cost Paradox               |              |               |
+| Chapter 8: Frontiers                |              |               |
+| Appendix A: Tools and Resources           |              |               |
 
 
 [![CC BY-NC-SA 4.0][cc-by-nc-sa-image]][cc-by-nc-sa]

diff --git a/tamingllms/_build/html/markdown/toc.html b/tamingllms/_build/html/markdown/toc.html
@@ -239,94 +239,57 @@ <h1><a class="reference external" href="https://www.tamingllms.com">Taming LLMs<
 <h2><em>A Practical Guide to LLM Pitfalls with Open Source Software</em><a class="headerlink" href="#a-practical-guide-to-llm-pitfalls-with-open-source-software" title="Permalink to this heading">¶</a></h2>
 <p>Abstract: <em>The current discourse around Large Language Models (LLMs) tends to focus heavily on their capabilities while glossing over fundamental challenges. Conversely, this book takes a critical look at the key limitations and implementation pitfalls that engineers and technical leaders encounter when building LLM-powered applications. Through practical Python examples and proven open source solutions, it provides an introductory yet comprehensive guide for navigating these challenges. The focus is on concrete problems with reproducible code examples and battle-tested open source tools. By understanding these pitfalls upfront, readers will be better equipped to build products that harness the power of LLMs while sidestepping their inherent limitations.</em></p>
 <hr class="docutils" />
-<p>(*) <em>The pdf version is preferred as it contains corrections and side notes.</em></p>
 <table class="docutils align-default">
 <thead>
 <tr class="row-odd"><th class="head"><p>Chapter</p></th>
-<th class="head"><p>PDF</p></th>
-<th class="head"><p>Podcast</p></th>
 <th class="head"><p>Website</p></th>
 <th class="head"><p>Notebook</p></th>
-<th class="head"><p>Status</p></th>
 </tr>
 </thead>
 <tbody>
 <tr class="row-even"><td><p>Preface</p></td>
-<td><p><a class="reference external" href="https://docsend.com/view/vdnwucxnpbtzu9pk">pdf</a></p></td>
-<td><p></p></td>
 <td><p><a class="reference external" href="https://www.tamingllms.com/markdown/preface.html">html</a></p></td>
 <td><p>N/A</p></td>
-<td><p><em>Ready for Review</em></p></td>
 </tr>
 <tr class="row-odd"><td><p>About the Book</p></td>
-<td><p><a class="reference external" href="https://docsend.com/view/h9zrk4su559d2mmk">pdf</a></p></td>
-<td><p></p></td>
 <td><p><a class="reference external" href="https://www.tamingllms.com/markdown/intro.html">html</a></p></td>
 <td><p>N/A</p></td>
-<td><p><em>Ready for Review</em></p></td>
 </tr>
 <tr class="row-even"><td><p>Chapter 1: The Evals Gap</p></td>
-<td><p><a class="reference external" href="https://docsend.com/view/a3s9qk8ix3btaa3u">pdf</a></p></td>
-<td><p><a class="reference external" href="https://tamingllm.substack.com/p/chapter-1-podcast-the-evals-gap">podcast</a></p></td>
 <td><p><a class="reference external" href="https://www.tamingllms.com/notebooks/evals.html">html</a></p></td>
 <td><p><a class="reference external" href="https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/evals.ipynb">ipynb</a></p></td>
-<td><p><em>Ready for Review</em></p></td>
 </tr>
 <tr class="row-odd"><td><p>Chapter 2: Structured Output</p></td>
-<td><p><a class="reference external" href="https://docsend.com/view/5i88kgvkvtxucj3b">pdf</a></p></td>
-<td><p><a class="reference external" href="https://tamingllm.substack.com/p/chapter-2-podcast-structured-output">podcast</a></p></td>
 <td><p><a class="reference external" href="https://www.tamingllms.com/notebooks/structured_output.html">html</a></p></td>
 <td><p><a class="reference external" href="https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/structured_output.ipynb">ipynb</a></p></td>
-<td><p><em>Ready for Review</em></p></td>
 </tr>
 <tr class="row-even"><td><p>Chapter 3: Managing Input Data</p></td>
-<td><p><a class="reference external" href="https://docsend.com/view/dp8b6j8jgpexkvkx">pdf</a></p></td>
-<td><p></p></td>
 <td><p><a class="reference external" href="https://www.tamingllms.com/notebooks/input.html">html</a></p></td>
 <td><p><a class="reference external" href="https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/input.ipynb">ipynb</a></p></td>
-<td><p><em>Ready for Review</em></p></td>
 </tr>
 <tr class="row-odd"><td><p>Chapter 4: Safety</p></td>
-<td><p><a class="reference external" href="https://docsend.com/view/xzb3f6fqvy5cjzps">pdf</a></p></td>
-<td><p></p></td>
 <td><p><a class="reference external" href="https://www.tamingllms.com/notebooks/safety.html">html</a></p></td>
 <td><p><a class="reference external" href="https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/safety.ipynb">ipynb</a></p></td>
-<td><p><em>Ready for Review</em></p></td>
 </tr>
 <tr class="row-even"><td><p>Chapter 5: Preference-Based Alignment</p></td>
-<td><p><a class="reference external" href="https://docsend.com/view/fi4z8c4jmzncviwa">pdf</a></p></td>
-<td><p></p></td>
 <td><p><a class="reference external" href="https://www.tamingllms.com/notebooks/alignment.html">html</a></p></td>
 <td><p><a class="reference external" href="https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/alignment.ipynb">ipynb</a></p></td>
-<td><p><em>Ready for Review</em></p></td>
 </tr>
 <tr class="row-odd"><td><p>Chapter 6: Local LLMs in Practice</p></td>
-<td><p><a class="reference external" href="https://docsend.com/view/ybbh5nifcazw3bdh">pdf</a></p></td>
-<td><p></p></td>
 <td><p><a class="reference external" href="https://www.tamingllms.com/notebooks/local.html">html</a></p></td>
 <td><p><a class="reference external" href="https://github.com/souzatharsis/tamingLLMs/blob/master/tamingllms/notebooks/local.ipynb">ipynb</a></p></td>
-<td><p><em>Ready for Review</em></p></td>
 </tr>
 <tr class="row-even"><td><p>Chapter 7: The Falling Cost Paradox</p></td>
 <td><p></p></td>
 <td><p></p></td>
-<td><p></p></td>
-<td><p></p></td>
-<td><p>WIP</p></td>
 </tr>
 <tr class="row-odd"><td><p>Chapter 8: Frontiers</p></td>
 <td><p></p></td>
 <td><p></p></td>
-<td><p></p></td>
-<td><p></p></td>
-<td><p></p></td>
 </tr>
 <tr class="row-even"><td><p>Appendix A: Tools and Resources</p></td>
 <td><p></p></td>
 <td><p></p></td>
-<td><p></p></td>
-<td><p></p></td>
-<td><p></p></td>
 </tr>
 </tbody>
 </table>

diff --git a/tamingllms/_build/html/searchindex.js b/tamingllms/_build/html/searchindex.js
diff --git a/tamingllms/_build/jupyter_execute/markdown/intro.ipynb b/tamingllms/_build/jupyter_execute/markdown/intro.ipynb
@@ -2,7 +2,7 @@
  "cells": [
   {
    "cell_type": "markdown",
-   "id": "87ed3c40",
+   "id": "789163a3",
    "metadata": {},
    "source": [
     "(intro)=\n",

diff --git a/tamingllms/latex/chapters/alignment.tex b/tamingllms/latex/chapters/alignment.tex
@@ -723,7 +723,7 @@ \subsection{User Prompts}
 \subsection{Rejected Responses}
 
 
-We create sample rejected responses from an unaligned base model, which here we use\texttt{SmolLM2-360M-Instruct}~\sidenote[][*1]{
+We create sample rejected responses from an unaligned base model, which here we use \texttt{SmolLM2-360M-Instruct}~\sidenote[][*1]{
 The \texttt{ResponseGenerator} class creates a dataset of responses from an unaligned base model that we aim to improve through fine-tuning. These responses serve as ``rejected'' examples in our training data since they may not properly align with safety policies and guidelines. The class supports both local model inference using the Hugging Face Transformers library and remote inference through the Hugging Face Inference API. When instantiated with a model name, it loads the model locally. Otherwise, if a cloud API URL is provided, it connects to the remote API endpoint for inference.
 }.
 

diff --git a/tamingllms/latex/chapters/safety.tex b/tamingllms/latex/chapters/safety.tex
@@ -1136,7 +1136,7 @@ \subsubsection{LLM-Based Safety Filtering}
 \label{granite}
 \end{figure}
 
-The industry is increasingly focusing on the fine-tuning of pre-trained base models targeting a specific dimension of requirements and standards, here Safety being a critical one. This trend encompasses the release of open-source, fine-tuned safety models that can act as protective guardrails for LLM applications, as exemplified by LLaMa-Guard and IBM Granite Guardian. Additionally, there is a notable rise in models fine-tuned through techniques such as Reinforcement Learning from Human Feedback (RLHF), utilizing human preference datasets that incorporate safety considerations. These specialized models can function as safety filters as discussed but also as main models that alone could accomplished their original intended task safely without the need of external filters. We will cover this specific topic in the Chapter \ref{alignment}, where we will explore the process of aligning language models with human preferences ultimately leading to the development of an open source fine-tuned model that complies with user provided policy-based requirements.
+The industry is increasingly focusing on the fine-tuning of pre-trained base models targeting a specific dimension of requirements and standards, here Safety being a critical one. This trend encompasses the release of open-source, fine-tuned safety models that can act as protective guardrails for LLM applications, as exemplified by LLaMa-Guard and IBM Granite Guardian. Additionally, there is a notable rise in models fine-tuned through techniques such as Reinforcement Learning from Human Feedback (RLHF), utilizing human preference datasets that incorporate safety considerations. These specialized models can function as safety filters as discussed but also as main models that alone could accomplished their original intended task safely without the need of external filters. We will cover this specific topic in the Chapter \ref{chapter:alignment}, where we will explore the process of aligning language models with human preferences ultimately leading to the development of an open source fine-tuned model that complies with user provided policy-based requirements.
 
 \subsection{Custom Moderation}
 \label{custom-moderation}