-
Notifications
You must be signed in to change notification settings - Fork 17
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #3605 from bright/cms/blog/understanding-embedding…
…s-a-short-guide-with-an-example Automatically generated. Merged on Netlify CMS.
- Loading branch information
Showing
5 changed files
with
49 additions
and
0 deletions.
There are no files selected for viewing
49 changes: 49 additions & 0 deletions
49
content/blog/understanding-embeddings-a-short-guide-with-an-example.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,49 @@ | ||
--- | ||
author: pawel-p | ||
tags: | ||
- AI | ||
- Embeddings | ||
date: 2024-01-16T12:08:54.954Z | ||
meaningfullyUpdatedAt: 2024-01-16T12:08:54.972Z | ||
title: "Understanding Embeddings: A Short Guide with an Example" | ||
layout: post | ||
image: /images/embeddings_blog_cover.png | ||
hidden: false | ||
comments: true | ||
published: true | ||
language: en | ||
--- | ||
**Embeddings are an invisible, yet important part of many technologies we encounter. From internet search engines, through recommendation systems and advertisement personalization, to advanced analyses of images, videos, and technologies for speech and sound recognition – embeddings play a key role everywhere. In this article, we will explain how embeddings work and how they facilitate, and enrich our daily experiences with technology.** | ||
|
||
<div className="image">data:image/s3,"s3://crabby-images/58d5c/58d5cbd9eb79682d0b0492d801697915b5fcf2cb" alt="Embedding explained"</div> | ||
|
||
As I mentioned above, vector embeddings are a popular technique to represent information in a format (typically as a vector of numerical values) that can be easily processed by algorithms, especially deep learning models. This ‘information’ can be text, pictures, video, and audio.\ | ||
\ | ||
For example, the conversion of the word 'dog' into a numerical vector representation could look like this: | ||
|
||
<div className="image">data:image/s3,"s3://crabby-images/324be/324be5c4a6511d08480509401e563fd8a035733e" alt="embeddings convert process"</div> | ||
|
||
## What is the embedding dimension? | ||
|
||
**A crucial factor in determining the quality and effectiveness of the embedding is the embedding dimension. Generally, the term 'dimensionality of word embedding' refers to the total count of dimensions used to define a word's vector representation.** This number is usually established during the development of the word embedding and indicates how many distinct features are included in the vector representation of the word. | ||
|
||
For text embeddings, these vectors are constructed in a way that captures the semantic meaning of the text. This ensures that words or sentences conveying similar meanings are close to each other in the embedded space, often referred to as a vector space. | ||
|
||
## What does that mean? Here’s a simple example | ||
|
||
Let's say we have a space with only two dimensions - \[x, y], where x represents sex and y represents activity. | ||
|
||
<div className="image">data:image/s3,"s3://crabby-images/0c00c/0c00cba872510df849a1884650a67ce5db7372db" alt="embeddings dimensional space"</div> | ||
|
||
\ | ||
Now, for example, by asking the question 'Who is walking?' the embeddings will search for vectors that include the 'Walk' sentence in the y dimension. | ||
|
||
<div className="image">data:image/s3,"s3://crabby-images/83385/83385e1a9f8cc10f1dfd36d8ad7e6519591224d6" alt="embeddings dimensional space with similarity comprasion"</div> | ||
|
||
We can observe that the sentence "Walk" is associated with a woman, a man, a boy, and a girl. This means that all of them are walking. And thus, we get our answer: A woman, a man, a boy, and a girl are walking. | ||
|
||
Remember, this is a very simplified example with only two dimensions of meaning. In reality, the more such dimensions there are, the better our embedding is at guessing the meanings of saved sentences as words or phrases. | ||
|
||
## Ready for more embedding tutorials? | ||
|
||
In the upcoming articles, we'll delve deeper into various applications of embedding in AI. Subscribe to our [newsletter](https://bright-dev-newsletter.getresponsepages.com/) to stay updated and not miss out on any insights! |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.