Back to Home

What Are Word Embeddings?

vie v3

Source: https://www.youtube.com/watch?v=hVM8qGRTaOA

Author: Under The Hood

Published At: 2025-02-22T00:00:00

Length: 19:32

Summary

Description

#word2vec #llm

Converting text into numbers is the first step in training any machine learning model for NLP tasks. While one-hot encoding and bag of words provide simple ways to represent text as numbers, they lack semantic and contextual understanding of words, making them unsuitable for NLP tasks like language translation and text generation. Embeddings help represent words as vectors that capture their semantic meaning.

In this video, I provide a detailed explanation of embeddings and popular embedding techniques like Word2Vec, along with custom embeddings used in Transformer architectures for language generation and other NLP tasks.

Watch Videos in Understanding Large Language Model:

1) Introduction to Large Language Model

https://youtu.be/NLOBYtfdxuM?si=PyVqqNLFsbRPvBI6

2) Preparing dataset and Tokenization

https://youtu.be/bNjVxUDZQfM?si=81fHwXskdl9abdZy

Timestamp:

0:00 - Intro

0:20 - Representing image into numbers

0:54 - Representing text into numbers

2:20 - One Hot Encoding

3:40 - Bag of Words (Unigram, Bigram and N-Gram)

4:59 - Semantic and Contextual Understanding of text