How LLMs Actually Work

2024-11-29

LLM Application Development

TODO

How LLMs Actually Work

text -> tokenizer -> embedding lookup -> transformer -> unembedding -> softmax -> next token
Tokenization
TODO
Embeddings
Positional encoding
Attention
feed-forward network
next-token prediction

reference

How LLMs Actually Work
LLM Application Development With Python
[Hands-on Large Language Models: Language Understanding and Generation]

Mentioned but not linked (1)

Other pages referenced in this note's text. Add [[wikilinks]] to connect them.

Language