How LLMs Actually Work

LLM Application Development

TODO

How LLMs Actually Work

  • text -> tokenizer -> embedding lookup -> transformer -> unembedding -> softmax -> next token

  • Tokenization
    TODO

  • Embeddings

  • Positional encoding

  • Attention

  • feed-forward network

  • next-token prediction

reference

Links to

Mentioned but not linked (4)

Other pages referenced in this note's text. Add [[wikilinks]] to connect them.