Ask HN: Build Your Own LLM?

  • Posted 4 months ago by retube
  • 15 points
The best way to really understand how something works is to build it yourself. So I am wondering if there are any good tutorials on building your own LLM from scratch. I.e. implementing tokenisation, embeddings, attention and so on. I am not suggesting one could replicate chatGPT, but more a toy model that implements the core features but based on a much smaller corpus and training data.

6 comments

    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..