Ask HN: Build Your Own LLM?

  • Posted 3 days ago by retube
  • 13 points
The best way to really understand how something works is to build it yourself. So I am wondering if there are any good tutorials on building your own LLM from scratch. I.e. implementing tokenisation, embeddings, attention and so on. I am not suggesting one could replicate chatGPT, but more a toy model that implements the core features but based on a much smaller corpus and training data.

5 comments

    Loading..
    Loading..
    Loading..
    Loading..
    Loading..