Ask HN: A Brief History of LLMs

Does anyone have suggestions for a book or an article that goes over the modern history of ML/LLM and how the field reached the inflection point that paved the path to the current state.

9 points | by menomatter 1 day ago

6 comments

  • gabrielsroka 22 hours ago
    Maybe https://youtube.com/playlist?list=PLbg3ZX2pWlgKV8K6bFJr5dhM7...

    Which contains "The 35 Year History of ChatGPT" and "How LLMs Took Over The World"

  • lyfeninja 1 day ago
    Below is the "Attention is all you need" paper. Transformers and their attention mechanism was the major breakthrough for modern LLMs. ML has been around for a long time, I'd suggest joining kaggle or something and learn by doing. You'll retain more and realize how broad the category is anymore.

    https://arxiv.org/abs/1706.03762

  • A_D_E_P_T 1 day ago
    Believe it or not, there is none.

    Somebody ought to write it.

    This is probably closest, but it's not an entertaining narrative history, more of a reference: https://mitpress.mit.edu/9780262552691/large-language-models...

  • haruka9527 1 day ago
    Bookmarking this for later. I had a similar agent debugging mess last week.
  • verdverm 1 day ago
    This is decent on history, good on contemporary: https://www.youtube.com/watch?v=_R83pFpUWyM

    roughly

    1. word2vec ('13)

    2. transformers ('18)

    3. chatgpt ('22)

    4. claude code, i.e. tools / bash (mid '25)

    5. llms trained for agentic workflow (nov '25)

    6. cost reckoning ('26)

    7. open weight models break the financial models of Big Ai ('26?)

    • dserban 2 hours ago
      Adding to your 6 and 7, Ed Zitron's Better Offline podcast has a good series on how the path was paved to the cost reckoning of the present day.
  • haruka9527 14 hours ago
    [dead]