Title:

Category:

Building LLMs from the Ground Up

Tutorial

URL

https://www.youtube.com/watch?v=quh7z1q7-uc

Authors:

Sebastian Raschka

https://sebastianraschka.com/books/

Published

31 August 2024

Review:

Dennis Kuriakose

Review Date :

6 September 2024

Summary

This tutorial is aimed at coders interested in understanding the building blocks of large language models (LLMs), how LLMs work, and how to code them from the ground up in PyTorch.

Review & Notes:

DESCRIPTION:

This tutorial is aimed at coders interested in understanding the building blocks of large language models (LLMs), how LLMs work, and how to code them from the ground up in PyTorch. We will kick off this tutorial with an introduction to LLMs, recent milestones, and their use cases. Then, we will code a small GPT-like LLM, including its data input pipeline, core architecture components, and pretraining code ourselves. After understanding how everything fits together and how to pretrain an LLM, we will learn how to load pretrained weights and finetune LLMs using open-source libraries.

OUTLINE:

0:00 – Workshop overview

2:17 – Part 1: Intro to LLMs

9:14 – Workshop materials

10:48 – Part 2: Understanding LLM input data

23:25 – A simple tokenizer class

41:03 – Part 3: Coding an LLM architecture

45:01 – GPT-2 and Llama 2

1:07:11 – Part 4: Pretraining

1:29:37 – Part 5.1: Loading pretrained weights

1:45:12 – Part 5.2: Pretrained weights via LitGPT

1:53:09 – Part 6.1: Instruction finetuning

2:08:21 – Part 6.2: Instruction finetuning via LitGPT

02:26:45 – Part 6.3: Benchmark evaluation

02:36:55 – Part 6.4: Evaluating conversational performance

02:42:40 – Conclusion

REFERENCES:

1. Build an LLM from Scratch book: https://mng.bz/M96o

2. Build an LLM from Scratch repo: https://github.com/rasbt/LLMs-from-sc ...

3. GitHub repository with workshop code: https://github.com/rasbt/LLM-workshop ...

4. Lightning Studio for this workshop: https://lightning.ai/lightning-ai/stu ...

5. LitGPT: https://github.com/Lightning-AI/litgpt

Collationist.

Title:

Building LLMs from the Ground Up

Published

31 August 2024

Technology Posts

Obervations from Karpathy on AI evolution

21 Lessons for 21st Century by Yuval Noah Harari

The future of AI compute - with Jonathan Ross

Who will dominate the AI Ecosystem

Top trends in the AI industry

Navigating the Debt Storm: A Bond Market Reckoning Looms

AI trade is still on?

State of the Union with Andreas Steno