Building LLMs from the Ground Up: A 3-hour Coding Workshop

Información de descarga y detalles del video Building LLMs from the Ground Up: A 3-hour Coding Workshop
Autor:
Sebastian RaschkaPublicado el:
31/8/2024Vistas:
117.6KDescripción:
This tutorial is aimed at coders interested in understanding the building blocks of large language models (LLMs), how LLMs work, and how to code them from the ground up in PyTorch. We will kick off this tutorial with an introduction to LLMs, recent milestones, and their use cases. Then, we will code a small GPT-like LLM, including its data input pipeline, core architecture components, and pretraining code ourselves. After understanding how everything fits together and how to pretrain an LLM, we will learn how to load pretrained weights and finetune LLMs using open-source libraries.
Videos similares: Building LLMs from the Ground Up

Obsidian Bases: My Bookshelf Migration - Part 03

How the TCP/IP Model Actually Works | CCNA Day 3

Machine Learning for Everybody – Full Course

My System Engineer Wiki: Obsidian Bases + Dataview - Part 02

Part 7: Prediction Sense | "Alien: Isolation" Smart AI in UE5

