LLM101n

LLM101n is an educational repository that walks you through building and understanding large language models from first principles. It emphasizes intuition and hands-on implementation, guiding you from tokenization and embeddings to attention, transformer blocks, and sampling. The materials favor compact, readable code and incremental steps, so learners can verify each concept before moving on. You’ll see how data pipelines, batching, masking, and positional encodings fit together to train a small GPT-style model end to end. The repo often complements explanations with runnable notebooks or scripts, encouraging experimentation and modification. By the end, the focus is less on polishing a production system and more on internalizing how LLM components interact to produce coherent text.

Features

Step-by-step build of a GPT-style transformer from scratch
Clear coverage of tokenization, embeddings, attention, and MLP blocks
Runnable code and exercises for experiential learning
Demonstrations of batching, masking, and positional encodings
Training and sampling loops you can inspect and modify
Emphasis on readability and conceptual understanding over framework magic

Project Samples

Project Activity

See All Activity >

Follow LLM101n

LLM101n Web Site

User Reviews

Be the first to post a review of LLM101n!

Additional Project Details

Registered

2025-10-15

Similar Business Software

PictoBlox

PictoBlox is a software that girls and boys can use to have a blast while learning to code. This approach to learning focuses on coding kid-friendly games where little ones learn a variety of concepts while they assimilate different programming languages without hardly realizing it. One of...

See Software
Train in Data

Train in Data is your go-to online school for mastering machine learning. We offer intermediate and advanced courses in Python programming, data science and machine learning, taught by industry experts with extensive experience in developing, optimizing, and deploying machine learning models in...

See Software
myACI

ACI Learning delivers hands-on IT and cybersecurity training built for modern teams. Expert-led videos, interactive labs, and certification prep for today’s top credentials turn knowledge into real-world skill. Whether you’re training a team or advancing your career, myACI makes it easy to...

See Software
Coursebox AI

Transform your content into engaging eLearning experiences with Coursebox, the #1 AI-powered eLearning authoring tool. Our platform automates the course creation process, allowing you to design a structured course in seconds. Simply make edits, add any missing elements, and your course is ready...

See Software
Bookinglayer

Bookinglayer is an all-in-one reservation system built for complex booking scenarios involving accommodation and activities. They help retreats, resorts, and schools automate their booking process from start to finish. Bookinglayer’s solution allows you to start selling activities with your...

See Software
Docubee

Docubee is an intelligent contract automation platform that allows you to quickly and painlessly generate, manage, share, and sign contracts. Featuring powerful conditional logic-based workflows, generative AI technology, and an easily adaptable interface, Docubee makes it easy to automate your...

See Software

Report inappropriate content

LLM101n

LLM101n: Let's build a Storyteller

Get an email when there's a new version of LLM101n

Features

Project Samples

Project Activity

Categories

Follow LLM101n

User Reviews

Additional Project Details

Registered