Navigate:

All ReposnanoGPT

~$NANOG↑0.9%

nanoGPT: GPT training and finetuning codebase

Minimal PyTorch implementation for training GPT models.

LIVE RANKINGS • 10:20 AM • STEADY

TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50

OVERALL

#38

AI & ML

#23

30 DAY RANKING TREND

ovr#38

·AI#23

STARS

53.8K

FORKS

9.1K

7D STARS

+506

7D FORKS

+81

Tags:

AI & ML

See Repo:

Learn more about nanoGPT

nanoGPT is a Python-based training framework for GPT-scale language models built on PyTorch. It consists of approximately 300 lines each for the training loop (train.py) and model definition (model.py), with support for loading pretrained GPT-2 weights from OpenAI. The codebase handles data preprocessing, distributed training on multi-GPU setups, and checkpoint management with optional Weights & Biases logging. It is used for training models ranging from character-level networks on small datasets to reproducing GPT-2 (124M parameters) on large text corpora like OpenWebText.

Minimal codebase

The core training and model logic is contained in two approximately 300-line files, making the implementation straightforward to understand and modify without abstraction layers.

Pretrained weight loading

Can load official GPT-2 weights from OpenAI and finetune them on custom datasets, supporting model sizes up to 1.3B parameters as a starting point.

Multi-GPU training

Supports distributed training across multiple GPUs with configuration files for different hardware setups, from CPU-only machines to multi-A100 nodes.

See how people are using nanoGPT

Loading tweets...

Top in AI & ML

Trending Repos

Pi Mono

17,222#1

OpenClaw

233,443#2

Zvec

8,089#3

Claude Code

70,649#4

Heretic

9,761#5

See all →

LIVE RANKINGS • 10:20 AM • STEADY

TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50TOP 50

OVERALL

#38

AI & ML

#23

30 DAY RANKING TREND

ovr#38

·AI#23

STARS

53.8K

FORKS

9.1K

7D STARS

+506

7D FORKS

+81

[ EXPLORE MORE ]

Related Repositories

Discover similar tools and frameworks used by developers

nanoGPT: GPT training and finetuning codebase

Learn more about nanoGPT

What is nanoGPT for?

What makes nanoGPT different?

Minimal codebase

Pretrained weight loading

Multi-GPU training

See how people are using nanoGPT

Top in AI & ML

Pi Mono

OpenClaw

Claude Code

Heretic

Rowboat

Trending Repos

Pi Mono

OpenClaw

Zvec

Claude Code

Heretic

Related Repositories

Ray

AI-Trader

Model Context Protocol Servers

ADK

OpenHands

Product

Company

Helpful Links