GPT-2

This repository contains the code and model weights for GPT-2, a large-scale unsupervised language model described in the OpenAI paper “Language Models are Unsupervised Multitask Learners.” The intent is to provide a starting point for researchers and engineers to experiment with GPT-2: generate text, fine‐tune on custom datasets, explore model behavior, or study its internal phenomena. The repository includes scripts for sampling, training, downloading pre-trained models, and utilities for tokenization and model handling.

Features

Pretrained model weights for multiple GPT-2 sizes (e.g. 117M, 345M, up to 1.5B parameters)
Sampling / generation scripts (conditional, unconditional, interactive)
Tokenizer and encoding / decoding utilities
Training / fine-tuning script support (for smaller models)
Support for memory-saving gradient techniques / optimizations during training
Utilities to download / manage model checkpoints via script

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow GPT-2

GPT-2 Web Site

nel_h2

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of GPT-2!

Additional Project Details

Programming Language

Python

Related Categories

Python Artificial Intelligence Software

Registered

3 days ago

Report inappropriate content

GPT-2

Code for the paper Language Models are Unsupervised Multitask Learners

Get an email when there's a new version of GPT-2

Features

Project Samples

Project Activity

Categories

License

Follow GPT-2

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered