DALL-E: PyTorch discrete VAE implementation
Official PyTorch package implementing the discrete VAE component for image tokenization used in OpenAI's DALL-E system.
Learn more about DALL-E
DALL-E is a PyTorch implementation of the discrete Variational Autoencoder (dVAE) component from OpenAI's DALL-E image generation system. The package uses discrete latent representations to encode and decode images, converting continuous image data into discrete tokens that can be processed by transformer models. The implementation focuses specifically on the VAE architecture that handles image tokenization and reconstruction. This component is commonly used for research into discrete image representations and as a foundation for building text-to-image generation systems.
Official Implementation
Direct release from OpenAI providing the exact discrete VAE architecture used in the original DALL-E research paper.
Discrete Tokenization
Converts images into discrete token representations suitable for processing by transformer-based language models.
Modular Design
Provides only the VAE component, allowing researchers to integrate it with custom transformer architectures for text-to-image tasks.
Top in AI & ML
Related Repositories
Discover similar tools and frameworks used by developers
YOLOX
PyTorch anchor-free object detector with scalable model variants.
mlx
Lazy-evaluated NumPy-like arrays optimized for Apple silicon.
unsloth
Memory-efficient Python library for accelerated LLM training.
Qwen
Alibaba Cloud's pretrained LLMs supporting Chinese/English with up to 32K context length.
crewAI
Python framework for autonomous multi-agent AI collaboration.