DALL-E in Pytorch

Implementation / replication of DALL-E (paper), OpenAI's Text to Image Transformer, in Pytorch. It will also contain CLIP for ranking the generations. Kobiso, a research engineer from Naver, has trained on the CUB200 dataset here, using full and deepspeed sparse attention. You can also skip the training of the VAE altogether, using the pretrained model released by OpenAI! The wrapper class should take care of downloading and caching the model for you auto-magically. You can also use the pretrained VAE offered by the authors of Taming Transformers! Currently only the VAE with a codebook size of 1024 is offered, with the hope that it may train a little faster than OpenAI's, which has a size of 8192. In contrast to OpenAI's VAE, it also has an extra layer of downsampling, so the image sequence length is 256 instead of 1024 (this will lead to a 16 reduction in training costs, when you do the math).

Features

Train DALL-E with pretrained VAE
The default VQGan is the codebook size 1024 one trained on imagenet
Adjust text conditioning strength
Rank the generations
Deepspeed Sparse Attention
Train with Microsoft Deepspeed's Sparse Attention

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow DALL-E in Pytorch

DALL-E in Pytorch Web Site

User Reviews

Be the first to post a review of DALL-E in Pytorch!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Image Generators, Python Generative AI

Registered

2022-08-02

Similar Business Software

Adaptive Security

Adaptive Security is OpenAI’s investment for AI cyber threats. Founded in 2024, Adaptive raised $50M+ from investors like OpenAI and a16z, as well as executives at Google Cloud, Fidelity, Shopify, and more. Adaptive protects customers from deepfakes, vishing, smishing, and AI email phishing...

See Software
Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
New Relic

There are an estimated 25 million engineers in the world across dozens of distinct functions. As every company becomes a software company, engineers are using New Relic to gather real-time insights and trending data about the performance of their software so they can be more resilient and...

See Software
ActiveCampaign

ActiveCampaign is the autonomous marketing platform that transforms how marketers, agencies, and business owners work. Use Active Intelligence to power goal-aware automations and orchestrate personalized experiences across email, SMS, and WhatsApp. Within a single tool, you get real-time...

See Software
SafetyCulture

SafetyCulture is a workplace operations platform trusted by 85,000+ teams to work safely, meet higher standards, and improve every day. 1. Streamline operations by eliminating paper processes - with simple checklists your teams can complete on any device. 2. Get the job done with seamless...

See Software
Coursebox AI

Transform your content into engaging eLearning experiences with Coursebox, the #1 AI-powered eLearning authoring tool. Our platform automates the course creation process, allowing you to design a structured course in seconds. Simply make edits, add any missing elements, and your course is ready...

See Software

Report inappropriate content

DALL-E in Pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image

Get an email when there's a new version of DALL-E in Pytorch

Features

Project Samples

Project Activity

Categories

License

Follow DALL-E in Pytorch

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered