Simple command-line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). In true deep learning fashion, more layers will yield better results. Default is at 16, but can be increased to 32 depending on your resources. Technique first devised and shared by Mario Klingemann, it allows you to prime the generator network with a starting image, before being steered towards the text. Simply specify the path to the image you wish to use, and optionally the number of initial training steps. We can also feed in an image as an optimization goal, instead of only priming the generator network. Deepdaze will then render its own interpretation of that image. The regular mode for texts only allows 77 tokens. If you want to visualize a full story/paragraph/song/poem, set create_story to True.

Features

  • This will require that you have an Nvidia GPU or AMD GPU
  • Recommended 16GB VRAM
  • Minimum requirements are 4GB VRAM
  • For Windows
  • Creates files with both the timestamp and the sequence number
  • Optimize for the interpretation of an image

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Deep Daze

Deep Daze Web Site

Other Useful Business Software
Fully Managed MySQL, PostgreSQL, and SQL Server Icon
Fully Managed MySQL, PostgreSQL, and SQL Server

Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end. Migrate from on-prem or other clouds with free migration tools.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Deep Daze!

Additional Project Details

Operating Systems

Windows

Programming Language

Python

Related Categories

Python Terminals, Python Command Line Tools, Python AI Image Generators, Python Deep Learning Frameworks, Python Generative AI

Registered

2022-02-01