A multimodal AI story teller, built with Stable Diffusion, GPT, and neural text-to-speech (TTS). Given a prompt as an opening line of a story, GPT writes the rest of the plot; Stable Diffusion draws an image for each sentence; a TTS model narrates each line, resulting in a fully animated video of a short story, replete with audio and visuals. To develop locally, install dev dependencies and install pre-commit hooks. This will automatically trigger linting and code quality checks before each commit. The final video will be saved as /out/out.mp4, alongside other intermediate images, audio files, and subtitles. For more advanced use cases, you can also directly interface with Story Teller in Python code.

Features

  • Story Teller is available on PyPI
  • The quickest way to run a demo is through the CLI. Simply type
  • The final video will be saved as /out/out.mp4, alongside other intermediate images, audio files, and subtitles
  • To adjust the defaults with custom parametes, toggle the CLI flags as needed
  • For more advanced use cases, you can also directly interface with Story Teller in Python code
  • Configure the model with custom settings

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow StoryTeller

StoryTeller Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of StoryTeller!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Video Generators, Python ChatGPT Apps, Python Generative AI

Registered

2023-03-22