β About
With this post you'll see how I started my first full artwork creating a bridge between:
- π Data
- ποΈ Sound design
- π€ Generative Text To Speech
- ποΈ Video artwork
- π’ Digital contents streamline
- π Social networks and content embedding
π‘ Inception
What triggered this creation is the following tweet:
... I immediately started to think:
"... and if I could create a fully digital, multimodal Customer Experience that would be ready to be shared on social platforms ?"
βοΈ Also, people are talking a lot about about AGI like Midjourney
, DALL-E
... but very much less about Generative AI for TTS (Text to Speech).
βΎοΈ " Voice prompts," aka. "History prompts"
As all others AGI, suno-ai/bark
makes no exception : it relies on "PROMPTs".
Luckily, the bark's community is very active and share their voices prompt (and tags) discoveries :
π Creative workflow
Here is the current workflow I could experiment:
- Create & release a SDK to get the data
- Imagine a customer experience at restaurant
- Develop & tune the data driven script and build soundtrack
- Create an avatar and scene for the waitress
- Put together soundtrack & avatar into video
π§° Tools
Here are the open source tools I used for now:
-
π¦
auptitcafe
package to get the data -
π¨
bluewillow.ai
: "AI Artwork Generator" for avatar creation -
π
suno-ai/bark
to build effective soundtrack -
π€
pydub
to compresswav
(~ 20 Mo) tomp3
(825.65 kB) &webm
(1.68 MB) - π₯
OpenTalker/SadTalker
- π¦Ύ
T4x2 GPU
from Kaggle
πΏ Demos
Below are the demos:
π€ How it's built (author's words)
ποΈ Soundtrack
Output soundtrack with bark
:
ποΈ Movie
Then put the sound into an avatar with SadTalker
:
π€ Ideas for "later"
Automate:
- Video creation
- Video upload on dedicated cloud services for further optimal collaboration, digital marketing,...
- Avatar creation so video is totally code driven... and makes content more original (and funny) on each release thanks to one time generative prompt (prompt design required)
β©οΈ Conclusion
The more I think about designing - and achieving - such experiences, the more I find evident the core of this kind of project is:
- π― Get a clear idea and be strongly focused on what you want to achieve (ie. you don't get lost in your creative journey)
- π Design a clean linear workflow that focus on tasks (not tools) so you can adapt it easily as AI projects are evolving at an amazing pace (I mean every week there are new tools)
π Resources
π Tools to prototype