📛 Any-to-Any Generation via Composable Diffusion
🧠 CoDi is a versatile AI model that generates different output types like language, image, video, or audio from various inputs, maintaining high generation quality.
🐦 7
❤️ 177
🔗 arxiv.org/pdf/2305.11846.pdf (https://arxiv.org/pdf/2305.11846.pdf)
https://nitter.moomoo.me/ArXivGPT/status/1660540769323692032#m