Turn any LLM multimodal; generate images, voices, videos, 3D models, music, and more.
Probing this server's capabilities…