An impressive new AI system from Google can generate music in any genre given a text description. But the company, fearing the risks, has no immediate plans to release it. From a report:
Called MusicLM, Google’s certainly isn’t the first generative AI system for song. There’s been other attempts, including Riffusion, an AI that composes music by visualizing it, as well as Dance Diffusion, Google’s own AudioML and OpenAI’s Jukebox. But owing to technical limitations and limited training data, none have been able to produce songs particularly complex in composition or high-fidelity. MusicLM is perhaps the first that can.
Detailed in an academic paper, MusicLM was trained on a data set of 280,000 hours of music to learn to generate coherent songs for descriptions of – as the creators put it – “significant complexity” (e.g. “enchanting jazz song with a memorable saxophone solo and a solo singer” or “Berlin '90s techno with a low bass and strong kick.” Its songs, remarkably, sound something like a human artist might compose, albeit not necessarily as inventive or musically cohesive. […] That’s not to suggest MusicLM’s flawless – far from it, truthfully. Some of the samples have a distorted quality to them, an unavoidable side effect of the training process. And while MusicLM can technically generate vocals, including choral harmonies, they leave a lot to be desired. Still, the Google researchers note the many ethical challenges posed by a system like MusicLM, including a tendency to incorporate copyrighted material from training data into the generated songs.