NVIDIA Fugatto AI: Creates Music, Voices & Sounds On Demand

A new generative AI model Fugatto from NVIDIA can produce any mix of music, speech, and noises given text and audio as inputs

Music producers may use Fugatto to rapidly modify or prototype a song concept, experimenting with various instruments, vocals, and genres

By applying various dialects and emotions to voiceovers, an advertising firm may use Fugatto to swiftly target an existing campaign

For example, Fugatto can meow on a saxophone or bark on a trumpet. The model can generate whatever that users may describe

Fugatto lets users construct soundscapes they have never seen before, such a thunderstorm fading into a morning with the sound of birdsong

The entire version was trained on a bank of NVIDIA DGX computers with 32 NVIDIA H100 Tensor Core GPUs and employs 2.5 billion parameters

Additionally, they examined pre-existing datasets to uncover novel connections between them. The entire project took almost a year to complete