Image 1 (IMAGE)
Caption
Figure 1. Amuse system configuration. After extracting music keywords from user input, a large language model-based code progression is generated and refined through rejection sampling (left). Code extraction from audio input is also possible (right). The bottom is an example visualizing the chord structure of the generated code.
Credit
Authors: Yewon Kim, Sung-Ju Lee, Chris Donahue
Usage Restrictions
Credit must be given to the creator.
License
CC BY