We propose a generative model for artificial composition of both classical and popular music with the goal of producing music as well as humans do. The problem is that music is based on a highly sophisticated hierarchical structure and it is hard to measure its quality automatically.
Contrary to other's work, we try to generate a symbolic representation of music with multiple different instruments playing simultaneously to cover a broader musical space. We train three modules based on LSTM networks to generate the music; a lot of effort is put into reducing the high complexity of multi-instrumental music representation by a thorough musical analysis.
We believe that the proposed preprocessing techniques and symbolic representation constitute a useful resource for future research in this field