Music Generation Based on Convolution-LSTM

Yongjie Huang; Xiaofeng Huang; Qiakai Cai

doi:10.5539/cis.v11n3p50

Music Generation Based on Convolution-LSTM

Yongjie Huang
Xiaofeng Huang
Qiakai Cai

Abstract

In this paper, we propose a model that combines Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) for music generation. We first convert MIDI-format music file into a musical score matrix, and then establish convolution layers to extract feature of the musical score matrix. Finally, the output of the convolution layers is split in the direction of the time axis and input into the LSTM, so as to achieve the purpose of music generation. The result of the model was verified by comparison of accuracy, time-domain analysis, frequency-domain analysis and human-auditory evaluation. The results show that Convolution-LSTM performs better in music genertaion than LSTM, with more pronounced undulations and clearer melody.

Full Text: PDF
DOI:10.5539/cis.v11n3p50

This work is licensed under a Creative Commons Attribution 4.0 License.

ISSN(Print): 1913-8989
ISSN(Online): 1913-8997
Started: 2008
Frequency: semiannual

Journal Metrics

WJCI (2022): 0.636

Impact Factor 2022 (by WJCI): 0.419

h-index (January 2024): 43

i10-index (January 2024): 193

h5-index (January 2024): N/A

h5-median(January 2024): N/A

( The data was calculated based on Google Scholar Citations. Click Here to Learn More. )

Contact

Chris LeeEditorial Assistant
cis@ccsenet.org

Music Generation Based on Convolution-LSTM

Abstract

Journal Metrics

Index

Contact