
Research Article
Automated Music Sheet to MIDI Conversion and Spectrogram Analysis for Multi-Instrument and Multi-Genre Composition
@INPROCEEDINGS{10.4108/eai.28-4-2025.2357778, author={Chukkapalli Ramya and Shaik Shaheena and Garnepudi Parimala}, title={Automated Music Sheet to MIDI Conversion and Spectrogram Analysis for Multi-Instrument and Multi-Genre Composition}, proceedings={Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I}, publisher={EAI}, proceedings_a={ICITSM PART I}, year={2025}, month={10}, keywords={optical music recognition (omr) deep learning midi generation image processing spectrogram analysis digital signal processing (dsp) music transcription computer vision}, doi={10.4108/eai.28-4-2025.2357778} }
- Chukkapalli Ramya
Shaik Shaheena
Garnepudi Parimala
Year: 2025
Automated Music Sheet to MIDI Conversion and Spectrogram Analysis for Multi-Instrument and Multi-Genre Composition
ICITSM PART I
EAI
DOI: 10.4108/eai.28-4-2025.2357778
Abstract
Automated music transcription has emerged as a crucial field in Optical Music Recognition (OMR) and digital signal processing, enabling seamless conversion of sheet music into machine-readable formats. This paper presents a comprehensive system for automated music sheet-to-MIDI conversion and spectrogram analysis, leveraging advanced image processing techniques, deep learning models, and MIDI synthesis algorithms. The proposed framework first processes scanned sheet music by detecting and extracting musical symbols using contour analysis, morphological operations, and the Hough Transform. A deep learning-based approach is employed for note head recognition, pitch estimation, and rhythm extraction, ensuring accurate musical interpretation. The extracted notes are converted into MIDI format, which is further synthesized into WAV audio for playback and spectrogram visualization. The system incorporates a user-defined playback duration feature, optimizing real-time applications for composers, researchers, and musicians. The generated spectrogram provides a time-frequency representation, facilitating detailed harmonic analysis. This frame- work integrates computer vision, artificial intelligence, and digital signal processing to enhance music transcription, performance evaluation, and AI-assisted composition.