3D - Learning Representations From Audio Using Autoencoders

Bharathi A; Prakash J

Proceedings of the First International Conference on Combinatorial and Optimization, ICCAP 2021, December 7-8 2021, Chennai, India

Research Article

3D - Learning Representations From Audio Using Autoencoders

Download1185 downloads

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.4108/eai.7-12-2021.2314968,
    author={Bharathi  A and Prakash  J},
    title={3D - Learning Representations From Audio Using Autoencoders},
    proceedings={Proceedings of the First International Conference on Combinatorial and Optimization, ICCAP 2021, December 7-8 2021, Chennai, India},
    publisher={EAI},
    proceedings_a={ICCAP},
    year={2021},
    month={12},
    keywords={: audio signal auto correlogram rnn encoder rnn decoder},
    doi={10.4108/eai.7-12-2021.2314968}
}

Bharathi A
Prakash J
Year: 2021
3D - Learning Representations From Audio Using Autoencoders
ICCAP
EAI
DOI: 10.4108/eai.7-12-2021.2314968

Bharathi A¹^,*, Prakash J¹

1: PSG College of technology

*Contact email: abi.cse@psgtech.ac.in

Abstract

Deep learning methods permit us to tackle signal processing challenges from a dissimilar perspective, which is currently overlooked in the composition of music in cinema industry. Audio is inherently added time-sensitive than movie. Audios are encoded using other past methods, resulting in data loss or temporal anomalies. This problem is alleviated by using an auto correlogram with a 3-dimensional view, including time, power, and frequency, to improve accuracy. First, acoustic data should be competently encoded into a compressed format using RNN autoencoder by interrelating with the information. As a result of the compressed format, audio waves should be accurately represented. After that, audio waves are rebuilt into an audio structure with little data loss. The accuracy is improved by 10% by using the RNN encoder and decoder.

Keywords: : audio signal auto correlogram rnn encoder rnn decoder

Published: 2021-12-22
Publisher: EAI

: http://dx.doi.org/10.4108/eai.7-12-2021.2314968

3D - Learning Representations From Audio Using Autoencoders

Abstract

About EAI

Community

Publish with EAI