About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Mobile Multimedia Communications. 16th EAI International Conference, MobiMedia 2023, Guilin, China, July 22-24, 2023, Proceedings

Research Article

A Supervised Domain Adaptive Method for Multi Device Acoustic Scene Classification

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-031-60347-1_2,
        author={Zhanqi Liu and Mei Wang and Liyan Luo and Zhenghong Liu and Guan Wang},
        title={A Supervised Domain Adaptive Method for Multi Device Acoustic Scene Classification},
        proceedings={Mobile Multimedia Communications. 16th EAI International Conference, MobiMedia 2023, Guilin, China, July 22-24, 2023, Proceedings},
        proceedings_a={MOBIMEDIA},
        year={2024},
        month={10},
        keywords={Acoustic Scene Classification Supervised Domain Adaptive Domain generalization Frequency band standardization},
        doi={10.1007/978-3-031-60347-1_2}
    }
    
  • Zhanqi Liu
    Mei Wang
    Liyan Luo
    Zhenghong Liu
    Guan Wang
    Year: 2024
    A Supervised Domain Adaptive Method for Multi Device Acoustic Scene Classification
    MOBIMEDIA
    Springer
    DOI: 10.1007/978-3-031-60347-1_2
Zhanqi Liu1, Mei Wang1, Liyan Luo1,*, Zhenghong Liu1, Guan Wang1
  • 1: Ministry of Education Key Laboratory of Cognitive Radio and Information Processing
*Contact email: lly1227@guet.edu.cn

Abstract

Acoustic scene classification faces performance degradation due to device mismatch when different pickup devices are used in the training and testing phases. To solve the device mismatch problem and improve the performance of the acoustic scene classification system on unseen pickup devices, we propose a joint frequency band standardization and supervised domain adaptation algorithm, which can effectively extract the domain invariant features of the acoustic scene signal to generalize the model to the unseen The algorithm can effectively extract the domain invariant features of the acoustic scene signal and generalize the model to the unseen distribution to solve the performance degradation of the model on the unseen pickups. The frequency band standardization is first used to linearly correct the extracted Log-Mel features, and then combined with supervised domain adaptation to reduce the difference between the source and target domains and correct the nonlinear differences between different kinds of pickup devices. Experimental results on the DCASE Challenge 2020 Task 1A dataset show an overall improvement of 13.9% compared to the baseline model, 6.2% on seen devices, and 20% on unseen device. The algorithm can better extract the domain invariant features of the model, has better classification performance, and enables the model to generalize to unseen device.

Keywords
Acoustic Scene Classification Supervised Domain Adaptive Domain generalization Frequency band standardization
Published
2024-10-25
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-031-60347-1_2
Copyright © 2023–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL