Audio multi-label classification method based on deep learning
A deep learning, multi-label technology, applied in the field of multi-label classification
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] The hardware environment of the present invention is mainly a server whose GPU model is GeForce GTX 2080Ti. The software implementation uses ubuntu 16.04 as the platform, adopts the Python programming language, and is developed based on the deep learning framework TensorFlow. The experimental data set comes from the FSDKaggle2019 data set on the Kaggle platform. The data set consists of two parts, namely Freesound Dataset (FSD) and Yahoo Flickr Creative Commons 100M dataset (YFCC). FSD is based on AudioSet, and YFCC is a set of Audio track for Flickr videos. The entire dataset contains 80 class labels, such as applause, cows, rain, etc. The specific implementation process is mainly divided into five parts: data preprocessing, audio feature extraction, model construction and training, model evaluation, and audio label classification. details as follows:
[0036] 1. Data preprocessing
[0037] Since the original audio data set contains noise interference, this patent ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



