Skip to content
#

mel-spectrogram

Here are 71 public repositories matching this topic...

This study converts piano recordings to mel spectrogram and classifies them by SOTA pre-trained neural network backbones in CV. Comparative experiments show that SqueezeNet achieves a best classification accuracy of 92.37%.|该项目将钢琴录音转为为mel频谱图,使用微调后的前沿计算机视觉领域预训练深度学习骨干网络对其进行分类,对比实验可知SqueezeNet作为最优网络正确率可达92.37%

  • Updated Jul 13, 2024
  • Python

Step onto the stage with Saxophone Hero, where your tenor saxophone is the key to unlocking a rhythmic adventure through a world of sheet music. In this game, your character scores points by hitting the right notes. Powered by machine learning, the game captures the pitch from your saxophone and translates it to player movement in real time.

  • Updated Dec 15, 2023
  • Jupyter Notebook

Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.

  • Updated Sep 12, 2023
  • Python

Improve this page

Add a description, image, and links to the mel-spectrogram topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mel-spectrogram topic, visit your repo's landing page and select "manage topics."

Learn more