1、首先创建有一个Python文件,并导入库文件: from scipy.io import wavfile from python_speech_features import mfcc, logfbank import matplotlib.pylab as plt
2、读取音频文件: samplimg_freq, audio = wavfile.read("data/input_freq.wav")


5、将MFCC特征可视化。转换矩阵,使得时域是水平的: mfcc_features = mfcc_features.T plt.matshow(mfcc_features) plt.title('MFCC')
