{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "\n", "# 1. 识别声音\n", " \n", " 通过听取声音,人的大脑会获取到大量的信息,其中的一个场景是识别和归类,如:识别熟悉的亲人或朋友的声音、识别不同乐器发出的声音和识别不同环境产生的声音,等等。\n", "\n", " 我们可以根据不同声音的特征(频率,音色等)进行区分,这种区分行为的本质,就是对声音进行分类。\n", "\n", "声音分类根据用途还可以继续细分:\n", "\n", "* 副语言识别:说话人识别(Speaker Recognition), 情绪识别(Speech Emotion Recognition),性别分类(Speaker gender classification)\n", "* 音乐识别:音乐流派分类(Music Genre Classification)\n", "* 场景识别:环境声音分类(Environmental Sound Classification)\n", "* 声音事件检测:各个环境中的声学事件检测\n", " \n", "\n", "