Maral Dadvar
Who Spoke When?: Audio-based Speaker Location Estimation for Diarization Maral Dadvar

Name: Who Spoke When?: Audio-based Speaker Location Estimation for Diarization
Price: 316 CNY
Availability: OutOfStock
Author: Maral Dadvar

价格

元 316

不含税

远程仓调货

预计送达时间年7月23日 - 年8月4日

顾客评价：

Top-vurdering på Google Reviews, baseret på tusinder af anmeldelser.

根据欧洲消费者保护法享受14天退换货政策

Trustpilot平台高分认证

添加至iMusic心愿单

Not rated yet

Who Spoke When?: Audio-based Speaker Location Estimation for Diarization

Maral Dadvar

Speaker diarization is the process which detects active speakers and groups those speech signals which has been uttered by the same speaker. Generally we can find two main applications for speaker diarization. Automatic Speech Recognition systems make use of the speaker homogeneous clusters to adapt the acoustic models to be speaker dependent and therefore increase recognition performance. Speaker indexing and rich transcription systems also use the speaker diarization output as one of information extracted from a recording, which allow its automatic indexation and other further processing. In this study a speaker diarization application is developed ? using multiparty binaural speech recordings ? to track speaker activity based on interaural time difference (ITD) cues. These cues, for a given speech signal frame, are computed using gammatone filtering and cross-correlation technique. Their values are used to determine which speaker in the recording produce the considered speech fragment. This study has been supervised by Dr. Jon Barker, and defended to fulfill the requirements for the degree of Master in Advanced Computer Science, University of Sheffield, United Kingdom, 2007.

介质类型	图书 Paperback Book (平装胶订图书)
已发行	2011年7月1日
ISBN13	9783844386288
出版商	LAP LAMBERT Academic Publishing
页数	68
商品尺寸	150 × 4 × 226 mm · 119 g
语言	德语