中国语音学报(第11辑)
上QQ阅读APP看书,第一时间看更新

xSegmenter:音段自动切分与标注工具

熊子瑜

摘要 音段自动切分与标注工具xSegmenter采用Perl脚本语言编写,主要面向较大规模的具有文字底稿的语音语料库建设,旨在解决音段标注的效率和一致性问题。该程序通过调用HTK工具,基于用户所提供的语料及相关资源训练生成HMM声学模型,然后进行音段强制对齐,最后针对每个声音文件转写生成相应的语音标注文件(*.TextGrid),包括词语层、音节层和音素层等标注内容。由于该工具自身不提供任何声学模型和词典,而是基于用户所提供的声音文件、带有分词信息的发音文本文件和发音词典文件等数据资源自动训练生成相应的语音声学模型,然后再利用所生成的语音声学模型去完成音段的自动切分和标注任务,因此这一工具可适用于任意语言和方言的语音库建设。

关键词 xSegmenter;音段切分;语音语料库

xSegmenter:a Tool for Automatic Segmentation and Annotation

XIONG Ziyu

Abstract The automatic segmentation and annotation toolxSegmenteris written in Perl script language.It is developed for the construction of large-scale speech corpora with text scripts,which aims to solve the problem of efficiency and consistency of segment annotation.This program calls the HTK toolkit to generate a HMM acoustic model based on the speech corpus and related resources provided by the user,and then compulsorily aligns the segments.Finally,for each sound file,it generates a corresponding speech annotation file (* .TextGrid) including word layer,syllable layer,and phoneme layer.Because the tool itself does not provide any acoustic models and dictionaries,it automatically trains and generates the corresponding acoustic models based on data resources provided by the user,including sound files,pronunciation text files with word segmentation information,and pronunciation dictionary files.The generated acoustic model is then used to complete the automatic segmentation and annotation tasks of the segment,so this tool can be applied to the construction of a speech corpus for any language and dialect.

Key words xSegmenter,Speech segmentation,Speech corpus

音段的手工切分与标注是一项费时费力的工作,但对于语音库建设和利用而言,细致而系统的音段切分与标注却是不可或缺的。本文主要介绍笔者开发的音段自动切分与标注工具xSegmenter的用法及其注意事项。