kaldi-FreeNAS中文网

admin 管理员组

文章数量: 900433

kaldi

一、服务器客户端识别系统建立方法
- 1. Command line to start the server（服务器端启动方式）:
- 2. Command line to start the client（客户端启动方式）:
二、使用麦克风建立客户端与服务器的实时解码
- 1. 使用online-server-gmm-decode-faster启动服务器：
- 2. 使用online-net-client启动客户端：

在kaldi 的工具集里有好几个程序可以用于在线识别。这些程序都位在src/onlinebin文件夹里，他们是由src/online文件夹里的文件编译而成(你现在可以用make ext 命令进行编译)。这些程序大多还需要tools文件夹中的portaudio 库文件支持，portaudio 库文件可以使用tools文件夹中的相应脚本文件下载安装。

# 安装portaudio
yum -y install *alsa*
cd kaldi/tools
./install_portaudio.sh
# 编译在线识别工具
cd src/
make ext

一、服务器客户端识别系统建立方法

建立整个在线识别系统需要：

准备两台机器，都安装kaldi；
作为服务器的机器，准备好声音模型、词典、解码网络、特征转换矩阵（我还没有使用转换矩阵）
首先启动服务器，待服务器运行后，再启动客户端连接。

1. Command line to start the server（服务器端启动方式）:

使用如下指令online-audio-server-decode-faster启动服务器：

online-audio-server-decode-faster --verbose=1 --rt-min=0.5 --rt-max=3.0 --max-active=6000 \
--beam=72.0 --acoustic-scale=0.0769 final.mdl graph/HCLG.fst graph/words.txt '1:2:3:4:5' \
graph/word_boundary.int 5010 final.mat

1.1 Arguments are as follow（参数意义）:

final.mdl - the acoustic model
HCLG.fst - the complete FST
words.txt - word dictionary (mapping word ids to their textual representation)
'1:2:3:4:5' - list of silence phoneme ids
5010 - port the server is listening on
word_boundary.int- a list of phoneme boundary information required for word alignemnt
final.mat - feature LDA matrix

注意：如果没有word_boundary.int 需要重新运行prepare_lang.sh生成。修改如下：

#原指令：
utils/prepare_lang.sh --position-dependent-phones false data/local/dict "<SPOKEN_NOISE>" \
data/local/lang data/lang
#改为：
utils/prepare_lang.sh data/local/dict "<SPOKEN_NOISE>" data/local/lang data/lang

启动后结果如下：

2. Command line to start the client（客户端启动方式）:

直接运行如下指令即可启动客户端：

 online-audio-client --htk --vtt localhost 5010 scp:test.scp

2.1 Arguments are as follow（参数意义）:

–htk - save results as an HTK label file
–vtt - save results as a WebVTT file
localhost - server to connect to
5010 - port to connect to
scp:test.scp - list of WAV files to send
启动后客户端不断传输数据，服务器实时进行解码！结果如下：

结果是边传输边识别的：

* Command line to start the Java client（移动客户端）:
移动客户端我还未尝试：

java -jar online-audio-client.jar

Or simply double-click the JAR file in the graphical interface.

二、使用麦克风建立客户端与服务器的实时解码

kaldi提供了读取客户端麦克风数据的解码工具，可以在客户端使用麦克风发送音频，服务器实时返回解码数据。

1. 使用`online-server-gmm-decode-faster`启动服务器：

通过网络接收特征进行解码。话语分词是即时完成的。如果给出可选（最后）参数，则使用特征拼接/ LDA变换。否则默认使用delta / delta-delta（2阶）特征。

Usage: online-server-gmm-decode-faster [options] model-infst-in word-symbol-table silence-phones udp-port [lda-matrix-in]
Example: online-server-gmm-decode-faster --rt-min=0.3 --rt-max=0.5 --max-active=4000 --beam=12.0 --acoustic-scale=0.0769 model HCLG.fst words.txt '1:2:3:4:5' 1234 lda-matrix

2. 使用`online-net-client`启动客户端：

通过online-net-client工具，使用麦克风（portaudio）作为输入，提取特征并通过网络连接发送它们到服务器上。具体设置如下：

Usage: online-net-client server-address server-port Options:--batch-size                : The number of feature vectors to be extracted and sent in one go (int, default = 27)Standard options:--config                    : Configuration file to read (this option may be repeated) (string, default = "")--help                      : Print out usage message (bool, default = false)--print-args                : Print the command line arguments (to stderr) (bool, default = true)--verbose                   : Verbose level (higher->more logging) (int, default = 0)

引用：kaldi-asr
转载请注明：.html

参考：kaldi首页

转载于:.html

本文标签： kaldi

版权声明：本文标题：kaldi 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.freenas.com.cn/jishu/1687681120h128484.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

技术交流 – FreeNAS中文网

kaldi

kaldi

一、服务器客户端识别系统建立方法

1. Command line to start the server（服务器端启动方式）:

1.1 Arguments are as follow（参数意义）:

2. Command line to start the client（客户端启动方式）:

2.1 Arguments are as follow（参数意义）:

* Command line to start the Java client（移动客户端）:

二、使用麦克风建立客户端与服务器的实时解码

1. 使用online-server-gmm-decode-faster启动服务器：

2. 使用online-net-client启动客户端：

更多相关文章

kaldi

Kaldi语音识别工具包简介及安装说明

kaldi windows安装_Kaldi环境配置及安装手册（更新加强版）

发表评论

推荐文章

U盘变成RAW格式怎么办？数据如何恢复？

通过路由器设置，让外网可以访问内网电脑应用服务，非常适合在公司访问家里的电脑应用

电脑系统知识：Windows原版系统与Ghost系统的区别，你知道吗？

Linux磁盘与文件系统管理

【算法】十大经典算法

热门文章

01-树莓派基本配置-基础配置配置

SpringBoot整合Minio实现文件上传、下载

Win10设置系统补丁更新服务器,Win10 windows更新和安全选项内手动更新补丁升级方法图解...

Chrome，谷歌浏览器已经不会自动更新了，但是会有更新提示，关闭更新提示，window10系统，本人亲自操作过，不坑人

Windows10禁止自动重启和按键盘任意键自动开机

win7记事本如何转换html,记事本格式_win7记事本怎么改格式

微博营销入门知识

github copilot X - chat 使用体验分享

Ghost每个功能详细解释

简单制作U盘启动盘安装Ghost XP系统（大白菜+深度）

最新文章

Raid技术

LSI_阵列卡操作手册

破解Centos7_root用户密码

Redhat重置Root用户密码方法

远程批量修改linux服务器密码的脚本

电脑没声音了怎么恢复正常？一键恢复电脑声音

Windows系统电脑使用图形化SFTP客户端WinSCP远程发送文件至内网Linux系统设备

windos读写ext3工具_“ ext2fsd” Windows系统工具，用于读写ext234文件系统

深度剖析Linux与Windows系统的区别

windows系统ping包显示时间

1. 使用`online-server-gmm-decode-faster`启动服务器：

2. 使用`online-net-client`启动客户端：