Papers - LEE Akinobu
-
再帰型ニューラルネットに基づく音素情報を用いた応答選択
牧野 健一郎,李 晃伸
情報処理学会 音声言語情報処理研究会研究報告 2017-SLP-117 ( 4 ) 1 - 6 2017.07
Language:Japanese Publishing type:Research paper (conference, symposium, etc.) Publisher:情報処理学会
-
日本語におけるG2Pによる統計的学習を用いた話し言葉に頑健な発音辞書の自動構築
寺田 卓矢,李 晃伸
情報処理学会 音声言語情報処理研究会研究報告 2017-SLP-117 ( 11 ) 1 - 6 2017.07
Language:Japanese Publishing type:Research paper (conference, symposium, etc.) Publisher:情報処理学会
-
国際会議 ICASSP2017 報告
浅見 太一, 大谷 大和, 岡本 拓磨, 小川 哲司, 落合 翼, 亀岡 弘和, 駒谷 和範, 高木 信二, 高道 慎之介, 俵 直弘, 南條 浩輝, 橋本 佳, 福田 隆, 増村 亮, 松田 繁樹, 李 晃伸, 渡部 晋治
第117回音声言語情報処理研究会 (SIG-SLP) SLP-3 2017.07
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
User generated dialogue systems: uDialogue Reviewed
Keiichi Tokuda, Akinobu Lee, Yoshihiko Nankaku, Keiichiro Oura, Kei Hashimoto, Daisuke Yamamoto, Ichi Takumi, Takahiro Uchiya, Shuhei Tsutsumi, Steve Renals, Junichi Yamagishi
Human-Harmonized Information Technology 2 77 - 114 2017.04
Language:English Publishing type:Part of collection (book) Publisher:Springer Japan
This chapter introduces the idea of user-generated dialogue content and describes our experimental exploration aimed at clarifying the mechanism and conditions that makes it workable in practice. One of the attractive points of a speech interface is to provide a vivid sense of interactivity that cannot be achieved with a text interface alone. This study proposes a framework that spoken dialogue systems are separated into content that can be produced and modified by users, and the systems that drive the content, and seek to clarify (1) the requirements of systems that enable the creation of attractive spoken dialogue, and (2) the conditions for the active generation of attractive dialogue content by users, while attempting to establish a method for realizing them. Experiments for validating user dialogue content generation were performed by installing interactive digital signage with a speech interface in public spaces as a dialogue device, and implementing a content generation environment for users via the Internet. The proposed framework is expected to lead to a breakthrough in the spread of using speech technology.
-
78 ( 78 ) 125 - 128 2016.10
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
78 ( 78 ) 129 - 134 2016.10
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
ユーザフレンドリィな音声対話システム実現のためのユーザ話速および発話内容に基づくシステム話速制御手法の検討
三原 寛哉, 李 晃伸
研究報告音声言語情報処理(SLP) 2016-SLP-112 ( 15 ) 1 - 6 2016.07
Language:Japanese Publishing type:Research paper (conference, symposium, etc.) Publisher:情報処理学会
-
音声対話システムのオープンコンテンツ化実現のためのモジュール仕様および管理手法
山西 元樹,船谷内 泰斗,李 晃伸
研究報告音声言語情報処理(SLP) 2016-SLP-112 ( 14 ) 1 - 6 2016.07
Language:Japanese Publishing type:Research paper (conference, symposium, etc.) Publisher:情報処理学会
-
音声対話システムにおけるシステムからの話しかけと他者性認知の関連性の調査
村上拓也, 李 晃伸, 西川 由里, 小島 良広, 遠藤 充
HAIシンポジウム2015 238 - 243 2015.12
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
音声対話インタフェースにおけるマルチタスク性の適切な表出方法の検討
小中 彩貴, 李 晃伸
HAIシンポジウム2015 108 - 112 2015.12
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
音声対話システムにおける音環境への反応表出によるアフォーダンスの評価
夏目 龍司, 李 晃伸
HAIシンポジウム2015 94 - 98 2015.12
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
利用者による履歴付き対話の共同構築・拡張が可能なユーザ生成音声対話システム
宮木 京介, 飯塚 遼, 李 晃伸
日本音響学会2015年秋季研究発表会講演論文集 3-Q-22 2015.09
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
単語間非共有ノードに基づく単語信頼度を用いたキーワードの発話中遂次確定
松尾 涼平, 小林 大晃, 李 晃伸
日本音響学会2015年秋季研究発表会講演論文集 3-Q-12 2015.09
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
Prosodically-Enhanced Recurrent Neural Network Language Models Reviewed International coauthorship International journal
Siva Reddy Gangireddy, Steve Renals, Yoshihiko Nankaku, Akinobu Lee
Proc. Conference of the International Speech Communiation Association (INTERSPEECH) 2390 - 2394 2015.09
Language:English Publishing type:Research paper (international conference proceedings)
-
Prosodically-enhanced Recurrent Neural Network Language Models Reviewed
Siva Reddy Gangireddy, Steve Renals, Yoshihiko Nankaku, Akinobu Lee
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 2390 - 2394 2015
Language:English Publishing type:Research paper (international conference proceedings) Publisher:ISCA-INT SPEECH COMMUNICATION ASSOC
Recurrent neural network language models have been shown to consistently reduce the word error rates (WERs) of large vocabulary speech recognition tasks. In this work we propose to enhance the RNNLMs with prosodic features computed using the context of the current word. Since it is plausible to compute the prosody features at the word and syllable level we have trained the models on prosody features computed at both these levels. To investigate the effectiveness of proposed models we report perplexity and WER for two speech recognition tasks, Switchboard and TED. We observed substantial improvements in perplexity and small improvements in WER.
-
Voice interaction system with 3D-CG virtual agent for stand-alone smartphones Reviewed
Daisuke Yamamoto, Keiichiro Oura, Ryota Nishimura, Takahiro Uchiya, Akinobu Lee, Keiichi Tokuda, Ichi Takumi
HAI 2014 - Proceedings of the 2nd International Conference on Human-Agent Interaction 323 - 330 2014.10
Language:English Publishing type:Research paper (international conference proceedings) Publisher:Association for Computing Machinery, Inc
In this paper, we propose a voice interaction system using 3D-CG virtual agents for stand-alone smartphones. Because the proposed system can handle speech recognition and speech synthesis on a stand-alone smartphone differently from the existing mobile voice interaction systems, this system enables us to talk naturally without encountering delays caused by network communications. Moreover, proposed system can be fully customized by dialogue scripts, Java-based plugins, and Android APIs. Therefore, developers can make original voice interaction systems for smartphones easily based on proposed system. We have made a subset of the proposed system available as opensource software. We expect that this system will contribute to studies of human-agent interaction using smartphones.
-
Voice interaction system with 3D-CG virtual agent for stand-alone smartphones Reviewed International journal
Daisuke Yamamoto, Keiichiro Oura, Ryota Nishimura, Takahiro Uchirya, Akinobu Lee, Ichi Takumi, Keiichi Tokuda
the 2nd International Conference on Human Agent Interaction (HAI 2014), ACM digital library, 320 - 330 2014.10
Language:English Publishing type:Research paper (international conference proceedings)
-
統計的音声対話システムにおける音素系列を用いた頑健な応答選択
佐伯 昌幸, 李 晃伸
音声言語情報処理研究会SIG-SLP第101回研究会 2014.05
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
ユーザ生成型音声対話システムにおけるクリエイターとユーザの相互刺激によるインセンティブ向上の検討
飯塚 遼, 李 晃伸
音声言語情報処理研究会SIG-SLP第101回研究会 2014.05
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
条件付き確率場に基づく仮説の逐次早期確定を用いた低遅延音声インタフェース
伊神 陽介,李 晃伸,徳田 恵一,南角 吉彦
音響学会公演論文集 63 - 64 2014.03
Language:Japanese Publishing type:Research paper (scientific journal)