Papers - UENO Sei
-
CGアバター対話における音声からの頭部動作および表情の自動生成
藤岡 侑貴, 上乃 聖, 李晃伸
人工知能学会全国大会 2023.06
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
Continuous Integrate-and-Fire を用いた音声区間検出とターン終了検知のマルチタスク学習
池口 弘尚, 東 佑樹, 上乃 聖,李 晃伸
日本音響学会講演論文集 2023.03
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
複数設定のスペクトログラムを用いた音声合成に基づく音声認識のデータ拡張
上乃聖, 李晃伸
日本音響学会講演論文集 2023.03
Authorship:Lead author Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
連続的な感情表出を用いたカウンセリング対話エージェントの評価
川又 朱莉, 上乃 聖, 李 晃伸
HAIシンポジウム 2023.03
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
多様な笑い声生成のための有声音・無声音間隔の制御
木全亮太朗, 上乃 聖, 李 晃伸
情報処理学会全国大会講演論文集 2023.03
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM Reviewed
Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
INTERSPEECH 3889 - 3893 2022.09
Language:English Publishing type:Research paper (international conference proceedings)
-
音声認識のデータ拡張のための話者情報およびマスクを用いた合成音声の周波数スペクトログラム強調
上乃 聖,李 晃伸,河原 達也
日本音響学会講演論文集 1149 - 1150 2022.09
Authorship:Lead author, Corresponding author Language:Japanese Publishing type:Research paper (other academic)
-
Phone-informed refinement of synthesized mel spectrogram for data augmentation in speech recognition Reviewed
Sei Ueno and Tatsuya Kawahara
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 8572 - 8576 2022.05
Authorship:Lead author, Corresponding author Language:English Publishing type:Research paper (international conference proceedings)
-
Data Augmentation Approaches for Automatic Speech Recognition Using Text-to-Speech
Sei Ueno
2022.03
Authorship:Lead author, Corresponding author Language:English Publishing type:Doctoral thesis
-
音声認識のデータ拡張のための音素情報を用いた合成音声の強調
上乃 聖,河原 達也
日本音響学会講演論文集 887 - 888 2022.03
Authorship:Lead author, Corresponding author Language:Japanese Publishing type:Research paper (other academic)
-
Data Augmentation for ASR Using TTS Via a Discrete Representation Reviewed
Sei Ueno, Masato Mimura, Shinsuke Sakai, and Tatsuya Kawahara
IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) 68 - 74 2021.12
Authorship:Lead author, Corresponding author Language:English Publishing type:Research paper (international conference proceedings)
-
音声認識のデータ拡張のための合成音声の周波数スペクトログラム強調
上乃 聖,河原 達也
研究報告音声言語情報処理(SLP) 2021-SLP-139 ( 28 ) 1 - 6 2021.11
Authorship:Lead author, Corresponding author Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
Synthesizing Waveform Sequence-to-sequence to Augment Training Data for Sequence-to-sequence Speech Recognition Reviewed
Sei Ueno, Masato Mimura, Shinsuke Sakai, and Tatsuya Kawahara
Acoustical Science and Technology 42 ( 6 ) 333 - 343 2021.11
Authorship:Lead author, Corresponding author Language:English Publishing type:Research paper (scientific journal)
DOI: https://doi.org/10.1250/ast.42.333
Other Link: https://www.jstage.jst.go.jp/article/ast/42/6/42_E2108/_article
-
wav2vec 2.0を用いた音声合成による音声認識のデータ拡張
上乃 聖,河原 達也
日本音響学会講演論文集 857 - 858 2021.09
Authorship:Lead author, Corresponding author Language:Japanese Publishing type:Research paper (other academic)
-
vq-wav2vecによる離散IDを扱う音声認識のデータ拡張
上乃 聖,三村 正人,河原 達也
日本音響学会講演論文集 825 - 826 2021.03
Authorship:Lead author, Corresponding author Language:Japanese Publishing type:Research paper (other academic)
-
複数話者を対象とした非自己回帰型ニューラル音声合成
上乃 聖,三村 正人,河原 達也
日本音響学会講演論文集 825 - 826 2021.03
Authorship:Lead author, Corresponding author Language:Japanese Publishing type:Research paper (other academic)
-
ELECTRAによる音声認識仮説のリスコアリング
二見 颯,稲熊 寛文,上乃 聖,三村 正人,坂井 信輔,河原 達也
日本音響学会講演論文集 827 - 828 2021.03
Language:Japanese Publishing type:Research paper (other academic)
-
BERTによるSequence-to-Sequence音声認識への知識蒸留
二見 颯,稲熊 寛文,上乃 聖,三村 正人,坂井 信輔,河原 達也
研究報告音声言語情報処理(SLP) 2020-SLP-134 ( 2 ) 1 - 6 2020.11
Language:Japanese Publishing type:Research paper (conference, symposium, etc.)
-
Endto-End Speech-to-Dialog-Act Recognition Reviewed
Viet-Trung Dang, Tianyu Zhao, Sei Ueno, Hirofumi Inaguma, Tatsuya Kawahara
INTERSPEECH 3910 - 3914 2020.10
Language:English Publishing type:Research paper (international conference proceedings)
-
End-to-End Speech Emotion Recognition Combined with Acoustic-to-Word ASR Model Reviewed
Han Feng, Sei Ueno, Tatsuya Kawahara
INTERSPEECH 501 - 505 2020.10
Language:English Publishing type:Research paper (international conference proceedings)