Papers - UENO Sei

Division display  21 - 40 of about 64 /  All the affair displays >>
  • CGアバター対話における音声からの頭部動作および表情の自動生成

    藤岡 侑貴, 上乃 聖, 李晃伸

    人工知能学会全国大会   2023.06

     More details

    Language:Japanese   Publishing type:Research paper (conference, symposium, etc.)  

  • Continuous Integrate-and-Fire を用いた音声区間検出とターン終了検知のマルチタスク学習

    池口 弘尚, 東 佑樹, 上乃 聖,李 晃伸

    日本音響学会講演論文集   2023.03

     More details

    Language:Japanese   Publishing type:Research paper (conference, symposium, etc.)  

  • 複数設定のスペクトログラムを用いた音声合成に基づく音声認識のデータ拡張

    上乃聖, 李晃伸

    日本音響学会講演論文集   2023.03

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (conference, symposium, etc.)  

  • 連続的な感情表出を用いたカウンセリング対話エージェントの評価

    川又 朱莉, 上乃 聖, 李 晃伸

    HAIシンポジウム   2023.03

     More details

    Language:Japanese   Publishing type:Research paper (conference, symposium, etc.)  

  • 多様な笑い声生成のための有声音・無声音間隔の制御

    木全亮太朗, 上乃 聖, 李 晃伸

    情報処理学会全国大会講演論文集   2023.03

     More details

    Language:Japanese   Publishing type:Research paper (conference, symposium, etc.)  

  • Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM Reviewed

    Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara

    INTERSPEECH   3889 - 3893   2022.09

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)  

  • 音声認識のデータ拡張のための話者情報およびマスクを用いた合成音声の周波数スペクトログラム強調

    上乃 聖,李 晃伸,河原 達也

    日本音響学会講演論文集   1149 - 1150   2022.09

     More details

    Authorship:Lead author, Corresponding author   Language:Japanese   Publishing type:Research paper (other academic)  

  • Phone-informed refinement of synthesized mel spectrogram for data augmentation in speech recognition Reviewed

    Sei Ueno and Tatsuya Kawahara

    International Conference on Acoustics, Speech, and Signal Processing (ICASSP)   8572 - 8576   2022.05

     More details

    Authorship:Lead author, Corresponding author   Language:English   Publishing type:Research paper (international conference proceedings)  

  • Data Augmentation Approaches for Automatic Speech Recognition Using Text-to-Speech

    Sei Ueno

    2022.03

     More details

    Authorship:Lead author, Corresponding author   Language:English   Publishing type:Doctoral thesis  

    DOI: https://doi.org/10.14989/doctor.k24027

  • 音声認識のデータ拡張のための音素情報を用いた合成音声の強調

    上乃 聖,河原 達也

    日本音響学会講演論文集   887 - 888   2022.03

     More details

    Authorship:Lead author, Corresponding author   Language:Japanese   Publishing type:Research paper (other academic)  

  • Data Augmentation for ASR Using TTS Via a Discrete Representation Reviewed

    Sei Ueno, Masato Mimura, Shinsuke Sakai, and Tatsuya Kawahara

    IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)   68 - 74   2021.12

     More details

    Authorship:Lead author, Corresponding author   Language:English   Publishing type:Research paper (international conference proceedings)  

  • 音声認識のデータ拡張のための合成音声の周波数スペクトログラム強調

    上乃 聖,河原 達也

    研究報告音声言語情報処理(SLP)   2021-SLP-139 ( 28 )   1 - 6   2021.11

     More details

    Authorship:Lead author, Corresponding author   Language:Japanese   Publishing type:Research paper (conference, symposium, etc.)  

  • Synthesizing Waveform Sequence-to-sequence to Augment Training Data for Sequence-to-sequence Speech Recognition Reviewed

    Sei Ueno, Masato Mimura, Shinsuke Sakai, and Tatsuya Kawahara

    Acoustical Science and Technology   42 ( 6 )   333 - 343   2021.11

     More details

    Authorship:Lead author, Corresponding author   Language:English   Publishing type:Research paper (scientific journal)  

    DOI: https://doi.org/10.1250/ast.42.333

    Other Link: https://www.jstage.jst.go.jp/article/ast/42/6/42_E2108/_article

  • wav2vec 2.0を用いた音声合成による音声認識のデータ拡張

    上乃 聖,河原 達也

    日本音響学会講演論文集   857 - 858   2021.09

     More details

    Authorship:Lead author, Corresponding author   Language:Japanese   Publishing type:Research paper (other academic)  

  • vq-wav2vecによる離散IDを扱う音声認識のデータ拡張

    上乃 聖,三村 正人,河原 達也

    日本音響学会講演論文集   825 - 826   2021.03

     More details

    Authorship:Lead author, Corresponding author   Language:Japanese   Publishing type:Research paper (other academic)  

  • 複数話者を対象とした非自己回帰型ニューラル音声合成

    上乃 聖,三村 正人,河原 達也

    日本音響学会講演論文集   825 - 826   2021.03

     More details

    Authorship:Lead author, Corresponding author   Language:Japanese   Publishing type:Research paper (other academic)  

  • ELECTRAによる音声認識仮説のリスコアリング

    二見 颯,稲熊 寛文,上乃 聖,三村 正人,坂井 信輔,河原 達也

    日本音響学会講演論文集   827 - 828   2021.03

     More details

    Language:Japanese   Publishing type:Research paper (other academic)  

  • BERTによるSequence-to-Sequence音声認識への知識蒸留

    二見 颯,稲熊 寛文,上乃 聖,三村 正人,坂井 信輔,河原 達也

    研究報告音声言語情報処理(SLP)   2020-SLP-134 ( 2 )   1 - 6   2020.11

     More details

    Language:Japanese   Publishing type:Research paper (conference, symposium, etc.)  

  • Endto-End Speech-to-Dialog-Act Recognition Reviewed

    Viet-Trung Dang, Tianyu Zhao, Sei Ueno, Hirofumi Inaguma, Tatsuya Kawahara

    INTERSPEECH   3910 - 3914   2020.10

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)  

    DOI: https://doi.org/10.21437/Interspeech.2020-1062

  • End-to-End Speech Emotion Recognition Combined with Acoustic-to-Word ASR Model Reviewed

    Han Feng, Sei Ueno, Tatsuya Kawahara

    INTERSPEECH   501 - 505   2020.10

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)  

    DOI: https://doi.org/10.21437/Interspeech.2020-1180

To the head of this page.▲