Speech Processing
-
- CONFERENCE (INTERNATIONAL)
- Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
- Xuankai Chang (Carnegie Mellon University), Brian Yan (Carnegie Mellon University), Yuya Fujita, Takashi Maekaku, Shinji Watanabe (Carnegie Mellon University)
- The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)
- August 20, 2023
-
- JOURNAL (INTERNATIONAL)
- Audio Signal Processing in the 21st Century
- Gaël Richard (Telecom-Paris), Paris Smaragdis (University of Illinois Urbana-Champaign), Sharon Gannot (Bar-Ilan University), Patrick A. Naylor (Imperial College London), Shoji Makino (Waseda University), Walter Kellermann (University of Erlangen-N ̈urnberg), Akihiko Sugiyama
- IEEE Signal Processing Magazine (Signal Processing Magazine)
- July 19, 2023
-
- CONFERENCE (INTERNATIONAL)
- Fully Unsupervised Topic Clustering of Unlabelled Spoken Audio Using Self-Supervised Representation Learning and Topic Model
- Takashi Maekaku, Yuya Fujita, Xuankai Chang (Carnegie Mellon University), Shinji Watanabe (Carnegie Mellon University)
- The International Conference on Acoustics, Speech, & Signal Processing 2023 (ICASSP 2023)
- June 07, 2023
-
- CONFERENCE (INTERNATIONAL)
- Adaptive Noise Canceller Algorithm with SNR-Based Stepsize and Data-Dependent Averaging
- Akihiko Sugiyama
- 2023 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- June 05, 2023
-
- CONFERENCE (INTERNATIONAL)
- Linear Microphone Array Parallel to the Driving Direction for In-Car Speech Enhancement
- Masanori Tsujikawa (NEC), Akihiko Sugiyama, Ken Hanazawa (NEC America), Yoshinobu Kajikawa (Kansai University)
- 2023 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- June 05, 2023
-
- OTHERS (INTERNATIONAL)
- Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
- Xuankai Chang (Carnegie Mellon University), Brian Yan (Carnegie Mellon University), Yuya Fujita, Takashi Maekaku, Shinji Watanabe (Carnegie Mellon University)
- arXiv
- May 29, 2023
-
- CONFERENCE (INTERNATIONAL)
- Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation
- Motoi Omachi, Brian Yan (Carnegie Mellon University), Siddharth Dalmia (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023)
- May 08, 2023
-
- CONFERENCE (DOMESTIC)
- 訳語対の推定と順序入れ替え操作による説明可能なEnd-to-end音声翻訳
- 大町 基, Brian Yan (Carnegie Mellon University), Siddharth Dalmia (Carnegie Mellon University), 藤田 悠哉, 渡部 晋治 (Carnegie Mellon University)
- 日本音響学会2023年春季研究発表会 (音響学会)
- March 22, 2023
-
- CONFERENCE (DOMESTIC)
- Transformerを用いた音声認識モデルにおける事前分布を用いた注意重みの平滑化の検討
- 前角 高史, 藤田 悠哉, Yifang Peng (Carnegie Mellon University), 渡部 晋治 (Carnegie Mellon University)
- 日本音響学会2023年春季研究発表会
- March 16, 2023
-
- CONFERENCE (DOMESTIC)
- ストリーミング End-to-End 音声認識のための RNN Transducer の最小遅延学習
- 篠原 雄介, 渡部 晋治 (Carnegie Mellon University)
- 日本音響学会2023年春季研究発表会
- March 15, 2023
-
- CONFERENCE (INTERNATIONAL)
- Adaptive Noise Canceller Algorithm with an SNR-Based Stepsize and Controlled Averaging
- Akihiko Sugiyama
- IEEE International Conference on Consumer Electronics (ICCE)
- January 06, 2023
-
- OTHERS (DOMESTIC)
- Raw or cooked? That is the Question in Adaptive Noise Cancelling
- Akihiko Sugiyama
- 電子情報通信学会第37回信号処理シンポジウム (SIPシンポジウム)
- December 13, 2022
-
- OTHERS (INTERNATIONAL)
- Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation
- Motoi Omachi, Brian Yan (Carnegie Mellon University), Siddharth Dalmia (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- arXiv.org (arXiv)
- November 14, 2022
-
- CONFERENCE (INTERNATIONAL)
- Attention Weight Smoothing Using Prior Distributions for Transformer-Based End-to-End ASR
- Takashi Maekaku, Yuya Fujita, Yifan Peng (Carnegie Mellon University), Shinji Watanabe (Carnegie Mellon University)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 19, 2022
-
- CONFERENCE (INTERNATIONAL)
- End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
- Xuankai Chang (Carnegie Mellon University), Takashi Maekaku, Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 19, 2022
-
- CONFERENCE (INTERNATIONAL)
- Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition
- Yusuke Shinohara, Shinji Watanabe (Carnegie Mellon University)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- WORKSHOP (INTERNATIONAL)
- User Preference between Residual Noise and Speech Distortion in Speech Enhancement
- Akihiko Sugiyama, Osamu Shimada (NEC Corporation), Toshiyuki Nomura (NEC Corporation)
- International Workshop on Acoustic Signal Enhancement (IWAENC)
- September 05, 2022
-
- CONFERENCE (INTERNATIONAL)
- An Exploration of Hubert with Large Number of Cluster Units and Model Assessment Using Bayesian Information Criterion
- Takashi Maekaku, Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- The International Conference on Acoustics, Speech, & Signal Processing 2022 (ICASSP 2022)
- May 10, 2022
-
- CONFERENCE (INTERNATIONAL)
- Robust Adaptive Noise Canceller Algorithm with SNR-Based Stepsize Control and Noise-Path Gain Compensation
- Akihiko Sugiyama
- 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022)
- May 08, 2022
-
- CONFERENCE (INTERNATIONAL)
- Non-Autoregressive End-to-End Automatic Speech Recognition Incorporating Downstream Natural Language Processing
- Motoi Omachi, Yuya Fujita, Shinji Watanabe (Carnegie Mellon University), Tianzi Wang (Johns Hopkins University)
- 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2022)
- April 27, 2022