-
Yui Sudo, Kazuya Hata, Kazuhiro Nakadai, Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation,
Interspeech 2023,
vol., no., pp.-,
, 20230801, Ireland, ,
-
Haris Gulzar, Monikka Roslianna Busto, Takeharu Eda, Katsutoshi Itoyama, Kazuhiro Nakadai, miniStreamer: Enhancing Small Conformer with Chunked-Context Masking for Streaming ASR Applications on the Edge,
Interspeech 2023,
vol., no., pp.-,
, 20230801, Ireland, ,
-
Takahiro Aizawa, Yoshiaki Bando, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai, Masaki Onishi, Unsupervised domain adaptation of universal source separation based on neural full-rank spatial covariance analysis,
2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP),
vol., no., pp.1-6,
, 20230901, Rome, Italy, ,
10.1109/MLSP55844.2023.10285999
-
Tan, Sihan, Khan, Nabeela Khanum, Itoyama, Katsutoshi, Nakadai, Kazuhiro,, Improving Sign Language Understanding Introducing Label Smoothing,
32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN 2023),
vol., no., pp.-,
, 20230801, Busan, Korea, ,
https://ras.papercept.net/conferences/scripts/rtf/ROMAN23_Conten...
-
Sudo, Yui, Takigahira, Masayuki, Tsuru, Hideo, Nakadai, Kazuhiro, Nakajima, Hirofumi, Online Adaptation of Fourier Series Based Acoustic Transfer Function Model to Improve Sound Source Localization and Separation,
32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN 2023),
vol., no., pp.-,
, 20230801, Busan, Korea, ,
https://ras.papercept.net/conferences/scripts/rtf/ROMAN23_Conten...
-
S.Matsubayashi,F.Saito,R.Suzuki,K.Nakadai,H.G.Okuno, Calling dynamics of the Ruddy-breasted crake (Porzanafusca) in a fragmented landscape,
AOS & SCO - SOC 2023,
vol., no., pp.-,
, 20230801, , ,
-
Hao Zhao, Reiji Suzuki, Ryosuke Kojima, Takaya Arita, Kazuhiro Nakadai, A soundscape analysis of bird and cicada vocalizations based on azimuth and elevation localization using robot audition and machine learning techniques,
28th International Symposium on Artificial Life and Robotics(AROB 28th 2023), 8th International Symposium on BioComplexity (ISBC8), 6th International Symposium on Swarm Behavior and Bio-Inspired Robotics (SWARM6),
vol., no., pp.-,
, 20230125, , ,
-
Kazuhiro Nakadai, Robot Audition 5.0 and Beyond, Southern University of Science and Technology,
,
20230101, Southern University of Science and Technology (SUSTech),
-
Naoki Yamamoto, Kenji Nishida, Katsutoshi Itoyama, Kazuhiro Nakadai, Classification of Ball Rotation Direction Using Hitting Sound in Tennis and Investigation of Generalization Performance Improvement,
Proceedings of IEEE/SICE International Symposium on System Integration (SII 2023),
20230101,
-
Kei Suzuki, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai, Audio-Visual Class Association Based on Two-Stage Self-Supervised Contrastive Learning towards Robust Scene Analysis,
Proceedings of IEEE/SICE International Symposium on System Integration (SII 2023),
20230101,
-
Masahiko Fujita, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai, An Ensemble Method for Multiple Speech Enhancement Using Deep Learning,
Proceedings of IEEE/SICE International Symposium on System Integration (SII 2023),
20230101,
-
Hidehiko Kishinami, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai, Reconstruction of Depth Scenes Based on Echolocation,
Proceedings of IEEE/SICE International Symposium on System Integration (SII 2023),
20230101,
-
Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai, Metric-Based Multimodal Meta-Learning for Human Movement Identification Via Footstep Recognition,
Proceedings of IEEE/SICE International Symposium on System Integration (SII 2023),
20230101,
-
Reiji Suzuki, Shinji Sumitani, Zachary Harlow, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi G. Okuno, Extracting Bird Vocalizations from a Complex Natural Soundscape in Forests Using Robot Audition Techniques,
Proceedings of IEEE/SICE International Symposium on System Integration (SII 2023),
20230101,
-
Haris Gulzar, Muhammad Shakeel, Katsutoshi Itoyama, Kazuhiro Nakadai, Kenji Nishida, Hideharu Amano, Takeharu Eda, FPGA Based Power-Efficient Edge Server to Accelerate Speech Interface for Socially Assistive Robotics,
Proceedings of IEEE/SICE International Symposium on System Integration (SII 2023),
20230101,
-
Chishio Sugiyama, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai, Assessment of Simultaneous Calibration for Positions, Orientations, and Time Offsets in Multiple Microphone Arrays Systems,
Proceedings of IEEE/SICE International Symposium on System Integration (SII 2023),
20230101,
-
Yuanzheng He, Jiang Wang, Daobilige Su, Kazuhiro Nakadai, Junfeng Wu, Shoudong Huang, You-Fu Li, He Kong, Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization,
Proceedings of IEEE/SICE International Symposium on System Integration (SII 2023),
20230101,
-
Benjamin Yen, Taiki Yamada, Katsutoshi Itoyama, Kazuhiro Nakadai, Performance evaluation of sound source localisation and tracking methods using multiple drones,
Internoise 2023,
20230101, Makuhari, Japan,
-
Benjamin Yen, C. T. Justine Hui, Esther Bergin, Eleesa Jensen, Suzanne C. Purdy, William Keith, Yusuke Hioka, James Whitlock, George Dodd, Development of a continuous classroom signal-to-noise ratio measurement system,
Internoise 2023,
20230101, Makuhari, Japan,
-
Sihan Tan, Nabeela Khanum Khan, Katsutoshi Itoyama and Kazuhiro Nakadai, Current status of sign language datasets,
Ro-MAN 2023 Workshop on Speech-based communication for robots and systems,
vol., no., pp.-,
, 20230828, , ,
-
Khan Nabeela Khanum, Tan Sihan, Itoyama Katsutoshi, and Nakadai Kazuhiro, Sign Language: Generation of Non-Verbal Communication,
Ro-MAN 2023 Workshop on Speech-based communication for robots and systems,
vol., no., pp.-,
, 20230828, , ,
-
Shuhei Asaka, Katsutoshi Itoyama, and Kazuhiro Nakadai, Towards Natural Spoken Dialogue Systems Based on AI Services,
Ro-MAN 2023 Workshop on Speech-based communication for robots and systems,
vol., no., pp.-,
, 20230828, , ,
-
Benjamin Yen, Taiki Yamada, Katsutoshi Itoyama, Kazuhiro Nakadai, Rotor Noise-Informed Sound Source Tracking with Multiple Drones Using Microphone Arrays,
IEEE/RSJ International Conference on Intellignet Robots and Systems (IROS 2023) LBR,
vol., no., pp.-,
, 20231001, , ,
-
Kazuhiro Nakadai, Masayuki Takigahira, Katsutoshi Itoyama, PyHARK: A Python Package for Robot Audition Based on HARK,
IEEE/RSJ International Conference on Intellignet Robots and Systems (IROS 2023) LBR,
vol., no., pp.-,
, 20231001, , ,
-
Kazuhiro Nakadai, Masayuki Takigahira, Katsutoshi Itoyama, PyHARK: A HARK-based python package for robot audition and computational auditory scene analysis,
Asia Pacific Conference on Robot IoT System Development and Platform (APRIS 2023),
vol., no., pp.65-66,
, 20231101, , ,
-
Zirui Lin, Katsutoshi Itoyama, Kazuhiro Nakadai, Masayuki Takigahira, Haris Gulzar, Takeharu Eda, Monikka Roslianna Busto, Hideharu Amano, PyHARK Acceleration: A GPU-based Approach,
Asia Pacific Conference on Robot IoT System Development and Platform (APRIS 2023),
vol., no., pp.61-62,
, 20231101, , ,
-
Yanke Long, Riku Yasuda, Yui Sudo, Katsutoshi Itoyama, Kazuhiro Nakadai, Hideharu Amano, Kenji Nishida, Sound event localization and detection utilizing overlapping end-to-end learning,
Asia Pacific Conference on Robot IoT System Development and Platform (APRIS 2023),
vol., no., pp.63-64,
, 20231101, , ,
-
Atsuo Hiroe, Katsutoshi Itoyama, Kazuhiro Nakadai, Is the Ideal Ratio Mask Really the Best? Exploring the Best Extraction Performance and Optimal Mask of Mask-Based Beamformers,
Asia Pacific Signal and Information Processing Association (APSIPA 2023),
vol., no., pp.4-5,
, 20231031, , ,
-
Kazuhiro Nakadai, Robot Audition 5.0 and Beyond,
POSTECH,
vol., no., pp.-,
, 20230101, Pohang, Korea, ,
|