Publication and Awards of Okuno Laboratory


[ AY2013|AY2012|AY2011|AY2010|AY2009|AY2008 |AY2007 | AY2006 | AY2005 | AY2004 | AY2003 | AY2002 | AY2001 | (Before AY2000) | Japanese | Awards ]


DBLP: H.G. Okuno, K. Itoyama, S. Nishide, T. Mizumoto, L.K. Cahier, E-H. Kim, T. Otsuka, A. Lim, T. Itohara, K. Nagira,
OB: T. Ogata, T. Takahashi, N. Yamakawa, Y. Hirasawa, N. Nishikawa, R. Takeda, W. Hinoshita, A. Maezawa, N. Yasuraoka, K. Matsuyama, H. Awano, K. Komatani, T. Yoshioka, H. Fujihara, Katsumaru, S. Shiramatsu, S. Ikeda, H. Kanda Y. Kubota Sumi, H-D. Kim, S. Yamamoto, K. Yoshii, R. Yokoya S. Naito, T. Kitahara, H. Niwa, M. Yoshida, T. Tasaki, S. Matsumoto, Valin, Y. Akiba, T. Watanabe, K. Ishihara, Kodaka, M. Toda, T. Misu, I. Lane, Y. Akita, Y. Yamakata, A. Raux, Ito, T. Kawahara,
Google Scnholar, Microsoft Academic Search, odysa

o Academic Year 2013o

Thesis | Journal Papers | Book Chapters | International Conferences | Domestic Conferences | Patents

    Peer-Reviewed Journal Papers

  1. Tsuyoshi Tasaki, Tetsuya Ogata, Hiroshi G. Okuno: The Iteraction between a Robt and Multiple People based on Spatially Mapping of Friendliness and Motion Parameters, Advanced Robotics, accepted with minor modification, Jul. 2, 2013.
  2. Ui-Hyun Kim, Hiroshi G. Okuno: Improved Binaural Sound Localization and Tracking for Unknown Time- Varying Number of Speakers, Advanced Robotics, accepted, Dec. 28, 2012. in print, Vo.27, Issue 17 (Nov. 2013). Published Online on 1 July 2013. doi: 10.1080/01691864.2013.812177
  3. Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno: A real-tome super-resolution robot audition system that improves the robustness of simultaneous speech recognition Advanced Robotics, Vol.27 Issue 12, pp.933-945, Published Online on 10 May 2013. doi:10.1080/01691864.2013.797139
  4. Daichi Sakaue, Katsutoshi Itoyama, Tetsuya Ogata, Hiroshi G. Okuno: Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus, Journal of Information Processing, Vol.21, No.2 (Apr. 2013), pp.246-256, IPSJ, Jan. 2013. doi:10.2197/ipsjjip.21.246

    Peer-Reviewed Book Chapters

  5. Ui-Hyun Kim, Kazuhiro Nakadai, Hiroshi G. Okuno: Improved Sound Source Localization and Front-Back Disambiguation for Humanoid Robots with Two Ears, Recent Trends in Applied Artificial Intelligence, Lecture Notes in Computer Science, Vol.7906, pp.282-291, Springer. June 17-21, Amsterdam, The Netherland. (Acceptance rate for long papers: 34.7%). The Best Paper Award (1/103 papers) pdf doi: 10.1007/978-3-642-38577-3_29
  6. YangYang Huang, Takuma Otsuka, Hiroshi G. Okuno: A Speaker Dialization System with Robust Speaker Localization and Voice Activity Detection, Comtemprary Challenges and solutions in Applied Artificial Intelligence, Studies in Computational Intelligence, Vol. 489 (2013) pp.77-82, Springer. June 17-21, Amsterdam, The Netherland. doi: 10.1007/978-3-319-00651-2_11

    Peer-Reviewed International Conference Papers

  7. Mayumi J. Hikita, Hiroshi G. Okuno: PROPOSAL OF INTERNATIONAL CONFERENCE PROMOTION --- Destination Branding and Risk Management by a network of conference centres ---, Proceedings of the First International Conference of Serviceology, accepted, Oct. 16-18, AIST Tokyo Waterfront, Japan.
  8. Kenta Mochizuki, Shun Nishide, Hiroshi G. Okuno, Tetsuya Ogata: DevelopmentalHuman-Robot Imitation Learning of Drawing with a Neuro Dynamical System, Proceeding of IEEE International Conference on Systems, Man, and Cybernetics (SMC 2011), part SMC: Human-Machine, accepted, Manchester, UK, 13-16 Oct. 2013.
  9. Yoshinori Bando, Takeshi Muzumoto, Katsutoshi Itoyama, Kazuhiro Nakadai, Hiroshi G. Okuno: Posture Estimation of Horse-Shaped Robot using Microphone Array Localization, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2013), accepted, (acceptance rate 43%), IEEE, RSJ, Tokyo Big Sight, Japan, 3-7 Nov. 2013.
  10. Kotaro Furukawa, Keita Okutani, Kohei Nagira, Takuma Otsuka, Katsutoshi Itoyama, Kazuhiro Nakadai, Hiroshi G. Okuno: Noise Correlation Matrix Estimation for Improving Sound Source Localization by Multirotor UAV, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2013), accepted, (acceptance rate 43%), IEEE, RSJ, Tokyo Big Sight, Japan, 3-7 Nov. 2013.
  11. Naoki Hirayama, Koichiro Yoshino, Katsutoshi Itoyama, Shunsuke Mori, Hiroshi G. Okuno: Automatic Estimation of Dialect Mixing Ratio for Dialect Speech Recognition, Proceedings of International Conference on Spoken Language Processing (Interspeech 2013), (acceptance rate 52%), Aug. , 2013. Lyon, France.
  12. Ui-Hyun Kim, Hiroshi G. Okuno: Robust Localization and Tracking of Multiple Speakers in Real Environments for Binaural Robot Audition, Proceedings of the 14th International Workshop on Image and Audio Analysis for Multimedia Interactive Services (WIA2MIS 2013), pp.1-4, Paris, July 3-5, 2013. pdf
  13. Kazuki Yazawa, Daichi Sakaue, Kohei Nagira, Katsutoshi Itoyama, Hiroshi G. Okuno: AUDIO-BASED GUITAR TABLATURE TRANSCRIPTION USING MULTIPITCH ANALYSIS AND PLAYABILITY CONSTRAINTS, Proceedings of 2013 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), AASP-P.1.4, pp.196-200. Vancouver, Canada, May 26-31. pdf
  14. Daichi Sakaue, Takuma Otsuka, Katsutoshi Itoyama, Hiroshi G. Okuno: INITIALIZATION-ROBUST BAYESIAN MULTIPITCH ANALYZER BASED ON PSYCHOACOUSTICAL AND MUSICAL CRITERIA, Proceedings of 2013 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), AASP-P1.10, pp.226-230, Vancouver, Canada, May 26-31. pdf
  15. Naoyuki Kanda, Katsutoshi Itoyama: Hiroshi G. Okuno: MULTIPLE INDEX COMBINATION FOR JAPANESE SPOKEN TERM DETECTION WITH OPTIMUM INDEX SELECTION BASED ON OOV-REGION CLASSIFIER, Proceedings of 2013 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), SLP-P5.7, pp.8540-8544, pdf Vancouver, Canada, May 26-31.
  16. Tetsuya Ogata, Hiroshi G. Okuno: Integration of behaviors and languages with a hierarchal structure self-organized in a neuro-dynamial model, IEEE Symposium Series on Computational Intelligence 2013, accepted Singapore, Apr. 16-19, 2013.
  17. Randy Gomez, Keisuke Nakamura, Kazuhiro Nakadai, Ui-Hyun Kim, Hiroshi G. Okuno, Tatsuya Kawahara: Hands-Free Human Robot Communication Robust to Speaker's Radial Position, Proceedings of 2013 IEEE International Conference on Robots and Automation (ICRA 2013), accepted (acceptance rate 39%), Karlsruhe, Germany, May 6-10, 2013.

    Domestic Conference Papers

  18. Yoshiaki Bando, Takuma Otsuka, Takeshi Mizumoto, Katsutoshi Itoyama, Kazuhiro Nakadai, Hiroshi G. Okuno: ホース型ロボットのマイクロホンアレイを用いた姿勢推定, 日本ロボット学会第31回学術講演会, , 首都大学東京, Sep. 4-6, 2013.
  19. 古川 孝太郎, 大塚 琢馬, 糸山 克寿, 中臺 一博, 奥乃 博: Multirotor UAV を用いた音源定位のための雑音相関行列推定, 日本ロボット学会第31回学術講演会, , 首都大学東京, Sep. 4-6, 2013.
  20. 西牟田 勇哉, 平山 直樹, 大塚 琢馬, 杉山 治, 糸山 克寿, 奥乃 博: HARKを用いたロボットクイズ司会者HATTACK25 の開発, 日本ロボット学会第31回学術講演会, , 首都大学東京, Sep. 4-6, 2013.

    Patents

  21. 音楽音響信号生成システム, 発明者: 安部 武宏, 安良岡 直希, 糸山 克寿, 奥乃 博, 特許第5283289号,登録日2013 (平成25年) 6月7日. 特願2011-500614号, 出願日2011年8月9日, 国立大学法人京都大学.
  22. Musical score position estimating apparatus, musical score position estimating method, and musical score position estimating program. Inventors: Kazuhiro Nakadai, Takuma Otsuka, Hiroshi Okuno, Patent No. US 8,440,901, Date of Patent: May 14, 2013. Date of Application: Mar. 1, 2011.
  23. Audio Source Detection System, Inventors: Hiroshi Tsujino, Kazuhiro Nakadai, Hiroshi Okuno, Takeshi Mizumoto, and Ikkyu Aihara. Patent No. US 8,416,957, Date of Patent: Apr. 9, 2013. Date of Application: Dec. 4, 2009.

o Academic Year 2012o

Thesis | Journal Papers | Book Chapters | International Conferences | Domestic Conferences | Patents

    Peer-reviewed Journal Papers

  1. Daichi Sakaue, Katsutoshi Itoyama, Tetsuya Ogata, Hiroshi G. Okuno: Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus, Journal of Information Processing, accepted, IPSJ, Jan. 2013.
  2. Kohei Nagira, Takuma Otsuka, Hiroshi G. Okuno: Nonparametric Bayesina Sparse Factor Analysis for Frequency Doain Blind Source Separation without Pearmuation Ambiguity, EURASIP Journal on Audio, Speech, and Music Processing, 2013, 2013:3. doi:10.1186/1687-4722-2013-4 目次 (学会サーバ)
  3. Kazunori Komatani, Mikio Nakano, Masaki Katsumaru, Kotaro Funakoshi, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Allocation of Training Data for Speech Understanding based on Multiple Model Combinations, IEICE Transactions on Information and Systems, Vol.E95-D, No.9 (Sep. 2012) pp.2298-2307. pdf (学会サーバ)
  4. Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: Long-Term Analysis of User Behaviors in Deployed Spoken Dialogue System Dialogue & Discourse, conditionally accepted pending required revisions, May 21, 2012.
  5. Akira Maezawa, Katsutoshi Itoyama, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automated Violin Fingering Transcription Through Analysis of an Audio Recording, Computer Music Journal, Fall 2012, Vol.36, No.3, Pages 57-72 Posted Online August 14, 2012 doi:10.1162/COMJ_a_00129 (Free at MIT Press)
  6. Shun Nishide, Jun Tani, Toru Takahashi, Hiroshi G. Okuno, Tetsuya Ogata: Tool-Body Assimilation of Humanoid Robt using Neuro-Dynamical System, IEEE Transactions on Autonomous Mental Development, Vol.4, Issue:2 (june 2012) pp.139-149, 2012. doi:10.1109/TAMD.2011.2177660

    Book Chapters

  7. Angelica Lim: Musical Robots and Interactive Multimodal Systems, Book Review, International Journal of Synthetic Emotions, Vol. 3, No. 2 (2012) 84-86. doi: 10.4018/jse.2012070105
  8. Kohei Nagira, Takuma Otsuka, Tetsuya Ogata, Hiroshi G. Okuno: Infinite Sparse Factor Analysis for Blind Source Separation in Reverberant Environments, SSPR/SPR2012, Lecture Notes in Computer Science, Vol. 7626, pp.638-647, Nov. 7-9. Hiroshima, Japan. doi: 10.1007/978-3-642-34166-3_70
  9. Angelica Lim, Hiroshi G. Okuno: Using speech data to recognize emotion in human gait, Human Behavior Understanding, A.A. Salah et al. (Eds): Human Behavior Understanding 2012, Lecture Notes in Computer Science, Vol.7559, pp.52-64, Springer, Algarve, Portgul, October 7, 2012. acceptance rate 42%. Abstractpdf doi: 10.1007/978-3-642-34014-7_5
  10. Katsutoshi Itoyama, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Chord Recognition Based on Probabilistic Integration of Acoustic Features and Chord Transition, He Jiang et al. (Eds.): Advanced Research in Applied Artificial Intelligence, IEA/AIE-2012, pp.58-67, LNAI Vol.7345. Springer. June 9-12, Dalian, China. doi: 10.1007/978-3-642-31087-4_7
  11. Takeshi Mizumoto, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Adaptive Pitch Control for Robot Thereminist using Unscented Kalman Filter. H. Jiang, M. Ali, and M. Li (Eds.), Modern Advances in Intelligent Systems and Tools, Studies in In Computational Intelligence, pp.19-24, Springer. June 9-12, Dalian, China, 2012. (IEA/AIE-2012) doi: 10.1007/978-3-642-30732-4_3
  12. Takeshi Mizumoto, Hiromitsu Awano, Yoshiaki Bando, Playing with 3D Printer (in Japanese) Joho Shori , Vol.53, No.8 (Aug. 2012) Information Processing Society of Japan, DL
  13. Hiroshi G. Okuno: Preface, Special Section for Summer Vacation, Joho Shori , Vol.53, No.8 (Aug. 2012) Information Processing Society of Japan, DL
  14. Hiroshi G. Okuno, Kazuhiro Nakadai, Takeshi Mizumoto: Sensing technology for listening to several things at once, IEICE Magazine, Vol.95, No.5 (May 2012) pp.401-404, IEICE. pdf
  15. Masataka Goto, Hiroshi G. Okuno, Preface Information Processiong, Special Issue on Present and Future of CGM, Vol.53, No.5 (May 2012) pp.464-465, IPSJ DL

    International Conference Papers

  16. Naoki Hirayama, Shinsuke Mori, Hiroshi G. Okuno: Statistical Method of Building Dialect Language Models for ASR Systems, Proceedings of the 24th International Conference on Computational Linguistics (Coling-2012), 1179-1194, Mumbai, India, Dec. 8-15, 2012.
  17. Tatsuhiko Itohara, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno: Improvement of Audio-Visual Score Following in Robot Ensemble with Human Guitarist, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2012), accepted as oral (acceptance rate 57% = 133/233), IEEE, Osaka, Nov. 30 - Dec. 1, 2012.
  18. Hiroshi G. Okuno: Human Robot Interaction through Robot Audition, Keynote, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2012) Workshop on Motivational Aspect of Robotics in Physical Therapy, Vilamoura, Algarve, Portgul, October 12, 2012.
  19. Keita Mochizuki, Harumitsu Nobuta, Shun Nishide, Hiroshi G. Okuno, Tetsuya Ogata: Developmental Human-Robot Imitation Learning with Phased Structuring in Neuro Dynamical Systems, Proceedings of IROS-2012 Workshop on Cognitive Neuroscience Robotics, Pos-3, 6 pages, IEEE, RSJ, Vilamoura, Algarve, Portgul, October 12, 2012. pdf
  20. Yuki Yamaguchi, Harumitsu Nobuta, Shun Nishide, Hiroshi G. Okuno, Tetsuya Ogata: Developmental Human-Robot Imitation Learning with Phased Structuring in Neuro Dynamical Systems, Proceedings of IROS-2012 Workshop on Cognitive Neuroscience Robotics, PM2-3, 6 pages, IEEE, RSJ, Vilamoura, Algarve, Portgul, October 12, 2012. pdf
  21. Takeshi Mizumoto, Tetsuya Ogata, Hiroshi G. Okuno: Who is the leader in a multiperson ensemble? ---Multiperson human-robot ensemble model with leaderness---, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2012), pp.1413-1419 (812/1801=45.0%), IEEE, RSJ, Vilamoura, Algarve, Portgul, October 7-12, 2012. pdf doi: 10.1109/IROS.2012.6385782
  22. Joao Lobato Oliveira, Gokhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Luis Paulo Reis, Fabien Gouyon, Live Assessment of Beat Tracking for Robot Audition, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2012), pp.992-997 (812/1801=45.0%), IEEE, RSJ, Vilamoura, Algarve, Portgul, October 7-12, 2012. pdf doi 10.1109/IROS.2012.6386100
  23. Takuma Otsuka, Katsutoshi Ishiguro, Hiroshi Sawada, Hiroshi G. Okuno: Unified Auditory Functions based on Bayesian Topic Model, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2012), pp.2370-2376 (812/1801=45.0%), IEEE, RSJ, Vilamoura, Algarve, Portgul, October 7-12, 2012. pdf doi: 10.1109/IROS.2012.6385787
  24. Yusuke Yamamura, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Unified Auditory Functions based on Bayesian Topic Model, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2012), pp.2364-2369 (812/1801=45.0%), IEEE, RSJ, Vilamoura, Algarve, Portgul, October 7-12, 2012. pdf doi: 10.1109/IROS.2012.6385765
  25. Hiroshi G. Okuno: Human Robot Interaction through Robot Audition Keynote, Workshop on Motivational Aspect of Robotics in Physical Therapy. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2012), Vilamoura, Algarve, Portgul, October 12, 2012.
  26. Daichi Sakaue, Takuma Otsuka, Katsutoshi Itoyama, Hiroshi G. Okuno: Bayesian Nonnegative Harmonic-Temporal Factorization and Its Application to Multipitch Analysis Proceedings of 13th International Society for Musical Information Retrieval Conference (ISMIR-2012), pp.91-96, Porto, Portgul, Oct, 2012.
  27. Joao Lobato Oliveira, Gokhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Luis Paulo Reis, Fabien Gouyon, An Active Audition Framework for Auditory-driven HRI: Application to Interactive Robot Dancing, Proceedings of International Workshop on Robot and Human Interaction (Ro-Man-2012), pp.1078-1085, IEEE, Paris, Sep 9-13, 2012. pdf doi: 10.1109/ROMAN.2012.6343892
  28. Takuya Yoshioka, Daichi Sakaue: Log-normal matrix factorization with application to speech-music separation, Proceedings of SAPA-SCALE Conference 2012, pp.80-85, Portland, OR, 7-8 September, 2012.
  29. Ikkyu Aihara, Takeshi Mizumoto, Takuma Otsuka, Hiromitsu Awano, Hiroshi G. Okuno, Kazuyuki Aihara: Possible Functions of Call Alternation in Frog Choruses, Tenth International Congress of Neuroethology, accepted, Aug. 5-10, 2012, University of Maryland, College Park, MD, USA. (poster) doi: 10.3389/conf.fnbeh.2012.27.00267
  30. Takeshi Mizumoto, Hiromitsu Awano, Ikkyu Aihara, Takuma Otsuka, Hiroshi G. Okuno: Sound imaging system for visualizing multiple sound sources from two species, Tenth International Congress of Neuroethology, accepted, Aug. 5-10, 2012, University of Maryland, College Park, MD, USA. (poster) doi: 10.3389/conf.fnbeh.2012.27.00247
  31. Takuma Otsuka, Katsuhiko Ishiguro, Hiroshi Sawada, Hiroshi G. Okuno: Bayesian Unification of Sound Source Localization and Separation with Permutation Resolution, Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI-12), 2038-2045 (26%, 294/1129), July 22-26 (26), 2012, Toronto, Canada. pdf AAAI server
  32. Harumitsu Nobuta, Kenta Kawamoto, Kuniaki Noda, Kohtaro Sabe, Hiroshi G. Okuno, Tetsuya Ogata: Body area segmentation from visual scene based on predictability of neuro-dynamical system, Proc. of the 2012 International Joint Conference on Neural Networks (IJCNN 2012), pp.1-8, Brisbane, Australia, June 10-15, 2012. doi:10.1109/IJCNN.2012.6252530
  33. Shun Nishide, Jun Tani, Hiroshi G. Okuno, Tetsuya Ogata: Self-organization of Object Features Representing Motion Using Multiple Timescales Recurrent Neural Network, Proc. of the 2012 International Joint Conference on Neural Networks (IJCNN 2012), pp.1-8, Brisbane, Australia, June 10-15, 2012. doi:10.1109/IJCNN.2012.6252714
  34. Luis-Kenzo Furuya Cahier, Tetsuya Ogata, Hiroshi G. Okuno: Incremental Probabilistic Geometry Estimation for Robot Scene Understanding, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2012), pp.3265-3630, (acceptance rate 40%), May 14-18, 2012, St. Paul, MN. pdf doi: 10.1109/ICRA.2012.6225343

    Patents

  35. Speech recognition system and method for generating a mask of the system. Inventors: Kazuhiro Nakadai, Toru Takahashi, Hiroshi Okuno, Patent No. US 8,392,185, Date of Patent: Mar. 5, 2013. Date of Application: Aug. 19, 2009.
  36. Reverberation suppressing apparatus and reverberation suppressing method . Inventors: Kazuhiro Nakadai, Hirofumi Nakajima, Hiroshi Okuno, and Ryu Takeda Patent No. US 8,391,505, Date of Patent: Mar. 5, 2013. Date of Application: Jun. 1, 2010.
  37. Musical piece recommendation system and method . Inventors: Masataka Goto, Kazuyoshi Yoshii, Hiroshi Okuno, Patent No. US 8,370,277, Date of Patent: Feb. 5, 2013. Date of Application: Jul. 31, 2008.
  38. 音源分離システム, 音源分離方法及び音源分離用コンピュータプログラム, 発明者: 糸山 克寿, 奥乃 博, 後藤 真孝, 特許第5201602号, 登録日平成25年2月22日. 特願2009-511801号, 2009年4月14日.
  39. 音声認識装置及び音声認識装置のマスク生成法, 発明者: 中臺 一博, 高橋 徹, 奥乃 博, 特許第5180928号, 登録日平成25年1月18日. 特開2010-49249号, 2010年3月4日. 特願2009-185164号, 2009年8月7日.
  40. 音源分離システム, 発明者: 武田 龍, 中臺 一博, 辻野 広司, 奥乃 博, 特許第5178370号, 登録日平成25年1月18日. 特開2009-42754号, 2009年2月26日. 特願2008-191382号, 2008年7月24日.
  41. 音源追跡システム,方法,およびロボット, 発明者: 中臺 一博, 辻野 広司, 長谷川 雄二, 奥乃 博, 特許第5170440号, 登録日平成25年1月11日. WO2007/129731, 2007年11月15日. 特願2008-514510号, 2007年5月9日. PCT/JP2007/059599
  42. 音源定位システム及び音源定位方法, 発明者: 中臺 一博, 奥乃 博, 大塚 琢馬, 特開2013-44950号, 2013年3月4日. 特願2011-182774号, 2011年8月24日.
  43. Sound source tracking system, method and robot. Inventors: Kazuhiro Nakadai, Hiroshi Tsujino, yuji Hasegawa, Hiroshi Okuno, Patent No. US 8,155,331, Date of Patent: Apr. 10, 2012. Date of Application: May 9, 2007.
  44. 文単位検索方法, 文単位検索装置, コンピュータプログラム, 記憶媒体, 及び文書記憶装置, 発明者: 白松 俊, 駒谷 和範, 奥乃 博, 特許第5167546号, 登録日平成25年1月11日. 特願2008-530812号, 2007年3月16日. 出願者: 京都大学
  45. ロボット, 発明者: 中臺 一博, 長谷川 雄二, 辻野 広司, 村田 和真, 武田 龍, 奥乃 博, 特許第5150573号, 登録日平成24年12月7日. 特開2010-026513号, 2010年2月4日. 特願2009-166049号, 2009年7月14日.
  46. 音分離装置、及び、それを備えたカメラユニット. 発明者:梅田 修志, 堀邊 隆介, 奥乃 博, 高橋 徹. 特開2012-238964号, 2012年12月6日. 特願2011-105404号, 2011年5月10日.
  47. 音楽音響信号と歌詞の時間的対応付けを自動で行うシステム及び方法, 発明者: 藤原 弘将, 奥乃 博, 後藤 真孝, 特許第5131904号,登録日: 2012年11月16日. 特開2008-134606号, 2008年6月12日. 特願2007-233682号, 2007年9月10日.
  48. Language Understanding Device. Inventors: Mikio Nakano, Hiroshi Okuno, Kazunori Komatani, Yuichiro Fukubayashi, Kotaro Funakoshi. Patent No. US 8,244,522, Date of Patent: Aug. 14, 2012. Date of Application: May 20, 2008.
  49. Sound source separation system, sound source separation method, and computer program for sound source separation. Inventors: Katsutoshi Itoyama, Hiroshi Okuno, and Masataka Goto. Patent No. US 8,239,052, Date of Patent: Aug. 7, 2012. Date of Application: Apr. 14, 2008.
  50. 楽譜位置推定装置、及び楽譜位置推定方法, 発明者: 中臺 一博, 大塚 琢馬, 奥乃 博 特開2012-168538号, 2012年9月6日. 特願2012-29802号, 2012年2月14日
  51. 言語理解装置, 発明者: 中野 幹生, 奥乃 博, 福林 雄一朗, 船越 孝太郎, 特許第50664834号,登録日: 2012年8月17日. 特開2008-293019号, 2008年12月4日. 特願2008-134401号, 2008年5月22日.
  52. Sound source tracking system, method and robot」 Inventors: Kazuhiro Nakadai, Hiroshi Tsujino, Yuji Hasegawa, and Hiroshi Okuno. Patent No. US 8,155,331, Date of Patent: Apr. 10, 2012. Date of Application: May 9, 2007.

o Academic Year 2011o

Thesis | Journal Papers | Book Chapters | International Conferences | Domestic Conferences | Patents

    Master and Bachelor Thesis

  1. Yasuharu Hirasawa: Under-Determined Blind Speech Separation Using a GMM-Based Sound Spectral Model. Master Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2012.
  2. Naoki Nishikawa: 音楽情報検索のための歌詞と音響特徴量を用いた楽曲印象軌跡推定. Master Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2012.
  3. Shinpei Aso: 歌声話声自動識別及び歌声の話声自動変換への応用. Master Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2012.
  4. Angelica Lim: Design and Implementation of Emotions for Humanoid Robots based on the Modality-independent DESIRE Model. Master Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2012.
  5. Hideki Takano: Automatic CHord Recognition System with Adaptation to Modulation Using Kye Slelection by Reliability, MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Aug. 2011.
  6. Nobuhide Yamakawa: Sound Source Recognition of Impulsive Sound Events using Matching Pursuit and Formant-wave Function and Audio Feature Design and Sound Source Adaptation for Their COnversion of Sound-Imitation Words, MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Aug. 2011.

    Peer-reviewed Journal Papers

  7. Angelica Lim, Takeshi Mizumoto, Tetsuya Ogata, Hiroshi G. Okuno: A musical robot that synchronizes with a co-player using non-verbal cues, Advanced Robotics, Special Issue on Cutting Edge of Robtics in Japan, Vol.26 (2012) pp.363-381. doi:10.1163/156855311X614626
  8. Angelica Lim, Tetsuya Ogata, Hiroshi G. Okuno: Towards expressive musical robots: A cross-modal framework for emotional gesture, voice and music, EURASIP Journal on Audio, Speech, and Music Processing, 2012:3, Published: 17 January 2012. doi:10.1186/1687-4722-2012-3
  9. Tatsuhiko Itohara, Takuma Otsuka, Takeshi Muzumoto, Angelica Lim, Tetsuya Ogata, Hiroshi G. Okuno: A multi-modal tempo and beat tracking system based on audio-visual information from live guitar performances. EURASIP Journal on Audio, Speech, and Music Processing, 2012, 2012:6, Published: 20 January 2012. doi:10.1186/1687-4722-2012-6
  10. Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Efficient Blind Dereverberation and Echo Cancellation based on Independent Component Analysis for Actual Acoustic Signals, Neural Computation, Vol.24, Issue 1 (Jan. 2012) pp. 234-272, MIT Press. doi:10.1162/NECO_a_00219 Posted online Dec. 9, 2011. IF: 2.290
  11. Tsuyoshi Tasaki, Fumio Ozaki, Nobuto Matsuhira, Tetsuya Ogata, Hiroshi G. Okuno: People Detection Based on Spatial Mapping of Friendliness and Floor Boundary Points for a Mobile Navigation Robot, Journal of Robotics, vol. 2011, Article ID 683975, 10 pages, 2011. doi:10.1155/2011/683975 Hindawi Publishing Corp.
  12. Kazunori Komatani, Kyoto Matsuyama, Ryu Takeda, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Spoken Dialogue System that Uses Information on Locutionary Acts to Interprete User Utterances, IPSJ Journal, Vol.52, No.12 (Dec., 2011) pp.3374-3385, IPSJ. pdf, DL
  13. Shinsuke Mori, Kazunori Komatani, Masaki Katsumaru, Tetsuya Okgata, Hiroshi G. Okuno: Automatic Vocabulary Expansion for Abbreviation Recognition in Spoken Dialogue System, IPSJ Journal, Vol.52, No.12 (Dec., 2011) pp.3398-3407, IPSJ. pdf, DL
  14. Angelica Lim, Takeshi Mizumoto, Takuma Otsuka, Luis-Kenzo Furuya Cahier, Tetsuya Ogata, Hiroshi G. Okuno: Musical Robot Co-Player: Real-time Syncrhorinzation with a Human Flutist Recognizing Visual Start and End Cues, IPSJ Journal, Vol.52, No.12 (Dec., 2011) pp.3599-3610, IPSJ. pdf, DL
  15. Naoki Yasuraoka, Takuya Yoshioka, Katsutoshi Itoyama, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Musical Sound Separation and Synthesis Using Harmonic/Inharmonic GMM and NMF for Phrase Replacing System, IPSJ Journal, Vol.52, No.12 (Dec., 2011) pp.3839-3852, IPSJ. pdf, DL
  16. Hiromasa Fujihara, Masataka Goto, Jun Ogata, Hiroshi G. Okuno: LyricSynchronizer: Automatic Synchronization Method Between Musical Audio Signals and Lyrics, IEEE Journal of Selected Topics in Signal Processing, Vol.5, No.6 (Oct. 2011) pp.1252-1261, doi:10.1109/JSTSP.2011.2159577
  17. Tetsuya Ogata, Tetsuo Sawaragi, Tadahiro Taniguchi: Preface, Advanced Robotics, Vol.25, No.17 (2011) pp. 2125-2126. 10.1163/016918611X59476
  18. Zhang Yang, Tetsuya Ogata, Shun Nishide, Toru Takahashi, Hiroshi G. Okuno: Classification of Known and Unknown Environmental Sounds based on Self-organized Space using Recurrent Neural Network, Advanced Robotics, Vol.25, No.17 (2011) pp. 2127-2141. 10.1163/016918611X595017
  19. Shun Nishide, Jun Tani, Hiroshi G. Okuno, Tetsuya Ogata: Towards Written Text Recognition based on Handwriting Experiences using Recurrent Neural Network, Advanced Robotics, Vol.25, No.7 (2011) pp.2173-2187. 10.1163/016918611X595026
  20. Takeshi Mizumoto, Ikkyu Aihara, Takuma Otsuka, Ryu Takeda, Kazuyuki Aihara, Hiroshi G. Okuno: Sound imaging of nocturnal animal calls in their natural habitat, Journal of Comparative Physiology A: Neuroethology, Sensory, Neural, and Behavioral Physiology, Vol.197, No.9, 915-921, Online First, 17 May 2011. doi:10.1007/s00359-011-0652-7 pdf, html, Supplementary material at MetaPress.
  21. Wataru Hinoshita, Horiaki Arie, Jun Tani, Hiroshi G. Okuno, Tetsuya Ogata: Emergence of Hierarchical Structure mirroring Linguistic Composition in a Recurrent Neural Network, Neural Networks, Vol.24, Issue 4 (May 2011) pages 311-320, Elsevier. doi:10.1016/j.neunet.2010.12.006
  22. Kazuhiro Nakadai, Hiroshi G. Okuno: Development of Robot Auditon Open-Sourced Software HARK (in Japanese), Digital Practice, Vol.2, No.2 (Apr. 2011) pp.133-140, IPSJ. pdf
  23. Kohei Sumi, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Chord Recognition Based on Integration of Chord and Bass Pitches Features (in Japanese), IPSJ Journal, Vol.52, No.4 (Apr. 2011) pp.1803-1812. Information Processing Society of Japan. pdf, DL

    Book Chapters, Reviews

  24. Kohei Nagira, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Complex Extension of Infinite Sparse Factor Analysis for Blind Source Separation of Speech Signals, F. Theis et al. (Eds.): LVA/ICA 2012, Lecture Notes in Computer Science 7191, Springer-Verlag, pp.388-396, 2012. Tel-Aviv, Israel, Mar. 12-15, 2012.
  25. Yasuharu Hirasawa, Naoki Yasuraoka, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: A GMM Sound Source Model for Blind Speech Separation in Under-determined Condisions, F. Theis et al. (Eds.): LVA/ICA 2012, Lecture Notes in Computer Science 7191, Springer-Verlag, pp.446-453, 2012. Tel-Aviv, Israel, Mar. 12-15, 2012.
  26. Hiromitsu Awano, Shun Nishide, Hiroaki, Arie, June Tani, Hiroshi G. Okuno, Tetsuya Ogata: Use of a Sparse Structure to Improve Learning Performance of Recurrent Neural Networks, Proceeding of 18th International Conference on Systems, Man, and Cybernetics (ICONIP 2011), Part III, pp.323-331, Lecture Notes in Computer Science 7064, Springer-Verlag, Shanghei, Nov. 13-17, 2011.
  27. Hiromitsu Awano, Shun Nishide, Hiroaki, Arie, June Tani, Hiroshi G. Okuno, Tetsuya Ogata: Use of a Sparse Structure to Improve Learning Performance of Recurrent Neural Networks, Proceeding of 18th International Conference on Systems, Man, and Cybernetics (ICONIP 2011), Part III, pp.323-331, Lecture Notes in Computer Science 7064, Springer-Verlag, Shanghei, Nov. 13-17, 2011.
  28. Kazunori Komatani, Kyoko Matsuyama, Ryu Takeda, Tetsuya Ogata, Hiroshi G. Okuno: Evaluation of Spoken Dialogue System that uses Utterance Timing to Interprete User Utterances, R.L-C. Delgado and T. Kobayashi (Eds.): Proceedings of International Workshop on Spoken Dialogue Systems (IWSDS2011), pp.315-326, Springer, Sep. 2011. doi:10.1007/978-1-4614-1335-6
  29. Yasuharu Hirasawa, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Robot with Two Ears Listens to More Than Two Simultaneous Utterances by Exploiting Harmonic Structures, K.G. Mehrotra et al. (Eds.): IEA/AIE-2011, Part I, LNAI 6703, pp.348-358. Springer. Syracuse, NY, June 28 - July 1, 2011.
  30. Nobuhide Yamakawa, Toru Takahashi, Tetsuro Kitahara, Tetsuya Ogata, Hiroshi G. Okuno: Environmental Sound Recognition for Robot Audition using Matching-pursuit, , K.G. Mehrotra et al. (Eds.): IEA/AIE-2011, Part II, LNAI 6704, pp.1-10. Springer, Syracuse, NY, June 28 - July 1, 2011.
  31. Zhang Yang, Tetsuya Ogata, Shun Nishide, Toru Takahashi, Hiroshi G. Okuno: Cluster Self-organization of Known and Unknown Environmental Sounds using Recurrent Neural Network, T. Honkela, W. Duch, M. A. Girolami, S. Kaski (Eds.): Artificial Neural Networks and Machine Learning - ICANN 2011, LNCS 6791, pp.167-175, Springer, (58%, 108/185), Espoo, Finland, June 14-17, 2011.
  32. Hiroshi G. Okuno, Kazuhiro Nakadai, Hyun-Don Kim: Robot Audition: Missing Feature Theory Approach and Active Audition, K. Pradalier, R. Siegward, and G. Hirzinger (Eds.): Robotics Research, STAR 70, pp.227-244, Springer-Verlag. doi:10.1007/978-3-642-19457-3_14

    Peer-reviewed International Conference Papers

  33. Luis-Kenzo Furuya Cahier, Tetsuya Ogata, Hiroshi G. Okuno: Incremental Probabilistic Geometry Estimation for Robot Scene Understanding, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2012), accepted (acceptance rate 40%), May 14-18, 2012, St. Paul, MN.
  34. Daichi Sakaue, Katsutoshi Itoyama, Tetsuya Ogata, Hiroshi G. Okuno: INITIALIZATION-ROBUST MULTIPITCH ESTIMATION BASED ON LATENT HARMONIC ALLOCATION USING OVERTONE CORPUS, Proceedings of 2012 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012), accepted, IEEE, Kyoto, Japan, March 25-30, 2012.
  35. Harumitsu Nobuta, Shun Nishide, Hiroshi G. Okuno, Tetsuya Ogata: Identification of self-body based on dynamic predictability using neuro-dynamical system, Proceedings of 2011 IEEE/SICE International Symposium on System Integration (SII2011), accepted, Dec. 20-22, 2011, Kyoto.
  36. Shotaro Sano, Shun Nishide, Hiroshi G. Okuno, Tetsuya Ogata: Predicting Listener Back-Channels for Human-Agent Interaction using Neuro-dynamical Model, Proceedings of 2011 IEEE/SICE International Symposium on System Integration (SII2011), accepted, Dec. 20-22, 2011, Kyoto.
  37. Naoki Nishikawa, Hiromasa Fujihara, Masataka Goto, Katsutoshi Itoyama, Tetsuya Ogata, Hiroshi G. Okuno: A Musical Mood Trajectory Estimation Method Using Lyrics and Acoustic Features, Proceedings of International ACM Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies (MIRUM'11), 51-56, ACM, Nov. 28 - Dec. 1, 2011, Scottsdale, AZ.
  38. Angelica Lim, Tetsuya Ogata, Hiroshi G. Okuno: Converting emotional voice to motion for robot telepresence, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2011), accepted as oral (acceptance rate 17.4% = 28/190), IEEE, Bled, Slovenia, Oct. 26-28, 2011.
  39. Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno: Incremental Bayesian Audio-to-Score Alignment with Flexible Harmonic Structure Models, Proceedings of 12th International Society for Musical Information Retrieval Conference (ISMIR-2011), accepte Miami, FL, Oct. 24-28. 2011.
  40. Angelica Lim, Takeshi Muzumoto, Takuma Otsuka, Tatsuhiko Itohara, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno: More cowbell! A musical ensemble with the NAO thereminist, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2011), IROS 2011 Standard Platform Demo, IEEE, RSJ, San Francisco, 25-30 Sep. 2011.
  41. Tatsuhiko Itohara, Takeshi Muzumoto, Takuma Otsuka, Tetsuya Ogata, Hiroshi G. Okuno: Particle-filter Based Audio-visual Beat-tracking for Music Robot Ensemble with Human Guitarist, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2011), pp.118-124, (790/2459 =32.1%), IEEE, RSJ, San Francisco, 25-30 (26) Sep. 2011. doi:10.1109/IROS.2011.6094773 IEEE Robotics and Automation Society Japan Chapter Young Award pdf
  42. Eui-Hyun Kim, Takeshi Muzumoto, Tetsuya Ogata, Hiroshi G. Okuno: Improvement of Speaker Localization by Considering Multipath Interference of Sound Wave for Binaural Robot Audition, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2011), pp.2910-2915, (790/2459 =32.1%), IEEE, RSJ, San Francisco, 25-30 Sep. 2011. doi:10.1109/IROS.2011.6094778 pdf
  43. Shun Nishide, Jun Tani, Hiroshi G. Okuno, Tetsuya Ogata: Handwriting Prediction Based Character Recognition using Recurrent Neural Network, Proceeding of IEEE International Conference on Systems, Man, and Cybernetics (SMC 2011), pp.2549-2554, June 2011. doi:10.1109/ICSMC.2011.6084060
  44. Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno: Bayesian Extension of MUSIC for Sound Source Localization and Tracking, Proceedings of International Conference on Spoken Language Processing (Interspeech 2011), pp.3109-3112, (oral), Aug. 30, 2011. Florence, Italy. pdf
  45. Yasuharu Hirasawa, Naoki Yasuraoka, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Fast and simple iterative algorithm of Lp-norm minimization for under-determined speech separation, Proceedings of International Conference on Spoken Language Processing (Interspeech 2011), pp.1745-1748, Florence, Italy, Aug. 29, 2011. pdf
  46. Shun Nishide, Hiroshi G. Okuno, Tetsuya Ogata, Jun Tani: Handwriting prediction based character recognition using recurrent neural network Proceeding of IEEE International Conference on Systems, Man, and Cybernetics (SMC 2011), pp. 2549-2554, Anchorage, Oct. 9-12, 2010.
  47. Mikio Nakano, Shun Sato, Kazunori Komatani, Kyoto Matsuyama, Kotaro Funakoshi, Hiroshi G. Okuno: A Two-Stage Domain Selection Framework for Extensible Multi-Domain Spoken Dialogue Systems, Proceedings of the 12th SIGDIAL Meeting on Discourse and Dialogue (SIGDIAL 2011), pp.18-29, accepted as an oral presentation, June 17-18, 2011, Portland, OR, USA. pdf Best paper award nomination finalist (4 papers)
  48. Katsutoshi Itoyama, Masataka Goto, Tetsuya Ogata, Hiroshi G. Okuno: SIMULTANEOUS PROCESSING OF SOUND SOURCE SEPARATION AND MUSICAL INSTRUMENT IDENTIFICATION USING BAYESIAN SPECTRAL MODELING, Proceedings of 2011 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2011), pp.3816-3819, Poster, IEEE, Prague, Czech Republic, May 22-28, 2011. pdf doi:10.1109/ICASSP.2011.5947187
  49. Akira Maezawa, Hiroshi G. Okuno, Tetsuya Ogata, Masataka Goto: POLYPHONIC AUDIO-TO-SCORE ALIGNMENT BASED ON BAYESIAN LATENT HARMONIC ALLOCATION HIDDEN MARKOV MODEL, Proceedings of 2011 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2011), pp.3816-3819, Poster, IEEE, Prague, Czech Republic, May 22-28, 2011. pdf
  50. Naoki Yasuraoka, Hirokazu Kameoka, Takuya Yoshioka, Hiroshi G. Okuno: I-DIVERGENCE-BASED DEREVERBERATION METHOD WITH AUXILIARY FUNCTION APPROACH, Proceedings of 2011 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2011), pp.369-372, Poster, IEEE, Prague, Czech Republic, May 22-28, 2011. pdf
  51. Takeshi Mizumoto, Takami Yoshida, Kazuhiro Nakadai, Ryu Takeda, Takuma Otsuka, Toru Takahashi, Hiroshi G. Okuno: Design and Implementation of Selectable Sound Separation on a Texai Telepresence System using HARK, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2011), pp.2130-2137, May 9-13 (10), 2011, Shanghai, China. pdf

o Academic Year 2010o

Thesis | Journal Papers | Book Chapters | International Conferences | Domestic Conferences | Patents

    Peer-reviewed Journal Papers

  1. Ryu Takeda: A Unified Framework of Blind Separation, Blind Dereverberation and Self-Voice Cancellation for Real-Time Robot Audition, Ph.D Thesis (Supervisor: Prof. Hiroshi G. Okuno), Jan. 2011.
  2. Katsutoshi Itoyama: , Ph.D Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.

  3. Takuma Otsuka: Real-time Audio-to-Score Alignment using Particle Filter for Co-player Robots, MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
  4. Wataru Hinoshita Cognitive Integration of Language and Sensory-Motor System for Robots using Neuro-Dynamical Models, MS Thesis (Supervisor: Assoc. Prof. Tetsuya Ogata), Feb. 2011.
  5. Akira Maezawa Score-Aided Inference of Classical Music Interpretation, MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
  6. Kyoko Matsuyama MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
  7. Naoki Yasuraoka Musical Audio Signal Modeling Based on Harmonic-Domain Parametric NMF and I-Divergence-Based Dereveberation for Application to Phrase Replacing System, MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.

  8. Tatsuya Itohara Particle Filter-Based Audio-Visual Beat Tracking for Music Robot Ensemble with Human Guitarist, BE Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
  9. Shotaro Sano Prediction of Back Channel Timing using Neurodynamical Model, BE Thesis (Supervisor: Assoc. Prof. Tetsuya Ogata), Feb. 2011.
  10. Kohei Nagira Blind Source Separation of Actual Speech Signals in Time-Frequency Domain using isFA, BE Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
  11. Harumitsu Nobuta Identification of self body and acquisition of body scheme based on dynamic predictability using neuro-dynamical system, BE Thesis (Supervisor: Assoc. Prof. Tetsuya Ogata), Feb. 2011.
  12. Zhang Yang: Prediction and Classification of Environmental Sounds using Recurrent Neural Network, Master Thesis (Supervisor: Assoc. Prof. Tetsuya Ogata), Sep. 2010.

    Peer-reviewed Journal Papers

  13. Ikkyu Aihara, Ryu Takeda, Takeshi Mizumoto, Takuma Otsuka, Toru Takahashi, Hiroshi G. Okuno, Kazuyuki Aihara: Complex and Transitive Synchronization in a Frustrated System of Calling Frogs, Physical Review E, Vol.83, Issue 3. 031913 (2011) [5 pages], 21 Mar. 2011. doi:10.1103/PhysRevE.83.031913
  14. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Query-by-Example Music Information Retrieval by Score-Informed Source Separation and Remixing Technologies, EURASIP Journal on Advances in Signal Processing, Vol.2010, Article ID 172961, 14 pages, Hindawi Pub., Jan. 2011. online page, doi:10.1155/2010/172961
  15. Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Audio-to-Score Alignment Using Particle Filter for Co-player Music Robots, EURASIP Journal on Advances in Signal Processing, Vol.2011, Article ID 384651, 13 pages, 2011, Hindawi Pub. online page, doi:10.1155/2011/384651
  16. Takuya Yoshioka, Tomohiro Nakatani, Masato Miyoshi, Hiroshi G. Okuno: Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization, IEEE Transactions on Audio, Speech and Language Processing, Vol.19, Issue 1 (Jan. 2011) pp.69-84, IEEE. doi:10.1109/TASL.2010.2045183
  17. Mikio Nakano, Yuji Hasegawa, Kotaro Funakoshi, Yohane Takeuchi, Toyotaka Torii, Kazuhiro Nakadai, Naoyuki Kandai, Kazunori Komatani, Hiroshi G. Okuno, Hiroshi Tsujino: A multi-expert model for dialogue and behavior control of conversational robots and agents. Knowledge-Based Systems, Vol.24, No.2 (Mar. 2011) pp.248-256, Elsevier. doi:10.1016/j.knosys.2010.08.004, Preprint
  18. Kazunori Komatani, Yuichiro Fukubayashi, Satoshi Ikeda, Tetsuya Ogata, Hiroshi G. Okuno: Selecting Help Messages by Using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems. IEICE Transactions Information and Systems, Vol.E93-D, No.12 (Dec. 2010) pp.3359-3367. doi:10.1587/transinf.E93.D.3359
  19. Hiromasa Fujihara, Masataka Goto, Hiroshi G. Okuno, Simultaneous Estimation of Fundamental Frequency of vocal and Vowel Phonems in Polyphonic Music (in Japanese), Journal of Information Processing Society of Japan, Vol.51, No.10 (Oct. 2010) pp.1995-2006. DL
  20. Takeshi Mizumoto, Hiroshi Tsujino, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno, Development of a Theremin Player Robot Based on Arm-Position-to-Pitch and -Volume Models (in Japanese), Journal of Information Processing Society of Japan, Vol.51, No.10 (Oct. 2010) pp.2007-2019. DL
  21. Takami Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: An Improvement in Auido-Visual Voice Activity Detection for Automatic Speech Recognition, Journal of Robotic Society of Japan, Vol.28, No.8 (Oct. 2010) pp.970-977. Abstract
  22. Tetsuya Ogata, Shun Nishide, Hideki Kozima, Kazunori Komatani, Hiroshi G. Okuno: Inter-modality Mapping in Robot with Recurrent Neural Network, Pattern Recognition Letters, Vol.31, Issue 12 (Sep. 2010) 1560-1569. doi:10.1016/j.patrec.2010.05.002
  23. Wataru Hinoshita, Tetsuya Ogata, Hideki Kojima, Toru Takahashi, Hiroshi G. Okuno: Journal of Robotic Society of Japan, Vol.28, No.4 (Apr. 2010) 532-543. Abstract
  24. Takuya Yoshioka, Tomohiro Nakatani, Masato Miyoshi, Hiroshi G. Okuno: Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization, IEEE Transactions on Audio, Speech and Language Processing, in print, IEEE, Mar. 2010. doi:10.1109/TASL.2010.2045183
  25. Masaki Katsumaru, Mikio Nakano, Kazunori Komatani, Kotaro Funakoshi, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno: Improving Speech Understanding Accuracy by Using Multiple Language Models and Multiple Language Understanding Models, IEICE Trans D, Special Issue on Information Explosion, Vol.J93-D, No.6 (June 2010) 879-888, IEICE. PDF at IEICE server
  26. Kazuhiro Nakadai, Toru Takahashi, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: Design and Implementation of Robot Audition System "HARK" - Open Source Software for Listening to Three Simulteaneous Speakers, Advanced Robotics, Vol.24 (2019) 739-761, VSP and Robotics Society of Japan. doi:10.1163/016918610X493561

    Book Chapters, Reviews

  27. Tetsuya Ogata, Wataru Hinoshita: Symbolic Processes in Multi-modara Communication between Robots, System/Control/Information, Vol.54, No.11 (Nov. 2011) pp.434-439.
  28. Wataru Hinoshita, Horiaki Arie, Jun Tani, Tetsuya Ogata, Hiroshi G. Okuno: Recognition and Generation of Sentences through Self-organizing Linguistic Hierarchy using MTRNN, Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.): Trends in Applied Intelligent Systems , LNAI 6098, 42-51, Cordoba, Spain, June 1-4 (2), 2010. pdf
  29. Takuma Otsuka, Takeshi Mizumoto, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Music-ensemble robot that is capable of playing the theremin while listening to the accompanied music, Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.): Trends in Applied Intelligent Systems , LNAI 6096, 102-112, Cordoba, Spain, June 1-4 (2), 2010. pdf Best paper award, Heisei 22nd Year C\amp; C Young Researcher Best Paper Award
  30. Akira Maezawa, Katsutoshi Itoyama, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Violin Fingering Estimation Based on Violin Pedagogical Fingering Model Constrained by Bowed Sequence Estimation from Audio Input, Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.): Trends in Applied Intelligent Systems , LNAI 6098, 249-259, Cordoba, Spain, June 1-4 (3), 2010. pdf
  31. Kyoko Matsuyama, Kazunori Komatani, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing, Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.): Trends in Applied Intelligent Systems , LNAI 6097, 585-594, Cordoba, Spain, June 1-4 (3), 2010. pdf
  32. Shun Shiramatsu, Jun Takasaki, Tatiana Zidrasco, Tadachika Ozono, Toramatsu Shintani, Hiroshi G. Okuno: System for Supporting Web-based Public Debate Using Transcripts of Face-to-face Meeting, Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.): Trends in Applied Intelligent Systems , LNAI 6098, 311-320, Cordoba, Spain, June 1-4 (4), 2010. pdf
  33. Takumi Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: An Improvement in Auido-Visual Voice Activity Detection for Automatic Speech Recognition, Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.): Trends in Applied Intelligent Systems , LNAI 6096, 51-61, Cordoba, Spain, June 1-4 (2), 2010. pdf

    Peer-reviewed International Conference Papers

  34. Zhang Yang, Tetsuya Ogata, Shun Nishide, Toru Takahashi, Hiroshi G. Okuno: Method of Discriminating Known and Unknown Environmental Sounds using Recurrent Neural Network, Proceedings of oint 5th International Conference on Soft Computing and Intelligent Systems and 11th International Symposium on advanced Intelligent Systems (SCIS & ISIS 2010), pp. 378-383, Okayama, JAPAN, December 8-12, 2010.
  35. Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Two-level Synchronization using Particle Filter for Co-player Music Robots, Proceedings of IEEE/RSJ-2010 Workshop on Robots and Musical Expression, CD-ROM, Oct. 18, 2010, Taipei, Taiwan. pdf
  36. Takeshi Mizumoto, Angelica Lim, Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno : Integration of flutist gesture recognition and beat tracking for human-robot ensemble, Proceedings of IEEE/RSJ-2010 Workshop on Robots and Musical Expression, CD-ROM, Oct. 18, 2010, Taipei, Taiwan. pdf
  37. Angelica Lim, Takeshi Mizumoto, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Programming by Playing and Approaches for Expressive Robot Performances, Proceedings of IEEE/RSJ-2010 Workshop on Robots and Musical Expression, CD-ROM, Oct. 18, 2010, Taipei, Taiwan. pdf
  38. Yasuharu Hirasawa, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Exploiting Harmonic Structures to Improve Separating Simultaneous Speech in Under-Determined Conditions (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2010), pp.450-457 (49.1%), TuBT12.5, IEEE, RSJ, Taipei, 18-22 Oct. 2010. doi:10.1109/IROS.2010.5651078 pdf IEEE Robotics and Automation Society Japan Chapter Young Award
  39. Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: An Improvement in Automatic Speech Recognition Using Soft Missing Feature Masks for Robot Audition, (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2010), pp.964-969, TuCT12.2, IEEE, RSJ, Taipei, 18-22 Oct. 2010. pdf
  40. Takumi Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: Two-Layered Audio-Visual Speech Recognition for Robots in Noisy Environments (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2010), pp.988-993, TuCT12.6, IEEE, RSJ, Taipei, 18-22 Oct. 2010. doi:10.1109/IROS.2010.5651205 pdf
  41. Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Speedup and Performance Improvement of ICA-based Robot Audition by Parallel and Resampling-based Block-wise Processing (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2010), pp.1949-1954 (49.1%), TuET12.1, IEEE, RSJ, Taipei, 18-22 Oct. 2010. doi:10.1109/IROS.2010.5652757 pdf
  42. Takeshi Mizumoto, Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Human-Robot Ensemble between Robot Thereminst and Human Percussionist using Coupled Oscillator Model, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2010), pp.1957-1962, TuET12.2, IEEE, RSJ, Taipei, 18-22 Oct. 2010. doi:10.1109/IROS.2010.5650364 pdf
  43. Angelica Lim, Takeshi Mizumoto, Lois-Kenzo Cahier, Takuma Otsuka, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot Musical Accompaniment: Integrating Audio and Visual Cues for Real-time Synchronization with a Human Flutist (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2010), pp.1964-1969, TuET12.3, IEEE, RSJ, Taipei, 18-22 Oct. 2010. doi:10.1109/IROS.2010.5650427 pdf IROS-2011NTF Award for Entertainment Robots and Systems.
  44. Shun Nishide, Tetsuya Ogata, Jun Tani, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno: Motion Generation Based on Reliable Predictability using Self-organized Object Features (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2010), pp.3453-3458, WeCT13.2, IEEE, RSJ, Taipei, 18-22 Oct. 2010. doi:10.1109/IROS.2010.5652609 pdf
  45. Nobuhide Yamakawa, Tetsuro Kitahara, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Effects of modelling within- and between-frame temporal variations in power spectra on non-verbal sound recognition, Proceedings of International Conference on Spoken Language Processing (Interspeech 2010), 2342-2345, (oral), Makuhari, 29 Sep. 2010. pdf
  46. Kyoko Matsuyama, Kazunori Komatani, Ryu Takeda, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Analyzing User Utterances in Barge-in-able Spoken Dialogue System for Improving Identification Accuracy Proceedings of International Conference on Spoken Language Processing (Interspeech 2010), 3050-3053, (acceptance rate 58.2%), Makuhari, 30 Sep. 2010. pdf
  47. Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Ryuichi Nishimura, Toshio Irino: Simplification and extension of non-periodic excitation source representation for high-quality speech manipulation systems, Proceedings of International Conference on Spoken Language Processing (Interspeech 2010), 38-41, (oral), Makuhari, 27 Sep. 2010. pdf
  48. Kazunori Komatani, Hiroshi G. Okuno: Online Error Detection of Barge-In Utterances by Using Individual Users' Utterance Histories in Spoken Dialogue System, Proceedings of the 11th SIGDIAL Meeting on Discourse and Dialogue (SIGDIAL 2010), 289-296, Tokyo, Sep. 24-25, 2010. pdf pdf at SIGDial Server.
  49. Hiromitsu Awano, Tetsuya Ogata, Shun Nishide, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno: Human-Robot Cooperation in Arrangement of Objects Using Confidence Measure of Neuro-dynamcal Systems, Proceeding of IEEE International Conference on Systems, Man, adn Cybernetics (SMC 2010), accepted, June 2010.
  50. Shimpei Aso, Takuya Saitou, Masataka Goto, Katsutoshi Itoyama, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: SpeakBySinging: Converting Singing Voices to Speaking Voices While Retaining Voice Timbre, Proceedings of the 13th International Conference on Digital Audio Effects (DAFx-10), 114-121, Graz, October, 2010, PDF at DAFx
  51. Kazunori Komatani, Masaki Katsumaru, Mikio Nakano, Kotaro Funakoshi, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Allocation of Training Data for Rapid Prototyping of Speech Understanding based on Multiple Model Combination, Proceedings of COLING 2010, accepted as poster presentation, (acceptance rate 42%), Beijing, China, September, 2010.
  52. Shun Shiramatsu, Tadachika Ozono, Toramatsu Shintani, Hiroshi G. Okuno: A Corpus-based Analysis of Coreferential Recency Effect in Japanese Discourse for Tracking Dynamic Topic. Proceedings of the 9th IEEE/ACIS International Conference on Computer and Information Science (ACIS-ICIS 2010), 645-650, Yamagata, Japan, Aug. 2010. doi:10.1109/ICIS.2010.65
  53. Akira Maezawa, Katsutoshi Itoyama, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Query-by-Conducting: An interface to retrieve classical-music interpretations by real-time tempo input, Proceedings of 11th International Society for Musical Information Retrieval Conference (ISMIR-2010), 477-482, Dresden, Aug. 2010. PDF at ISMIR
  54. Ikkyu Aihara, Takeshi Mizumoto, Ryu Takeda, Takuma Otsuka, Toru Takahashi, Kazuyuki Aihara, Hiroshi G. Okuno: Frustration in Synchronized Calling Behavior of Japanese Tree Frogs, International Conference on Nyuroethology, Aug. 5-7 (6-7), 2010, Salamanca, Spain. (poster)
  55. Takeshi Mizumoto, Ikkyu Aihara, Takuma Otsuka, Ryu Takeda, Kazuyuki Aihara, Hiroshi G. Okuno: Sound imaging system for visualizing spatio-temporal behavior of calling nocturnal animals, International Conference on Nyuroethology, Aug. 5-7 (6-7), 2010, Salamanca, Spain. (poster)
  56. Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and Implementation of Two-level Synchronization for Interactive Music Robot, Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (AAAI-10), 1138-1244 (26.9%, 264/982), July 11-15 (15), 2010, Atlanta, GA. pdf
  57. Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Improvement in Listening Capability for Humanoid Robot HRP-2. Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2010), 470-475, (847/2062), May 3-8 (4), 2010, Anchorage, Aalaska, USA. pdf doi:10.1109/ROBOT.2010.5509830
  58. Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Upper-limit Evaluation of a Robot Audition based on ICA-BSS in Multi-source, Barge-in and Highly Reveberant Conditions. Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2010), 4366-4371, (847/2062), May 3-8, 2010, Anchorage, Aalaska, USA. pdf doi:10.1109/ROBOT.2010.5509891

    Non-reviewed papers

  59. Angelica Lim, Takeshi Mizumoto, Louis-Kenzo Cahier, Takuma Otsuka, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Multimodal gesture recognition for robot musical accompaniment, 28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
  60. Louis-Kenzo Cahier, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno : Probabilistic polygonal mesh for 3D SLAM, 28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
  61. Nobuhide Yakamawa, Toru Takahashi, Tetsuro Kitahara, Tetsuya Ogata, Hiroshi G. Okuno : ロボット聴覚のための Matching-Pursuit による環境音の分離音認識, 28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
  62. 平澤 恭治, 高橋 徹, 尾形 哲也, 奥乃 博 : 調波構造を用いた L1 ノルム最小化に基づく劣決定音源分離手法の性能評価, 28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
  63. Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno. : Predictive Score Following user Particle Filter for Music Robots, 28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
  64. 水本 武志, 中臺 一博, 大塚 琢馬, 高橋 徹, 尾形 哲也, 奥乃 博 : 打楽器とロボットの合奏のための結合振動子モデルに基づく打撃時刻予想手法, 28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
  65. 武田 龍, 中臺 一博, 高橋 徹, 尾形 哲也, 奥乃 博 : リサンプル-ブロック処理と並列化に基づく ICA の実時間実装 , 28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
  66. Shimpei Aso, Takeshi Saitou, Masataka Goto, Katsutoshi Itoyama, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: SpeakBySinging: A Speaking Voice Synthesis System Onverting Singing Voices to Speaking Voices, SIGMUS, Vol.2010-MUS-86, No.2, pp. IPSJ, Jul. 2010.
  67. Akira Maezawa, Hiroshi G. Okuno: Query-by-Conducting: A classical-music interpretation retrieval interface based on tempo similarity, SIGMUS, Vol.2010-MUS-86, No.2, pp. IPSJ, Jul. 2010.
  68. Naoki Yasuraoka, Katsutoshi Itoyama, Takuya Yoshioka, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Phrase Replacing System for Polyphonic MUsic Waveforms, 音楽情報科学研究会, Vol.2010-MUS-, No., pp. 情報処理学会, Jul. 2010.
  69. Kyoko Matsuyama, Kazunori Komatani, Ryu Takeda, Tetsuya Ogata, Hiroshi G. Okuno: Analysis of User Utterances and Application to Identify User's Referent in Barge-in-able Spoken Dialogue System, SIGMUS, Vol.2010-SLP-86, No.2, pp. IPSJ, Jul. 2010.

    Patent

  70. Sound location Estimation System, Inventors: Hiroshi Tsujino, Kazuhiro Nakadai, Hiroshi G. Okuno, Takeshi Mizumoto, Kazuyuki Aihara, Issued: No.2010-133964, June 17, 2010. Filed: No.2009-277075, Dec. 4, 2009.

o Academic Year 2009o

Thesis | Journal Papers | Book Chapters | International Conferences | Domestic Conferences | Patents

    Thesis

  1. Hiromasa Fujihara: Statistical Modeling for Recognizing Singing Voices in Polyphonic Music, Ph.D Thesis, Feb. 2010.
  2. Takuya Yoshioka: Speech Enhancement in Reverberatn Environments Feb. 2010. Ph.D Thesis, Feb. 2010.

  3. Masaki Katsumaru: 複数の言語モデルと言語理解モデルによる音声理解の高精度化とそのラピッドプロトタイピングへの適用, MS Thesis, Feb. 2010.
  4. Takeshi Mizumoto: ロボットによるテルミン演奏のための音高・音量特性のモデル化とフィードフォワード制御, MS Thesis, Feb. 2010.

  5. Soramichi Akiyama: 文法検証を統合したPOMDPによる対話管理, BE Thesis, Feb.10, 2009.
  6. Shinpei Aso: 音韻長・F0・振幅の制御により歌声を話声に変換する話声合成システム SpeakBySinging, BE Thesis, Feb.10, 2009.
  7. Akimitsu Awano: 人間とロボットの作業確信度を利用した協調物体配置システム, BE Thesis, Feb.10, 2009.
  8. Kyoji Hirasawa: 調波構造を用いた音源分離によるマイク数以上の同時発話認識, BE Thesis, Feb.10, 2009.

    Peer-reviewed Journal Papers

  9. Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Soft missing-feature mask generation for robot audition, PALADYN Journal of Behavioral Robotics, Vol.1, No.1 (Mar. 2010) pp. 37-47, doi:10.2478/s13230-010-0005-1
  10. Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Voice-awareness control for a humanoid robot consistent with its body posture and movements, PALADYN Journal of Behavioral Robotics, Vol.1, No.1 (Mar. 2010) pp.80-88, doi:10.2478/s13230-010-0009-x
  11. Hiromasa Fujihara, Masataka Goto, Tetsuro Kitahara Hiroshi G. Okuno: A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval, IEEE Transactions on Audio, Speech and Language Processing, Vol.18, No.3 (Mar. 2010) pp. 638 - 648, IEEE. doi:10.1109/TASL.2010.2041386
  12. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Binaural active audition for humanoid robots to localise speech over entire azimuth range, Applied Bionics and Biomechanics, Special Issue on "Humanoid Robots", Vol.6, Issue 3-4 (Sep. 2009) pp.355-368, Taylor & Francis 2009. doi:10.1080/11762320903007430
  13. Hyun-Don Kim, Jinsung Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Target Speech Detection and Separation for Communication with Humanoid Robots in Noisy Home Environments, Advanced Robotics, Vol.23, No.15 (2009) 2093-2111, VSP and Robotics Society of Japan. doi:10.1163/016918609X12529300552105
  14. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Self-Organization of Dynamic Object Features based on Bi-Directional Training, Advanced Robotics, Vol.23, No.15 (2009) 2035-2057. doi:10.1163/016918609X12529289797027 VSP and Robotics Society of Japan.
  15. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Autonomous Motion Generation based on Reliable Predictability, Journal of Robotics and Mechatronics, special issue on Kukanchi Interactive Human-Space Design and Intelligence Dedicated to Dr. Kazuo Tanie, Vol.21, No.4 (2009) 478-488.
  16. Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot audition by multi-channel input Independent Component Analysis (in Japanese). Journal of Robotics Society of Japan, Vol.27, No.7 (July, 2009) 782-792. pdf, pdf at RSJ server.
  17. Kazumasa Murata, Kazuhiro Nakadai, Ryu Takeda, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: Musical Beat-Tracking for Robots and Its Application to A Music Robot, Journal of Robotics Society of Japan, Vol.27, No.7 (July, 2009) 793-801. pdf, pdf at RSJ server.
  18. Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Simulation of Phoneme Aquisition Process (in Japanese), Journal of Robotics Society of Japan, Vol.27, No.7 (2009) 902-813. pdf, pdf at RSJ server.
  19. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Parameter Estimation for Harmonic and Inharmonic Models by Using Timbre Feature Distributions, IPSJ Journal, Vol.50, No.7 (Jul. 2009) 1757-1767, IPSJ. Journal of Information Processing, Vol.17 (2009) 191-201, IPSJ. pdf D-Library
  20. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Human Tracking System Integrating Sound and Face Localization Using an Expection-Maximization Algorithm in Real Environments, Advanced Robotics, Vol.23, No.6 (May 2009) 629-653, doi:10.1163/156855309X431659 VSP and Robotics Society of Japan.

    Book Chapters, Reviews

  21. Masaki Katsumaru, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Adjusting Occurrence Probabilities of Automatically-Generated Abbreviated Words in Spoken Dialogue Systems, B.-C. Chien, T.-P. Hong, S.-M. Chen, M. Ali (Eds.): Next-Generation Applied Intelligence, 22nd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, Lecture Notes in Artificial Intelligence 5579, pp.481-490, Tainan, Taiwan, Jun. 24-27, 2009. doi:10.1007/978-3-642-02568-6_49
  22. Shun Shiramatsu, Yuji Kubota, Kazunori Komatani, Tetsuya Ogata, Toru Takahashi, Hiroshi G. Okuno: Visualization-based Approaches to Support Context Sharing towards Public Involment Support System, Opportunities and Challenges for Next-Generation Applied Intelligence, Studies in Computational Intelligence, Springer, Vol.214, pp.111--117, Tainan, Taiwan, Jun. 24-27, 2009. doi:10.1007/978-3-540-92814-0_18
  23. Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: A Model of Temporally Changing User Behaviors in a Deployed Spoken Dialogue System, G.-J. Houben et al. (Eds.): UMAP 2009, First and Seventeenth International Conference on User Modeling, Adaptation, and Personalization, Lecture Notes in Computer Science 5535, pp.408-414, Trento, Italy, Jun. 22-26, 2009. doi:10.1007/978-3-642-02247-0_45

    Peer-reviewed Conference Papers

  24. Naoki Yasuraoka, Takuya Yoshioka, Tomohiro Nakatani, Aatsushi Nakamura, Hiroshi G. Okuno: MUSIC DEREVERBERATION USING HARMONIC STRUCTURE SOURCE MODEL AND WIENER FILTER, Proceedings of 2010 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2010), pp.53-56, (lecture), Dallus, March, 2010. pdf
  25. Takuya Yoshioka, Tomohiro Nakatani, Hiroshi G. Okuno: NOISY SPEECH ENHANCEMENT BASED ON PRIOR KNOWLEDGE ABOUT SPECTRAL ENVELOPE AND HARMONIC STRUCTURE, Proceedings of 2010 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2010), pp.4270-4273, (lecture+poster 48.8\%), Dallus, March, 2010. pdf
  26. Akira Maezawa, Katsutoshi Itoyama, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Bowed String Sequence Estimation of a Violin Based on Adaptive Audio Signal Classification and Context-Dependent Error Correction, Proceedings of IEEE International Symposium on Multimedia (ISM2009), pp.9-16, (acceptance rate for full papers, 19.6%), San Diego, Dec. 14-16, 2009. doi:10.1109/ISM.2009.30
  27. Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Estimation of Reverberation Time with Robot Speech to Improve ICA-based Robot Audition, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2009), pp.250-355, IEEE, Paris, Dec. 7-10, 2009. pdf. doi:10.1109/ICHR.2009.5379572
  28. Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Voice quality manipulation for humanoid robots consistent with their head movements, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2009), pp.405-410, IEEE, Paris, Dec. 7-10, 2009. pdf. doi:10.1109/ICHR.2009.5379569
  29. Takumi Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: Automatic Speech Recognition Improved by Two-Layered Audio-Visual, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008), pp.604-609, IEEE, Paris, Dec. 7-10, 2009. pdf. doi:10.1109/ICHR.2009.5379586
  30. Hiromasa Fujihara, Masataka Goto, Hiroshi G. Okuno: A NOVEL FRAMEWORK FOR RECOGNIZING PHONEMES OF SINGING VOICE IN POLYPHONIC MUSIC, Proceedings of 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), pp.17-20, Oct. 18-21, New Paltz, NY, 2009. doi:10.1109/IROS.2009.5354527
  31. Takuya Yoshioka, Hirokazu Kameoka, Tomohiro Nakatani, Hiroshi G. Okuno: Statistical models for speech dereverberation, Proceedings of 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), pp.145-148, Oct. 18-21, New Paltz, NY, 2009. doi:10.1109/IROS.2009.5354489
  32. Naoki Yasuraoka, Takehiro Abe, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Changing Timbre and Phrase in Existing Musical Performances as You Like, Proceedings of the ACM International Confernece on Multimedia (ACM Multimedia 2009), 203-212 (16% 22/138), Beijing, China, Oct. 19-24, 2009. pdf, doi:10.1145/1631272.1631302
  33. Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Step-size Parameter Adaptation of Multi-channel Semi-blind ICA with Piecewise Linear Model for Barge-in-able Robot Audition (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.2273-2282, (900/1650), IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009. pdf doi:10.1109/IROS.2009.5354527
  34. Takuma Otsuka, Kazumasa Murata, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Incremental Polyphonic Audio to Score Alignment using Beat Tracking for Singer Robots (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.2289-2296, IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009. pdf doi:10.1109/IROS.2009.5354637
  35. Takeshi Mizumoto, Hiroshi Tsujino, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Thereminist Robot: Development of a Robot Theremin Player with Feedforward and Feedback Arm Control based on a Theremin's Pitch Model (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.2297-2302, IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009. pdf doi:10.1109/IROS.2009.5354473
  36. Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Missing-Feature-Theory-based Robust Simultaneous Speech Recognition System with Non-clean Speech Acoustic Model (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.2730-2735, IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009. pdf doi:10.1109/IROS.2009.5354201
  37. Wataru Hinoshita, Tetsuya Ogata, Hideki Kozima, Hisashi Kanda, Toru Takahashi, Hiroshi G. Okuno: Emergence of Evolutional Interaction with Voice and Motion between Two Robots using RNN, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.4196-4291, IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009. pdf doi:10.1109/IROS.2009.5354887
  38. Shun Nishide, Tetsuhiro Nakagawa, Tetsuya Ogata, Jun Tani, Toru Takahashi, Hiroshi G. Okuno: Modeling Tool-Body Assimilation using Second-order Recurrent Neural Network, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.5376-5381, (900/1650), IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009. pdf doi:10.1109/IROS.2009.5354655
  39. Hisashi Kanda, Tetsuya Ogata, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno: Phoneme Acquisition Model based on Vowel Imitation using Recurrent Neural Network, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.5388-5393, IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009. pdf doi:10.1109/IROS.2009.5354825
  40. Kazunori Komatani, Satoshi Ikeda, Yuichiro Fukubayashi, Tetsuya Ogata, Hiroshi G. Okuno: Ranking Help Message Candidates Based on Robust Grammar Verification Results and Utterance History in Spoken Dialogue Systems, Proceedings of the 10th SIGdial Workshop on Discourse and Dialogue (SigDial 2009), 314-321, Sep. 12, 2009.
  41. Kyoko Matsuyama, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Enabling A User To Specify An Item At Any Time During System Enumeration, Proceedings of International Conference on Spoken Language Processing (Interspeech-2009), Mon-Ses2-P4-1, (57.7%), Brighton, 6-10 Sep. 2009. pdf
  42. Masaki Katsumaru, Mikio Nakano, Kazunori Komatani, Kotaro Funakoshi, Tetsuya Ogata, Hiroshi G. Okuno: Improving Speech Understanding Accuracy with Limited Training Data Using Multiple Language Models and Multiple Understanding Models, Proceedings of International Conference on Spoken Language Processing (Interspeech-2009), Thu-Ses1-P4-9, (57.7%), Brighton, 6-10 (10) Sep. 2009. pdf
  43. Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Ryuichi Nishimura, Toshio Irino: Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion, Proceedings of International Conference on Spoken Language Processing (Interspeech-2009), Thu-Ses1-P2-6, (57.7%), Brighton, 6-10 Sep. 2009. pdf
  44. Hiroshi G. Okuno, Kazuhiro Nakadai, Hyun-Don Kim: Robot Auditon: Missing Feature Theory Approach and Active Audition (Invited talk), Proceeding of the 14th International Symposium of Robotics Research (ISRR 2009), August 31 - September 3, 2009, Lucerne, Switzerland, International Foundation of Robotics Research. Springer STAR series.
  45. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: QUERY-BY-EXAMPLE MUSIC RETRIEVAL APPROACH BASED ON MUSICAL GENRE SHIFT BY CHANGING INSTRUMENT VOLUME, Proceeding of the 12th International Conference on Digital Audio Effects (DAFx-09), accepted, Como, Italy, Sep.1-4. 2009.
  46. Shun Shiramatsu, Tadachika Ozono, Toramatsu Shintani, Kazunori Komatani, Tetsuya Ogata, Toru Takahashi, Hiroshi G. Okuno: Development of a Meeting Browser towards Supporting Public Involvement, Proceedings of the 12th IEEE International Conference on Computational Science and Engineering (CSE-09), pp.717-722, Vancouver, Canada, Aug., 2009. doi:10.1109/CSE.2009.362
  47. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Analysis of Motion Searching based on Reliable Predictability using Recurrent Neural Network, Proceedings of 2009 IEEE/ASME Conference on Advanced Intelligent Mechatronics (AIM 2009), 192-197, Singapore, July 14-19, 2009. doi:10.1145/10.1109/AIM.2009.5230015
  48. Kazunori Komatani, Alexander I. Rudnicky: Predicting Barge-in Utterance Errors by using Implicitly-Supervised ASR Accuracy and Barge-in Rate per User, Proceedings of the Fourth International Joint Conference on Natural Language Processing (ACL-IJCNLP 2009), pp.89-92, Jul. 2009.
  49. Masaki Katsumaru, Mikio Nakano, Kazunori Komatani, Kotaro Funakoshi, Hiroshi G. Okuno: A Speech Understanding Framework that Uses Multiple Language Models and Multiple Understanding Models, Proceeding of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 Conference, pp.133-136, (40%), Boulder, CO, May 31 - Jun. 5, 2009.
  50. Tetsuya Ogata, Ryunosuke Yokoya, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Prediction and Imitation of Other's Motions by Reusing Own Forward-Inverse Model in Robots, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2009), pp.4144-4149, (699/1624), (May 12-17 (16), 2009), Kobe. pdf doi:10.1145/10.1109/ROBOT.2009.5152363
  51. Hisashi Kanda, Tetsuya Ogata, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno: Continuous Vocal Imitation with Self-organized Vowel Spaces in Recurrent Neural Network, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2009), pp.4438-4443, (May 12-17 (16), 2009), Kobe. pdf doi:10.1145/10.1109/ROBOT.2009.5152818
  52. Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: ICA-BASED EFFICIENT BLIND DEREVERBERATION AND ECHO CANCELLATION METHOD FOR BARGE-IN-ABLE ROBOT AUDITION, Proceedings of 2009 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2009), SS-L7.1, pp.3677-3680, (1178/2633), Taipei, Taiwan, April 19--24 (23), 2009. pdf doi:10.1145/10.1109//ICASSP.2009.4960424
  53. Hideki Kawahara, Ryuichi Nisimura, Toshio Irino, Masanori Morise, Toru Takahashi, Hideki Banno: TEMPORALLY VARIABLE MULTI-ASPECT AUDITORY MORPHING ENABLING EXTRAPOLATION WITHOUT OBJECTIVE AND PERCEPTUAL BREAKDOWN, Proceedings of 2009 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2009), pp. , April 23. pdf

    Patents

  54. Robotics visual and auditory system, Patent No. US 7,526,361, Date of Patent: Apr. 28, 2009. Inventors: Kazuhiro Nakadai, Hiroshi Okuno, Hiroaki Kitano, PCT No.: PCT/JP02/08827.

o Academic Year 2008o

Thesis | Journal Papers | Book Chapters | International Conferences | Domestic Conferences | Patents

    Thesis

  1. Shun Nishide: Self-Organization of Invariants for Motion Generation based on Reliable Predictability, Ph.D Thesis, Feb. 2009.
  2. Hyun-Don Kim: Binaural Active Audition for Humanoid Robots, Ph.D Thesis, Sep. 2008.

  3. Takehiro Abe, MS Thesis, Feb. 2008.
  4. Satoshi Ikeda, MS Thesis, Feb. 2008.
  5. Hisashi Kanda, MS Thesis, Feb. 2008.
  6. Yuji Kubota, MS Thesis, Feb. 2008.
  7. Kaiping Wang, MS Thesis, Feb. 2008.

  8. Takuma Otsuka, BE Thesis, Feb. 2008.
  9. Wataru Hinoshita, BE Thesis, Feb. 2008.
  10. Kyoko Matsuyama, BE Thesis, Feb. 2008.
  11. Tadanori Yasuraoka, BE Thesis, Feb. 2008.
  12. Tatsuhiro Nakagawa, BE Thesis, Feb. 2008.

    Peer-reviewed Journal Papers

  13. Takehiro Abe, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: An Analysis-and-Synthesis Approach for Manipulating Pitch of a Musical Instrument Sound Considering Pitch-dependency of Timbral Characteristics, IPSJ Journal, Vol.50, No.3 (Mar., 2009) 1054-1066 IPSJ. pdf, D-Lib
  14. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Integrating Topic Estimation and Dialogue History for Domain Selection in Multi-Domain Spoken Dialogue Systems, IPSJ Journal, Vol.50, No.2 (Feb., 2009) 488-500, IPSJ. pdf, D-Lib
  15. Masaharu Morise, Toru Takahashi, Hideki Kawahara, Toshio Irino: IEIC Trans. A, Vol.J92-A, No.3 (Mar. 2009).
  16. Shun Shiramatsu, Kazunori Komatani, Koiti Hasida, Tetsuya Ogata, Hiroshi G. Okuno: A Game-Theoretic Model of Referential Coherence and Its Empirical Verification Using Large Japanese and English Corpora, ACM Transactions on Speech and Language Processing, Vol.5, No.3 (Oct. 2008) Article 6, ACM. pdf, doi:10.1145/1410358.1410360
  17. Hiromasa Fujihara, Masataka Goto, Hiroshi G. Okuno: An F0 Estimation Method of Vocal Part in Polyphonic Music by Using Statistical Modelling of Singing Voice and Viterbi Search, IPSJ Journal, Vol.49, No.10 (Oct. 2008) 3682-3693, IPSJ. pdf, D-Lib
  18. Chyon Hae Kin, Tetsuya Ogata, Shigeki Sugano: Reinforcement Signal Propagation Algorithm for Logic Circuit, Journal of Robotics and Mechatronics, Vol.20, No.5 (Oct. 2008) pp757-774.
  19. Kazunori Komatani, Satoshi Ikeda, Tetsuya Ogata, Hiroshi G. Okuno: Managing out-of-grammar utterances by topic estimation with domain extensibility in multi-domain spoken dialogue systems, Speech Communication, Vol.50, No.10 (2008) 836-870. doi:10.1016/j.specom.2008.05.010
  20. Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot Audition using an Adaptive Filter Based on Independent Component Analysis, Journal of Robotic Society of Japan, Vol.26, No.6 (Sep. 2008) pp.529-536. Digital Library
  21. Yuki Suga, Tetsuya Ogata, Shigeki Sugano: Human-Adaptive Robot Interaction using Interactive EC with Human-Machine Hybrid Evaluation, Journal of Robotics and Mechatronics, Vol.20, No.4 (Aug. 2008) pp.610-620.
  22. Hayeong Jeong, Shun Shiramatsu, Kiyoshi Kobayashi, and Tsuyoshi Hatori: Discourse Analysis of Public Debates Using Corpus Linguistic Methodologies, Journal of Computers, Vol.3, No.8 (Aug. 2008) pp.58--68.
  23. Yuichiro Fukubayashi, Kazunori Komatani, Mikio Nakano, Kotaro Funakoshi, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno: WFST-based Language Understanding for Rapid Prototyping of Spoken Dialogue Systems, IPSJ Journal, Vol.49, No.8 (Aug. 2008) pp.2762-2772, Information Processing Society of Japan, pdf, Digital Library.
  24. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Predicting Object Dynamics from Visual Images through Active Sensing Experiences, Advanced Robotics, Vol.22, No.5 (May 2008) pp.527-546, doi:10.1163/156855308X294879 Online version, VSP and Robotics Society of Japan.
  25. Hiroshi G. Okuno, Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata: A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals, Journal of Acoustic Society of America, Vol.123, No.5 (May 2008) Pt.2, pp.3066-3067.
  26. Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nishimura, Hideki Banno, Toshio Irino: A temporally stable representation of power spectra of periodic signals and its application to F0 and periodicity estimation, Journal of Acoustic Society of America, Vol.123, No.5 (May 2008) Pt.2, pp.3074-3075.

    Book Chapters, Reviews

  27. Shun Shiramatsu, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: SalienceGraph: Visualizing Salience Dynamics of Written Discourse by Using Reference Probability and PLSA, T. B. Ho and Z-H. Zhou (Eds.): PRICAI-2008: Trends in Artificial Intelligence, 890-902, (84/234, 35.8%), Lecture Notes in Computer Science, Vol.5351, Springer-Verlag, Dec. 2008. doi:10.1007/978-3-540-89197-0_83
  28. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Integrating Topic Estimation and Dialogue History for Domain Selection in Multi-Domain Spoken Dialogue Systems, Ngoc Thanh Nguyen,Leszek Borzemski,Adam Grzech,Moonis Ali (Eds.): New Frontiers in Applied Artificial Intelligence, pp.294-304, Lecture Notes in Artificial Intelligence, Vol.5027, June, 2008. doi:10.1007/978-3-540-69052-8_31
  29. Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Vowel Imitation using Vocal Tract Model and Recurrent Neural Network, Masumi Ishikawa, Kenji Doya, Hiroyuki Miyamoto, Takeshi Yamakawa (Eds.): Neural Information Processing, 14th International Conference, ICONIP 2007, Revised Selected Papers, Part II, pp.222-232, Lecture Notes in Computer Science 4985, Springer-Verlag, June 2008. doi:10.1007/978-3-540-69162-4_24
  30. Tetsuya Ogata, Hideki Kojima, Hiroshi G. Okuno: Motion Emergence from Sound using Cross-Modal Mapping on Recurrent Neural Network, Aucouturier, J.-J. (ed.) Cheek to Chip: Dancing Robots and AI's Future, IEEE Intelligent Systems, Vol.23, No.2 (Apr. 2008), 74--84, doi:10.1109/MIS.2008.22

    Peer-reviewed Conference Papers

  31. Masato Onishi, Toru Takahashi, Toshio Irino, Hideki Kawahara: Vowel-based frequency alignment function design and recognition-based time alignment for automatic speech morphing, Proceedings of IEEE Workshop on Spoken Language Technology 2008 (SLT 2008), accepted, Goa, India, December, 15--18, 2008,
  32. Yuji Kubota, Masatoshi Yoshida, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and Implementation of 3D Auditory Scene Visualizer towards Auditory Awareness with Face Tracking, Proceedings of IEEE International Symposium on Multimedia (ISM2008), pp.468-476 (acceptance rate for regular papers, 24%), Berkeley, Dec. 16. 2008. pdf doi:10.1109/ISM.2008.107
  33. Yuji Kubota, Shun Shiramatsu, Masatoshi Yoshida, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: 3D Auditory Scene Visualizer With Face Tracking: Design and Implementation For Auditory Awareness Compensation, Proceedings of 2nd International Symposium on Universal Communication (ISUC2008), pp.42-49, IEEE, Osaka, Dec. 15. 2008. pdf doi:10.1109/ISUC.2008.59
  34. Kazuhiro Nakadai, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: An Open Source Software System For Robot Audition HARK and Its Evaluation, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008), pp.561-566, Daejeon, Korea, Dec. 3, 2008. pdf doi:10.1109/ICHR.2008.4756031
  35. Kazumasa Murata, Kazuhiro Nakadai, Ryu Takeda, Hiroshi G. Okuno, Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino: A Beat-Tracking Robot for Human-Robot Interaction and Its Evaluation, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008), pp.79-84, Daejeon, Korea, Dec. 2, 2008. pdf doi:10.1109/ICHR.2008.4755935
  36. Shun Shiramatsu, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: SalienceGraph: Visualizing Salience Dynamics of Written Discourse by Using Reference Probability and PLSA, Proceedings of the Tenth Pacific Rim International Conference on Artificial Intelligence (PRICAI-08), 890-902, (84/234, 35.8%), Lecture Notes in Computer Science, Vol.5351, Springer-Verlag, Hanoi, Vienam, Dec. 15-19. 2008. doi:10.1007/978-3-540-89197-0_83
  37. Hiroshi G. Okuno: Computational Auditory Scene Analysis and Its Application to Robot Audition (Invited Talk), Proceedings of the Second International Symposium on Robotics and Artificial Intelligence, University of Electro-Communication and Shanghai Jiao Tong University, 9 Oct. 2008.
  38. Ikkyu Aihara: Synchronization and Frustration in Calling Behavior of Japanese Tree Frogs, Dynamics Days Asia Pacific 5 (DDAP5), September, 2008.(oral)
  39. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Analysis of Reliable Predictability based Motion Generation using RNNPB, Proceedings of Joint 4th International Conference on Soft Computing and Intelligent Systems and 9th International Symposium on advanced Intelligent Systems (SCIS & ISIS 2008), pp.305-310, Nagoya, JAPAN, September 17-21, 2008.
  40. Hideki Kawahara, Masanori Morise, Hideki Banno, Toru Takahashi, Ryuichi Nishimura, Toshio Irino: Spectral Envelope Recovery beyond the Nyquist Limit for High-Quality Manipulation of Speech Sounds, Proceedings of International Conference on Spoken Language Processing (Interspeech-2008), pp.22-26, Brisbane, Sept. 24, 2008.
  41. Toru Takahashi, Shun'ichi Yamamoto, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Soft Missing-Feature Mask Generation for Simultaneous Speech Recognition System in Robots, Proceedings of International Conference on Spoken Language Processing (Interspeech-2008), pp.992-997, Brisbane, Sept. 24, 2008.
  42. Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: Predicting ASR Errors by Exploiting Barge-In Rate of Individual Users for Spoken Dialogue Systems, Proceedings of International Conference on Spoken Language Processing (Interspeech-2008), pp.183--186, Brisbane, Sept. 2008.
  43. Masaki Katsumaru, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno Expanding Vocabulary for Recognizing User\'s Abbreviations of Proper Nouns without Increasing ASR Error Rates in Spoken Dialogue Systems, Proceedings of International Conference on Spoken Language Processing (Interspeech-2008), pp.187-190, Brisbane, Sept. 2008.
  44. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:Extensibility Verification of Robust Domain Selection against Out-of-Grammar Utterances in Multi-Domain Spoken Dialogue System, Proceedings of International Conference on Spoken Language Processing (Interspeech-2008), pp.487-490, Brisbane, Sept. 2008.
  45. Shun Nishide, Tetsuya Ogata, Ryunosuke Yokoya, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Active Ssensing based Dynamical Object Feature Extraction, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), pp.1-7, TuAT1.1, IEEE, RSJ, Nice, 23 Sep. 2008. pdf doi:10.1109/IROS.2008.4650794
  46. Takeshi Mizumoto, Ryu Takeda, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: A Robot Listens to Music and Counts Its Beats Aloud by Separating Music from Counting Voice, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), 1538-1543, WeAT6.1 IEEE, RSJ, Nice, 24 Sep. 2008. pdf doi:10.1109/IROS.2008.4650821
    Award for Entertainment Robots and Systems (NTF Award) Nomination Finalist.
  47. Hyun-Don Kim, Jinsung Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Target Speech Detection and Separation for Humanoid Robot in Sparse Dialogue with Noisy Home Environments (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), 1705-1711, WeAT10.4 IEEE, RSJ, Nice, 24 Sep. 2008. pdf doi:10.1109/IROS.2008.4650977
  48. Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Segmenting Acoustic Signal with Articulatory Movement using Recurrent Neural Network for Phoneme Aquisition (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), 1712-1717, WeAT10.5 IEEE, RSJ, Nice, 24 Sep. 2008. pdf doi:10.1109/IROS.2008.4651060
  49. Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Barge-in-able Robot Audition Based on ICA and Missing Feature Theory under Semi-Blind Situation (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), 1718-1723, WeAT10.6, IEEE, RSJ, Nice, 24 Sep. 2008. pdf doi:10.1109/IROS.2008.4650821
  50. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and Evaluation of Two-Channel Sound Source Localization over Entire Azimuth Range for Moving Talker (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), pp.2197-2203 Sept. 2008 pdf IEEE, RSJ, Nice, Sept. 2008. doi:10.1109/IROS.2008.4650947
  51. Kazumasa Murata, Kazuhiro Nakadai, Kazuyoshi Yoshii, Ryu Takeda, Toyotaka Torii, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: A Robot Uses Its Own Microphone to Synchronize Its Steps to Musical Beats While Scatting and Singing, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), pp.2459-2464, WeCT6.1, IEEE, RSJ, Nice, 24 Sep. 2008. pdf doi:10.1109/IROS.2008.4650596
    Award for Entertainment Robots and Systems (NTF Award) Nomination Finalist.
  52. Kohei Sumi, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Chord Recognition Based on Probabilistic Integration of Chord Transition and Bass Pitch Estimation, Proceedings of 9th International Conference on Musical Information Retrieval (ISMIR-2008), 39-44, Philadelphia, 15 Sep. 2008. pdf
  53. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrument Equalizer for Query-by-Example Retrieval: Improving Sound Source Separation based on Integrated Harmonic and Inharmonic Models, Proceedings of 9th International Conference on Musical Information Retrieval (ISMIR-2008),133-138, Philadelphia, 15 Sep. 2008. pdf
  54. Kazumasa Murata, Kazuhiro Nakadai, Kazuyoshi Yoshii, Ryu Takeda, Toyotake Torii, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: A Robot Singer with Music Recognition Based on Real-Time Beat Tracking, Proceedings of 9th International Conference on Musical Information Retrieval (ISMIR-2008), 199-204, Philadelphia, 15 Sep. 2008. pdf
  55. Takehiro Abe, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Synthesis Approach for Manipulating Pitch of a Musical Instrument Sound with Considering Timbral Characteristics, Proceeding of the 11th International Conference on Digital Audio Effects (DAFx-08), 249-256, Espoo, Finland, Sep.1-4. 2008. pdf
  56. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Integrating Topic Estimation and Dialogue History for Domain Selection in Multi-Domain Spoken Dialogue Systems, Proceeding of the 21st International Conference on Industrial, Engineering and Other Applications of Applied Intelligence Systems (IEA/AIE-2008), pp.294-304, (acceptance rate is about 30%), LNAI 5027, Wroclaw, Poland, Jun. 18, 2008. doi:10.1007/978-3-540-69052-8_31
  57. Hiroshi G. Okuno, Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata: A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals, Proceedings of Acoustics'08, CD-ROM , 1pSCa8, June 30, 2008.
  58. Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nishimura, Hideki Banno, Toshio Irino: A temporally stable representation of power spectra of periodic signals and its application to F0 and periodicity estimation, Proceedings of Acoustics'08, CD-ROM , 1pSCc24, June 30, 2008.
  59. Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nishimura, Hideki Banno, Toshio Irino: A unified approach for F0 extraction and aperiodicity estimation based on a temporally stable power spectral representation, Proceedings of ISCA Tutorial and Research Workshop (ITRW) on "Speech Analysis and Processing for Knowledge Discovery", June 4, 2008, Aalborg, DK.
  60. Shun Nishide, Tetsuya Ogata, Ryunosuke Yokoya, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Object Dynamics Prediction and Motion Generation based on Reliable Predictability, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2008), 1608-1614, (May 20, 2008). pdf doi:10.1109/ROBOT.2008.4543431
  61. Kazuhiro Nakadai, Shun'ichi Yamamoto, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: A Robot Referee for Rock-Paper-Scissors Sound Games, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2008), 3469--3474, (May 20, 2008). pdf doi:10.1109/ROBOT.2008.4543741
  62. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Two-Channel-Based Voice Activity Detection for Humanoid Robots in Noisy Home Environments, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2008), 3495-3501, (May 20, 2008). pdf doi:10.1109/ROBOT.2008.4543745
  63. Hiroshi G. Okuno, Kazuhiro Nakadai: COMPUTATIONAL AUDITORY SCENE ANALYSIS AND ITS APPLICATION TO ROBOT AUDITION (invited talk), Proceedings of Hands-free Speech Communication and Microphone Arrays (HSCMA-2008), pp.123-127, May 7, 2008, Trento, Italy. pdf doi:10.1109/HSCMA.2008.4538702
  64. Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nishimura, Toshio Irino, Hideki Banno: TANDEM-STRAIGHT: A Temporally Stable Power Spectral Representation for Periodic Signals and Applications to Interference-free Spectrum, F0, and Aperiodicity Estimation, Proceedings of 2008 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2008), pp.3933-3936, Las Vegas, Nevada, USA, March 30 - April 4, 2008.

    Patents

  65. Sound Source Separation System, Sound Source Separation Method, and Computer Program for Sound Source Separation, PCT/JP2008/057310, WO 2008/133097 Date of Open: 06.11.2008, Inventors: Katsutoshi Itoyama, Hiroshi Okuno, Masataka Goto. Assignee: Kyoto University, AIST.
  66. Moving object equipped with ultra-directional speaker, Patent No. US 7,424,118, Date of Patent: Sep. 9, 2008. Inventors: Kiyofumi Mori, Shunji Yoshida, Hiroshi Okuno, Kazuhiro Nakadai, Hiroshi Tsujino, PCT No.: PCT/JP2005/002043.
  67. Speech Recognition Apparatus, Application No. 20080167869. Filed: July 10, 2008. Inventors: Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto. PCT No.: PCT/JP05/22601.

o Academic Year 2007o

    Thesis

  1. Shun Shiramatsu: Salience-based Modeling of Discourse Context, Ph.D Thesis, Feb. 2008. pdf
  2. Shun'ichi Yamamoto: Real-Time Robot Audition Software Based on Missing Feature Theory for Multiple Simultaneous Talkers in Real Environments, Ph.D Thesis, Feb. 2008.
  3. Kazuyoshi Yoshii: Studies on Hybrid Music Recommendation Using Timbral and Rhythmic Features, Ph.D Thesis, Feb. 2008.

  4. Katsutoshi Itoyama: MS Thesis, Feb. 2008.
  5. Ryu Takeda MS Thesis, Feb. 2008.
  6. Yuichiro Fukubayashi: MS Thesis, Feb. 2008.
  7. Koichi Tokuda: MS Thesis, Feb. 2008.
  8. Ryunosuke Yokoya: MS Thesis, Feb. 2008.

  9. Kohei Sumi: BE Thesis, Feb. 2007.
  10. Masaki Katsumaru: BE Thesis, Feb. 2008.
  11. Hiroki Saito: BE Thesis, Feb. 2007.
  12. Zhang: BE Thesis, Feb. 2007.
  13. Takeshi Mizumoto: BE Thesis, Feb. 2007.

    Peer-reviewed Journal Papers

  14. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Simultaneous Realization of Score-informed Sound Source Separation of Polyphonic Musical Siganals and Constrained Parameter Estimation for Integrated Model of Harmonic and Inharmonic Structure, IPSJ Journal, Vol.49, No.3 (Mar., 2008) pp.1465-1479, Information Processing Society of Japan, Digital Library, pdf
  15. Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: , Transactions of Human Interface Society, Vol.10, No.1 (Feb. 2008) pp.59-72.
  16. Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Hybrid Collaborative and Content-based Music Recommendation Using Incrementally-trainable Probabilistic Generative Model, IEEE Transactions on Audio, Speech and Language Processing, Vol.16, No.2 (Feb. 2008) pp.435-447, pdf, doi:10.1109/TASL.2007.911503
  17. Shun Shiramatsu, Kazunori Komatani, Koiti Hasida, Tetsuya Ogata, Hiroshi G. Okuno: A Game-Theoretic Model of Referential Coherence and Its Statistical Verification Based on Large Japanese and English Corpora, Natural Language Processing, Vol.14, No.4 (Oct. 2007) pp.199-239.
  18. Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Experience Based Imitation Using RNNPB, Advanced Robotics, Vol.21, No.12 (2007) pp.1351-1367, doi:10.1163/156855307781746106 Online version, VSP and Robotics Society of Japan.
  19. Chyon Hae Kim, Jun-ichi Idesawa, Tetsuya Ogata, Shigeki Sugano: Restraining of Noises in Self-Organizing Network Elements, Journal of Robotics Society of Japan, Vol.25, No.6 (Sep. 2007) pp.115-122. Digital Library
  20. Kazuhiro Nakadai, Hirofumi Nakashima, Masamitsu Murase, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: Tracking of Mulitiple Sound Sources by Integration of Robot-Embedded and In-Room Microphone Arrays, Journal of Robotics Society of Japan, Vol.25, No.6 (Sep. 2007) pp.181-191. pdf, pdf at RSJ server.
  21. Jean-Marc Valin, Shun'ichi Yamamoto, Jean Rouat, Francois Michaud, Kazuhiro Nakadai, Hiroshi G. Okuno: Robust Recognition of Simultaneous Speech By a Mobile Robot, IEEE Transactions on Robotics, Vol.23, No.4 (Aug. 2007) pp.742--752. pdf, doi:10.1109/TRO.2007.900612
  22. Hiroaki Arie, Tetsuya Ogata, Jun Tani, and Shigeki Sugano: Reinforcement learning of continuous motor sequence with hidden state, Advanced Robotics, Special Issue on Robotic Platforms for Research in Neuroscience, VSP and Robotics Society of Japan, Vol.21, No.10 (July 2007), pp.1215-1229. Online version doi:10.1163/156855307781389365
  23. Taro Watanabe, Kenji Imamura, Eiichiro Sumita, Hiroshi G. Okuno: Statistical machine translation using hierarchical phrase alignment, Systems and Computers in Japan, Vol.38, No.6 (June 2007) pp.70-79, doi:10.1002/scj.20271
  24. Naoyuki Kanda, Kazunori Komatani, Mikio Nakano, Kazuhiro Nakadai, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno: Robust Domain Selection Using Dialogue History in Multi-domain Spoken Dialogue Systems, IPSJ Journal, Vol.48, No.5 (May 2007) pp.1980-1989, IPSJ.

    Book Chapters, Articles

  25. Hiroshi G. Okuno, Tetsuya Ogata, Kazunori Komatani: Robot Audition from the viewpoint of Computational Auditory Scene Analysis, Informatics Education and Research for Knowledge-Creation Society Infrastructure (ICKS'08), pp.35-40, Jan. 2008. doi:10.1109/ICKS.2008.10
  26. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Structual Feature Extraction based on Active Sensing Experiences, Informatics Education and Research for Knowledge-Creation Society Infrastructure (ICKS'08), pp.209-212, Jan. 2008. doi:10.1109/ICKS.2008.9
  27. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Evaluation of Two-Channel-Based Sound Source Localization using 3D Moving Sound Creation Tool, Informatics Education and Research for Knowledge-Creation Society Infrastructure (ICKS'08), pp.210-216. doi:10.1109/ICKS.2008.25
  28. Koiti Hasida, Shun Shiramatsu, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Meaning Games, LENLS 2007 Postproceedings, accepted, LNCS Oct. 2007.
  29. Hiroshi G. Okuno, Moonis Ali (Eds.): New Trends in Applied Artificial Intelligence (IEA/AIE-2007), Lecture Notes in Computer Science, Vol.4570, Springer-Verlag, 14 Jun. 2007, XXI, 1194p. ISBN: 978-3-540-73322-5. doi:10.1007/978-3-540-73325-6
  30. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Auditory and Visual Talker Tracking through integrating EM algorithm and Particle Filter, New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570, pp.280-290, Springer-Verlag. Kyoto, Jun. 2007. doi:10.1007/978-3-540-73325-6_28
  31. Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and MTF-based ASR, New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570, pp.384-394, Springer-Verlag. Kyoto, Jun. 2007. doi:10.1007/978-3-540-73325-6_38
  32. Hiroshi G. Okuno, Tetsuro Kitahara, Kazuyoshi Yoshii: Music Feature Extraction and Music Information Retrieval, IEE Journal, Vol.127, No.7 (Jul. 2007).
  33. Hiroshi G. Okuno, Hiroshi Mizoguchi: Information Integration for Robot Audition: the State-of-the-art and issues, SICE, Vol.46, No.6 (Jun. 2007) pp.415-419.
  34. Shun'ichi Yamamoto, Ryu Takeda, Hiroshi G. Okuno: Missing Feature Theory Based Automatic Speech Recognition and Its Application to Simultaneous Multiple Speaker Speech Recognition, SICE, Vol.46, No.6 (Jun. 2007) pp.447-452.
  35. Shinichi Ueno, Fumihiro Adachi, Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: Bus Information System Based on User Models and Dynamic Generation of VoiceXML Scripts, New Frontiers in Artificial Intelligence (JSAI 2003/2004), LNAI 3609, pp.45-60, 2007. Springer-Verlag. doi:10.1007/978-3-540-71009-7_4

    Peer-reviewed Conference Papers

  36. Yuichiro Fukubayashi, Kazunori Komatani, Mikio Nakano, Kotaro Funakoshi, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno: Rapid Prototyping of Robust Language Understanding Modules with Less Training Data for Spoken Dialogue Systems, Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP 2008), pp.210-216, Jan. 2008, Hyderabad, India.
  37. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and Implementation of A Robot Audition System for Automatic Speech Recognition of Simultaneous Speech, Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-2007), 111-116, acceptance rate (115/267), IEEE, Kyoto, Dec. 2007. pdf doi:10.1109/ASRU.2007.4430093
  38. Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Vocal Imitation using Vocal Tract Model and Recurrent Neural Network, Proceedings of International Conference on Neural Information Processing (ICONIP-2007), Vol.2, pp.222-232, Nov. 2007.
  39. Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Vocal Imitation Using Physical Vocal Tract Model, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.1846-1851, IEEE, RSJ, San Diego, Oct. 2007. pdf doi:10.1109/IROS.2007.4399137
  40. Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Discovery of Other Individuals by Projecting a Self-Model Through Imitation, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.1009-1014, IEEE, RSJ, San Diego, Oct. 2007. pdf doi:10.1109/IROS.2007.4399153
  41. Kazuyoshi Yoshii, Kazuhiro Nakadai, Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: A Biped Robot that Keeps Steps in Time with Musical Beats while Listening to Music with Its Own Ears, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.1743-1750, IEEE, RSJ, San Diego, Oct. 2007. pdf doi:10.1109/IROS.2007.4399244
  42. Tetsuya Ogata, Masamitsu Murase, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno, Two-way Translation of Compound Sentences and Arm Motions by Recurrent Neural Networks, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.1858-1863, IEEE, RSJ, San Diego, Oct. 2007. pdf doi:10.1109/IROS.2007.4399265
  43. Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Exploiting Known Sound Sources to Improve ICA-based Robot Audition in Speech Separation and Recognition, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.1757-1762, IEEE, RSJ, San Diego, Oct. 2007. pdf
  44. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Auditory and Visual Integration based Localization and Tracking of Humans in Daily-life Environments, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.2021-2027, IEEE, RSJ, San Diego, Oct. 2007. pdf
  45. Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences, Proceedings of 8th International Conference on Musical Information Retrieval (ISMIR-2007), long paper (15.8% of 214 submissions), pp.89-94, Vienna, Sep. 2007.
  46. Kazunori Komatani, Yuichiro Fukubayashi, Tetsuya Ogata, Hiroshi G. Okuno: Introducing Utterance Verification in Spoken Dialogue System to Improve Dynamic Help Generation for Novice Users, Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, pp.202-205, Sep. 2007
  47. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Topic Estimation with Domain Extensibility for Guiding User's Out-of-Grammar Utterance in Multi-Domain Spoken DIalogue Systems, Proceedings of International Conference on Spoken Language Processing (Interspeech-2007), pp.2561-2564,, Antwerp, Sep. 2007. pdf
  48. Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: Analyzing Temporal Transition of Real User's Behaviors in a Spoken Dialogue System, Proceedings of International Conference on Spoken Language Processing (Interspeech-2007), pp.142-145, Antwerp, Sep. 2007. pdf
  49. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Auditory and VIsual Integration based Localization and Tracking of Multiple Moving Sounds in Daily-life Environments, Proceedings of International Workshop on Robot and Human Interaction (Ro-Man 2007), 399-404, IEEE, Jeju Island, Korea, Aug. 2007. pdf
  50. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Auditory and Visual Talker Tracking through integrating EM algorithm and Particle Filter, New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570, pp.280-290, Springer-Verlag. Kyoto, Jun. 2007. doi:10.1007/978-3-540-73325-6_28
  51. Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and MTF-based ASR, New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570, pp.384-394, Springer-Verlag. Kyoto, Jun. 2007. doi:10.1007/978-3-540-73325-6_38
  52. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: INTEGRATION AND ADAPTATION OF HARMONIC AND INHARMONIC MODELS FOR SEPARATING POLYPHONIC MUSICAL SIGNALS, Proceedings of 2007 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2007), pp.57-60, Hawaii, April 2007, pp.57-60, (15.1% acceptance rate for lecture presentation) doi:10.1109/ICASSP.2007.366615
  53. Haruhiko Niwa, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Distance Estimation of Hidden Objects Based on Acoustical Holography by applying Acoustic Diffraction of Audible Sound, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2007), pp.423-428, (Apr. 2007). doi:10.1109/ROBOT.2007.363823
  54. Tetsuya Ogata, Shohei Matsumoto, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Human-Robot Cooperation using Quasi-symbols Generated by RNNPB Model, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2007), pp.2156-2161, (Apr. 2007). doi:10.1109/ROBOT.2007.363640
  55. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Predicting Object Dynamics from Visual Images through Active Sensing Experiences, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2007), pp.2501-2506, (Apr. 2007). doi:10.1109/ROBOT.2007.363841
  56. Chyon Hae Kim, Tetsuya Ogata, Shigeki Sugano: Enhancement of Self Organizing Network Elements for Supervised Learning, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2007), WeA3.5, (Apr. 2007).

    Patents

  57. Robot acoustic device and robot acoustic system Patent No. US 7,215,786. Date of Patent: May 8, 2007. Inventors: Kazuhiro Nakadai, Hiroshi Okuno, Hiroaki Kitano, Assignee: Japan Science and Technology Agency.

o Academic Year 2006o

    Peer-reviewed Journal Papers

  1. Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Drumix: An Audio Player with Functions of Realtime Drum-Part Rearrangement for Active Music Listening, IPSJ Journal, Vol.48, No.3 (Mar. 2007), 1229-1239, IPSJ, IPSJ Digital Courier, Vol.3 (2007), pp.134-144. DL
  2. Hyun-Don Kim, Jong-Suk Choi, and Munsang Kim: Human-robot interaction in real environments by audio-visual integration, International Journal of Control Automation and Systems, Vol.5, No.1 (Feb. 2007) pp.61-69.
  3. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music, IPSJ Journal, Vol.48, No.1 (Jan. 2007) pp.214-226, IPSJ. IPSJ Digital Courier, Vol.3 (2007) pp.1-13.
  4. Shunsuke Kurotaki, Noriaki Suzuki, Kazuhiro Nakadai, Hiroshi G. Okuno, Hideharu Aamano: Sound Source Separation Filter for Robot Audition used by Dynamic Reconfigurable Device, DRP (in Japanese), IEICE Transaction on Information and Systems, Vol.J90-D, No.3, pp.897-907, Mar. 2007, IEICE. DL
  5. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G,. Okuno: Simultaneous Speech Recognition based on Automatic Missing-Feature Mask Generation integrated with Sound Source Separation (in Japanese), Journal of Robotics Society of Japan, Vol.25, No.1 (Jan. 2007) pp.92-102. pdf pdf at RSJ server.
  6. Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectogram Templates with Harmonic Structure Suppression, IEEE Transactions on Audio, Speech and Language Processing, Vol.15, No.1 (Jan. 2007) pp.333-345, pdf, doi:10.1109/TASL.2006.876754
  7. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps, EURASIP Journal on Applied Signal Processing, Special issue on Music Information Retrieval Based on Signal Processing, Vol.2007, Article ID 51979, 15 pages, 2007, doi:10.1155/2007/51979
  8. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrument Identification in Polyphonic Music: Feature Weighting Based on Mixed-Sound Template and Use of Musical Context (in Japanese), IEICE Transaction on Information and Systems, Vol.J89-D, No.12 (Dec. 2006), pp.2721-2733, IEICE.
  9. Naoyuki Kanda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Spoken Language Understinding Using Dialogue Context in Database Search (in Japanese), IPSJ Journal, Vol.47, No.6 (June 2006) pp.1802-1811, IPSJ. pdf
  10. Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: A Singer Identification Method for Musical Pieces on the Basis of Accompaniment Sound Reduction and Reliable Frame Selection (in Japanese), IPSJ Journal, Vol.47, No.6 (June 2006) pp.1831-1843, IPSJ.
  11. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. OKuno: Missing Feature Theory Based Interface Between Sound Source Separation and Automatic Speech Recognition and Applying to Multiple Robots, (in Japanese), Journal of Human Interface Society, Vol.8, No.2 (Jun. 2006) pp.203-212.
  12. Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: A Privacy-Enhanced Access Control, Systems and Computers in Japan, (2006) A Privacy-Enhanced Access Control, Systems and Computers in Japan, Vol.37, No.5 (May 2006) pp.77-86. doi:10.1002/scj.10214
  13. Tenkai Kim, 尾形 哲也, Shigeki Sugano; ローカルルールに基づいた論理回路の自己組織アルゴリズム (in Japanese), Transaction on SICE, Vol.42, No.4 (Apr. 2006) pp.334-341.
  14. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Improving Location-Based Speech Recognition of Simultaneous Speech Signals by Parameter Optimization with Genetic Algorithm (in Japanese), Human Interface, Vol.8, No.2 (Jun. 2006) pp.203-212.
  15. Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: A Privacy-Enhanced Access Control, Systems and Computers in Japan, (2006) A Privacy-Enhanced Access Control, Systems and Computers in Japan, Vol.37, No.5 (May 2006) pp.77-86. doi:10.1002/scj.10214

    Book Chapters, Survey Papers, and Articles

  16. Hiroaki Arie, Jun Namikawa, Tetsuya Ogata, Jun Tani, Shigeki Sugano: Reinforcement Learning Algorithm with CTRNN in Continuous Action Space, Neural Information Processing (ICONIP-2006), Part I, LNCS 4232, pp.387-396. Oct. 2006. doi:10.1007/11893028_44
  17. Shun'ichi Yamamoto, Ryu Takeda, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition, PRICAI 2006: Trends in Artificial Intelligence, LNCS 4099, pp.484-494, accepted as regular paper for ORAL Presentation (14.1%), Springer-Verlag, Guilin, China, Aug. 2006. doi:10.1007/11801603_52
  18. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Genetic Algorithm based Improvement of Robot's Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals, Moonis Ali, Richard Dapoigny (Eds.): Advances in Applied Artificial Intelligence (IEA/AIE-2006), LNAI 4031, pp.207-217, Springer-Verlag. Annecy, France, Jun. 2006. doi:10.1007/11779568_24

    Peer-reviewed Conference Papers

  19. Hiroshi G. Okuno, Tetsuya Ogata, Kazunori Komatani: Computational Auditory Scene Analysis and Its Application to Robot Audition: Five Years Experience, Proceedings of the 2nd International Conference on Informatics Research for Development of Knowledge Society Infrastructure (ICKS 2007), pp.69-76, Jan. 2007. doi:10.1109/ICKS.2007.7
  20. Shun Shiramatsu, Kazunori Komatani, Koiti Hasida, Tetsuya Ogata, Hiroshi G. Okuno: Meaning-Game-based Centering Model with Statistical Definition of Utility of Referential Expression and Its Verification Using Japanese and English Corpora, Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC2007), pp.121-126, Lisbon, Mar. 2007.
  21. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Musical Instrument Recognizer ``Instrogram'' and Its Application to Music Retrieval based on Instrumentation Similarity, Proceedings of IEEE International Symposium on Multimedia (ISM2006), pp.265-272, San Diego, Dec. 2006. doi:10.1109/ISM.2006.113
  22. Hiromasa Fujihara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic synchronization between lyrics and music CD recordings based on Viterbi alignment of segregated vocal signals, Proceedings of IEEE International Symposium on Multimedia (ISM2006), pp.257-264, San Diego, Dec. 2006. doi:10.1109/ISM.2006.38
  23. Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences, Proceedings of 7th International Conference on Musical Information Retrieval (ISMIR-2006), pp.296-301, Vancouver, CA, Sep. 2006. pdf
  24. Katsutoshi Itoyama, Tetsuro Kitahara, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Feature Weighting in Automatic Transcription of Specified Part in Polyphonic Music, Proceedings of 7th International Conference on Musical Information Retrieval (ISMIR-2006), pp.172-175, Vancouver, CA, Sep. 2006. pdf
  25. Kazuhiro Nakadai, Hirofumi Nakajima, Masamitsu Murase, Satoshi Kaijiri, Kentaro Yamada, Yuji Hasegawa, Hiroshi G. Okuno, Hiroshi Tsujino: Real-Time Tracking of Multiple Sound Sources by Integration of In-Room and Robot-Embedded Microphone Arrays, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 852-859, IEEE, RSJ, Beijing, China, Sep. 2006. pdf, doi:10.1109/IROS.2006.281737
  26. Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Missing-Feature based Speech Recognition for Two Simultaneous Speech Signals Separated by ICA with a pair of Humanoid Ears, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 878-885, IEEE, RSJ, Beijing, China, Sep. 2006. pdf, doi:10.1109/IROS.2006.281741
    IEEE Robotics and Automation Society Japan Chapter Young Award,
    RSJ/SICE Award for IROS 2006 Best Paper Nomination Finalist (2nd to 5th Place) at IROS-2007.
  27. Haruhiko Niwa, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Multiple Acoustical Holography Method for Localization of Objects in Broad Range using Audible Sound, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 1146-1151, IEEE, RSJ, Beijing, China, Sep. 2006. pdf, doi:10.1109/IROS.2006.281844
  28. Chyon Hae Kim, Tetsuya Ogata, Shigeki Sugano: Efficient Organization of Network Topology based on Reinforcement Signals, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 3154-3159, IEEE, RSJ, Beijing, China, Sep. 2006. pdf
  29. Yuki Suga, Chihiro Endo, Daizo Kobayashi, Takeshi Matsumoto, Tetsuya Ogata, Shigeki Sugano: User-Adaptive Human-Robot Interaction System using Interactive EC, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 3663-3668, IEEE, RSJ, Beijing, China, Sep. 2006. pdf
  30. Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Experience Based Imitation Using RNNPB, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 3669-3674, IEEE, RSJ, Beijing, China, Sep. 2006. pdf, doi:10.1109/IROS.2006.281724
  31. Jong-Suk Choi, Hyun-Don Kim, and Munsang Kim: Probabilistic Speaker Localization in Noisy Environment by Audio-Visual Integration, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 4704-4709, IEEE, RSJ, Beijing, China, Sep. 2006. pdf
  32. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Robot Audition System That Recognizes Simultaneous Speech in the Real World, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 5333-5338, IEEE, RSJ, Beijing, China, Sep. 2006. pdf, doi:10.1109/IROS.2006.282037
  33. Tetsuya Ogata, Yuya Hattori, Hideki Kojima, Kazunori Komatani, Hiroshi G. Okuno: Generation of Robot Motions from Environmental Sounds using Inter-modality Mapping by RNNPB, Proceedings of Sixth International Workshop on Epigenetic Robotics (EpiRobo-2006), 95-102, Paris, Sep., 2006.
  34. Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Speaker Identification under Noisy Environments by using Harmonic Structure Extraction and Reliable Frame Weighting, Proceedings of International Conference on Spoken Language Processing (Interspeech-2006), 1459-1462, Pittsburgh, Sep. 2006. pdf
  35. Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Improving Speech Recognition of Two Simultaneous Speech Signals by Integrating ICA BSS and Automatic Missing Feature Mask Generation, Proceedings of International Conference on Spoken Language Processing (Interspeech-2006), 2302-2305, Pittsburgh, Sep. 2006. pdf
  36. Yuichiro Fukubayashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Dynamic Help Generation by Estimating User's Mental Model in Spoken Dialogue Systems, Proceedings of International Conference on Spoken Language Processing (Interspeech-2006), 1946-1949, Pittsburgh, Sep. 2006. pdf
  37. Shun'ichi Yamamoto, Ryu Takeda, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Leak Energy based Missing Feature Mask Generation for ICA and GSS and Its Evaluation with Simultaneous Speech Recognition, Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition (SAPA2006), pp.42-46, pdf
  38. Kazunori Komatani, Naoyuki Kanda, Mikio Nakano, Kazuhiro Nakadai, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno: Multi-Domain Spoken Dialogue System with Extensibility and Robustness against Speech Recognition Errors, Proceedings of SIGdial Workshop on Discourse and Dialogue, 9-17, Aug. 2006
  39. Hiroshi G. Okuno: Computational Auditory Scene Analysis - Towards Listening to Several Thinkgs at Once -, 50th Anniversary Summit of Artificial Intelligence (ASAI50) workshop and abstract booklet, accepted for inclusion, Monte Verita, Switzerland, July 2006.
  40. Takuya Yoshioka, Takafumi Hikichi, Masato Miyoshi, Hiroshi G. Okuno: Robust Decomposition of Inverse Filter of Channel and Prediction Error Filter of Speech Signal for Dereverberation, Proceedings of the 14th European Signal Processing Conference (EUSIPCO 2006), CD-ROM Proceedings, Florence, 2006. pdf
  41. Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Robot Imitation from Active-Sensing Experiences, Proceedings of Fifth International Conference on Learning and Development (ICDL06), accepted, Bloomington, IN USA, May 2006.
  42. Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: AN ERROR CORRECTION FRAMEWORK BASED ON DRUM PATTERN PERIODICITY FOR IMPROVING DRUM SOUND DETECTION, Proceedings of 2006 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2006), Vol.V, pp.237-240, Toulouse, May 2006. pdf, doi:10.1109/ICASSP.2006.11661256 IEEE Kansai Chapter Young Researcher Award
  43. Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: F0 ESTIMATION METHOD FOR SINGING VOICE IN POLYPHONIC AUDIO SIGNAL BASED ON STATISTICAL VOCAL MODEL AND VITERBI SEARCH, Proceedings of 2006 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2006), Vol.V, pp.253-256, Toulouse, May 2006. pdf, doi:10.1109/ICASSP.2006.1661260
  44. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrogram: A New Musical Instrument Recognition Technique Without Using Onset Detection Nor F0 Estimation, Proceedings of 2006 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2006), Vol.V, pp.229-232, Toulouse, May 2006. pdf, doi:10.1109/ICASSP.2006.1661254 IEEE Kansai Chapter Young Researcher Award
  45. Kazuhiro Nakadai, Hirofumi Nakajima, Masamitsu Murase, Satoshi Kaijiri. Kentaro Yamada, Yuji Hasegawa, Hiroshi G. Okuno, Hiroshi Tsujino: ROBUST TRACKING OF MULTIPLE SOUND SOURCES BY SPATIAL INTEGRATION OF ROOM AND ROBOT MICROPHONE ARRAYS, Proceedings of 2006 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2006), Vol.IV, pp.929-932, Toulouse, May 2006. pdf, doi:10.1109/ICASSP.2006.1661122
  46. Hyun-Don Kim, Jong-Suk Choi, and Munsang Kim: Speaker Localization among Multi-faces in Noisy Environment by Audio-Visual Integration, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2006), 1305-1310, (May 2006). 10.1109/ROBOT.2006.1641889

    Patents

  47. Speech Recongition Device, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto, European Patent: EP1691344, Publication Date: 08/16/2006, Application number: EP20040818533, Filing Date: 11/12/2004
  48. Method and Apparatus for Determining Sound Source, Patent No. US 7,035,418. Filing date: June 7, 2000. Issue date: Apr. 25, 2006. Inventors: Hiroshi Okuno, Hiroaki Kitano, Yukiko Nakagawa, Assignee: Japan Science and Technology Agency.
  49. Robot audiovisual system Patent No. US 7,016,505. Filing date: Nov 1, 2000. Issue date: Mar 21, 2006. Inventors: Kazuhiro Nakadai, Hiroshi Okuno, Hiroaki Kitano, Assignee: Japan Science and Technology Agency.

o Academic Year 2005o

    Peer-Reviewed Journal Papers

  1. Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Using Multiple Edit Distances to Automatically Grade Outputs from Machine Translation Systems, IEEE Transactions on Audio, Speech and Language Processing, Vol.14, No.2, (Mar. 2006) 393--402. doi:10.1109/TSA.2005.860770
  2. Mototaka Suzuki, Kuniaki Noda, Yuki Suga, Tetsuya Ogata, and Shigeki Sugano: Dynamic Perception after Visually-Guided Grasping by a Human-Like Autonomous Robot, Advanced Robotics, Vol.20, No.2 (Feb. 2006) 233-254. VSP and Robotics Society of Japan. doi:10.1163/156855306775525785
  3. Takuya Yoshioka, Takafumi Hikichi, Masato Miyoshi, Hiroshi G. Okuno: Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals, IEICE Trans. on Fundamentals of Electronics, Communications, and Computer Sciences, Vol.E89-A, No.1 (Jan. 2006) 240-247, IEICE.
  4. Tetsuya Ogata, Hayato Ohba, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Extracting Multi-Modal Dynamics of Objects using RNNPB, Journal of Robotics and Mechatronics, Vol.17, No.6 (Dec. 2005) pp.681-688, Special Issue on Human Modeling in Robotics.
  5. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Pitch-dependent identification of musical instrument sounds, Applied Intelligence, Vol.23, No.3, pp.267-275, Springer-Verlag (formerly Kluwer Publishers). doi:10.1007/s10489-005-4612-1
  6. Kenri Kodaka, Tetsuya Ogata, Hiroshi G. Okuno: Walking in Virtual Space with Entrainment Based on a Nonlinear Oscillator, Journal of Human Interface Society, Vol.7, No.4, 26-36, 2005.
  7. Shun Shiramatsu, Takashi Miyata, Hiroshi G. Okuno, Koiti Hasida: Dissolution of Centering Theory Based on Game Theory and Its Empirical Verification (in Japanese), Natural Language Processing, Vol.12, No.3 (July 2005) 91-110.
  8. Shunichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi G. Okuno: Missing Feature Theory Based Interface Between Sound Source Separation and Automatic Speech Recognition and Applying to Multiple Robots (in Japanese), Journal of Robotics Society of Japan, Vol.23, No.6 (Aug. 2005) 743-751. pdf, pdf at RSJ server.
  9. Tetsuya Ogata, Shigeki Sugano, and Jun Tani: Open-end Human-Robot Interaction from the Dynamical Systems Perspective - Mutual Adaptation and Incremental Learning, Advanced Roboics, Vol.19. No.6, pp.651-670, VSP and Robotics Society of Japan. doi:10.1163/1568553054255655
  10. Katsuhisa Ishida, Tetsuro Kitahara, Masayuki Takeda: Improvisation Supporting System Using N-gram-based Melody Appropriateness Determination, IPSJ Journal, Vol.46, No.7 (July 2005) pp.1548-1559, IPSJ. in html

    Book Chapters

  11. Masahiro Nisiyama, Hiroaki Kawashima, Takatsugu Hirayama, Takashi Matsuyama: Facial Expression Representation based on Timing Structures in Faces, Proceedings of IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2005), LNCS 3723, pp.139-153, Beijing, Oct. 2005.
  12. Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Mitsuhiko Toda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Distance Based Dynamic Interaction of Humanoid Robot with Multiple People. Innovations in Applied Artificial Intelligence (IEA/AIE-2005) LNAI 3533, 111-120, Best paper award, Springer-Verlag. Bari, Italy, Jun. 2005. Paper pdf doi:10.1007/11504894_18
  13. Katsutoshi Uchiyama, Toshiaki Ohji, Mari Oka, Hiroshi G. Okuno, Hiroyuki Suzuki, Kenji Fukaya, Modjtaba Sadria, Hubert Durt Kokoro and Topos -- Bastions of Kokoro, Kyoto Inernational Culture Forum 2005, pp.62-73, Mar. 2006.

    Peer-Reviewed International Conference Papers

  14. Shun Shiramatsu, Kazunori Komatani, Takashi Miyata, Koiti Hasida, Hiroshi G. Okuno: Empirical Verification of Meaning-Game-based Generalization of Centering Theory with Large Japanese Corpus, Proceedings of the 19th Pacific Asia Conference on Language, Information, and Computation (PACLIC 19), 192-210, Taipei, Dec. 2005.
  15. Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: INTER:D A Drum Sound Equalizer for Controlling Volume and Timbre of Druams, Proceedings of 2nd European Workshop on the Integration of Knowledge, Semantic and Digital Media Technologies (EWIMT 2005), pp.205--212, EU Commission, IEE Savoy Place, London, Nov. 2005. IEEE
  16. Masahiro Nisiyama, Hiroaki Kawashima, Takatsugu Hirayama, Takashi Matsuyama: Facial Expression Representation based on Timing Structures in Faces, Proceedings of IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2005), LNCS 3723, pp.139-153, accepted, Beijing, Oct. 2005.
  17. Kenri Kodaka, Tetsuya Ogata, Hiroshi G. Okuno: Walking with Body-sense in Virtual Space Using the Nonlinear Oscillator, Proceedings of the International Conference on Systems, Man and Cybernetics (SMC-2005), 324--329, IEEE, Hawaii, Oct. 10-12, 2005. Finalist for Best Student Paper doi:10.1109/ICSMC.2005.1571166
  18. Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: AdaMast: A Drum Sound Recognizer based on Adaptation and Matching of Spectrogram Templates, Proceedings of MIREX 2005, London, Sep. 2005. Paper pdf. Best in Class Award.
  19. Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: SINGER IDENTIFICATION BASED ON ACCOMPANIMENT SOUND REDUCTION AND RELIABLE FRAME SELECTION, Proceedings of 6th International Conference on Musical Information Retrieval (ISMIR-2005), 329-336, London, Sep. 2005.
  20. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: INSTRUMENT IDENTIFICATION IN POLYPHONIC MUSIC: FEATURE WEIGHTING WITH MIXED SOUNDS, PITCH-DEPENDENT TIMBRE MODELING, AND USE OF MUSICAL CONTEXT, Proceedings of 6th International Conference on Musical Information Retrieval (ISMIR-2005), 558-563, London, Sep. 2005.
  21. Kazunori Komatani, Naoyuki Kanda, Tetsuya Ogata, Hiroshi G. Okuno: Contextual Constraints based on Dialogue Models in Database Search Task for Spoken Dialogue Systems, Proceedings of the Nineth European Conference on Speech Communication and Technology (Interspeech-2005), 877-880, Lisboa, Sep. 2005. pdf.
  22. Masamitsu Murase, Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Kentaro Yamada, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Multiple Moving Speaker Tracking by Microphone Array on Mobile Robot, Proceedings of the Nineth European Conference on Speech Communication and Technology (Interspeech-2005), 249-252, Lisboa, Sep. 2005. pdf.
  23. Tetsuro Kitahara, Katsuhisa Ishida, Masayuki Takeda: ism: Improvisation Supporting Systems with Melody Correction and Key Vibration, Proceedings of International Conference on Entertainment Computing (ICEC 2005), Mita, Hyogo, Sep. 2005.
  24. Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Jean Rouat, Francois Michaud, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Making A Robot Recognize Three Simultaneous Sentences in Real-Time, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2005), pp.4040-4045, IEEE, RSJ, Edmonton, Aug. 2005. pdf. doi:10.1109/IROS.2005.1545094
  25. Syunsuke Kurotaki, Noriaki Suzuki, Kazuhiro Nakadai, Hiroshi G. Okuno, Hideharu Amano: Implementation of Active Direction-Pass Filter on Dynamically Reconfigurable Processor, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2005), pp.3175-3180, IEEE, RSJ, Edmonton, Aug. 2005. pdf. doi:10.1109/IROS.2005.1545033
  26. Tsuyoshi Tasaki, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Spatially Mapping of Friendliness for Human-Robot Interaction, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2005), pp.1277-1282, IEEE, RSJ, Edmonton, Aug. 2005. pdf. doi:10.1109/IROS.2005.1545034
  27. Mikio Nakano, Naoyuki Kanda, Yuji Hasegawa, Toyotaka Torii, Yohane Takeuchi, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi G. Okuno: A Two-Layer Model for Behavior and Dialogue Planning in Conversational Service Robots, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2005), pp.3329-3335, IEEE, RSJ, Edmonton, Aug. 2005. pdf. doi:10.1109/IROS.2005.1545198
  28. Tetsuya Ogata, Hayato Ohba, Kazunori Komatani, Jun Tani, Hiroshi G. Okuno: Extracting Multi-Modal Dynamics of Objects using RNNPB Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2005), pp.966-971, IEEE, RSJ, Edmonton, Aug. 2005. pdf. doi:10.1109/IROS.2005.1544975
  29. Kazunori Komatani, Ryoji Hamabe, Tetsuya Ogata, Hiroshi G. Okuno: Generating Confirmation to Distinguish Phonologically Confusing Word Pairs in Spoken Dialogue Systems Proceedings of 4th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems, pp.40-45, July 2005.
  30. Yuya Hattori, Hideki Kojima, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot Gesture Generation from Environmental Sounds Using Inter-modality Mapping, Proceedings of Fifth International Workshop on Epigenetic Robotics (EpiRobo-2005), 139-140, Nara, July 2005.
  31. Takuya Yoshioka, Takafumi Hikichi, Masato Miyoshi, Hiroshi G. Okuno: Blind Estimation of Room Resonances Using Popular, Classical, and Jazz Music. Proceedings of AES 118th Convenvion, 6632, Audio Engineering Society, Barcelona, Spain, May 28-31, 2005.
  32. Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Mitsuhiko Toda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Distance Based Dynamic Interaction of Humanoid Robot with Multiple People. Innovations in Applied Artificial Intelligence: Eighteenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2005) LNAI 3533, 111-120, Best paper award, Springer-Verlag. Bari, Italy, Jun. 2005. Paper pdf doi:10.1007/11504894_18
  33. Shun'ichi Yamamoto, Jean-Marc Valin Kazuhiro Nakadai, Hiroshi Tsujino, Jean Rouat, Francois Michaud, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory. Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2005), 1477-1482, IEEE, Barcelona, Apr. 2005.

    Patents

  34. Robot audiovisual system Patent No. US 6,967,455 Filing date: Mar 8, 2002 Issue date: Nov 22, 2005 Inventors: Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Okuno, Hiroaki Kitano Assignee: Japan Science and Technology Agency
  35. Speech Recongition Device, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto, Wipo Patent: WO/2005/048239, Application Number: PCT/JP2004/016883, Publication Date: 05/26/2005, Filing Date: 11/12/2004.

o Academic Year 2004o

    Thesis

  1. Yasuhiro Akiba: Automatic Evaluation Methods for Machine Translation Systems, Ph.D Thesis, Jan. 2005.

  2. Kazushi Ishihara: MS Thesis, Feb. 2005
  3. Kenri Kodaka: MS Thesis, Feb. 2005
  4. Shun'ichi Yamamoto: MS Thesis, Feb. 2005
  5. Ken Yamaguchi: MS Thesis, Feb. 2005
  6. Kazuyoshi Yoshii: Drum Sound Recognition for Polyphonic Audio Signals by Adaptation of Spectral Templates and Suppression of Harmonic Structure, MS Thesis, Feb. 2005

  7. Hayato Ohba: BE Thesis, Feb. 2005
  8. Taku Oya: BE Thesis, Feb. 2005
  9. Satoshi Kaijiri: BE Thesis, Feb. 2005
  10. Ryoji Hamabe: BE Thesis, Feb. 2005
  11. Masahiro Fujihara: BE Thesis, Feb. 2005
  12. Masamitsu Murase: BE Thesis, Feb. 2005

    Peer-Reviewed Journal Papers

  13. Tetsuya Ogata, Shigeki Sugano, and Jun Tani: Acquisition of Motion Primitives of Robot in Human-Navigation Task: Towards Human-Robot Interaction based on "Quasi-Symbol", Transactions of the Japanese Society for Artificial Intelligence, Vol.20, No.3, pp.188-196. Mar. 2005. Online Journal doi:10.1527/tjsai.20.188
  14. Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Shun'ichi Yamamoto, Mitsuhiko Toda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance, Transactions of the Japanese Society for Artificial Intelligence, Vol.20, No.3, pp.209-219, Mar. 2005. Online Journal doi:10.1527/tjsai.20.209
  15. Yasuhiro Akiba, Kenji Imamura, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Automatic Grader of MT Outputs in Colloquial Style by Using Multiple Edit Distance, (in Japanese), Transactions of the Japanese Society for Artificial Intelligence, Vol.20, No.3,pp.139-148 (2005). Online Journal doi:10.1527/tjsai.20.139
  16. Kazushi Ishihara, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Recognition of Onomatopoeia for Environmental Sounds, (in Japanese), Transactions of the Japanese Society for Artificial Intelligence, Vol.20, No.3, pp.229-236, March 2005. Online Journal doi:10.1527/tjsai.20.229
  17. Teruhisa Misu, Kazunori Komatani, Youji Seita, Tatsuya Kawahara: IEICE Transaction on Information and Systems, Vol.88-D2, No.3 (Mar. 2005) 499-508, IEICE, pdf
  18. Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance, User Modeling and User-Adapted Interaction, Special Issue on Language-Based Interaction: User Modeling and Adaptation, Vol.15, No.1-2 (2005) pp.169-183, Kluwer. Abstract doi:10.1007/s11257-004-5659-0
  19. Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino: Improvement of Recognition of Simultaneous Speech Signals Using AV Integration and Scattering Theory for Humanoid Robots, Speech Communication, Vol.44, Issues 1-4 (Oct. 2004) 97--112, Elsevier. doi:10.1016/j.specom.2004.10.010
  20. Tino Lourens, Hiroshi G. Okuno, Hiroshi Tsujino: A computational model of monkey cortical grating cells. Biological Cybernetics, Vol.92, No.1 (Jan. 2005) 61--70. Springer-Verlag. pdf doi:10.1007/s00422-004-0522-2
  21. Hiroshi G. Okuno, Kazuhiro Nakadai,Kazuhiro Nakadai, Hiroaki Kitano: Effects of increasing modalities in recognizing three simultaneous speeches, Speech Communication, Vol.43, Issues 4, pp.347-359, Sep. 2004. doi:10.1016/j.specom.2004.03.008
  22. Yasuhisa Hayakawa, Tetsuya Ogata, and Shigeki Sugano: Flexible Assembly Work Cooperating System based on Work State Identifications by Self-Organizing Map, IEEE/ASME Transactions on Mechatronics, Vol.9, No.3, accepted, Sept. 2004.
  23. Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G Okuno: User model for Adaptive Response Generation in Spoken Dialogue System, IEICE Transactions on Information and Systems, Vol.87-D2, No.10 (Oct. 2004) 1921-1928, IEICE. pdf
  24. Hiroshi G. Okuno, Kazuhiro Nakadai, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot, Applied Intelligence, Vol.20, No.3 (May/June, 2004), 253-266, doi:10.1023/B:APIN.0000021417.62541.e0, (accepted in Oct. 2002), Kluwer Publishers.
  25. Taro Watanabe, Kenji Imamura, Eiichiro Sumita, Hiroshi G. Okuno: Statistical machine translation using hierarchical phrase alignment, IEICE Transactions on Information and Systems, Vol.J87-D2, No.4 (Apr. 2004) 978-986, IEICE. pdf

    Book Chapters

  26. Kazushi Ishihara, Tomohiro Nakatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Sound-Imitation Word Recognition from Environmental Sounds focusing on Ambiguity Problem in Determining Phonemes, PRICAI 2004: Trends in Artificial Intelligence (Proc. of Eighth Pacific Rim International Conference on Artificial Intelligence), LNAI 3157, pp.909-918, Springer-Verlag, Auckland, Aug. 2004. doi:10.1007/b99563 html
  27. Kazunori Komatani, Ryosuke Itoh, Tatsuya Kawahara, Hiroshi G. Okuno: Recognition of Emotional States in Spoken Dialogue with a Robot, Innovations in Applied Artificial Intelligence:, Seventeenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems, IEA/AIE-2004, LNAI 3029, 413-423, Springer-Verlag. Ottawa, May. 2004, Springer-Verlag
  28. Tetsuya Ogata, Jun Tani: Open-end Human Robot Interaction from the Dynamical Systems Perspective: Mutual Adaptation and Incremental Learning. Innovations in Applied Artificial Intelligence:, Seventeenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems, IEA/AIE-2004, LNAI 3029, 435-444, Springer-Verlag. Ottawa, May. 2004, Springer-Verlag

    Peer-Reviewed International Conference Papers

  29. Hiroshi G. Okuno: Robot Audition: Its Issues and State of the Art (invited talk), Proceedings of 2nd International Symposium on Life Science, Kyoto, Feb. 2005.
  30. Tetsuya Ogata, Shigeki Sugano, and Jun Tani: Acquisition of Motion Primitives of Robot in Human-Navigation Task: Towards Human-Robot Interaction based on "Quasi-Symbol", Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems, 315-326, Kyoto, Nov. 2004.
  31. Tsuyoshi Tasaki, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot Motion Control using Listener's Back-Channels and Head Gesture Information, Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems, 327-338, Kyoto, Nov. 2004.
  32. Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Mitsuhiko Toda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Dynamic Communication of Humanoid Robot with multiple people based on Interaction Distance, Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems, 385-392, Kyoto, Nov. 2004.
  33. Yuki Suga, Hiroaki ARIE, Tetsuya Ogata, and Shigeki Sugano: Constructivist Approach to Human-Robot Emotional Communication: Design of Evolutionary Function for WAMOEBA-3, Proceedings of IEEE/RAS Interanational Conference on Humanoid Robots (Humanoids 2004), No.76, Los Angels, Nov. 2004.
  34. Yuki Suga, Tetsuya Ogata, and Shigeki Sugano: Development of Emotional Communication Robot, WAMOEBA-3, Proceedings of International Conference on Advanced Mechatronics (ICAM 2004), 413-418, Oct. 2004.
  35. Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods, Proceedings of 5th International Conference on Musical Information Retrieval (ISMIR-2004), 184-191, Barcelona, Spain, Oct. 2004. pdf
  36. Takuya Yoshioka, Tetsuro Kitahara, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Chord Transcription with Concurrent Recognition of Chord Symbols and Boundaries, Proceedings of 5th International Conference on Musical Information Retrieval (ISMIR-2004), 100-105, Barcelona, Spain, Oct. 2004. pdf
  37. Tsuyoshi Tasaki, Takeshi Yamaguchi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot Motion Control using Listener's Back-Channels and Head Gesture Information, Proceedings of 2004 International Conference on Spoken Language Processing (ICSLP-2004), 1033-1036, ASA, ASJ, and ESCA, Korea, Oct. 2004.
  38. Kazushi Ishihara, Yuya Hattori, Tomohiro Nakatani, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Disambiguation in Determining Phonemes of Sound-Imitation Words for Environmental Sound Recognition, Proceedings of 2004 International Conference on Spoken Language Processing (ICSLP-2004), 1485-1488, ASA, ASJ, and ESCA, Korea, Oct. 2004.
  39. Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods, Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing (SAPA-2004), accepted, ASA, ASJ, and ESCA, Korea, Oct. 2004.
  40. Shun'ichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi G. Okuno: Assessment of General Applicability of Robot Audition System by Recognizing Three Simultaneous Speeches, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2004), pp.2111-2116, IEEE, RSJ, Sendai, Sep. 2004. IEEE Kansai Chapter Young Researcher Award doi:10.1109/IROS.2004.1389721
  41. Tetsuya Ogata, Masaki Matsunaga, Shigeki Sugano, and Jun Tani: Human Robot Collaboration Using Behavioral Primitives, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2004), pp.1592-1597, IEEE, RSJ, Sendai, Sep. 2004. doi:10.1109/IROS.2004.1389721
  42. Yuki SUGA, Tetsuya Ogata, and Shigeki Sugano: Aquisition of Reactive Motion for Communication Robots Using Interactive EC: Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2004), accepted, Sept. 2004. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2004), pp.1198-1203, IEEE, RSJ, Sendai, Sep. 2004. doi:10.1109/IROS.2004.1389721
  43. Yoshihiro Sakamoto, Tetsuya Ogata, and Shigeki Sugano: Human-Robot Communication Using Multiple Recurrent Neural Networks, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2004), pp.1574-1579, IEEE, RSJ, Sendai, Sep. 2004.
  44. Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Mitsuhiko Toda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Dynamic Communication of Humanoid Robot with multiple people based on Interaction Distance, Proceedings of International Workshop on Robot and Human Interaction (Ro-Man-2004), 71-76, IEEE, Kurashiki, Sep. 2004. pdf doi:10.1109/ROMAN.2004.1374732
  45. Yuya Hattori, Kazushi Ishihara, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Repeat Recognition for Environmental Sounds, Proceedings of International Workshop on Robot and Human Interaction (Ro-Man-2004), 83-88, IEEE, Kurashiki, Sep. 2004. pdf doi:10.1109/ROMAN.2004.1374734
  46. Yusuke Akiwa, Yuki Suga, Tetsuya Ogata, and Shigeki Sugano: Imitation based Human-Robot Communication: Roles of Joint Attention and Motion Prediction, Proceedings of International Workshop on Robot and Human Interaction (Ro-Man-2004), 283-288, IEEE, Kurashiki, Sep. 2004.
  47. Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Using a Mixture of N Best Lists from Multiple MT Systems in Rank-Sum-Based Confidence Measure for MT Outputs, Proceedings of the 20th International Conference on Computational Linguistics (Coling-2004), 322-328, Geneva, Aug. 2004.
  48. Kazunori Komatani, Teruhisa Misu, Tatsuya Kawahara, Hiroshi G. Okuno: Efficient Confirmation Strategy for Large-scale Text Retrieval Systems with Spoken Dialogue Interface, Proceedings of the 20th International Conference on Computational Linguistics (Coling-2004), 1100-1106, Geneva, Aug. 2004.
  49. Katsuhisa Ishida, Masayuki Takeda, Tetsuro Kitahara: ism: Improvisation Supporting Systems with Melody Correction, Proceedings of the International Symposium on Musical Acoustics (NIME2004), 177-180, Hamamatsu, Jun. 2004.
  50. Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Incremental Methods to Select Test Sentence for Evaluating Translation Ability, Proceedings of the fourth international conference on Language Resources and Evaluation (LREC-2004), pp.2015-2018, Lisbon, Portugal, May 2004. pdf
  51. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Category-level Identification of Non-registered Musical Instrument Sounds, Proceedings of 2004 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2004), Vol.IV, 253-256, Montreal, May 2004. pdf doi:10.1109/ICASSP.2004.1326811
  52. Yohei Sakuraba, Tetsuro Kitahara, Hiroshi G. Okuno: Comparing Features for Forming Music Streams in Automatic Music Transcription, Proceedings of 2004 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2004), Vol.IV, 273-276, Montreal, May 2004. pdf
  53. Shun'ichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Toshio Yokoyama, Hiroshi G. Okuno: Improvement of Robot Audition by Interfacing Sound Source Separation and Automatic Speech Recognition with Missing Feature Theory, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2004), 1517-1523, IEEE, New Orleans, May. 2004. pdf IEEE Robotics and Automation Society Japan Chapter Young Award doi:10.1109/ROBOT.2004.1308039
  54. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Acoustical-similarity-based Musical Instrument Hierarchy and Its Application to Musical Instrument Identification, Proceedings of the International Symposium on Musical Acoustics (ISMA2004), 297-300, Nara, Apr. 2004.

o Academic Year 2003o

    Thesis

  1. Taro Watanabe : Example-Based Statistical Machine Translation, Ph.D Thesis, Feb. 2004.

  2. Tetsuro Kitahara: , MS Thesis, Feb. 2004.
  3. Yohei Sakuraba: , MS Thesis, Feb. 2004.
  4. Mitsuhiro Sakuraba: , MS Thesis, Feb. 2004.

  5. Naoyuki Kanda: , BE Thesis, Feb. 2004.
  6. Tsuyoshi Tasaki: , BE Thesis, Feb. 2004.
  7. Shohei Matsumoto: , BE Thesis, Feb. 2004.
  8. Yuya Hattori: , BE Thesis, Feb. 2004.
  9. Takuya Yoshioka: , BE Thesis, Feb. 2004.

    Peer-Reviewed Journal Papers

  10. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Acoustic-feature-based Musical Instrument Hierarchy and Its Application to Category-level Recognition of Unknown Musical Instruments, IPSJ Journal, Vol.45, No.3 (Mar. 2004) pp.680-689, IPSJ. in html
  11. Katsuhisa Ishida, Tetsuro Kitahara, Masayuki Takeda: N-gram Based Melody Correction for Improvisation, to Category-level Recognition of Unknown Musical Instruments. IPSJ Journal, Vol.45, No.3 (Mar. 2004) pp.680-689, IPSJ. in html
  12. Yoko Yamakata, Tatsuya Kawahara, Hiroshi G. Okuno, Michihiko Minoh: Belief Network based Disambiguation of Object Reference in Spoken Dialogue System, Transactions of the Japanese Society for Artificial Intelligence, Vol.19, No.1 F, pp.47-56 (2004). Online Journal
  13. Taro Watanabe, Eiichiro Sumita, Hiroshi G. Okuno: Decoding Algorithms for Statisitcal Machine Translation Considering Generation Directions, IPSJ Journal, Vol.44, No.12 (Dec. 2003) 3202-3210, IPSJ. in html
  14. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Musical Instrument Identification Considering Pitch-dependent Characteristics of Timbre: A Classifier Based on F0-dependent Multivariate Normal Distribution, IPSJ Journal, Vol.44, No.10 (Oct. 2003) 2448-2458, IPSJ. in html
  15. Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroshi Mizoguchi, Hiroaki Kitano: Real-time Multiple Talker Tracking by Audio-Visual Integration for Humanoids: Integration of Active Audition nad Face Recognition, Journal of Robotics Society of Japan, Vol.21, No.5 (Jul. 2003), pp.517--525. pdf, pdf at RSJ server.
  16. Kazunori Komatani, Hiroaki Kashima, Katsuaki Tanaka, Tatsuya Kawahara: Domain-independent Spoken Dialogue Platform for Database Query Using Key-phrase Spotting Based on Combined Language Model, IPSJ Journal, Vol.44, No.5 (May 2003) 1333-1342. in html
  17. Hiroshi G. Okuno, Kazuhiro Nakadai, Active audition for humanoid robots that can listen to three simultaneous talkers. Journal of the Acoustical Society of America, Vol.113, No.4, Pt.2 of 2, Apr. 2003, pp.2230. Abstract at ASA.

    Book Chapters and Survye Papers

  18. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Pitch-dependent Musical Instrument Identification and Its Application to Musical Sound Ontology, In Chung, P,W.H., Hinde, C. and Ali, M. (Eds.) Developments in Applied Artificial Intelligence, LNAI 2718, 112-122, Springer-Verlag. Proceedings of Nineteenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2003), Loughborough, UK, Jun. 2003, doi:10.1007/3-540-45034-3_12
  19. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Design and Implementation of Personality of Humanoids in Human Humanoid Non-verbal Interaction, In Chung, P,W.H., Hinde, C. and Ali, M. (Eds.) Developments in Applied Artificial Intelligence, LNAI 2718, 662-673, Springer-Verlag. Proceedings of Nineteenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2003), Loughborough, UK, Jun. 2003, doi:10.1007/3-540-45034-3_67
  20. Hiroshi G. Okuno, Kazuhiro Nakadai: Real-time Sound Source Localization and Separation based on Active Audio-Visual Integration, In Jose Mira and Jose R. Alvarez (Eds.): Computational Methods in Neural Modeling, LNCS 2686, 118-125, Springer-Verlag. The Seventh International Work Conference on Artificial and Nataural Neural Networks, IWANN 2003, Proceedings, Part 1, Ma\'{o}, Menorca,, Spain, June 2003, pdf doi:10.1007/3-540-44868-3_16
  21. Hiroshi G. Okuno, Kazuhiro Nakadai: Robot Audition: its research topics and current status, Joho SHori, Vol.44, No.11 (Nov. 2003) pp.1138-1144, IPSJ. Article in html

    Peer-Reviewed International Conference Papers

  22. Hiroshi G. Okuno, Tetsuya Ogata, Kazunori Komatani: Computational Auditory Scene Analysis and Its Application to Robot Audition, Proceedings of the International Conference on Informatics Research for Development of Knowledge Society Infrastructure (ICKS 2004), pp.73-80, Mar. 2004, doi:10.1109/ICKS.2004.1313411
  23. Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroaki Kitano: Applying Scattering Theory to Robot Audition System, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2003), 1147-1152, IEEE, Las Vegas, Oct. 2003. pdf
  24. Tetsuya Ogata, Shigeki Sugano, Jun Tani: Interactive Learning in Human-Robot Collaboration, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2003), 162-167, IEEE, Las Vegas, Oct. 2003. pdf
  25. Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroaki Kitano: Active audition based humanoid system and ist evaluation: Localization, Seperation and Recognition of Simultaneous Speech. Proceedings of IEEE/RSJ International Conference on Humanoids (Humanoids-2003), Springer-Verlag, IEEE, Munchen, Oct. 2003.
  26. Yohei Sakuraba, Hiroshi G. Okuno: Note Recognition of Polyphonic Music by Using Timbre Similarity and Direction Proximity. Proceedings of International Computer Music Conference (ICMC2003), 167-170, Singapore, Oct. 2003. pdf
  27. Yasuhiro AKiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Experimental Comparison of MT Evaluation Methods: RED vs. BLEU. Proceedings of MT Summit IX, 1-8, New Orleans, Sep. 2003.
  28. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Realizing Personality in Audio-Visually Triggered Non-verbal Behaviors. Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2003), 392-397, IEEE, Sep. 2003. pdf
  29. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Robot Recognizes Three Simultaneous Speech By Active Audition. Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2003), 398-403, IEEE, Sep. 2003. pdf
  30. Yasuhiro AKiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, and Hiroshi G. Okuno: Experimental Comparison of MT Evaluation Methods: RED vs. BLEU. Proceedings of MT Summit IX, 1-8, New Orleans, Sep. 2003. pdf
  31. Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino: Improvement of Three Simultaneous Speech Recognition by Using AV Integration and Scattering Theory for Humanoid. Proceedings of Audio Visual Spoken Processing (AVSP-2003), 157-162, St. Jorioz, France, Sep. 2003. pdf
  32. Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: User Modeling in Spoken Dialogue Systems for Flexible Guidance Generation. Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003), 745-748, Geneva, Sep. 2003.
  33. Kazushi Ishihara, Yasushi Tsubota, Hiroshi G. Okuno: Automatic Transformation of Environmental Sounds into Sound-Imitation Words Based on Japanese Syllable Structure. Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003), 3185-3188, Geneva, Sep. 2003.
  34. Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino: Three Simultaneous Speech Recognition by Integration of Active Audition and Face Recognition for Humanoid, Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003), 2705-2708, Geneva, Sep. 2003.
  35. Tatsuya Kawahara, Ryosuke Ito, Kazunori Komatani: Spoken Dialogue System for Queries on Appliance Manuals using Hierarchical Confirmation Strategy. Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003), accepted for presentation, Geneva, Sep. 2003.
  36. Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: Flexible Guidance Generation using User Model in Spoken Dialogue Systems, Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL 2003), pp.256-263, Sapporo, Jul. 2003.
  37. Taro Watanabe, Eiichiro Sumita, and Hiroshi G. Okuno: Chunk-based statistical translation, Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL 2003), pp.303-310, Sapporo, Jul. 2003.
  38. Yasuhiro AKiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, and Hiroshi G. Okuno: A Statistical-Informmation-Based Selector of the Best among Multiple Outputs, Exhibition Brochure of the 41st Annual Meeting of the ACL (ACL 2003), 16, Sapporo, Jul. 2003.
  39. Yoji Kiyota, Sadao Kurohashi, Teruhisa Misu, Kazunori Komatani, Tatsuya Kawahara, Fuyuko Kido: Dialog Navigator''A Spoken Dialog Q-A System based on Large Text Knowledge Base. ACL03 Interactive Poster/Demo Session, pp.149--152 (Companion Volume), 2003.
  40. Kazunori Komatani, Fumihiro Adachi, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: Flexible Spoken Dialogue System based on User Models and Dynamic Generation of VoiceXML Scripts. 4th SIGdial Workshop on Discourse and Dialogue, pp.87--96, 2003.
  41. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution. Proceedings of 2003 International Conference on Muotimedia and Expo (ICME 2003), IEEE, Vol.III, pp.405-409, Baltimore, MD, Jul. 2003.
  42. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution. Proceedings of 2003 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2003), Vol.5, pp.421--424, IEEE, Hong Kong, Apr. 2003. pdf

  43. Shun Tsuchiya (Ed.): "Encyclopedia AI", Feigenbaum, McCarthy. Kyoritsu Publishers, 2003.

o Academic Year 2002o

    Thesis

  1. Shinya Amano: Studies on Natural Language Processing for Kana-to-Kanji Conversion and Machine Translation, Ph.D Thesis, Feb. 2003.
  2. Kazunori Komatani: Spoken Dialogue Systems for Information Retrieval with Domain-Independent Dialogue Strategies, Ph.D Thesis, Oct. 2002.

  3. Ryosuke Ito: , MS Thesis, Feb. 2003.
  4. Takashi Sumiyoshi: , MS Thesis, Feb. 2003.
  5. Masahiro Hasegawa: , MS Thesis, Feb. 2003.
  6. Naofumi Yoshida: , MS Thesis, Feb. 2003.
  7. Ian R. Lane: Language Model Switching Based on Topic Detection for Multi-Domain Dialog Speech Recognition, MS Thesis, Feb. 2003.
  8. Yuha Aakita: , MS Thesis, Aug. 2002.

  9. Kazushi Ishihara: , BS Thesis, Feb. 2003.
  10. Tasuku Kitade: , BS Thesis, Feb. 2003.
  11. Teruhisa Misu: , BS Thesis, Feb. 2003.
  12. Shun-ichi Yamamoto: , BS Thesis, Feb. 2003.
  13. Kazuyoshi Yoshii: , BS Thesis, Feb. 2003.

    Peer-Reviewed Journal Papers

  14. Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot, Applied Intelligence, Vol.20, No.3 (May/June, 2004), 253-266, doi:10.1023/B:APIN.0000021417.62541.e0 (accepted in Oct. 2002), Kluwer Publishers.
  15. Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot. Applied Intelligence, Kluwer Publisher, accepted for publication, International Society for Applied Intelligence, 2003. doi:10.1007/3-540-45324-5_19
  16. Hiroshi G. Okuno, Kazuhiro Nakadai, Ken'ichi Hidai, Hiroshi Mizoguchi, Hiroaki Kitano: Human-Robot Non-Verbal Interaction Empowered by Real-Time Auditory and Visual Multiple-Talker Tracking Advanced Robotics, Vol.17, No.2, pp.115-130, VSP and Robotics Society of Japan, 2003. doi:10.1163/156855303321165088 Online version, pdf
  17. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Issues in Humanoid Audition and Sound Source Localization by Active Audition. Transaction of JSAI, Vol.18, No.2 F, pp.103-110 (Mar. 2003). Online Journal
  18. Kazunori Komatani, Tatsuya Kawahara: , IPSJ Journal, Vol.43, No.10, pp.3078--3086, 2002. in html
  19. Ryosuke Ito, Kazunori Komatani, Tatsuya Kawahara: , IPSJ Journal, Vol.43, No.7, pp.2147--2154, 2002. in html
  20. Masahiro Hasegawa, Yuya Akita, Tatsuya Kawahara: , IPSJ Journal, Vol.43, No.7, pp.2222-2229, 2002. html
  21. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Auditory and Visual Multiple-Speaker Tracking For Human-Robot Interaction. Journal of Robotics and Mechatronics, special issue on Human Robot Interaction, Vol.14, No.5 (2002) 479-489, Mechatronics Society of Japan.
  22. Kentaro Umesawa, Takamichi Saito, Hiroshi G. Okuno: , IPSJ Journal, Vol.43, No.8 (Aug. 2002) 1553-1562. in html

    Books, Book Chapters and Survye Papers

  23. Yuasa, T. and Okuno, H.G. (Eds.): Advanced Lisp Technology, Advanced Information Processing Technology, Vol.4, Taylor and Francis Publishers, London, UK, May, 2002.
  24. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Realizing Audio-Visually triggered ELIZA-like non-verbal Behaviors. In Ishizuka, M. and Slaney, J. (eds) PRICAI-2002 Topics in Artificial Intelligence, LNAI 2417, 552--562, Springer-Verlag, Tokyo, Aug. 2002. pdf doi:10.1007/3-540-45683-X_59
  25. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Social Interaction of Humanoid Robot through Auditory and Visual Tracking. In Hendtlass, T., and Ali, M. (Eds.) Developments in Applied Artificial Intelligence (IEA/AIE-2002), Cairns, Australia, June 2002, LNAI 2358, pp.725-735, Springer-Verlag. pdf doi:10.1007/3-540-48035-8_70

    Peer-Reviewed Conference Papers

  26. Takamichi Saito, Toshiyuki Kitoh, Kentaro Umesawa, Hiroshi G. Okuno: Privacy-Enhanced SPKI Access Control on PKIX and Its Application to Web Server. Proc. of the Seventeenth International Conference on Advanced Information Networking and Applications (AINA 2003), 696--703, IEEE, Xi'an, China. Paper. doi:10.1109/AINA.2003.1192970
  27. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Auditory Fovea Based Speech Separation and Its Application to Dialog System. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2002), 1320-1325, IEEE, Geneva, Oct. 2002. doi:10.1109/IRDS.2002.1043937
  28. Yoko Yamakata, Tatsuya Kawahara, Hiroshi G. Okuno: BELIEF NETWORK BASED DISAMBIGUATION OF OBJECT REFERENCE IN SPOKEN DIALOGUE SYSTEM FOR ROBOT. Proceedings of 2002 International Conference on Spoken Language Processing (ICSLP-2002), 170-176, ASA, ASJ, and ESCA, Denver, Sep. 2002.
  29. Yasushi Tsubota, Tatsuya Kawahara, Hiroshi G. Okuno, Masatake Dantsuji: RECOGNITION AND VERIFICATION OF ENGLISH BY JAPANESE STUDENTS FOR COMPUTER-ASSISTED LANGUAGE LEARNING SYSTEM. Proceedings of 2002 International Conference on Spoken Language Processing (ICSLP-2002), 1205-1208, ASA, ASJ, and ESCA, Denver, Sep. 2002.
  30. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: AUDITORY FOVEA BASED SPEECH ENCHANCEMENT AND ITS APPLICATION TO HUMAN-ROBOT DIALOG SYSTEM. Proceedings of 2002 International Conference on Spoken Language Processing (ICSLP-2002), 1817-1820, ASA, ASJ, and ESCA, Denver, Sep. 2002.
  31. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: REAL-TIME SOUND SOURCE LOCALIZATION AND SEPARATION FOR ROBOT AUDITION. Proceedings of 2002 International Conference on Spoken Language Processing (ICSLP-2002), 193-196, ASA, ASJ, and ESCA, Denver, Sep. 2002.
  32. Taro Watanabe, Eiichiro Sumita: Statistical Machine Translation Decoder Base On Phrase. Proceedings of 2002 International Conference on Spoken Language Processing (ICSLP-2002), 1889-1892, Spec3Co2, ASA, ASJ, and ESCA, Denver, Sep. 2002.
  33. Kazunori Komatani, Tatsuya Kawahara, Ryosuke Ito, Hiroshi G. Okuno: Efficient Dialogue Strategy to Find Users' Intended Items from Information Query Results, Proceedings of the Nineteenth International Conference on Computational Linguistics (Coling-2002), Vol.1, pp.481-487, Aug. 2002.
  34. Taro Watanabe, Eiichiro Sumita: Bidirectional Decoding for Statistical Machine Translation. Proceedings of the Nineteenth International Conference on Computational Linguistics (Coling-2002), pp.1079- Aug. 2002.
  35. Yasuhiro Akiba, Taro Watanabe, Eiichiro Sumita: Using Language and Translation Models to Select the Best among Outputs from Multiple MT Systems, Proceedings of the Nineteenth International Conference on Computational Linguistics (Coling-2002), pp. Aug. 2002.
  36. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Exploiting Auditory Fovea in Humanoid-Human Interaction. Proceedings of Eighteenth National Conference on Artificial Intelligence (AAAI-2002), 431-438, AAAI, Edmonton, Aug. 2002. pdf AAAI Contents
  37. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Non-Verbal Eliza-like Human Behaviors in Human-Robot Interaction through Real-Time Auditory and Visual Multiple-Talker Tracking. Proceedings of the Third International Workshop on Cognitive Robotics (CogRob-2002), AAAI, Edmonton, Jul. 2002. pdf
  38. Yoko Yamakata, Tatsuya Kawahara, Hiroshi G. Okuno: Belief Network based Disambiguation of Word Reference in Spoken Dialogue System for Robot. Proceedings of ISCA Tutorial and Research Workshop on Multi-Modal Dialogue in Mobile Environments, Germany, Jun. 2002.
  39. Kazuhiro Nakadai, Ken'ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time speaker localization and speech separation by Audio-Visual Integration, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2002), pp.1043-1049, IEEE, May 2002. pdf doi:10.1109/ROBOT.2002.1013493

o Academic Year 2001o

    Thesis

  1. Hirofumi Adachi: , MS Thesis, Feb. 2002.
  2. Yoko Yamakata: , MS Thesis, Feb. 2002.
  3. Raux Antoine Roland: Intelligibility Assessment and Adaptive Drill Generation for a Computer-Assisted Pronunciation Learning System, MS Thesis, Feb. 2002.

  4. Shinichi Ueno: , BS Thesis, Feb. 2002.
  5. Yohei Sakuraba: , BS Thesis, Feb. 2002.
  6. Kazuya Shitaoka: , BS Thesis, Feb. 2002.
  7. Masahiro Yokoo: , BS Thesis, Feb. 2002.

    Peer-Reviewed Journal Papers

  8. Hiroshi G. Okuno, Kazuhiro Nakadai, Lourens, T., Hiroaki Kitano: Sound and Visual Tracking by Active Audition. in Jin, Q., Li, J., Zhang, N., Cheng, J., Yu, C., and Noguchi, N (eds) Enabling Society with Information Technology pp.174-185, Springer-Verlag, 2002.
  9. Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: , Trans. IEICE, Vol.J84-D1, No.11 (Nov. 2001) pp.1553-1562, IEICE, pdf
  10. Kentaro Umesawa, Takamichi Saito, Hiroshi G. Okuno: , IPSJ Journal, Vol.42, No.8 (Aug. 2001) pp.2067-2076. TAF Telecom Technology Student Award
  11. Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Koichi Takeda, N. Minematsu, Shigeki Sagayama, Katsuya Itou, A. Ito, M. Yamamoto, A. Yamada, T.Utsuto, Kiyohiro Shikano: Japanese Dictation ToolKit -- 1999 version --, Journal of Acoustic Society of Japan, Vol.57, No.3, pp.210--214, 2001
  12. M. Mimura and Tatsuya Kawahara: Difference of acoustic modeling for read speech and dialogue speech, Acoustical Science & Technology, Vol.22, No.5, pp.373--374, 2001.

    Translated Books, Book Chapters and Survey Papers

  13. Hiroshi G. Okuno, Kazuhiro Nakadai: , JSAJ, Vol.58, No.3 (Mar. 2002) pp.205-210.
  14. Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot, Engineering of Intelligent Systems: 14th International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems, IEA/AIE-2001, Proceedings, Budapest, Hungary, June 2001. LNAI 2070, 640-650, Springer. Best Paper Award (1st Place) doi:10.1007/3-540-45034-3_67
  15. Tino Lourens, Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Detection of Oriented Repetitive Alternating Patterns in Color Images -- A Computational Model of Monkey Grating Cells. Connectionist Models of Neurons, Learning Processes and Artificial Intelligence: Sixth International Work-Conference on Artificial and Natural Neural Networks, IWANN 2001, Proceeding, Part I, Granada, Spain, June 2001. LNCS 2084, 95-107, Springer-Verlag. doi:10.1007/3-540-45720-8_12/
  16. Ian Frank, Kumiko Tanaka, Hiroshi G. Okuno, Jun'ichi Akita, Yukiko Nakagawa, K. Maeda, Kazuhiro Nakadai, Hiroaki Kitano: And The Fans are Going Wild! SIG plus MIKE. RoboCup 2000: Robot Soccer World Cup IV, LNAI 2019, 139-148, Springer-Verlag, May 2001. doi:10.1007/3-540-45324-5_12
  17. Yukiko Nakagawa, Hiroshi G. Okuno, Hiroaki Kitano: Bridging gap between small sized league and simulator league. RoboCup 2000: Robot Soccer World Cup IV, LNAI 2019, 209-218, Springer-Verlag, May 2001. doi:10.1007/3-540-45324-5_19
  18. Hiroaki Kitano: Hiroshi G. Okuno, Mineo Morohashi, Koji Kyoda, Kazuhiro Nakadai : PC Cluster -- Beowulf Sangyo Publisher, 2001.

    Peer-Reviewed International Conference Papers

  19. Kazuhiro Nakadai, Ken'ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Active Human Tracking by Hierarchical Integration of Audition and Vision. Proceedings of IEEE-RAS International Conference on Humanoid Robots (Humanoids2001), pp.91-98, IEEE, Nov. 2001. pdf
  20. Kazuhiro Nakadai, Tatsuya Matsui, Hiroshi G. Okuno, Hiroaki Kitano: Epipolar Geometry Based Sound Localization and Extraction for Humanoid Audition. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2001), 1395-1401, IEEE and RSJ, Oct. 2001. pdf doi:10.1109/IROS.2001.977176
  21. Hiroshi G. Okuno, Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroaki Kitano: Human-Robot Interaction Through Real-Time Auditory and Visual Multiple-Talker Tracking Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2001), 1402-1409, IEEE and RSJ, Oct. 2001. pdf Nakamura Award for IROS-2001 Best Paper Nomination Finalist (2nd or 3rd Place) at IROS-2002 doi:10.1109/IROS.2001.977177
  22. Tino Lourens, Hiroshi G. Okuno, Hiroaki Kitano: Automatic Graph Extraction from Color Images. Proc. of 11th International Conference Image Analysis and Processing (ICIAP 2001), pp.302-308, Granada, Spain, June 2001.
  23. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Multiple Speaker Tracking by Multi-Modal Integration for Mobile Robots. Proceedings of European Conforence on Speech Processing (Eurospeech 2001), pp.2643-2646, Sep. 2001. pdf
  24. Hiroshi G. Okuno, Kazuhiro Nakadai, T. Lourens, Hiroaki Kitano: Separating Three Simultaneous Speeches with Two Microphones by Integrating Auditory and Visual Processing. Proceedings of European Conforence on Speech Processing (Eurospeech 2001), pp.1193-1196, Sep. 2001. pdf
  25. Akinobu Lee, Tatsuya Kawahara, and Kiyohiro Shikano: Gaussian mixture selection using context-independent HMM, Proc. IEEE-ICASSP, pp.69--72, 2001.
  26. Hiroaki Nanjo, Tatsuya Kawahara: Speaking-Rate Dependent Decoding and Adaptation for Spontaneous Lecture Speech Recognition, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), pp.725--728, 2002.
  27. Kazunori Komatani, K.Tanaka, H.Kashima, and Tatsuya Kawahara: Domain-independent spoken dialogue platform using key-phrase spotting based on combined language model, Proceedings of European Conforence on Speech Processing (Eurospeech 2001), pp.1319--1322, 2001.
  28. Akinobu Lee, Tatsuya Kawahara, and Kiyohiro Shikano: Julius -- an open source real-time large vocabulary recognition engine, Proceedings of European Conforence on Speech Processing (Eurospeech 2001), pp.1691--1694, 2001
  29. Hiroaki Nanjo, Kazuomi Kato, and Tatsuya Kawahara: Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition, Proceedings of European Conforence on Speech Processing (Eurospeech 2001), pp.2531--2534, 2001
  30. Tatsuya Kawahara, Hiroaki Nanjo, and S.Furui: Automatic transcription of spontaneous lecture speech, Proc. IEEE workshop on Automatic Speech Recognition and Understanding, 2001.
  31. Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Auditory and Visual Multiple-Object Tracking for Robots. Proc. of 17th International Joint Conference on Artificial Intelligence (IJCAI-01) , 1425-1432, Seattle, Aug. 2001. 電気通信普及財団テレコム技術賞奨励賞 pdf
  32. Tino Lourens, Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Detection of Oriented Repetitive Alternating Patterns in Color Images -- A Computational Model of Monkey Grating Cells. Proc. of Sixth International Work-Conference on Artificial and Natural Neural Networks (IWANN2001), Granada, Spain, June 2001. LNCS 2084, 95-107, Springer-Verlag.
  33. Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot, Proc. of 17th International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2001) , Budapest, Hungary, June 2001. Lecture Notes in Artificial Intelligence No.2070, 640-650, Springer. Best Paper Award (1st Place)
  34. Ian Frank, Kumiko Tanaka, Hiroshi G. Okuno, Jun'ichi Akita, Yukiko Nakagawa, K. Maeda, Kazuhiro Nakadai, Hiroaki Kitano: And The Fans are Going Wild! SIG plus MIKE. RoboCup 2000: Robot Soccer World Cup IV, Lecture Notes in Artificial Intelligence No.2019, 139-148, Springer-Verlag, May 2001.
  35. Yukiko Nakagawa, Hiroshi G. Okuno, Hiroaki Kitano: Bridging gap between small sized league and simulator league. RoboCup 2000: Robot Soccer World Cup IV, Lecture Notes in Artificial Intelligence No.2019, 209-218, Springer-Verlag, May 2001.
  36. Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: An Access Control with Handling Private Information Server. Proc. of the First International Workshop on Internet Computing and E-Commerce (ICEC01), IEEE, San Francisco, April 2001.

o Before Academic Year 2000o

    Peer-Reviewed Journal Papers

  1. Takahide Hoshide, Masayoshi Nose, Hisazumi Tsuchida, Kisaku Fujimoto, Hiroshi G. Okuno: Adaptive real-time planning for multimedia communication services by multiagent system, Electronics and Communications in Japan (Part I: Communications), Vol.84, No.2 (Feb. 2001) pp.90-98, Weiley. doi:10.1002/1520-6424(200102)84:2<90::AID-ECJA10>3.0.CO;2-C
  2. Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata: Listening to two simultaneous speeches, Speech Communication, Vol.27, Issue 3-4 (Apr. 1999) pp.299-310, doi:10.1016/S0167-6393(98)00080-6
  3. Hiroshi G. Okuno, Shin-ichi Minato, Hideki Isozaki: On the propertyies of combination set operations, Information Processing Letters, Vol.66, Issue 4 (May 1998) pp.195-199, Elsevier. doi:10.1016/S0020-0190(98)00067-2
  4. Hiroshi G. Okuno: Experience of parallel AI programming with parallel Lisp, Future Generation Computer Systems, Vol.7, Issues 2-3 (April 1992) pp.211-219, doi:10.1016/0167-739X(92)90008-Y
  5. Hiroshi G. Okuno, Nobuyasu Osato, Ikuo Takeuchi: Firmware approach to fast Lisp interpreter, SIGMICRO Newsletter, Vol.19, Issue 1-2 (June 1988) pp.5-11, ACM, pdf ACM DL
  6. Ikuo Takeuchi, Hiroshi G. Okuno, Nobuyasu Osato: TAO: a harmonic mean of Lisp, Prolog and Smalltalk, SIGPLAN Notices, Vol.18, Issue 7 (July 1983) pp.65-74, pdf ACM DL

    Books, Book Chapters and Survey Papers

  7. Hiroshi G. Okuno, Koji M. Kyoda, Mineo Morohashi, and Hiroaki Kitano: Initial Assessment of ERATO-1 Beowulf-Class Cluster, Ito, T. and Yuasa, T. (eds.) Parallel and Distributed Computing in Symbolic and Irregular Applications, World Scientific Publishing, 372--383, 2000.
  8. Kazuhiro Nakadai, Lourens, T., Hiroshi G. Okuno, Hiroaki Kitano: Humanoid Active Audition System Improved by The Cover Acoustics. In Mizoguchi, R. and Slaney, J. (eds) "PRICAI 2000 Topics in Artificial Intelligence" Lecture Notes in Artificial Intelligence 1886, 544--554, Springer-Verlag, Melborne, Aug. 2000. doi:10.1007/3-540-44533-1_55
  9. Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata: Cocktail-Party Effect with Computational Auditory Scene AAnalysis -- Preliminary Report -- Advances in Human Factors/Ergonomics, Vol.20, Part2, 1995, pp.503-508, doi:10.1016/S0921-2647(06)80266-2

    Peer-Reviewed International Conference Papers

  10. Hiroshi G. Okuno, Kazuhiro Nakadai, T. Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid. Proceedings of 2000 International Conference on Information Society in the 21st Century: Emerging Technologies and New Challenges, Univ. of Aizu, 254--261, Aizu-Wakamatsu, Nov. 2000.
  11. Kazuhiro Nakadai, Matsui, T., Hiroshi G. Okuno, Hiroaki Kitano: Active Audition System and Humanoid Exterior Design. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2000), IEEE, 1453--1461, Takamatsu, Nov. 2000. doi:10.1109/IROS.2000.893225
  12. Hiroaki Kitano, Hiroshi G. Okuno, Kazuhiro Nakadai, Sabisch, T., and Matsui, T.: Design and Architecture of SIG the Humanoid: An Experiemntal Platformfor Integratind Perception in RoboCup Humanoid Challenge. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2000), IEEE, 181--190, Takamatsu, Nov. 2000. doi:10.1109/IROS.2000.894602
  13. Fermin, I., Ishiguro, H., Hiroshi G. Okuno, Hiroaki Kitano: A Framework for Integrating Sensory Information in a Humanoid Robot. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2000), IEEE, 1748--1753, Takamatsu, Nov. 2000. doi:10.1109/IROS.2000.894602
  14. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Humanoid Active Audition System. Proceedings of First IEEE-RAS Internationa l Conference on Humanoid Robots (Humanoids2000), Cambridge, Sep. 2000.
  15. T. Lourens, Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Selective Attention by Integration of Vision and Audition. Proceedings of First IEEE-RAS Internationa l Conference on Humanoid Robots (Humanoids2000), Cambridge, Sep. 2000.
  16. Frank, I., Tanaka-Ishii, K., Hiroshi G. Okuno,Nakagawa, Y., Meada, K., Kazuhiro Nakadai, Hiroaki Kitano: And The Fans are Going Wild! SIG plus MIKE. Proceedings of the Fourth International Workshop on RoboCup (RoboCup-2000), LNCS 2019, 267--276, Springer-Verlag, Melbourne, Aug. 2000.
  17. Nakagawa, Y., Hiroshi G. Okuno, Hiroaki Kitano: Bridging gap between small sized league and simulator league. Proceedings of the Fourth International Workshop on RoboCup (RoboCup-2000), LNCS 2019, 1--10, Springer-Verlag Melborne, Aug. 2000.
  18. Kazuhiro Nakadai, T. Lourens, Hiroshi G. Okuno, Hiroaki Kitano: Active Audition for Humanoid. Proceedings of the Seventeenth National Conference on Artificial Intelligence (AAAI-2000), 832--839, AAAI, Austin, Aug. 2000. AAAI Contents
  19. Yukiko Nakagawa, Hiroshi G. Okuno, Hiroaki Kitano: Using Vision to Improve Sound Source Separation, Proceedings of the Sixteenth National Conference on Artificial Intelligence (AAAI-1999), 768--773, AAAI, Orlando, Aug. 1999. AAAI Contents
  20. Tomohiro Nakatani, Hiroshi G. Okuno: Sound Ontology for Computational Auditory Scence Analysis, Proceedings of the Fifteenth National Conference on Artificial Intelligence (AAAI-1998), 1004--1009, AAAI, Madison, Aug. 1998. AAAI Contents
  21. Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata: Understanding Three Simultaneous Speeches Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-1997), Vol.1, pp.30--35, IJCAI, Nagoya, Aug. 1997. IJCAI Contents pdf
  22. Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata: Interfacing Sound Stream Segregation to Automatic Speech Recognition -- Preliminary Results on Listening to Several Sounds Simultaneously, Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-1996), 1082--1089, AAAI, Portland, Aug. 1996. AAAI Contents
  23. Tomohiro Nakatani, Hiroshi G. Okuno, Takeshi Kawabata: Residue-Driven Architecture for Computational Auditory Scene Analysis Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence (IJCAI-1995), 165--174, IJCAI, Montreal, Aug. 1997. IJCAI Contents pdf
  24. Tomohiro Nakatani, Hiroshi G. Okuno, Takeshi Kawabata: Auditory Stream Segregation in Auditory Scene Analysis with a Multi-Agent System, Proceedings of the Twelfth National Conference on Artificial Intelligence (AAAI-1994), 1082--1089, AAAI, Seattle, Aug. 1994. AAAI Contents
  25. Saito, T., Umesawa, K., Hiroshi G. Okuno: Privacy Enhanced Access Control by SPKI, Proc. of the Seventh International Conference on Parallel and Distributed Systems: International Workshop on Next-Generation Internet Technologies and Applications 2000 (NGITA00), 301--306, IEEE, Iwate, July. 2000. doi:10.1109/PADSW.2000.884605
  26. Saito, T., Umesawa, K., Hiroshi G. Okuno: Privacy-Enhanced Access Control by SPKI and Its Application to Web Server. Proc. of Ninth IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises, IEEE, 201--206, NIST, Maryland, June 2000. doi:10.1109/ENABL.2000.883729
  27. Hiroaki Kitano, Hiroshi G. Okuno, Kazuhiro Nakadai, Fermin, I., Sabisch, T., Nakagawa, Y., and Matsui, T.: Designing a huymanoid head for RoboCup challenge. Proceedings of the Fourth International Conference on Autonomous Agents (Agents2000), ACM, pp.17--18, Barcelona, June 2000. pdf ACM DL
  28. Tomohiro Nakatani, Masataka Goto, Hiroshi G. Okuno: Localization by harmonic structure and its application to harmonic sound stream segregation, Proceedings of 1996 International Conference on Acoustics, Speech, and Signal Processing (ICASSP-1996), Vol.2, 653-656, ASA, ASJ, and ESCA, Atlanta, doi:10.1109/ICASSP.1996.543205
  29. Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata: A new speech enhancement: speech stream segregation, Proceedings of 1996 International Conference on Spoken Language Processing (ICSLP-1996), Vol.4, 2356-2359, ASA, ASJ, and ESCA, Yokohama, doi:10.1109/ICSLP.1996.607281
  30. Tomohiro Nakatani, Takeshi Kawabata, Hiroshi G. Okuno: A Computational Model of Sound Stream Segregation with Multi-Agent Paradigm, Proceedings of 1995 International Conference on Acoustics, Speech and Signal Processing (ICASSP'1995), Vol.4, pp.2671--2674, IEEE, 1995. doi:10.1109/ICASSP.1995.480111
  31. Hiroshi G. Okuno, Anoop Gupta: Parallel execution of OPS5 in QLISP, Proceedings of the Fourth Conference on Artificial Intelligence Applications, pp.268-273, IEEE, 1988. doi:10.1109/CAIA.1988.196114
  32. Hiroshi G. Okuno, Nobuyasu Osato, Ikuo Takeuchi: Firmware approach to fast Lisp interpreter, Proceedings of the 20th Annual Workshop on Microprogramming (MICRO-20), pp.1-11, ACM, Boulder, 1987. pdf ACM DL
  33. Hiroshi G. Okuno, Ikuo Takeuchi, Nobuyasu Osato, Yasushi Hibino, Kazufumi Watanabe: TAO: A fast interpreter-centered system on LISP machine ELIS Proceedings of the 1984 ACM Symposium on LISP and functional programming (LFP'84) Austin, Texas, pp.140-149, 1984. pdf ACM DL

ACM author profile page fun Most Prolific DBLP Authors


Last modified: Wed Aug 15 23:23:58 JST 2012


(C) Copyleft All Wrongs Reserved, 2001-2010.