Publication and Awards of Okuno Laboratory


[ AY2009|AY2008 |AY2007 | AY2006 | AY2005 | AY2004 | AY2003 | AY2002 | AY2001 | (Before AY2000) | Japanese | Awards ]


DBLP: H.G. Okuno, T. Ogata, K. Komatani, T. Takahashi, S. Nishide, R. Takeda, K. Itoyama, T. Yoshioka, H. Fujihara, T. Mizumoto Katsumaru, Yasuraoka,
OB: S. Shiramatsu, S. Ikeda, H. Kanda Y. Kubota H-D. Kim, S. Yamamoto, K. Yoshii, R. Yokoya S. Naito, T. Kitahara, H. Niwa, M. Yoshida, T. Tasaki, S. Matsumoto, Valin, Y. Akiba, T. Watanabe, K. Ishihara, Kodaka, M. Toda, T. Misu, I. Lane, Y. Akita, Y. Yamakata, A. Raux, Ito, T. Kawahara,

o Academic Year 2009o

Thesis | Journal Papers | Book Chapters | International Conferences | Domestic Conferences | Patents

    Peer-reviewed Journal Papers

  1. Kazuhiro Nakadai, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: Design and Implementation of Robot uAudition System "HARK", Advanced Robotics, accepted, Sep. 2009, VSP and Robotics Society of Japan.
  2. Hyun-Don Kim, Jinsung Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Target Speech Detection and Separation for Communication with Humanoid Robots in Noisy Home Environments, Advanced Robotics, in print, VSP and Robotics Society of Japan. doi:10.1163/016918609X12529300552105,
  3. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Self-Organization of Dynamic Object Features based on Bi-Directional Training, Advanced Robotics, Vol.23 (2009) 2035-2057. doi:10.1163/016918609X12529289797027, VSP and Robotics Society of Japan.
  4. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Autonomous Motion Generation based on Reliable Predictability, Journal of Robotics and Mechatronics, special issue on Kukanchi Interactive Human-Space Design and Intelligence Dedicated to Dr. Kazuo Tanie, Vol.21, No.4 (2009) 478-488.
  5. Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot audition by multi-channel input Independent Component Analysis (in Japanese), Journal of Robotics Society of Japan, Vol.27, No.7/8 (2009) accepted.
  6. Kazumasa Murata, Kazuhiro Nakadai, Ryu Takeda, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: Musical Beat-Tracking for Robots and Its Application to A Music Robot, Journal of Robotics Society of Japan, Vol.27, No.7/8 (2009) accepted.
  7. Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Simulation of Phoneme Aquisition Process (in Japanese), Journal of Robotics Society of Japan, Vol.27, No.7/8 (2009) accepted.
  8. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Parameter Estimation for Harmonic and Inharmonic Models by Using Timbre Feature Distributions IPSJ Journal, Vol.50, No.7 (Jul. 2009) 1757-1767, IPSJ. Journal of Information Processing, Vol.17 (2009) 191-201, IPSJ. pdf D-Library
  9. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Binaural Active Audition for Humanoid Robots to Localize Speech over Entire Azimuth Range, Applied Bionics and Biomechanics, Special Issue on "Humanoid Robots", accepted with minor modifications, Taylor & Francis, Mar. 2009.
  10. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Human Tracking System Integrating Sound and Face Localization using EM Algorithm in Real Environments, Advanced Robotics, Vol.23, No.6 (May 2009) 629-653, doi:10.1163/156855309X431659, VSP and Robotics Society of Japan.

    Book Chapters, Reviews

  11. Masaki Katsumaru, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Adjusting Occurence Probabilities of Automatically-Generated Abbreviated Words in Spoken Dialogue Systems, B.-C. Chien, T.-P. Hong, S.-M. Chen, M. Ali (Eds.): Next-Generation Applied Intelligence, 22nd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, Lecture Notes in Artificial Intelligence 5579, pp.481-490, Tainan, Taiwan, Jun. 24-27, 2009. doi:10.1007/978-3-642-02568-6_49
  12. Shun Shiramatsu, Yuji Kubota, Kazunori Komatani, Tetsuya Ogata, Toru Takahashi, Hiroshi G. Okuno: Visualization-based Approaches to Support Context Sharing towards Public Involment Support System, Opportunities and Challenges for Next-Generation Applied Intelligence, Studies in Computational Intelligence, Springer, Vol.214, pp.111--117, Tainan, Taiwan, Jun. 24-27, 2009. doi:10.1007/978-3-540-92814-0_18
  13. Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: A Model of Temporally Changing User Behaviors in a Deployed Spoken Dialogue System, G.-J. Houben et al. (Eds.): UMAP 2009, First and Seventeenth International Conference on User Modeling, Adaptation, and Personalization, Lecture Notes in Computer Science 5535, pp.408-414, Trento, Italy, Jun. 22-26, 2009.

    Peer-reviewed Conference Papers

  14. Akira Maezawa, Katsutoshi Itoyama, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Bowed String Sequence Estimation of a Violin Based on Adaptive Audio Signal Classification and Context-Dependent Error Correction, Proceedings of IEEE International Symposium on Multimedia (ISM2009), accepted for full paper presentation (acceptance rate for full papers, 19.6%), San Diego, Dec. 14-16, 2009.
  15. Takuma Ohtsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Voice quality manipulation for humanoid robots consistent with their head movements, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008), accepted, IEEE, Paris, Dec. 7-10, 2009
  16. Takumi Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: Automatic Speech Recognition Improved by Two-Layered Audio-Visual, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008), accepted, IEEE, Paris, Dec. 7-10, 2009.
  17. Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Estimation of Reverberation Time with Robot Speech to Improve ICA-based Robot Audition, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008), accepted, IEEE, Paris, Dec. 7-10, 2009.
  18. Hiromasa Fujihara, Masataka Goto, Hiroshi G. Okuno: A NOVEL FRAMEWORK FOR RECOGNIZING PHONEMES OF SINGING VOICE IN POLYPHONIC MUSIC, Proceedings of 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), accepted, Oct. 18-21, New Paltz, NY, 2009.
  19. Takuya Yoshioka, Hirokazu Kameoka, Tomohiro Nakatani, Hiroshi G. Okuno: Statistical models for speech dereverberation, Proceedings of 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), accepted, Oct. 18-21, New Paltz, NY, 2009.
  20. Naoki Yasuraoka, Takehiro Abe, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Changing Timbre and Phrase in Existing Musical Performances as You Like, ACM Multimedia 2009, 203-212 (16% 22/138), Beijing, China, Oct. 19-24, 2009. pdf, doi:10.1145/1631272.1631302
  21. Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Step-size Parameter Adaptation of Multi-channel Semi-blind ICA with Piecewise Linear Model for Barge-in-able Robot Audition (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.2273-2282, (900/1650), IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009. pdf
  22. Takuma Ohtsuka, Kazumasa Murata, iToru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Incremental Polyphonic Audio to Score Alignment using Beat Tracking for Singer Robots (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.2289-2296, IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009. pdf
  23. Takeshi Mizumoto, Hiroshi Tsujino, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Thereminist Robot: Development of a Robot Theremin Player with Feedforward and Feedback Arm Control based on a Theremin's Pitch Model (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.2297-2302, IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009. pdf
  24. Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Missing-Feature-Theory-based Robust Simultaneous Speech Recognition System with Non-clean Speech Acoustic Model (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.2730-2735, IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009. pdf
  25. Wataru Hinoshita, Tetsuya Ogata, Hideki Kozima, Hisashi Kanda, Toru Takahashi, Hiroshi G. Okuno: Emergence of Evolutional Interaction with Voice and Motion between Two Robots using RNN, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.4196-4291, IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009. pdf
  26. Shun Nishide, Tetsuhiro Nakagawa, Tetsuya Ogata, Jun Tani, Toru Takaahashi, Hiroshi G. Okuno: Modeling Tool-Body Assimilation using Second-order Recurrent Neural Network, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.5376-5381, (900/1650), IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009. pdf
  27. Hisashi Kanda, Tetsuya Ogata, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno: Phoneme Acquisition Model based on Vowel Imitation using Recurrent Neural Network, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.5388-5393, IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009. pdf
  28. Kazunori Komatani, Satoshi Ikeda, Yuichiro Fukubayashi, Tetsuya Ogata, Hiroshi G. Okuno: Ranking Help Message Candidates Based on Robust Grammar Verification Results and Utterance History in Spoken Dialogue Systems, Proceedings of the 10th SIGdial Workshop on Discourse and Dialogue (SigDial 2009), 314-321, Sep. 12, 2009.
  29. Kyoko Matsuyama, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Enabling A User To Specify An Item At Any Time During System Enumeration, Proceedings of International Conference on Spoken Language Processing (Interspeech-2009), Mon-Ses2-P4-1, (57.7%), Brighton, 6-10 Sep. 2009. pdf
  30. Masaki Katsumaru, Mikio Nakano, Kazunori Komatani, Kotaro Funakoshi, Tetsuya Ogata, Hiroshi G. Okuno: Improving Speech Understanding Accuracy with Limited Training Data Using Multiple Language Models and Multiple Understanding Models, Proceedings of International Conference on Spoken Language Processing (Interspeech-2009), Thu-Ses1-P4-9, (57.7%), Brighton, 6-10 (10) Sep. 2009. pdf
  31. Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Ryuichi Nishimura, Toshio Irino: Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion, Proceedings of International Conference on Spoken Language Processing (Interspeech-2009), Thu-Ses1-P2-6, (57.7%), Brighton, 6-10 Sep. 2009. pdf
  32. Hiroshi G. Okuno, Kazuhiro Nakadai, Hyun-Don Kim: Robot Auditon: Missing Feature Theory Approach and Active Audition (Invited talk), Proceeding of the 14th International Symposium of Robotics Research (ISRR 2009), August 31 - September 3, 2009, Lucerne, Switzerland, International Foundation of Robotics Research.
  33. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: QUERY-BY-EXAMPLE MUSIC RETRIEVAL APPROACH BASED ON MUSICAL GENRE SHIFT BY CHANGING INSTRUMENT VOLUME, Proceeding of the 12th International Conference on Digital Audio Effects (DAFx-09), accepted, Como, Italy, Sep.1-4. 2009.
  34. Shun Shiramatsu, Tadachika Ozano, Toramatsu Shintani, Kazunori Komatani, Tetsuya Ogata, Toru Takahashi: Hiroshi G. Okuno: Development of a Meeting Browser towards Supporting Public Involvement, Proceedings of International Conference on Computational Science and Engineering, Vol.4, 717-722 (Aug. 2008), IEEE pdf, doi:10.1145/10.1109/CSE.2009.362
  35. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Analysis of Motion Searching based on Reliable Predictability using Recurrent Neural Network, Proceedings of 2009 IEEE/ASME Conference on Advanced Intelligent Mechatronics (AIM 2009), 192-197, Singapore, July 14-19, 2009. doi:10.1145/10.1109/AIM.2009.5230015
  36. Kazunori Komatani, Alexander I. Rudnicky: Predicting Barge-in Utterance Errors by using Implicitly-Supervised ASR Accuracy and Barge-in Rate per User, Proceedings of the Fourth International Joint Conference on Natural Language Processing (ACL-IJCNLP 2009), accepted as a short paper, Jul. 2009.
  37. Masaki Katsumaru, Mikio Nakano, Kazunori Komatani, Kotaro Funakoshi, Hiroshi G. Okuno: A Speech Understanding Framework that Uses Multiple Language Models and Multiple Understanding Models, Proceeding of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 conference, accepted, (73/180) Boulder, CO, May 31 - Jun. 5, 2009.
  38. Tetsuya Ogata, Ryunosuke Yokoya, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Prediction and Imitation of Other's Motions by Reusing Own Forward-Inverse Model in Robots, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2009), pp.4144-4149, (699/1624), (May 12-17 (16), 2009), Kobe. pdf doi:10.1145/10.1109/ROBOT.2009.5152363
  39. Hisashi Kanda, Tetsuya Ogata, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno: Continuous Vocal Imitation with Self-organized Vowel Spaces in Recurrent Neural Network, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2009), pp.4438-4443, (May 12-17 (16), 2009), Kobe. pdf doi:10.1145/10.1109/ROBOT.2009.5152818
  40. Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: ICA-BASED EFFICIENT BLIND DEREVERBERATION AND ECHO CANCELLATION METHOD FOR BARGE-IN-ABLE ROBOT AUDITION, Proceedings of 2009 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2009), SS-L7.1, pp.3677-3680, (1178/2633), Taipei, Taiwan, April 19--24 (23), 2009. pdf doi:10.1145/10.1109//ICASSP.2009.4960424
  41. Hideki Kawahara, Ryuichi Nisimura, Toshio Irino, Masanori Morise, Toru Takahashi, Hideki Banno: TEMPORALLY VARIABLE MULTI-ASPECT AUDITORY MORPHING ENABLING EXTRAPOLATION WITHOUT OBJECTIVE AND PERCEPTUAL BREAKDOWN, Proceedings of 2009 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2009), pp. , April 23. pdf

o Academic Year 2008o

Thesis | Journal Papers | Book Chapters | International Conferences | Domestic Conferences | Patents

    Thesis

  1. Shun Nishide: Self-Organization of Invariants for Motion Generation based on Reliable Predictability, Ph.D Thesis, Feb. 2009.
  2. Hyun-Don Kim: Binaural Active Audition for Humanoid Robots, Ph.D Thesis, Sep. 2008.

  3. Takehiro Abe, MS Thesis, Feb. 2008.
  4. Satoshi Ikeda, MS Thesis, Feb. 2008.
  5. Hisashi Kanda, MS Thesis, Feb. 2008.
  6. Yuji Kubota, MS Thesis, Feb. 2008.
  7. Kaiping Wang, MS Thesis, Feb. 2008.

  8. Takuma Otsuka, BE Thesis, Feb. 2008.
  9. Wataru Hinoshita, BE Thesis, Feb. 2008.
  10. Kyoko Matsuyama, BE Thesis, Feb. 2008.
  11. Tadanori Yasuraoka, BE Thesis, Feb. 2008.
  12. Tatsuhiro Nakagawa, BE Thesis, Feb. 2008.

    Peer-reviewed Journal Papers

  13. Takehiro Abe, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: An Analysis-and-Synthesis Approach for Manipulating Pitch of a Musical Instrument Sound Considering Pitch-dependency of Timbral Characteristics, IPSJ Journal, Vol.50, No.3 (Mar., 2009) 1054-1066 IPSJ. pdf, D-Lib
  14. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Integrating Topic Estimation and Dialogue History for Domain Selection in Multi-Domain Spoken Dialogue Systems, IPSJ Journal, Vol.50, No.2 (Feb., 2009) 488-500, IPSJ. pdf, D-Lib
  15. Masaharu Morise, Toru Takahashi, Hideki Kawahara, Toshio Irino: IEIC Trans. A, Vol.J92-A, No.3 (Mar. 2009).
  16. Shun Shiramatsu, Kazunori Komatani, Koiti Hasida, Tetsuya Ogata, Hiroshi G. Okuno: Game-Theoretic Model of Referential Coherence and Its Empirical Verification Using Large Japanese and English Corpora, ACM Transactions on Speech and Language Processing, Vol.5, No.3 (Oct. 2008) Article 6, ACM. pdf, doi:10.1145/1410358.1410360
  17. Hiromasa Fujihara, Masataka Goto, Hiroshi G. Okuno: An F0 Estimation Method of Vocal Part in Polyphonic Music by Using Statistical Modelling of Singing Voice and Viterbi Search, IPSJ Journal, Vol.49, No.10 (Oct. 2008) 3682-3693, IPSJ. pdf, D-Lib
  18. Kazunori Komatani, Satoshi Ikeda, Tetsuya Ogata, Hiroshi G. Okuno: Managing Out-of-Grammar Utterances by Topic Estimation with Domain Extensibility in Multi-Domain Spoken Dialogue Systems, Speech Communication, No.50 (2008) 836-870. doi:10.1016/j.specom.2008.05.010
  19. Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot Audition using an Adaptive Filter Based on Independent Component Analysis, Journal of Robotic Society of Japan, Vol.26, No.6 (Sep. 2008) pp.529-536. 学会サーバ
  20. Yuichiro Fukubayashi, Kazunori Komatani, Mikio Nakano, Kotaro Funakoshi, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno: WFST-based Language Understanding for Rapid Prototyping of Spoken Dialogue Systems, IPSJ Journal, Vol.49, No.8 (Aug. 2008) pp.2762-2772, Information Processing Society of Japan, pdf, Digital Library.
  21. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Predicting Object Dynamics from Visual Images through Active Sensing Experiences, Advanced Robotics, Vol.22, No.5 (May 2008) pp.527-546, doi:10.1163/156855308X294879, Online version, VSP and Robotics Society of Japan.
  22. Hiroshi G. Okuno, Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata: A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals, Journal of Acoustic Society of America, Vol.123, No.5 (May 2008) Pt.2, pp.3066-3067.
  23. Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nishimura, Hideki Banno, Toshio Irino: A temporally stable representation of power spectra of periodic signals and its application to F0 and periodicity estimation, Journal of Acoustic Society of America, Vol.123, No.5 (May 2008) Pt.2, pp.3074-3075.

    Book Chapters, Reviews

  24. Shun Shiramatsu, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: SalienceGraph: Visualizing Salience Dynamics of Written Discourse by Using Reference Probability and PLSA, T. B. Ho and Z-H. Zhou (Eds.): PRICAI-2008: Trends in Artificial Intelligence, 890-902, (84/234, 35.8%), Lecture Notes in Computer Science, Vol.5351, Springer-Verlag, Dec. 2008. doi:10.1007/978-3-540-89197-0_83
  25. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Integrating Topic Estimation and Dialogue History for Domain Selection in Multi-Domain Spoken Dialogue Systems, Ngoc Thanh Nguyen,Leszek Borzemski,Adam Grzech,Moonis Ali (Eds.): New Frontiers in Applied Artificial Intelligence, pp.294-304, Lecture Notes in Artificial Intelligence, Vol.5027, June, 2008. doi:10.1007/978-3-540-69052-8_31
  26. Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Vocal Imitation using Vocal Tract Model and Recurrent Neural Network, Masumi Ishikawa, Kenji Doya, Hiroyuki Miyamoto, Takeshi Yamakawa (Eds.): Neural Information Processing, 14th International Conference, ICONIP 2007, Revised Selected Papers, Part II, pp.222-232, Lecture Notes in Computer Science 4985, Springer-Verlag, June 2008. doi:10.1007/978-3-540-69162-4_24
  27. Tetsuya Ogata, Hideki Kojima, Hiroshi G. Okuno: Motion Emergence from Sound using Cross-Modal Mapping on Recurrent Neural Network, Aucouturier, J.-J. (ed.) Cheek to Chip: Dancing Robots and AI's Future, IEEE Intelligent Systems, Vol.23, No.2 (Apr. 2008), 74--84, doi:10.1109/MIS.2008.22

    Peer-reviewed Conference Papers

  28. Masato Onishi, Toru Takahashi, Toshio Irino, Hideki Kawahara: Vowel-based frequency alignment function design and recognition-based time alignment for automatic speech morphing, Proceedings of IEEE Workshop on Spoken Language Technology 2008 (SLT 2008), accepted, Goa, India, December, 15--18, 2008,
  29. Yuji Kubota, Masatoshi Yoshida, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and Implementation of 3D Auditory Scene Visualizer towards Auditory Awareness with Face Tracking, Proceedings of IEEE International Symposium on Multimedia (ISM2008), pp.468-476 (acceptance rate for regular papers, 24%), Berkeley, Dec. 16. 2008. pdf doi:10.1109/ISM.2008.107
  30. Yuji Kubota, Shun Shiramatsu, Masatoshi Yoshida, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: 3D Auditory Scene Visualizer With Face Tracking: Design and Implementation For Auditory Awareness Compensation, Proceedings of 2nd International Symposium on Universal Communication (ISUC2008), pp.42-49, IEEE, Osaka, Dec. 15-16. 2008. pdf doi:10.1109/ISUC.2008.59
  31. Kazuhiro Nakadai, Hiroshi G. Okuno: Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: An Open Source Software System For Robot Audition HARK and Its Evaluation, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008), pp.561-566, Daejeon, Korea, Dec. 3, 2008. pdf
  32. Kazumasa Murata, Kazuhiro Nakadai, Ryu Takeda, Hiroshi G. Okuno, Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino: A beat-tracking robot for human-robot interaction and its evaluation, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008), pp.79-84, Daejeon, Korea, Dec. 2, 2008. pdf
  33. Shun Shiramatsu, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: SalienceGraph: Visualizing Salience Dynamics of Written Discourse by Using Reference Probability and PLSA, Proceedings of the Tenth Pacific Rim International Conference on Artificial Intelligence (PRICAI-08), 890-902, (84/234, 35.8%), Lecture Notes in Computer Science, Vol.5351, Springer-Verlag, Hanoi, Vienam, Dec. 15-19. 2008.
  34. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Analysis of Reliable Predictability based Motion Generation using RNNPB, Proceedings of Joint 4th International Conference on Soft Computing and Intelligent Systems and 9th International Symposium on advanced Intelligent Systems (SCIS & ISIS 2008) pp.305-310, Nagoya, JAPAN, September 17-21, 2008.
  35. Hideki Kawahara, Masanori Morise, Hideki Banno, Toru Takahashi, Ryuichi Nishimura, Toshio Irino: Spectral Envelope Recovery beyond the Nyquist Limit for High-Quality Manipulation of Speech Sounds, Proceedings of International Conference on Spoken Language Processing (Interspeech-2008), pp.22-26, Brisbane, Sept. 24, 2008.
  36. Toru Takahashi, Shun'ichi Yamamoto, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:Soft Missing-Feature Mask Generation for Simultaneous Speech Recognition System in Robots, Proceedings of International Conference on Spoken Language Processing (Interspeech-2008), pp.992-997, Brisbane, Sept. 24, 2008.
  37. Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno:Predicting ASR Errors by Exploiting Barge-In Rate of Individual Users for Spoken Dialogue Systems, Proceedings of International Conference on Spoken Language Processing (Interspeech-2008), pp.183--186, Brisbane, Sept. 2008.
  38. Masaki Katsumaru, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:Expanding Vocabulary for Recognizing User¥'s Abbreviations of Proper Nouns without Increasing ASR Error Rates in Spoken Dialogue Systems, Proceedings of International Conference on Spoken Language Processing (Interspeech-2008), pp.187-190, Brisbane, Sept. 2008.
  39. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:Extensibility Verification of Robust Domain Selection against Out-of-Grammar Utterances in Multi-Domain Spoken Dialogue System, Proceedings of International Conference on Spoken Language Processing (Interspeech-2008), pp.487-490, Brisbane, Sept. 2008.
  40. Shun Nishide, Tetsuya Ogata, Ryunosuke Yokoya, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Active Ssensing based Dynamical Object Feature Extraction, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), pp.1-7, TuAT1.1, IEEE, RSJ, Nice, 23 Sep. 2008. pdf doi:10.1109/IROS.2008.4650794
  41. Takeshi Mizumoto, Ryu Takeda, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:A Robot Listens to Music and Counts Its Beats Aloud by Separating Music from Counting Voice, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), 1538-1543, WeAT6.1 IEEE, RSJ, Nice, 24 Sep. 2008. pdf doi:10.1109/IROS.2008.4650821
    Award for Entertainment Robots and Systems (NTF Award) Nomination Finalist.
  42. Hyun-Don Kim, Jinsung Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Target Speech Detection and Separation for Humanoid Robot in Sparse Dialogue with Noisy Home Environments (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), 1705-1711, WeAT10.4 IEEE, RSJ, Nice, 24 Sep. 2008. pdf doi:10.1109/IROS.2008.4650977
  43. Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Segmenting Acoustic Signal with Articulatory Movement using Recurrent Neural Network for Phoneme Aquisition (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), 1712-1717, WeAT10.5 IEEE, RSJ, Nice, 24 Sep. 2008. pdf doi:10.1109/IROS.2008.4651060
  44. Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:Barge-in-able Robot Audition Based on ICA and Missing Feature Theory under Semi-Blind Situation (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), 1718-1723, WeAT10.6, IEEE, RSJ, Nice, 24 Sep. 2008. pdf doi:10.1109/IROS.2008.4650821
  45. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and Evaluation of Two-Channel Sound Source Localization over Entire Azimuth Range for Moving Talker (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), pp.2197-2203 Sept. 2008 pdf IEEE, RSJ, Nice, Sept. 2008. doi:10.1109/IROS.2008.4650947
  46. Kazumasa Murata, Kazuhiro Nakadai, Kazuyoshi Yoshii, Ryu Takeda, Toyotaka Torii, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: A Robot Uses Its Own Microphone to Synchronize Its Steps to Musical Beats While Scatting and Singing, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008), pp.2459-, WeCT6.1, IEEE, RSJ, Nice, 24 Sep. 2008. pdf doi:10.1109/IROS.2008.4650596
    Award for Entertainment Robots and Systems (NTF Award) Nomination Finalist.
  47. Kohei Sumi, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Chord Recognition Based on Probabilistic Integration of Chord Transition and Bass Pitch Estimation, Proceedings of 9th International Conference on Musical Information Retreival (ISMIR-2008), 39-44, Philadelphia, 15 Sep. 2008. pdf
  48. Kohei Sumi, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Chord Recognition Based on Probabilistic Integration of Chord Transition and Bass Pitch Estimation, Proceedings of 9th International Conference on Musical Information Retreival (ISMIR-2008), 39-44, Philadelphia, 15 Sep. 2008. pdf
  49. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrument Equalizer for Query-by-Example Retrieval: Improving Sound Source Separation based on Integrated Harmonic and Inharmonic Models, Proceedings of 9th International Conference on Musical Information Retreival (ISMIR-2008),133-138, Philadelphia, 15 Sep. 2008. pdf
  50. Kazumasa Murata, Kazuhiro Nakadai, Kazuyoshi Yoshii, Ryu Takeda, Toyotake Torii, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: A Robot Singer with Music Recognition Based on Real-Time Beat Tracking, Proceedings of 9th International Conference on Musical Information Retreival (ISMIR-2008), 199-204, Philadelphia, 15 Sep. 2008. pdf
  51. Takehiro Abe, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: SSynthesis Approach for Manipulating Pitch of a Musical Instrument Sound with Considering Timbral Characteristics, Proceeding of the 11th International Conference on Digital Audio Effects (DAFx-08), 249-256, Espoo, Finland, Sep.1-4. 2008. pdf
  52. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Integrating Topic Estimation and Dialogue History for Domain Selection in Multi-Domain Spoken Dialogue Systems Proceeding of the 21st International Conference on Industrial, Engineering and Other Applications of Applied Intelligence Systems (IEA/AIE-2008), pp.294-304, (acceptance rate is about 30%), LNAI 5027, Wroclaw, Poland, Jun. 18, 2008. doi:10.1007/978-3-540-69052-8_31
  53. Hiroshi G. Okuno, Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata: A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals, Proceedings of Acoustics'08, CD-ROM , 1pSCa8, June 30, 2008.
  54. Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nishimura, Hideki Banno, Toshio Irino: A temporally stable representation of power spectra of periodic signals and its application to F0 and periodicity estimation, Proceedings of Acoustics'08, CD-ROM , 1pSCc24, June 30, 2008.
  55. Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nishimura, Hideki Banno, Toshio Irino: A unified approach for F0 extraction and aperiodicity estimation based on a temporally stable power spectral representation, Proceedings of ISCA Tutorial and Research Workshop (ITRW) on "Speech Analysis and Processing for Knowledge Discovery", June 4, 2008, Aalborg, DK.
  56. Shun Nishide, Tetsuya Ogata, Ryunosuke Yokoya, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Object Dynamics Prediction and Motion Generation based on Reliable Predictability, Proceedings of IEEE-RAS International Conference on Robots and Automation (ICRA-2008), 1608-1614, (May 20, 2008). pdf doi:10.1109/ROBOT.2008.4543431
  57. Kazuhiro Nakadai, Shun'ichi Yamamoto, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: A Robot Referee for Rock-Paper-Scissors Sound Games, Proceedings of IEEE-RAS International Conference on Robots and Automation (ICRA-2008), 3469--3474, (May 20, 2008). pdf doi:10.1109/ROBOT.2008.4543741
  58. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Two-Channel-Based Voice Activity Detection for Humanoid Robots in Noisy Home Environments, Proceedings of IEEE-RAS International Conference on Robots and Automation (ICRA-2008), 3495-3501, (May 20, 2008). pdf doi:10.1109/ROBOT.2008.4543745
  59. Hiroshi G. Okuno, Kazuhiro Nakadai: COMPUTATIONAL AUDITORY SCENE ANALYSIS AND ITS APPLICATION TO ROBOT AUDITION, (invited talk), Proceedings of Hands-free Speech Communication and Microphone Arrays (HSCMA-2008), pp.123-127, May 7, 2008, Trento, Italy. pdf doi:10.1109/HSCMA.2008.4538702
  60. Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nishimura, Toshio Irino, Hideki Banno: TANDEM-STRAIGHT: A Temporally Stable Power Spectral Representation for Periodic Signals and Applications to Interference-free Spectrum, F0, and Aperiodicity Estimation, Proceedings of 2008 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2008), pp.3933-3936, Las Vegas, Nevada, USA, March 30 - April 4, 2008.

    Patents

  61. Sound Source Separation System, Sound Source Separation Method, and Computer Program for Sound Source Separation, PCT/JP2008/057310, WO 2008/133097 Date of Open: 06.11.2008, Inventors: Katsutoshi Itoyama, Hiroshi Okuno, Masataka Goto. Assignee: Kyoto University, AIST.
  62. Moving object equipped with ultra-directional speaker, Patent No. US 7,424,118, Date of Patent: Sep. 9, 2008. Inventors: Kiyofumi Mori, Shunji Yoshida, Hiroshi Okuno, Kazuhiro Nakadai, Hiroshi Tsujino, PCT No.: PCT/JP2005/002043.
  63. Speech Recognition Apparatus, Application No. 20080167869. Filed: July 10, 2008. Inventors: Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto. PCT No.: PCT/JP05/22601.

o Academic Year 2007o

    Thesis

  1. Shun Shiramatsu: Salience-based Modeling of Discourse Context, Ph.D Thesis, Feb. 2008. pdf
  2. Shun'ichi Yamamoto: Real-Time Robot Audition Software Based on Missing Feature Theory for Multiple Simultaneous Talkers in Real Environments, Ph.D Thesis, Feb. 2008.
  3. Kazuyoshi Yoshii: Studies on Hybrid Music Recommendation Using Timbral and Rhythmic Features, Ph.D Thesis, Feb. 2008.

  4. Katsutoshi Itoyama: MS Thesis, Feb. 2008.
  5. Ryu Takeda MS Thesis, Feb. 2008.
  6. Yuichiro Fukubayashi: MS Thesis, Feb. 2008.
  7. Koichi Tokuda: MS Thesis, Feb. 2008.
  8. Ryunosuke Yokoya: MS Thesis, Feb. 2008.

  9. Kohei Sumi: BE Thesis, Feb. 2007.
  10. Masaki Katsumaru: BE Thesis, Feb. 2008.
  11. Hiroki Saito: BE Thesis, Feb. 2007.
  12. Zhang: BE Thesis, Feb. 2007.
  13. Takeshi Mizumoto: BE Thesis, Feb. 2007.

    Peer-reviewed Journal Papers

  14. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Simultaneous Realization of Score-informed Sound Source Separation of Polyphonic Musical Siganals and Constrained Parameter Estimation for Integrated Model of Harmonic and Inharmonic Structure, IPSJ Journal, Vol.49, No.3 (Mar., 2008) pp.1465-1479, Information Processing Society of Japan, Digital Library, pdf
  15. Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: , Transactions of Human Interface Society, Vol.10, No.1 (Feb. 2008) pp.59-72.
  16. Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Hybrid Collaborative and Content-based Music Recommendation Using Incrementally-trainable Probabilistic Generative Model, IEEE Transactions on Audio, Speech and Language Processing, Vol.16, No.2 (Feb. 2008) pp.435-447, pdf, doi:10.1109/TASL.2007.911503
  17. Shun Shiramatsu, Kazunori Komatani, Koiti Hasida, Tetsuya Ogata, Hiroshi G. Okuno: A Game-Theoretic Model of Referential Coherence and Its Statistical Verification Based on Large Japanese and English Corpora, Natural Language Processing, Vol.14, No.4 (Oct. 2007) pp.199-239.
  18. Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Experience Based Imitation Using RNNPB, Advanced Robotics, Vol.21, No.12 (2007) pp.1351-1367, doi:10.1163/156855307781746106, Online version, VSP and Robotics Society of Japan.
  19. Chyon Hae Kim, Jun-ichi Idesawa, Tetsuya Ogata, Shigeki Sugano: Restraining of Noises in Self-Organizing Network Elements, Journal of Robotics Society of Japan, Vol.25, No.6 (Sep. 2007) pp.115-122. Digital Library
  20. Kazuhiro Nakadai, Hirofumi Nakashima, Masamitsu Murase, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: Tracking of Mulitiple Sound Sources by Integration of Robot-Embedded and In-Room Microphone Arrays, Journal of Robotics Society of Japan, Vol.25, No.6 (Sep. 2007) pp.181-191. Digital Library, pdf
  21. Jean-Marc Valin, Shun'ichi Yamamoto, Jean Rouat, Francois Michaud, Kazuhiro Nakadai, Hiroshi G. Okuno: Robust Recognition of Simultaneous Speech By a Mobile Robot, IEEE Transactions on Robotics, Vol.23, No.4 (Aug. 2007) pp.742--752, pdf, doi:10.1109/TRO.2007.900612
  22. Hiroaki ARIE, Tetsuya Ogata, Jun TANI, and Shigeki SUGANO: Reinforcement learning of continuous motor sequence with hidden state, Advanced Robotics, Special Issue on Robotic Platforms for Research in Neuroscience, VSP and Robotics Society of Japan, Vol.21, No.10 (July 2007), pp.1215-1229.
  23. Taro Watanabe, Kenji Imamura, Eiichiro Sumita, Hiroshi G. Okuno: Statistical machine translation using hierarchical phrase alignment, Systems and Computers in Japan, Vol.38, No.6 (June 2007) pp.70-79, doi:10.1002/scj.20271
  24. Naoyuki Kanda, Kazunori Komatani, Mikio Nakano, Kazuhiro Nakadai, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno: Robust Domain Selection Using Dialogue History in Multi-domain Spoken Dialogue Systems, IPSJ Journal, Vol.48, No.5 (May 2007) pp.1980-1989, IPSJ.

    Book Chapters, Articles

  25. Hiroshi G. Okuno, Tetsuya Ogata, Kazunori Komatani: Robot Audition from the viewpoint of Computational Auditory Scene Analysis, Informatics Education and Research for Knowledge-Creation Society Infrastructure (ICKS'08), pp.35-40, Jan. 2008. doi:10.1109/ICKS.2008.10
  26. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Structual Feature Extraction based on Active Sensing Experiences, Informatics Education and Research for Knowledge-Creation Society Infrastructure (ICKS'08), pp.209-212, Jan. 2008. doi:10.1109/ICKS.2008.9
  27. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Evaluation of Two-Channel-Based Sound Source Localization using 3D Moving Sound Creation Tool, Informatics Education and Research for Knowledge-Creation Society Infrastructure (ICKS'08), pp.210-216. doi:10.1109/ICKS.2008.25
  28. Koiti Hasida, Shun Shiramatsu, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Meaning Games, LENLS 2007 Postproceedings, accepted, LNCS Oct. 2007.
  29. Hiroshi G. Okuno, Moonis Ali (Eds.): New Trends in Applied Artificial Intelligence (IEA/AIE-2007), Lecture Notes in Computer Science, Vol.4570, Springer-Verlag, 14 Jun. 2007, XXI, 1194p. ISBN: 978-3-540-73322-5. doi:10.1007/978-3-540-73325-6
  30. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Auditory and Visual Talker Tracking through integrating EM algorithm and Particle Filter, New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570, pp.280-290, Springer-Verlag. Kyoto, Jun. 2007. doi:10.1007/978-3-540-73325-6_28
  31. Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and MTF-based ASR, New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570, pp.384-394, Springer-Verlag. Kyoto, Jun. 2007. doi:10.1007/978-3-540-73325-6_38
  32. Hiroshi G. Okuno, Tetsuro Kitahara, Kazuyoshi Yoshii: Music Feature Extraction and Music Information Retrieval, IEE Journal, Vol.127, No.7 (Jul. 2007).
  33. Hiroshi G. Okuno, Hiroshi Mizoguchi: Information Integration for Robot Audition: the State-of-the-art and issues. SICE, Vol.46, No.6 (Jun. 2007) pp.415-419.
  34. Shun'ichi Yamamoto, Ryu Takeda, Hiroshi G. Okuno: Missing Feature Theory Based Automatic Speech Recognition and Its Application to Simultaneous Multiple Speaker Speech Recognition, SICE, Vol.46, No.6 (Jun. 2007) pp.447-452.
  35. Shinichi Ueno, Fumihiro Adachi, Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: Bus Information System Based on User Models and Dynamic Generation of VoiceXML Scripts, New Frontiers in Artificial Intelligence (JSAI 2003/2004), LNAI 3609, pp.45-60, 2007. Springer-Verlag.

    Peer-reviewed Conference Papers

  36. Yuichiro Fukubayashi, Kazunori Komatani, Mikio Nakano, Kotaro Funakoshi, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno: Rapid Prototyping of Robust Language Understanding Modules with Less Training Data for Spoken Dialogue Systems, Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP 2008), accepted, Jan. 2008, Hyderabad, India.
  37. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and Implementation of A Robot Audition System for Automatic Speech Recognition of Simultaneous Speech, Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-2007), 111-116, acceptance rate (115/267), IEEE, Kyoto, Dec. 2007. pdf doi:10.1109/ASRU.2007.4430093.
  38. Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Vocal Imitation using Vocal Tract Model and Recurrent Neural Network, Proceedings of International Conference on Neural Information Processing (ICONIP-2007), Vol.2, pp.222-232, Nov. 2007.
  39. Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Vocal Imitation Using Physical Vocal Tract Model, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.1846-1851, IEEE, RSJ, San Diego, Oct. 2007. pdf doi:10.1109/IROS.2007.4399137.
  40. Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Discovery of Other Individuals by Projecting a Self-Model Through Imitation, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.1009-1014, IEEE, RSJ, San Diego, Oct. 2007. pdf doi:10.1109/IROS.2007.4399153.
  41. Kazuyoshi Yoshii, Kazuhiro Nakadai, Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: A Biped Robot that Keeps Steps in Time with Musical Beats while Listening to Music with Its Own Ears, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.1743-1750, IEEE, RSJ, San Diego, Oct. 2007. pdf doi:10.1109/IROS.2007.4399244.
  42. Tetsuya Ogata, Masamitsu Murase, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno, Two-way Translation of Compound Sentences and Arm Motions by Recurrent Neural Networks, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.1858-1863, IEEE, RSJ, San Diego, Oct. 2007. pdf doi:10.1109/IROS.2007.4399265.
  43. Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Exploiting Known Sound Sources to Improve ICA-based Robot Audition in Speech Separation and Recognition, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.1757-1762, IEEE, RSJ, San Diego, Oct. 2007. pdf
  44. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Auditory and Visual Integration based Localization and Tracking of Humans in Daily-life Environments, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), pp.2021-2027, IEEE, RSJ, San Diego, Oct. 2007. pdf
  45. Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences, Proceedings of 8th International Conference on Musical Information Retreival (ISMIR-2007), long paper (15.8% of 214 submissions), pp.89-94, Vienna, Sep. 2007.
  46. Kazunori Komatani, Yuichiro Fukubayashi, Tetsuya Ogata, Hiroshi G. Okuno: Introducing Utterance Verification in Spoken Dialogue System to Improve Dynamic Help Generation for Novice Users, Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, pp.202-205, Sep. 2007
  47. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Topic Estimation with Domain Extensibility for Guiding User's Out-of-Grammar Utterance in Multi-Domain Spoken DIalogue Systems, Proceedings of International Conference on Spoken Language Processing (Interspeech-2007), pp.2561-2564,, Antwerp, Sep. 2007. pdf
  48. Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: Analyzing Temporal Transition of Real User's Behaviors in a Spoken Dialogue System, Proceedings of International Conference on Spoken Language Processing (Interspeech-2007), pp.142-145, Antwerp, Sep. 2007. pdf
  49. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Auditory and VIsual Integration based Localization and Tracking of Multiple Moving Sounds in Daily-life Environments, Proceedings of International Workshop on Robot and Human Interaction (Ro-Man 2007), 399-404, IEEE, Jeju Island, Korea, Aug. 2007. pdf
  50. Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Auditory and Visual Talker Tracking through integrating EM algorithm and Particle Filter, New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570, pp.280-290, Springer-Verlag. Kyoto, Jun. 2007. doi:10.1007/978-3-540-73325-6_28
  51. Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and MTF-based ASR, New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570, pp.384-394, Springer-Verlag. Kyoto, Jun. 2007. doi:10.1007/978-3-540-73325-6_38
  52. Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: INTEGRATION AND ADAPTATION OF HARMONIC AND INHARMONIC MODELS FOR SEPARATING POLYPHONIC MUSICAL SIGNALS, Proceedings of 2007 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2007), pp.57-60, Hawaii, April 2007, pp.57-60, (15.1% acceptance rate for lecture presentation) doi:10.1109/ICASSP.2007.366615
  53. Haruhiko Niwa, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Distance Estimation of Hidden Objects Based on Acoustical Holography by applying Acoustic Diffraction of Audible Sound, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2007), pp.423-428, (Apr. 2007). doi:10.1109/ROBOT.2007.363823
  54. Tetsuya Ogata, Shohei Matsumoto, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Human-Robot Cooperation using Quasi-symbols Generated by RNNPB Model, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2007), pp.2156-2161, (Apr. 2007). doi:10.1109/ROBOT.2007.363640
  55. Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Predicting Object Dynamics from Visual Images through Active Sensing Experiences, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2007), pp.2501-2506, (Apr. 2007). doi:10.1109/ROBOT.2007.363841
  56. Chyon Hae Kim, Tetsuya Ogata, Shigeki Sugano: Enhancement of Self Organizing Network Elements for Supervised Learning, Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2007), WeA3.5, (Apr. 2007).

    Patents

  57. Robot acoustic device and robot acoustic system Patent No. US 7,215,786. Date of Patent: May 8, 2007. Inventors: Kazuhiro Nakadai, Hiroshi Okuno, Hiroaki Kitano, Assignee: Japan Science and Technology Agency.

o Academic Year 2006o

    Peer-reviewed Journal Papers

  1. Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Drumix: An Audio Player with Functions of Realtime Drum-Part Rearrangement for Active Music Listening, Journal of Information Proceeding Society of Japan, pp.1229-1239, Vol.48, No.3 (Mar. 2007), IPSJ. Vol.3 (2007), pp.134-144. DL
  2. Hyun-Don Kim, Jong-Suk Choi, and Munsang Kim: Human-robot interaction in real environments by audio-visual integration, International Journal of Control Automation and Systems, Vol.5, No.1 (Feb. 2007) pp.61-69.
  3. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music, Journal of Information Proceeding Society of Japan, Vol.48, No.1 (Jan. 2007), pp.214-226, IPSJ. IPSJ Digital Courier, Vol.3 (2007) pp.1-13.
  4. Shunsuke Kurotaki, Noriaki Suzuki, Kazuhiro Nakadai, Hiroshi G. Okuno, Hideharu Aamano: Sound Source Separation Filter for Robot Audition used by Dynamic Reconfigurable Device, DRP (in Japanese), IEICE Transaction on Information and Systems, Vol.J90-D, No.3, pp.897-907, Mar. 2007, IEICE. DL
  5. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata Hiroshi G,. Okuno: Simultaneous Speech Recognition based on Automatic Missing-Feature Mask Generation integrated with Sound Source Separation (in Japanese), Journal of Robotics Society of Japan, Vol.25, No.1 (Jan. 2007) pp.92-102. Digital Library, pdf
  6. Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectral Templates with Harmonic Harmonic Structure Suppression, IEEE Transactions on Audio, Speech and Language Processing, Vol.15, No.1 (Jan. 2007) pp.333-345, pdf, doi:10.1109/TASL.2006.876754
  7. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps, EURASIP Journal on Applied Signal Processing, Special issue on Music Information Retrieval Based on Signal Processing, Vol.2007, Article ID 51979, 15 pages, 2007, doi:10.1155/2007/51979
  8. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrument Identification in Polyphonic Music: Feature Weighting Based on Mixed-Sound Template and Use of Musical Context (in Japanese), IEICE Transaction on Information and Systems, Vol.J89-D, No.12 (Dec. 2006), pp.2721-2733, IEICE.
  9. Naoyuki Kanda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Spoken Language Understinding Using Dialogue Context in Database Search (in Japanese), IPSJ Journal, Vol.47, No.6 (June 2006) pp.1802-1811, IPSJ. Paper in pdf
  10. Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: A Singer Identification Method for Musical Pieces on the Basis of Accompaniment Sound Reduction and Reliable Frame Selection (in Japanese), IPSJ Journal, Vol.47, No.6 (June 2006) pp.1831-1843, IPSJ.
  11. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. OKuno: Missing Feature Theory Based Interface Between Sound Source Separation and Automatic Speech Recognition and Applying to Multiple Robots, (in Japanese), Journal of Human Interface Society, Vol.8, No.2 (Jun. 2006) pp.203-212.
  12. Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: A Privacy-Enhanced Access Control, Systems and Computers in Japan, (2006) A Privacy-Enhanced Access Control, Systems and Computers in Japan, Vol.37, No.5 (May 2006) pp.77-86. doi:10.1002/scj.10214
  13. Tenkai Kim, 尾形 哲也, Shigeki Sugano; ローカルルールに基づいた論理回路の自己組織アルゴリズム (in Japanese), Transaction on SICE, Vol.42, No.4 (Apr. 2006) pp.334-341.
  14. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Improving Location-Based Speech Recognition of Simultaneous Speech Signals by Parameter Optimization with Genetic Algorithm (in Japanese), Human Interface, Vol.8, No.2 (Jun. 2006) pp.203-212.
  15. Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: A Privacy-Enhanced Access Control, Systems and Computers in Japan, (2006) A Privacy-Enhanced Access Control, Systems and Computers in Japan, Vol.37, No.5 (May 2006) pp.77-86. doi:10.1002/scj.10214

    Book Chapters, Survey Papers, and Articles

  16. Hiroaki Arie, Jun Namikawa, Tetsuya Ogata, Jun Tani, Shigeki SUGANO: Reinforcement Learning Algorithm with CTRNN in Continuous Action Space, Neural Information Processing (ICONIP-2006), Part I, LNCS 4232, pp.387-396. Oct. 2006. doi:10.1007/11893028_44
  17. Shun'ichi Yamamoto, Ryu Takeda, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition, PRICAI 2006: Trends in Artificial Intelligence, LNCS 4099, pp.484-494, accepted as regular paper for ORAL Presentation (14.1%), Springer-Verlag, Guilin, China, Aug. 2006. doi:10.1007/11801603_52
  18. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Genetic Algorithm based Improvement of Robot's Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals, Advances in Applied Artificial Intelligence (IEA/AIE-2006), LNAI 4031, pp.207-217, Springer-Verlag. Annecy, France, Jun. 2006. doi:10.1007/11779568_24

    Peer-reviewed Conference Papers

  19. Hiroshi G. Okuno, Tetsuya Ogata, Kazunori Komatani: Computational Auditory Scene Analysis and Its Application to Robot Audition: Five Years Experience, Proceedings of the 2nd International Conference on Informatics Research for Development of Knowledge Society Infrastructure (ICKS 2007), pp.69-76, Jan. 2007. doi:10.1109/ICKS.2007.7
  20. Shun Shiramatsu, Kazunori Komatani, Koiti Hasida, Tetsuya Ogata, Hiroshi G. Okuno: Meaning-Game-based Centering Model with Statistical Definition of Utility of Referential Expression and Its Verification Using Japanese and English Corpora, Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC2007), pp.121-126, Lisbon, Mar. 2007.
  21. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Musical Instrument Recognizer ``Instrogram'' and Its Application to Music Retrieval based on Instrumentation Similarity, Proceedings of IEEE International Symposium on Multimedia (ISM2006), pp.265-272, San Diego, Dec. 2006. doi:10.1109/ISM.2006.113
  22. Hiromasa Fujihara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic synchronization between lyrics and music CD recordings based on Viterbi alignment of segregated vocal signals, Proceedings of IEEE International Symposium on Multimedia (ISM2006), pp.257-264, San Diego, Dec. 2006. doi:10.1109/ISM.2006.38
  23. Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences, Proceedings of 7th International Conference on Musical Information Retreival (ISMIR-2006), pp.296-301, Vancouver, CA, Sep. 2006. pdf
  24. Katsutoshi Itoyama, Tetsuro Kitahara, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Feature Weighting in Automatic Transcription of Specified Part in Polyphonic Music, Proceedings of 7th International Conference on Musical Information Retreival (ISMIR-2006), pp.172-175, Vancouver, CA, Sep. 2006. pdf
  25. Kazuhiro Nakadai, Hirofumi Nakajima, Masamitsu Murase, Satoshi Kaijiri. Kentaro Yamada, Yuji Hasegawa, Hiroshi G. Okuno, Hiroshi Tsujino: Real-Time Tracking of Multiple Sound Sources by Integration of In-Room and Robot-Embedded Microphone Arrays, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 852-859, IEEE, RSJ, Beijing, China, Sep. 2006. pdf, doi:10.1109/IROS.2006.281737.
  26. Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Missing-Feature based Speech Recognition for Two Simultaneous Speech Signals Separated by ICA with a pair of Humanoid Ears, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 878-885, IEEE, RSJ, Beijing, China, Sep. 2006. pdf, doi:10.1109/IROS.2006.281741, IEEE Robotics and Automation Society Japan Chapter Young Award RSJ/SICE Award for IROS 2006 Best Paper Nomination Finalist (2nd to 45th Place) at IROS-2007.
  27. Haruhiko Niwa, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Multiple Acoustical Holography Method for Localization of Objects in Broad Range using Audible Sound, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 1146-1151, IEEE, RSJ, Beijing, China, Sep. 2006. pdf, doi:10.1109/IROS.2006.281844
  28. Chyon Hae KIM, Tetsuya Ogata, Shigeki SUGANO: wEfficient Organization of Network Topology based on Reinforcement Signals, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 3154-3159, IEEE, RSJ, Beijing, China, Sep. 2006. pdf
  29. Yuki Suga, Chihiro Endo, Daizo Kobayashi, Takeshi Matsumoto, Tetsuya Ogata, Shigeki Sugano: User-Adaptive Human-Robot Interaction System using Interactive EC, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 3663-3668, IEEE, RSJ, Beijing, China, Sep. 2006. pdf
  30. Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Experience Based Imitation Using RNNPB, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 3669-3674, IEEE, RSJ, Beijing, China, Sep. 2006. pdf, doi:10.1109/IROS.2006.281724.
  31. Jong-Suk Choi, Hyun-Don Kim, and Munsang Kim: Probabilistic Speaker Localization in Noisy Environment by Audio-Visual Integration, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 4704-4709, IEEE, RSJ, Beijing, China, Sep. 2006. pdf
  32. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Robot Audition System That Recognizes Simultaneous Speech in the Real World, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2006), 5333-5338, IEEE, RSJ, Beijing, China, Sep. 2006. pdf, doi:10.1109/IROS.2006.282037.
  33. Tetsuya Ogata, Yuya Hattori, Hideki Kojima, Kazunori Komatani, Hiroshi G. Okuno: Generation of Robot Motions from Environmental Sounds using Inter-modality Mapping by RNNPB, Proceedings of Sixth International Workshop on Epigenetic Robotics (EpiRobo-2006), 95-102, Paris, Sep., 2006.
  34. Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Speaker Identification under Noisy Environments by using Harmonic Structure Extraction and Reliable Frame Weighting, Proceedings of International Conference on Spoken Language Processing (Interspeech-2006), 1459-1462, Pittsburgh, Sep. 2006. pdf
  35. Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Improving Speech Recognition of Two Simultaneous Speech Signals by Integrating ICA BSS and Automatic Missing Feature Mask Generation, Proceedings of International Conference on Spoken Language Processing (Interspeech-2006), 2302-2305, Pittsburgh, Sep. 2006. pdf
  36. Yuichiro Fukubayashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Dynamic Help Generation by Estimating User's Mental Model in Spoken Dialogue Systems, Proceedings of International Conference on Spoken Language Processing (Interspeech-2006), 1946-1949, Pittsburgh, Sep. 2006. pdf
  37. Shun'ichi Yamamoto, Ryu Takeda, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Leak Energy based Missing Feature Mask Generation for ICA and GSS and Its Evaluation with Simultaneous Speech Recognition, Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition (SAPA2006), pp.42-46, pdf
  38. Kazunori Komatani, Naoyuki Kanda, Mikio Nakano, Kazuhiro Nakadai, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno: Multi-Domain Spoken Dialogue System with Extensibility and Robustness against Speech Recognition Errors, Proceedings of SIGdial Workshop on Discourse and Dialogue, 9-17, Aug. 2006
  39. Hiroshi G. Okuno: Computational Auditory Scene Analysis - Towards Listening to Several Thinkgs at Once -, 50th Anniversary Summit of Artificial Intelligence (ASAI50) workshop and abstract booklet, accepted for inclusion, Monte Verita, Switzerland, July 2006.
  40. Takuya Yoshioka, Takafumi Hikichi, Masato Miyoshi, Hiroshi G. Okuno: Robust Decomposition of Inverse Filter of Channel and Prediction Error Filter of Speech Signal for Dereverberation, Proceedings of the 14th European Signal Processing Conference (EUSIPCO 2006), CD-ROM Proceedings, Florence, 2006. pdf
  41. Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Robot Imitation from Active-Sensing Experiences, Proceedings of Fifth International Conference on Learning and Development (ICDL06), accepted, Bloomington, IN USA, May 2006.
  42. Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: AN ERROR CORRECTION FRAMEWORK BASED ON DRUM PATTERN PERIODICITY FOR IMPROVING DRUM SOUND DETECTION, Proceedings of 2006 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2006), Vol.V, pp.237-240, Toulouse, May 2006. pdf, doi:10.1109/ICASSP.2006.11661256 IEEE Kansai Chapter Young Researcher Award
  43. Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: F0 ESTIMATION METHOD FOR SINGING VOICE IN POLYPHONIC AUDIO SIGNAL BASED ON STATISTICAL VOCAL MODEL AND VITERBI SEARCH, Proceedings of 2006 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2006), Vol.V, pp.253-256, Toulouse, May 2006. pdf, doi:10.1109/ICASSP.2006.1661260
  44. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrogram: A New Musical Instrument Recognition Technique Without Using Onset Detection Nor F0 Estimation, Proceedings of 2006 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2006), Vol.V, pp.229-232, Toulouse, May 2006. pdf, doi:10.1109/ICASSP.2006.1661254 IEEE Kansai Chapter Young Researcher Award
  45. Kazuhiro Nakadai, Hirofumi Nakajima, Masamitsu Murase, Satoshi Kaijiri. Kentaro Yamada, Yuji Hasegawa, Hiroshi G. Okuno, Hiroshi Tsujino: ROBUST TRACKING OF MULTIPLE SOUND SOURCES BY SPATIAL INTEGRATION OF ROOM AND ROBOT MICROPHONE ARRAYS, Proceedings of 2006 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2006), Vol.IV, pp.929-932, Toulouse, May 2006. pdf, doi:10.1109/ICASSP.2006.1661122
  46. Hyun-Don Kim, Jong-Suk Choi, and Munsang Kim: Speaker Localization among Multi-faces in Noisy Environment by Audio-Visual Integration, Proceedings of IEEE-RAS International Conference on Robots and Automation (ICRA-2006), 1305-1310, (May 2006). doi:10.1109/ICKS.2004.1313411

    Patents

  47. Speech Recongition Device, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto, European Patent: EP1691344, Publication Date: 08/16/2006, Application number: EP20040818533, Filing Date: 11/12/2004
  48. Method and Apparatus for Determining Sound Source, Patent No. US 7,035,418. Filing date: June 7, 2000. Issue date: Apr. 25, 2006. Inventors: Hiroshi Okuno, Hiroaki Kitano, Yukiko Nakagawa, Assignee: Japan Science and Technology Agency.
  49. Robot audiovisual system Patent No. US 7,016,505. Filing date: Nov 1, 2000. Issue date: Mar 21, 2006. Inventors: Kazuhiro Nakadai, Hiroshi Okuno, Hiroaki Kitano, Assignee: Japan Science and Technology Agency.

o Academic Year 2005o

    Peer-Reviewed Journal Papers

  1. Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Using Multiple Edit Distances to Automatically Grade Outputs from Machine Translation Systems, IEEE Transactions on Audio, Speech and Language Processing, Vol.14, No.2, (Mar. 2006) 393--402. doi:10.1109/TSA.2005.860770
  2. Mototaka Suzuki, Kuniaki Noda, Yuki Suga, Tetsuya Ogata, and Shigeki Sugano: Dynamic Perception after Visually-Guided Grasping by a Human-Like Autonomous Robot, Advanced Robotics, Vol.20, No.2 (Feb. 2006) 233-254. VSP and Robotics Society of Japan. doi:10.1163/156855306775525785
  3. Takuya Yoshioka, Takafumi Hikichi, Masato Miyoshi, Hiroshi G. Okuno: Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals, IEICE Trans. on Fundamentals of Electronics, Communications, and Computer Sciences, Vol.E89-A, No.1 (Jan. 2006) 240-247, IEICE.
  4. Tetsuya Ogata, Hayato Ohba, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Extracting Multi-Modal Dynamics of Objects using RNNPB, Journal of Robotics and Mechatronics, Vol.17, No.6 (Dec. 2005) 681-688, Special Issue on Human Modeling in Robotics.
  5. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Pitch-dependent identification of musical instrument sounds, Applied Intelligence, Vol.23, No.3, pp.267-275, Springer-Verlag (formerly Kluwer Publishers). doi:10.1007/s10489-005-4612-1
  6. Kenri Kodaka, Tetsuya Ogata, Hiroshi G. Okuno: Walking in Virtual Space with Entrainment Based on a Nonlinear Oscillator, Journal of Human Interface Society, Vol.7, No.4, 26-36, 2005.
  7. Shun Shiramatsu, Takashi Miyata, Hiroshi G. Okuno, Koiti Hasida: Dissolution of Centering Theory Based on Game Theory and Its Empirical Verification (in Japanese), Natural Language Processing, Vol.12, No.3 (July 2005) 91-110.
  8. Shunichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi G. Okuno: Missing Feature Theory Based Interface Between Sound Source Separation and Automatic Speech Recognition and Applying to Multiple Robots (in Japanese), Journal of Robotics Society of Japan, Vol.23, No.6 (Aug. 2005) 743-751. Digital Library, pdf
  9. Tetsuya Ogata, Shigeki Sugano, and Jun Tani: Open-end Human-Robot Interaction from the Dynamical Systems Perspective - Mutual Adaptation and Incremental Learning, Advanced Roboics, Vol.19. No.6, pp.651-670, VSP and Robotics Society of Japan. doi:10.1163/1568553054255655
  10. Katsuhisa Ishida, Tetsuro Kitahara, Masayuki Takeda: Improvisation Supporting System Using N-gram-based Melody Appropriateness Determination, IPSJ Journal, Vol.46, No.7 (July 2005) pp.1548-1559, IPSJ. Paper in html

    Book Chapters

  11. Masahiro Nisiyama, Hiroaki Kawashima, Takatsugu Hirayama, Takashi Matsuyama: Facial Expression Representation based on Timing Structures in Faces, Proceedings of IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2005), LNCS 3723, pp.139-153, Beijing, Oct. 2005.
  12. Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Mitsuhiko Toda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Distance Based Dynamic Interaction of Humanoid Robot with Multiple People. Innovations in Applied Artificial Intelligence (IEA/AIE-2005) LNAI 3533, 111-120, Best paper award, Springer-Verlag. Bari, Italy, Jun. 2005. Paper pdf doi:10.1007/11504894_18
  13. Katsutoshi Uchiyama, Toshiaki Ohji, Mari Oka, Hiroshi G. Okuno, Hiroyuki Suzuki, Kenji Fukaya, Modjtaba Sadria, Hubert Durt Kokoro and Topos -- Bastions of Kokoro, Kyoto Inernational Culture Forum 2005, pp.62-73, Mar. 2006.

    Peer-Reviewed International Conference Papers

  14. Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: INTER:D A Drum Sound Equalizer for Controlling Volume and Timbre of Druams, Proceedings of 2nd European Workshop on the Integration of Knowledge, Semantic and Digital Media Technologies (EWIMT 2005), accepted for oral presentation, EU Commission, IEE Savoy Place, London, Nov. 2005.
  15. Shun Shiramatsu, Kazunori Komatani, Takashi Miyata, Koiti Hasida, Hiroshi G. Okuno: Empirical Verification of Meaning-Game-based Generalization of Centering Theory with Large Japanese Corpus, Proceedings of the 19th Pacific Asia Conference on Language, Information, and Computation (PACLIC 19), 192-210, Taipei, Dec. 2005.
  16. Masahiro Nisiyama, Hiroaki Kawashima, Takatsugu Hirayama, Takashi Matsuyama: Facial Expression Representation based on Timing Structures in Faces, Proceedings of IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2005), LNCS 3723, pp.139-153, accepted, Beijing, Oct. 2005.
  17. Kenri Kodaka, Tetsuya Ogata, Hiroshi G. Okuno: Walking with Body-sense in Virtual Space Using the Nonlinear Oscillator, Proceedings of the International Conference on Systems, Man and Cybernetics (SMC-2005), IEEE, Hawaii, Oct. 10-12, 2005. Finalist for Best Student Paper doi:10.1109/ICSMC.2005.1571166
  18. Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: SINGER IDENTIFICATION BASED ON ACCOMPANIMENT SOUND REDUCTION AND RELIABLE FRAME SELECTION, Proceedings of 6th International Conference on Musical Information Retreival (ISMIR-2005), 329-336, London, Sep. 2005.
  19. Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: INSTRUMENT IDENTIFICATION IN POLYPHONIC MUSIC: FEATURE WEIGHTING WITH MIXED SOUNDS, PITCH-DEPENDENT TIMBRE MODELING, AND USE OF MUSICAL CONTEXT, Proceedings of 6th International Conference on Musical Information Retreival (ISMIR-2005), 558-563, London, Sep. 2005.
  20. Kazunori Komatani, Naoyuki Kanda, Tetsuya Ogata, Hiroshi G. Okuno: Contextual Constraints based on Dialogue Models in Database Search Task for Spoken Dialogue Systems, Proceedings of the Nineth European Conference on Speech Communication and Technology (Interspeech-2005), 877-880, Lisboa, Sep. 2005. Paper in PDF.
  21. Masamitsu Murase, Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Kentaro Yamada, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Multiple Moving Speaker Tracking by Microphone Array on Mobile Robot, Proceedings of the Nineth European Conference on Speech Communication and Technology (Interspeech-2005), 249-252, Lisboa, Sep. 2005. Paper in PDF.
  22. Tetsuro Kitahara, Katsuhisa Ishida, Masayuki Takeda: ism: Improvisation Supporting Systems with Melody Correction and Key Vibration, Proceedings of International Conference on Entertainment Computing (ICEC 2005), Mita, Hyogo, Sep. 2005.
  23. Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Jean Rouat, Francois Michaud, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Making A Robot Recognize Three Simultaneous Sentences in Real-Time, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2005), pp.897-892, IEEE, RSJ, Edmonton, Aug. 2005. Paper in PDF. doi:10.1109/IROS.2005.1545094
  24. Syunsuke Kurotaki, Noriaki Suzuki, Kazuhiro Nakadai, Hiroshi G. Okuno, Hideharu Amano: Implementation of Active Direction-Pass Filter on Dynamically Reconfigurable Processor, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2005), pp.515-520, IEEE, RSJ, Edmonton, Aug. 2005. Paper in PDF. doi:10.1109/IROS.2005.1545033
  25. Tsuyoshi Tasaki, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Spatially Mapping of Friendliness for Human-Robot Interaction, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2005), pp.521-526, IEEE, RSJ, Edmonton, Aug. 2005. Paper in PDF. doi:10.1109/IROS.2005.1545034
  26. Mikio Nakano, Naoyuki Kanda, Yuji Hasegawa, Toyotaka Torii, Yohane Takeuchi, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi G. Okuno: A Two-Layer Model for Behavior and Dialogue Planning in Conversational Service Robots, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2005), pp.1542-154, IEEE, RSJ, Edmonton, Aug. 2005. Paper in PDF. doi:10.1109/IROS.2005.1545198
  27. Tetsuya Ogata, Hayato Ohba, Kazunori Komatani, Jun Tani, Hiroshi G. Okuno: Extracting Multi-Modal Dynamics of Objects using RNNPB Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2005), pp.160-165, IEEE, RSJ, Edmonton, Aug. 2005. Paper in PDF. doi:10.1109/IROS.2005.1544975
  28. Kazunori Komatani, Ryoji Hamabe, Tetsuya Ogata, Hiroshi G. Okuno: Generating Confirmation to Distinguish Phonologically Confusing Word Pairs in Spoken Dialogue Systems Proceedings of 4th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems, pp.40-45, July 2005.
  29. Yuya Hattori, Hideki Kojima, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot Gesture Generation from Environmental Sounds Using Inter-modality Mapping, Proceedings of Fifth International Workshop on Epigenetic Robotics (EpiRobo-2005), Nara, July 2005.
  30. Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Mitsuhiko Toda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Distance Based Dynamic Interaction of Humanoid Robot with Multiple People. Innovations in Applied Artificial Intelligence: Eighteenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2005) LNAI 3533, 111-120, Best paper award, Springer-Verlag. Bari, Italy, Jun. 2005. Paper in pdf doi:10.1007/11504894_18
  31. Takuya Yoshioka, Takafumi Hikichi, Masato Miyoshi, Hiroshi G. Okuno: Blind Estimation of Room Resonances Using Popular, Classical, and Jazz Music. Proceedings of AES 118th Convenvion, Audio Engineering Society, Barcelona, Spain, May 28-31, 2005.
  32. Shun'ichi Yamamoto, Jean-Marc Valin Kazuhiro Nakadai, Hiroshi Tsujino, Jean Rouat, Francois Michaud, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory. Proceedings of IEEE-RAS International Conference on Robots and Automation (ICRA-2005), 1489-1494, IEEE, Barcelona, Apr. 2005.

    Patents

  33. Robot audiovisual system Patent No. US 6,967,455 Filing date: Mar 8, 2002 Issue date: Nov 22, 2005 Inventors: Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Okuno, Hiroaki Kitano Assignee: Japan Science and Technology Agency
  34. Speech Recongition Device, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto, Wipo Patent: WO/2005/048239, Application Number: PCT/JP2004/016883, Publication Date: 05/26/2005, Filing Date: 11/12/2004.

o Academic Year 2004o

    Thesis

  1. Yasuhiro Akiba: Automatic Evaluation Methods for Machine Translation Systems, Ph.D Thesis, Jan. 2005.

  2. Kazushi Ishihara: MS Thesis, Feb. 2005
  3. Kenri Kodaka: MS Thesis, Feb. 2005
  4. Shun'ichi Yamamoto: MS Thesis, Feb. 2005
  5. Ken Yamaguchi: MS Thesis, Feb. 2005
  6. Kazuyoshi Yoshii: Drum Sound Recognition for Polyphonic Audio Signals by Adaptation of Spectral Templates and Suppression of Harmonic Structure, MS Thesis, Feb. 2005

  7. Hayato Ohba: BE Thesis, Feb. 2005
  8. Taku Oya: BE Thesis, Feb. 2005
  9. Satoshi Kaijiri: BE Thesis, Feb. 2005
  10. Ryoji Hamabe: BE Thesis, Feb. 2005
  11. Masahiro Fujihara: BE Thesis, Feb. 2005
  12. Masamitsu Murase: BE Thesis, Feb. 2005

    Peer-Reviewed Journal Papers

  13. Tetsuya Ogata, Shigeki Sugano, and Jun Tani: Acquisition of Motion Primitives of Robot in Human-Navigation Task: Towards Human-Robot Interaction based on "Quasi-Symbol", Transactions of the Japanese Society for Artificial Intelligence, Vol.20, No.3, pp.188-196. Mar. 2005. Paper Online Journal doi:10.1527/tjsai.20.188
  14. Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Shun'ichi Yamamoto, Mitsuhiko Toda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance, Transactions of the Japanese Society for Artificial Intelligence, Vol.20, No.3, pp.209-219, Mar. 2005. Paper Online Journal doi:10.1527/tjsai.20.209
  15. Yasuhiro Akiba, Kenji Imamura, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Automatic Grader of MT Outputs in Colloquial Style by Using Multiple Edit Distance, (in Japanese), Transactions of the Japanese Society for Artificial Intelligence, Vol.20, No.3,pp.139-148 (2005). Paper Online Journal doi:10.1527/tjsai.20.139
  16. Kazushi Ishihara, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Recognition of Onomatopoeia for Environmental Sounds, (in Japanese), Transactions of the Japanese Society for Artificial Intelligence, Vol.20, No.3, pp.229-236, March 2005. Paper Online Journal doi:10.1527/tjsai.20.229
  17. Teruhisa Misu, Kazunori Komatani, Youji Seita, Tatsuya Kawahara: 音声対話によるソフトウェアサポートタスクのための効果的な確認戦略, IEICE Transaction on Information and Systems, Vol.88-D2, No.3 (Mar. 2005) 499-508, IEICE, Paper in pdf
  18. Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance, User Modeling and User-Adapted Interaction, Special Issue on Language-Based Interaction: User Modeling and Adaptation, Vol.15, No.169-183, Kluwer, 2005. Abstract doi:10.1007/s11257-004-5659-0
  19. Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino: Improvement of Recognition of Simultaneous Speech Signals Using AV Integration and Scattering Theory for Humanoid Robots, Speech Communication, Vol.44 (2004) 97--112, Elsevier, Oct. 2004. doi:10.1016/j.specom.2004.10.010
  20. Tino Lourens, Hiroshi G. Okuno, Hiroshi Tsujino: A computational model of monkey cortical grating cells. Biological Cybernetics, Vol.92, No.1 (Jan. 2005) 61--70. Springer-Verlag. Paper in pdf doi:10.1007/s00422-004-0522-2
  21. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Effects of increasing modalities in recognizing three simultaneous speeches, Speech Communication, Vol.43, No.4, pp.347-359, Sep. 2004. doi:10.1016/j.specom.2004.03.008
  22. Yasuhisa Hayakawa, Tetsuya Ogata, and Shigeki Sugano: Flexible Assembly Work Cooperating System based on Work State Identifications by Self-Organizing Map, IEEE/ASME Transactions on Mechatronics, Vol.9, No.3, accepted, Sept. 2004.
  23. Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G Okuno: User model for Adaptive Response Generation in Spoken Dialogue System, IEICE Transactions on Information and Systems, Vol.87-D2, No.10 (Oct. 2004) 1921-1928, IEICE. Paper in pdf
  24. Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot, Applied Intelligence, Vol.20, No.3 (May/June, 2004), 253-266, doi:10.1023/B:APIN.0000021417.62541.e0, (accepted in Oct. 2002), Kluwer Publishers.
  25. Taro Watanabe, Kenji Imamura, Eiichiro Sumita, Hiroshi G. Okuno: Statistical machine translation using hierarchical phrase alignment. IEICE Transactions on Information and Systems, Vol.J87-D2, No.4 (Apr. 2004) 978-986, IEICE. Paper in pdf

    Peer-Reviewed International Conference Papers

  26. Hiroshi G. Okuno: Robot Audition: Its Issues and State of the Art (invited talk), Proceedings of 2nd International Symposium on Life Science, Kyoto, Feb. 2005.
  27. Tetsuya Ogata, Shigeki Sugano, and Jun Tani: Acquisition of Motion Primitives of Robot in Human-Navigation Task: Towards Human-Robot Interaction based on "Quasi-Symbol", Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems, 315-326, Kyoto, Nov. 2004.
  28. Tsuyoshi Tasaki, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot Motion Control using Listener's Back-Channels and Head Gesture Information. Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems, 327-338, Kyoto, Nov. 2004.
  29. Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Mitsuhiko Toda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Dynamic Communication of Humanoid Robot with multiple people based on Interaction Distance, Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems, 385-392, Kyoto, Nov. 2004.
  30. Yuki Suga, Hiroaki ARIE, Tetsuya Ogata, and Shigeki Sugano: Constructivist Approach to Human-Robot Emotional Communication: Design of Evolutionary Function for WAMOEBA-3, Proceedings of IEEE/RAS Interanational Conference on Humanoid Robots (Humanoids 2004), No.76, Los Angels, Nov. 2004.
  31. Yuki Suga, Tetsuya Ogata, and Shigeki Sugano: Development of Emotional Communication Robot, WAMOEBA-3, Proceedings of International Conference on Advanced Mechatronics (ICAM 2004), 413-418, Oct. 2004.
  32. Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods Proceedings of 5th International Conference on Musical Information Retreival (ISMIR-2004), 184-191, Barcelona, Spain, Oct. 2004. Paper in pdf
  33. Takuya Yoshioka, Tetsuro Kitahara, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Chord Transcription with Concurrent Recognition of Chord Symbols and Boundaries. Proceedings of 5th International Conference on Musical Information Retreival (ISMIR-2004), 100-105, Barcelona, Spain, Oct. 2004. Paper in pdf
  34. Tsuyoshi Tasaki, Takeshi Yamaguchi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot Motion Control using Listener's Back-Channels and Head Gesture Information. Proceedings of 2004 International Conference on Spoken Language Processing (ICSLP-2004), 1033-1036, ASA, ASJ, and ESCA, Korea, Oct. 2004.
  35. Kazushi Ishihara, Yuya Hattori, Tomohiro Nakatani, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Disambiguation in Determining Phonemes of Sound-Imitation Words for Environmental Sound Recognition. Proceedings of 2004 International Conference on Spoken Language Processing (ICSLP-2004), 1485-1488, ASA, ASJ, and ESCA, Korea, Oct. 2004.
  36. Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing (SAPA-2004), accepted, ASA, ASJ, and ESCA, Korea, Oct. 2004.
  37. Shun'ichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi G. Okuno: Assessment of General Applicability of Robot Audition System by Recognizing Three Simultaneous Speeches. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2004), pp.2111-2116, IEEE, RSJ, Sendai, Sep. 2004. IEEE Kansai Chapter Young Researcher Award doi:10.1109/IROS.2005.1544975
  38. Tetsuya Ogata, Masaki Matsunaga, Shigeki Sugano, and Jun Tani: Human Robot Collaboration Using Behavioral Primitives, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2004), pp.1592-1597, IEEE, RSJ, Sendai, Sep. 2004.
  39. Yuki SUGA, Tetsuya Ogata, and Shigeki Sugano: Aquisition of Reactive Motion for Communication Robots Using Interactive EC: Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2004), accepted, Sept. 2004. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2004), pp.1198-1203, IEEE, RSJ, Sendai, Sep. 2004.
  40. Yoshihiro Sakamoto, Tetsuya Ogata, and Shigeki Sugano: Human-Robot Communication Using Multiple Recurrent Neural Networks, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2004), pp.1574-1579, IEEE, RSJ, Sendai, Sep. 2004.
  41. Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Mitsuhiko Toda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Dynamic Communication of Humanoid Robot with multiple people based on Interaction Distance, Proceedings of International Workshop on Robot and Human Interaction (Ro-Man-2004), 81-86, IEEE, Kurashiki, Sep. 2004. Paper in pdf doi:10.1109/ROMAN.2004.1374732
  42. Yuya Hattori, Kazushi Ishihara, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Repeat Recognition for Environmental Sounds, Proceedings of International Workshop on Robot and Human Interaction (Ro-Man-2004), 121-126, IEEE, Kurashiki, Sep. 2004. Paper in pdf doi:10.1109/ROMAN.2004.1374734
  43. Yusuke Akiwa, Yuki Suga, Tetsuya Ogata, and Shigeki Sugano: Imitation based Human-Robot Communication: Roles of Joint Attention and Motion Prediction, Proceedings of International Workshop on Robot and Human Interaction (Ro-Man-2004), 283-288, IEEE, Kurashiki, Sep. 2004.
  44. Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Using a Mixture of N Best Lists from Multiple MT Systems in Rank-Sum-Based Confidence Measure for MT Outputs. Proceedings of the 20th International Conference on Computational Linguistics (Coling-2004), 322-328, Geneva, Aug. 2004.
  45. Kazunori Komatani, Teruhisa Misu, Tatsuya Kawahara, Hiroshi G. Okuno: Efficient Confirmation Strategy for Large-scale Text Retrieval Systems with Spoken Dialogue Interface. Proceedings of the 20th International Conference on Computational Linguistics (Coling-2004), 1100-1106, Geneva, Aug. 2004.
  46. Kazushi Ishihara, Tomohiro Nakatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Sound-Imitation Word Recognition from Environmental Sounds focusing on Ambiguity Problem in Determining Phonemes. PRICAI 2004: Trends in Artificial Intelligence (Proc. of Eighth Pacific Rim International Conference on Artificial Intelligence), LNAI 3157, pp.909-918, Springer-Verlag, Auckland, Aug. 2004.
  47. Ishida, Masayuki Takeda, Tetsuro Kitahara: ism: Improvisation Supporting Systems with Melody Correction, Proceedings of the International Symposium on Musical Acoustics (NIME2004), 177-180, Hamamatsu, Jun. 2004.
  48. Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Incremental Methods to Select Test Sentence for Evaluating Translation Ability. Proceedings of the fourth international conference on Language Resources and Evaluation (LREC-2004), pp.2015-2018, Lisbon, Portugal, May 2004. Paper in pdf
  49. Kazunori Komatani, Ryosuke Itoh, Tatsuya Kawahara, Hiroshi G. Okuno: Recognition of Emotional States in Spoken Dialogue with a Robot. Innovations in Applied Artificial Intelligence:, Seventeenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems, IEA/AIE-2004, LNAI 3029, 413-423, Springer-Verlag. Ottawa, May. 2004, Paper at Springer-Verlag
  50. Tetsuya Ogata, Jun Tani: Open-end Human Robot Interaction from the Dynamical Systems Perspective: Mutual Adaptation and Incremental Learning. Innovations in Applied Artificial Intelligence:, Seventeenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems, IEA/AIE-2004, LNAI 3029, 435-444, Springer-Verlag. Ottawa, May. 2004, Paper at Springer-Verlag
  51. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Category-level Identification of Non-registered Musical Instrument Sounds. Proceedings of 2004 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2004), Vol.IV, 253-256, Montreal, May 2004. Paper in pdf doi:10.1109/ICASSP.2004.1326811
  52. Yohei Sakuraba, Tetsuro Kitahara, Hiroshi G. Okuno: Comparing Features for Forming Music Streams in Automatic Music Transcription. Proceedings of 2004 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2004), Vol.IV, 273-276, Montreal, May 2004. Paper in pdf
  53. Shun'ichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Toshio Yokoyama, Hiroshi G. Okuno: Improvement of Robot Audition by Interfacing Sound Source Separation and Automatic Speech Recognition with Missing Feature Theory, Proceedings of IEEE-RAS International Conference on Robots and Automation (ICRA-2004), 1517-1523, IEEE, New Orleans, May. 2004. Paper in pdf IEEE Robotics and Automation Society Japan Chapter Young Award
  54. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Acoustical-similarity-based Musical Instrument Hierarchy and Its Application to Musical Instrument Identification, Proceedings of the International Symposium on Musical Acoustics (ISMA2004), 297-300, Nara, Apr. 2004.

o Academic Year 2003o

    Thesis

  1. Taro Watanabe : Example-Based Statistical Machine Translation, Ph.D Thesis, Feb. 2004.

  2. Tetsuro Kitahara: , MS Thesis, Feb. 2004.
  3. Yohei Sakuraba: , MS Thesis, Feb. 2004.
  4. Mitsuhiro Sakuraba: , MS Thesis, Feb. 2004.

  5. Naoyuki Kanda: , BE Thesis, Feb. 2004.
  6. Tsuyoshi Tasaki: , BE Thesis, Feb. 2004.
  7. Shohei Matsumoto: , BE Thesis, Feb. 2004.
  8. Yuya Hattori: , BE Thesis, Feb. 2004.
  9. Takuya Yoshioka: , BE Thesis, Feb. 2004.

    Peer-Reviewed Journal Papers

  10. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Acoustic-feature-based Musical Instrument Hierarchy and Its Application to Category-level Recognition of Unknown Musical Instruments. IPSJ Journal, Vol.45, No.3 (Mar. 2004) pp.680-689, IPSJ. Paper in html
  11. Katsuhisa Ishida, Tetsuro Kitahara, Masayuki Takeda: N-gram Based Melody Correction for Improvisation, to Category-level Recognition of Unknown Musical Instruments. IPSJ Journal, Vol.45, No.3 (Mar. 2004) pp.680-689, IPSJ. Paper in html
  12. Yoko Yamakata, Tatsuya Kawahara, Hiroshi G. Okuno, Michihiko Minoh: Belief Network based Disambiguation of Object Reference in Spoken Dialogue System. Transactions of the Japanese Society for Artificial Intelligence, Vol.19, No.1 F, pp.47-56 (2004). Paper Online Journal
  13. Taro Watanabe, Eiichiro Sumita, Hiroshi G. Okuno: Decoding Algorithms for Statisitcal Machine Translation Considering Generation Directions, IPSJ Journal, Vol.44, No.12 (Dec. 2003) 3202-3210, IPSJ. Paper in html
  14. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Musical Instrument Identification Considering Pitch-dependent Characteristics of Timbre: A Classifier Based on F0-dependent Multivariate Normal Distribution. IPSJ Journal, Vol.44, No.10 (Oct. 2003) 2448-2458, IPSJ. Paper in html
  15. Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroshi Mizoguchi, Hiroaki Kitano: Real-time Multiple Talker Tracking by Audio-Visual Integration for Humanoids: Integration of Active Audition nad Face Recognition. Journal of Robotics Society of Japan, Vol.21, No.5 (Jul. 2003), pp.517--525. Digital Library, pdf
  16. Kazunori Komatani, Hiroaki Kashima, Katsuaki Tanaka, Tatsuya Kawahara: Domain-independent Spoken Dialogue Platform for Database Query Using Key-phrase Spotting Based on Combined Language Model, IPSJ Journal, Vol.44, No.5 (May 2003) 1333-1342. Paper in html
  17. Hiroshi G. Okuno, Kazuhiro Nakadai, Active audition for humanoid robots that can listen to three simultaneous talkers. Journal of the Acoustical Society of America, Vol.113, No.4, Pt.2 of 2, Apr. 2003, pp.2230. Abstract at ASA.

    Survye Papers

  18. Hiroshi G. Okuno, Kazuhiro Nakadai: Robot Audition: its research topics and current status. Joho SHori, Vol.44, No.11 (Nov. 2003) pp.1138-1144, IPSJ. Article in html

    Peer-Reviewed International Conference Papers

  19. Hiroshi G. Okuno, Tetsuya Ogata, Kazunori Komatani: Computational Auditory Scene Analysis and Its Application to Robot Audition, Proceedings of the International Conference on Informatics Research for Development of Knowledge Society Infrastructure (ICKS 2004), pp.73-80, Mar. 2004, doi:10.1109/ICKS.2004.1313411
  20. Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroaki Kitano: Applying Scattering Theory to Robot Audition System Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2003), 1147-1152, IEEE, Las Vegas, Oct. 2003. Paper in pdf
  21. Tetsuya Ogata, S. Sugano, Jun Tani: Interactive Learning in Human-Robot Collaboration, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2003), 162-167, IEEE, Las Vegas, Oct. 2003. Paper in pdf
  22. Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroaki Kitano: Active audition based humanoid system and ist evaluation: Localization, Seperation and Recognition of Simultaneous Speech. Proceedings of IEEE/RSJ International Conference on Humanoids (Humanoids-2003), Springer-Verlag, IEEE, Munchen, Oct. 2003.
  23. Yohei Sakuraba, Hiroshi G. Okuno: Note Recognition of Polyphonic Music by Using Timbre Similarity and Direction Proximity. Proceedings of International Computer Music Conference (ICMC2003), 167-170, Singapore, Oct. 2003.
  24. Yasuhiro AKiba, Hiroshi G. Okuno: Experimental Comparison of MT Evaluation Methods: RED vs. BLEU. Proceedings of MT Summit IX, New Orleans, Sep. 2003.
  25. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Realizing Personality in Audio-Visually Triggered Non-verbal Behaviors. Proceedings of IEEE-RAS International Conference on Robots and Automation (ICRA-2003), 392-397, IEEE, Sep. 2003. Paper in pdf
  26. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Robot Recognizes Three Simultaneous Speech By Active Audition. Proceedings of IEEE-RAS International Conference on Robots and Automation (ICRA-2003), 398-403, IEEE, Sep. 2003. Paper in pdf
  27. Yasuhiro AKiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, and Hiroshi G. Okuno: Experimental Comparison of MT Evaluation Methods: RED vs. BLEU. Proceedings of MT Summit IX, 1-8, New Orleans, Sep. 2003. Paper in pdf
  28. Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino: Improvement of Three Simultaneous Speech Recognition by Using AV Integration and Scattering Theory for Humanoid. Proceedings of Audio Visual Spoken Processing (AVSP-2003), 157-162, St. Jorioz, France, Sep. 2003. Paper in pdf
  29. Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: User Modeling in Spoken Dialogue Systems for Flexible Guidance Generation. Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003), 745-748, Geneva, Sep. 2003.
  30. Kazushi Ishihara, Yasushi Tsubota, Hiroshi G. Okuno: Automatic Transformation of Environmental Sounds into Sound-Imitation Words Based on Japanese Syllable Structure. Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003), 3185-3188, Geneva, Sep. 2003.
  31. Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino: Three Simultaneous Speech Recognition by Integration of Active Audition and Face Recognition for Humanoid, Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003), 2705-2708, Geneva, Sep. 2003.
  32. Tatsuya Kawahara, Ryosuke Ito, Kazunori Komatani: Spoken Dialogue System for Queries on Appliance Manuals using Hierarchical Confirmation Strategy. Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003), accepted for presentation, Geneva, Sep. 2003.
  33. Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: Flexible Guidance Generation using User Model in Spoken Dialogue Systems, Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL 2003), pp.256-263, Sapporo, Jul. 2003.
  34. Taro Watanabe, Eiichiro Sumita, and Hiroshi G. Okuno: Chunk-based statistical translation, Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL 2003), pp.303-310, Sapporo, Jul. 2003.
  35. Yasuhiro AKiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, and Hiroshi G. Okuno: A Statistical-Informmation-Based Selector of the Best among Multiple Outputs, Exhibition Brochure of the 41st Annual Meeting of the ACL (ACL 2003), 16, Sapporo, Jul. 2003.
  36. Yoji Kiyota, Sadao Kurohashi, Teruhisa Misu, Kazunori Komatani, Tatsuya Kawahara, Fuyuko Kido: Dialog Navigator''A Spoken Dialog Q-A System based on Large Text Knowledge Base. ACL03 Interactive Poster/Demo Session, pp.149--152 (Companion Volume), 2003.
  37. Kazunori Komatani, Fumihiro Adachi, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: Flexible Spoken Dialogue System based on User Models and Dynamic Generation of VoiceXML Scripts. 4th SIGdial Workshop on Discourse and Dialogue, pp.87--96, 2003.
  38. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution. Proceedings of 2003 International Conference on Muotimedia and Expo (ICME 2003), IEEE, Vol.III, pp.405-409, Baltimore, MD, Jul. 2003.
  39. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Pitch-dependent Musical Instrument Identification and Its Application to Musical Sound Ontology. In Chung, P,W.H., Hinde, C. and Ali, M. (Eds.) Developments in Applied Artificial Intelligence, LNAI 2718, 112-122, Springer-Verlag. Proceedings of Nineteenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2003), Loughborough, UK, Jun. 2003,
  40. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Design and Implementation of Personality of Humanoids in Human Humanoid Non-verbal Interaction. In Chung, P,W.H., Hinde, C. and Ali, M. (Eds.) Developments in Applied Artificial Intelligence, LNAI 2718, 662-673, Springer-Verlag. Proceedings of Nineteenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2003), Loughborough, UK, Jun. 2003,
  41. Hiroshi G. Okuno, Kazuhiro Nakadai: Real-time Sound Source Localization and Separation based on Active Audio-Visual Integration. In Jose Mira and Jose R. Alvarez (Eds.): Computational Methods in Neural Modeling, LNCS 2686, 118-125, Springer-Verlag. The Seventh International Work Conference on Artificial and Nataural Neural Networks, IWANN 2003, Proceedings, Part 1, Ma¥'{o}, Menorca,, Spain, June 2003, Paper in PDF
  42. Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution. Proceedings of 2003 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2003), Vol.5, pp.421--424, IEEE, Hong Kong, Apr. 2003. Paper in PDF

  43. Shun Tsuchiya (Ed.): "Encyclopedia AI", Feigenbaum, McCarthy. Kyoritsu Publishers, 2003.

o Academic Year 2002o

    Thesis

  1. Shinya Amano: Studies on Natural Language Processing for Kana-to-Kanji Conversion and Machine Translation, Ph.D Thesis, Feb. 2003.
  2. Kazunori Komatani: Spoken Dialogue Systems for Information Retrieval with Domain-Independent Dialogue Strategies, Ph.D Thesis, Oct. 2002.

  3. Ryosuke Ito: , MS Thesis, Feb. 2003.
  4. Takashi Sumiyoshi: , MS Thesis, Feb. 2003.
  5. Masahiro Hasegawa: , MS Thesis, Feb. 2003.
  6. Naofumi Yoshida: , MS Thesis, Feb. 2003.
  7. Ian R. Lane: Language Model Switching Based on Topic Detection for Multi-Domain Dialog Speech Recognition, MS Thesis, Feb. 2003.
  8. Yuha Aakita: , MS Thesis, Aug. 2002.

  9. Kazushi Ishihara: , BS Thesis, Feb. 2003.
  10. Tasuku Kitade: , BS Thesis, Feb. 2003.
  11. Teruhisa Misu: , BS Thesis, Feb. 2003.
  12. Shun-ichi Yamamoto: , BS Thesis, Feb. 2003.
  13. Kazuyoshi Yoshii: , BS Thesis, Feb. 2003.

    Peer-Reviewed Journal Papers

  14. Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot. Applied Intelligence, Kluwer Publisher, accepted for publication, International Society for Applied Intelligence, 2003.
  15. Hiroshi G. Okuno, Kazuhiro Nakadai, Ken'ichi Hidai, Hiroshi Mizoguchi, Hiroaki Kitano: Human-Robot Non-Verbal Interaction Empowered by Real-Time Auditory and Visual Multiple-Talker Tracking Advanced Robotics, Vol.17, No.2, pp.115-130, VSP and Robotics Society of Japan, 2003. doi:10.1163/156855303321165088 Online version, pdf
  16. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Issues in Humanoid Audition and Sound Source Localization by Active Audition. Transaction of JSAI, Vol.18, No.2 F, pp.103-110 (Mar. 2003). Paper Online Journal
  17. Kazunori Komatani, Tatsuya Kawahara: , IPSJ Journal, Vol.43, No.10, pp.3078--3086, 2002. Paper in html
  18. Ryosuke Ito, Kazunori Komatani, Tatsuya Kawahara: , IPSJ Journal, Vol.43, No.7, pp.2147--2154, 2002. Paper in html
  19. Masahiro Hasegawa, Yuya Akita, Tatsuya Kawahara: , IPSJ Journal, Vol.43, No.7, pp.2222-2229, 2002. Paper in html
  20. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Auditory and Visual Multiple-Speaker Tracking For Human-Robot Interaction. Journal of Robotics and Mechatronics, special issue on Human Robot Interaction, Vol.14, No.5 (2002) 479-489, Mechatronics Society of Japan.
  21. Kentaro Umesawa, Takamichi Saito, Hiroshi G. Okuno: , IPSJ Journal, Vol.43, No.8 (Aug. 2002) 1553-1562. Paper in html

  22. Yuasa, T. and Okuno, H.G. (Eds.): Advanced Lisp Technology, Advanced Information Processing Technology, Vol.4, Taylor and Francis Publishers, London, UK, May, 2002.

    Peer-Reviewed J ournal Papers

  23. Takamichi Saito, Toshiyuki Kitoh, Kentaro Umesawa, Hiroshi G. Okuno: Privacy-Enhanced SPKI Access Control on PKIX and Its Application to Web Server. Proc. of the Seventeenth International Conference on Advanced Information Networking and Applications (AINA 2003), 696--703, IEEE, Xi'an, China. Paper. doi:10.1109/AINA.2003.1192970
  24. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Auditory Fovea Based Speech Separation and Its Application to Dialog System. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2002), 1314-1319, IEEE, Geneva, Oct. 2002.
  25. Yoko Yamakata, Tatsuya Kawahara, Hiroshi G. Okuno: BELIEF NETWORK BASED DISAMBIGUATION OF OBJECT REFERENCE IN SPOKEN DIALOGUE SYSTEM FOR ROBOT. Proceedings of 2002 International Conference on Spoken Language Processing (ICSLP-2002), 170-176, ASA, ASJ, and ESCA, Denver, Sep. 2002.
  26. Yasushi Tsubota, Tatsuya Kawahara, Hiroshi G. Okuno, Masatake Dantsuji: RECOGNITION AND VERIFICATION OF ENGLISH BY JAPANESE STUDENTS FOR COMPUTER-ASSISTED LANGUAGE LEARNING SYSTEM. Proceedings of 2002 International Conference on Spoken Language Processing (ICSLP-2002), 1205-1208, ASA, ASJ, and ESCA, Denver, Sep. 2002.
  27. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: AUDITORY FOVEA BASED SPEECH ENCHANCEMENT AND ITS APPLICATION TO HUMAN-ROBOT DIALOG SYSTEM. Proceedings of 2002 International Conference on Spoken Language Processing (ICSLP-2002), 1817-1820, ASA, ASJ, and ESCA, Denver, Sep. 2002.
  28. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: REAL-TIME SOUND SOURCE LOCALIZATION AND SEPARATION FOR ROBOT AUDITION. Proceedings of 2002 International Conference on Spoken Language Processing (ICSLP-2002), 193-196, ASA, ASJ, and ESCA, Denver, Sep. 2002.
  29. Taro Watanabe, Eiichiro Sumita: Statistical Machine Translation Decoder Base On Phrase. Proceedings of 2002 International Conference on Spoken Language Processing (ICSLP-2002), Spec3Co2, ASA, ASJ, and ESCA, Denver, Sep. 2002.
  30. Kazunori Komatani, Tatsuya Kawahara, Ryosuke Ito, Hiroshi G. Okuno: Efficient Dialogue Strategy to Find Users' Intended Items from Information Query Results, Proceedings of the Nineteenth International Conference on Computational Linguistics (Coling-2002), Vol.1, pp.481-487, Aug. 2002.
  31. Taro Watanabe, Eiichiro Sumita: Bidirectional Decoding for Statistical Machine Translation. Proceedings of the Nineteenth International Conference on Computational Linguistics (Coling-2002), pp. Aug. 2002.
  32. Yasuhiro Akiba, Taro Watanabe, Eiichiro Sumita: Using Language and Translation Models to Select the Best among Outputs from Multiple MT Systems, Proceedings of the Nineteenth International Conference on Computational Linguistics (Coling-2002), pp. Aug. 2002.
  33. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Realizing Audio-Visually triggered ELIZA-like non-verbal Behaviors. In Ishizuka, M. and Slaney, J. (eds) PRICAI-2002 Topics in Artificial Intelligence (Seventh Pacific Rim International Conference on Artificial Intelligence), LNAI 2417, 552--562, Springer-Verlag, Tokyo, Aug. 2002. Paper in PDF
  34. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Exploiting Auditory Fovea in Humanoid-Human Interaction. Proceedings of Eighteenth National Conference on Artificial Intelligence (AAAI-2002), 431-438, AAAI, Edmonton, Aug. 2002. Paper in PDF
  35. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Non-Verbal Eliza-like Human Behaviors in Human-Robot Interaction through Real-Time Auditory and Visual Multiple-Talker Tracking. Proceedings of the Third International Workshop on Cognitive Robotics (CogRob-2002), AAAI, Edmonton, Jul. 2002. Paper in PDF
  36. Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Social Interaction of Humanoid Robot through Auditory and Visual Tracking. In Hendtlass, T., and Ali, M. (Eds.) Developments in Applied Artificial Intelligence, Proceedings of Eighteenth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2002), Cairns, Australia, June 2002, LNAI 2358, pp.725-735, Springer-Verlag. Paper in PDF
  37. Yoko Yamakata, Tatsuya Kawahara, Hiroshi G. Okuno: Belief Network based Disambiguation of Word Reference in Spoken Dialogue System for Robot. Proceedings of ISCA Tutorial and Research Workshop on Multi-Modal Dialogue in Mobile Environments, Germany, Jun. 2002.
  38. Kazuhiro Nakadai, Ken'ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time speaker localization and speech separation by Audio-Visual Integration, Proceedings of IEEE-RAS International Conference on Robots and Automation (ICRA-2002), pp.1043-1049, IEEE, May 2002. Paper in PDF doi:10.1109/ROBOT.2002.1013493

o Academic Year 2001o

    Thesis

  1. Hirofumi Adachi: , MS Thesis, Feb. 2002.
  2. Yoko Yamakata: , MS Thesis, Feb. 2002.
  3. Raux Antoine Roland: Intelligibility Assessment and Adaptive Drill Generation for a Computer-Assisted Pronunciation Learning System, MS Thesis, Feb. 2002.

  4. Shinichi Ueno: , BS Thesis, Feb. 2002.
  5. Yohei Sakuraba: , BS Thesis, Feb. 2002.
  6. Kazuya Shitaoka: , BS Thesis, Feb. 2002.
  7. Masahiro Yokoo: , BS Thesis, Feb. 2002.

    Peer-Reviewed Journal Papers

  8. Hiroshi G. Okuno, Kazuhiro Nakadai, Lourens, T., Hiroaki Kitano: Sound and Visual Tracking by Active Audition. in Jin, Q., Li, J., Zhang, N., Cheng, J., Yu, C., and Noguchi, N (eds) Enabling Society with Information Technology pp.174-185, Springer-Verlag, 2002.
  9. Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: , Trans. IEICE, Vol.J84-D1, No.11 (Nov. 2001) pp.1553-1562, IEICE, Paper in pdf
  10. Kentaro Umesawa, Takamichi Saito, Hiroshi G. Okuno: , IPSJ Journal, Vol.42, No.8 (Aug. 2001) pp.2067-2076. TAF Telecom Technology Student Award
  11. Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Koichi Takeda, N. Minematsu, Shigeki Sagayama, Katsuya Itou, A. Ito, M. Yamamoto, A. Yamada, T.Utsuto, Kiyohiro Shikano: Japanese Dictation ToolKit -- 1999 version --, Journal of Acoustic Society of Japan, Vol.57, No.3, pp.210--214, 2001
  12. M. Mimura and Tatsuya Kawahara: Difference of acoustic modeling for read speech and dialogue speech, Acoustical Science & Technology, Vol.22, No.5, pp.373--374, 2001.

    Survey Papers

  13. Hiroshi G. Okuno, Kazuhiro Nakadai: , JSAJ, Vol.58, No.3 (Mar. 2002) pp.205-210.

  14. Hiroaki Kitano: Hiroshi G. Okuno, 諸橋 峰雄, 京田 耕司, Kazuhiro Nakadai : 『PCクラスタ構築法 − Linux クラスタベオウルフ』, 産業図書, 2001.

    Peer-Reviewed International Conference Papers
  15. Kazuhiro Nakadai, Ken'ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Active Human Tracking by Hierarchical Integration of Audition and Vision. Proceedings of IEEE-RAS International Conference on Humanoid Robots (Humanoids2001), pp.91-98, IEEE, Nov. 2001. Paper in PDF
  16. Kazuhiro Nakadai, Tatsuya Matsui, Hiroshi G. Okuno, Hiroaki Kitano: Epipolar Geometry Based Sound Localization and Extraction for Humanoid Audition. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2001), 1395-1401, IEEE and RSJ, Oct. 2001. Paper in PDF doi:10.1109/IROS.2001.977176
  17. Hiroshi G. Okuno, Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroaki Kitano: Human-Robot Interaction Through Real-Time Auditory and Visual Multiple-Talker Tracking Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2001), 1402-1409, IEEE and RSJ, Oct. 2001. Paper in PDF Nakamura Award for IROS-2001 Best Paper Nomination Finalist (2nd or 3rd Place) at IROS-2002 doi:10.1109/IROS.2001.977177
  18. Tino Lourens, Hiroshi G. Okuno, Hiroaki Kitano: Automatic Graph Extraction from Color Images. Proc. of 11th International Conference Image Analysis and Processing (ICIAP 2001), pp.302-308, Granada, Spain, June 2001.
  19. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Multiple Speaker Tracking by Multi-Modal Integration for Mobile Robots. Proceedings of European Conforence on Speech Processing (Eurospeech 2001), pp.2643-2646, Sep. 2001. Paper in PDF
  20. Hiroshi G. Okuno, Kazuhiro Nakadai, T. Lourens, Hiroaki Kitano: Separating Three Simultaneous Speeches with Two Microphones by Integrating Auditory and Visual Processing. Proceedings of European Conforence on Speech Processing (Eurospeech 2001), pp.1193-1196, Sep. 2001. Paper in PDF
  21. Akinobu Lee, Tatsuya Kawahara, and Kiyohiro Shikano: Gaussian mixture selection using context-independent HMM, Proc. IEEE-ICASSP, pp.69--72, 2001.
  22. Hiroaki Nanjo, Tatsuya Kawahara: Speaking-Rate Dependent Decoding and Adaptation for Spontaneous Lecture Speech Recognition, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), pp.725--728, 2002.
  23. Kazunori Komatani, K.Tanaka, H.Kashima, and Tatsuya Kawahara: Domain-independent spoken dialogue platform using key-phrase spotting based on combined language model, Proceedings of European Conforence on Speech Processing (Eurospeech 2001), pp.1319--1322, 2001.
  24. Akinobu Lee, Tatsuya Kawahara, and Kiyohiro Shikano: Julius -- an open source real-time large vocabulary recognition engine, Proceedings of European Conforence on Speech Processing (Eurospeech 2001), pp.1691--1694, 2001
  25. Hiroaki Nanjo, Kazuomi Kato, and Tatsuya Kawahara: Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition, Proceedings of European Conforence on Speech Processing (Eurospeech 2001), pp.2531--2534, 2001
  26. Tatsuya Kawahara, Hiroaki Nanjo, and S.Furui: Automatic transcription of spontaneous lecture speech, Proc. IEEE workshop on Automatic Speech Recognition and Understanding, 2001.
  27. Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Auditory and Visual Multiple-Object Tracking for Robots. Proc. of 17th International Joint Conference on Artificial Intelligence (IJCAI-01) , 1425-1432, Seattle, Aug. 2001. 電気通信普及財団テレコム技術賞奨励賞 Paper in PDF
  28. Tino Lourens, Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Detection of Oriented Repetitive Alternating Patterns in Color Images -- A Computational Model of Monkey Grating Cells. Proc. of Sixth International Work-Conference on Artificial and Natural Neural Networks (IWANN2001), Granada, Spain, June 2001. LNCS 2084, 95-107, Springer-Verlag.
  29. Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot, Proc. of 17th International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2001) , Budapest, Hungary, June 2001. Lecture Notes in Artificial Intelligence No.2070, 640-650, Springer. Best Paper Award (1st Place)
  30. Ian Frank, Kumiko Tanaka, Hiroshi G. Okuno, Jun'ichi Akita, Yukiko Nakagawa, K. Maeda, Kazuhiro Nakadai, Hiroaki Kitano: And The Fans are Going Wild! SIG plus MIKE. RoboCup 2000: Robot Soccer World Cup IV, Lecture Notes in Artificial Intelligence No.2019, 139-148, Springer-Verlag, May 2001.
  31. Yukiko Nakagawa, Hiroshi G. Okuno, Hiroaki Kitano: Bridging gap between small sized league and simulator league. RoboCup 2000: Robot Soccer World Cup IV, Lecture Notes in Artificial Intelligence No.2019, 209-218, Springer-Verlag, May 2001.
  32. Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: An Access Control with Handling Private Information Server. Proc. of the First International Workshop on Internet Computing and E-Commerce (ICEC01), IEEE, San Francisco, April 2001.

ACM author profile page fun


Last Update: Mon Nov 9 17:54:47 2009


(C) Copyleft All Wrongs Reserved, 2001-2009.