Publication and Awards of Okuno Laboratory
DBLP:
H.G. Okuno,
T. Ogata,
K. Komatani,
T. Takahashi,
S. Nishide,
R. Takeda,
K. Itoyama,
T. Yoshioka,
H. Fujihara,
T. Mizumoto
Katsumaru,
Yasuraoka,
OB:
S. Shiramatsu,
S. Ikeda,
H. Kanda
Y. Kubota
H-D. Kim,
S. Yamamoto,
K. Yoshii,
R. Yokoya
S. Naito,
T. Kitahara,
H. Niwa,
M. Yoshida,
T. Tasaki,
S. Matsumoto,
Valin,
Y. Akiba,
T. Watanabe,
K. Ishihara,
Kodaka,
M. Toda,
T. Misu,
I. Lane,
Y. Akita,
Y. Yamakata,
A. Raux,
Ito,
T. Kawahara,
Academic Year 2009
Thesis |
Journal Papers |
Book Chapters |
International Conferences |
Domestic Conferences |
Patents
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino:
Design and Implementation of Robot uAudition System "HARK",
Advanced Robotics, accepted, Sep. 2009,
VSP and Robotics Society of Japan.
-
Hyun-Don Kim,
Jinsung Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Target Speech Detection and Separation for Communication with
Humanoid Robots in Noisy Home Environments,
Advanced Robotics, in print,
VSP and Robotics Society of Japan.
doi:10.1163/016918609X12529300552105,
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Self-Organization of Dynamic Object Features based on Bi-Directional Training,
Advanced Robotics, Vol.23 (2009) 2035-2057.
doi:10.1163/016918609X12529289797027,
VSP and Robotics Society of Japan.
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Autonomous Motion Generation based on Reliable Predictability,
Journal of Robotics and Mechatronics,
special issue on Kukanchi Interactive Human-Space Design and Intelligence
Dedicated to Dr. Kazuo Tanie,
Vol.21, No.4 (2009) 478-488.
-
Ryu Takeda,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot audition by multi-channel input Independent Component Analysis (in Japanese),
Journal of Robotics Society of Japan, Vol.27, No.7/8 (2009) accepted.
-
Kazumasa Murata,
Kazuhiro Nakadai,
Ryu Takeda,
Hiroshi G. Okuno,
Yuji Hasegawa, Hiroshi Tsujino:
Musical Beat-Tracking for Robots and Its Application to A Music Robot,
Journal of Robotics Society of Japan, Vol.27, No.7/8 (2009) accepted.
-
Hisashi Kanda,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Simulation of Phoneme Aquisition Process (in Japanese),
Journal of Robotics Society of Japan, Vol.27, No.7/8 (2009) accepted.
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Parameter Estimation for Harmonic and Inharmonic Models by Using Timbre Feature Distributions
IPSJ Journal, Vol.50, No.7 (Jul. 2009) 1757-1767, IPSJ.
Journal of Information Processing, Vol.17 (2009) 191-201, IPSJ.
pdf
D-Library
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Binaural Active Audition for Humanoid Robots to Localize Speech over Entire
Azimuth Range,
Applied Bionics and Biomechanics, Special Issue on "Humanoid Robots",
accepted with minor modifications,
Taylor & Francis, Mar. 2009.
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Human Tracking System Integrating Sound and Face Localization using
EM Algorithm in Real Environments,
Advanced Robotics, Vol.23, No.6 (May 2009) 629-653,
doi:10.1163/156855309X431659,
VSP and Robotics Society of Japan.
-
Masaki Katsumaru,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Adjusting Occurence Probabilities of Automatically-Generated Abbreviated
Words in Spoken Dialogue Systems,
B.-C. Chien, T.-P. Hong, S.-M. Chen, M. Ali (Eds.):
Next-Generation Applied Intelligence, 22nd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems,
Lecture Notes in Artificial Intelligence 5579, pp.481-490,
Tainan, Taiwan, Jun. 24-27, 2009.
doi:10.1007/978-3-642-02568-6_49
-
Shun Shiramatsu,
Yuji Kubota,
Kazunori Komatani,
Tetsuya Ogata,
Toru Takahashi,
Hiroshi G. Okuno:
Visualization-based Approaches to Support Context Sharing towards Public
Involment Support System,
Opportunities and Challenges for Next-Generation Applied Intelligence,
Studies in Computational Intelligence, Springer, Vol.214,
pp.111--117, Tainan, Taiwan, Jun. 24-27, 2009.
doi:10.1007/978-3-540-92814-0_18
-
Kazunori Komatani,
Tatsuya Kawahara,
Hiroshi G. Okuno:
A Model of Temporally Changing User Behaviors in a Deployed Spoken
Dialogue System, G.-J. Houben et al. (Eds.):
UMAP 2009, First and Seventeenth International Conference on User Modeling, Adaptation, and Personalization,
Lecture Notes in Computer Science 5535, pp.408-414,
Trento, Italy, Jun. 22-26, 2009.
-
Akira Maezawa,
Katsutoshi Itoyama,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Bowed String Sequence Estimation of a Violin Based on Adaptive Audio Signal
Classification and Context-Dependent Error Correction,
Proceedings of IEEE International Symposium on Multimedia (ISM2009),
accepted for full paper presentation (acceptance rate for full papers, 19.6%),
San Diego, Dec. 14-16, 2009.
-
Takuma Ohtsuka,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Voice quality manipulation for humanoid robots consistent with their head movements,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008),
accepted, IEEE, Paris, Dec. 7-10, 2009
-
Takumi Yoshida,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
Automatic Speech Recognition Improved by Two-Layered Audio-Visual,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008),
accepted, IEEE, Paris, Dec. 7-10, 2009.
-
Ryu Takeda,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Estimation of Reverberation Time with Robot Speech to Improve ICA-based Robot Audition,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008),
accepted, IEEE, Paris, Dec. 7-10, 2009.
-
Hiromasa Fujihara,
Masataka Goto,
Hiroshi G. Okuno:
A NOVEL FRAMEWORK FOR RECOGNIZING PHONEMES OF SINGING VOICE IN POLYPHONIC MUSIC,
Proceedings of 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), accepted,
Oct. 18-21, New Paltz, NY, 2009.
-
Takuya Yoshioka,
Hirokazu Kameoka, Tomohiro Nakatani,
Hiroshi G. Okuno:
Statistical models for speech dereverberation,
Proceedings of 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), accepted,
Oct. 18-21, New Paltz, NY, 2009.
-
Naoki Yasuraoka,
Takehiro Abe,
Katsutoshi Itoyama,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Changing Timbre and Phrase in Existing Musical Performances as You Like,
ACM Multimedia 2009, 203-212
(16% 22/138), Beijing, China, Oct. 19-24, 2009.
pdf,
doi:10.1145/1631272.1631302
-
Ryu Takeda,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Step-size Parameter Adaptation of Multi-channel Semi-blind ICA with Piecewise Linear Model for Barge-in-able Robot Audition (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.2273-2282, (900/1650),
IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009.
pdf
-
Takuma Ohtsuka,
Kazumasa Murata,
iToru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Incremental Polyphonic Audio to Score Alignment using Beat Tracking for Singer Robots (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.2289-2296,
IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009.
pdf
-
Takeshi Mizumoto,
Hiroshi Tsujino,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Thereminist Robot: Development of a Robot Theremin Player with Feedforward and Feedback Arm Control based on a Theremin's Pitch Model (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.2297-2302,
IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009.
pdf
-
Toru Takahashi,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Missing-Feature-Theory-based Robust Simultaneous Speech Recognition System with Non-clean Speech Acoustic Model (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.2730-2735,
IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009.
pdf
-
Wataru Hinoshita,
Tetsuya Ogata,
Hideki Kozima,
Hisashi Kanda,
Toru Takahashi,
Hiroshi G. Okuno:
Emergence of Evolutional Interaction with Voice and Motion between Two Robots using RNN,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.4196-4291,
IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009.
pdf
-
Shun Nishide,
Tetsuhiro Nakagawa,
Tetsuya Ogata,
Jun Tani,
Toru Takaahashi,
Hiroshi G. Okuno:
Modeling Tool-Body Assimilation using Second-order Recurrent Neural Network,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.5376-5381, (900/1650),
IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009.
pdf
-
Hisashi Kanda,
Tetsuya Ogata,
Toru Takahashi,
Kazunori Komatani,
Hiroshi G. Okuno:
Phoneme Acquisition Model based on Vowel Imitation using Recurrent Neural Network,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.5388-5393,
IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009.
pdf
-
Kazunori Komatani,
Satoshi Ikeda,
Yuichiro Fukubayashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Ranking Help Message Candidates Based on Robust Grammar Verification Results
and Utterance History in Spoken Dialogue Systems,
Proceedings of the 10th SIGdial Workshop on Discourse and Dialogue (SigDial 2009),
314-321, Sep. 12, 2009.
-
Kyoko Matsuyama,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Enabling A User To Specify An Item At Any Time During System Enumeration,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2009), Mon-Ses2-P4-1, (57.7%),
Brighton, 6-10 Sep. 2009.
pdf
-
Masaki Katsumaru,
Mikio Nakano,
Kazunori Komatani,
Kotaro Funakoshi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improving Speech Understanding Accuracy with Limited Training
Data Using Multiple Language Models and Multiple Understanding Models,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2009), Thu-Ses1-P4-9, (57.7%), Brighton, 6-10 (10) Sep. 2009.
pdf
-
Hideki Kawahara,
Masanori Morise,
Toru Takahashi,
Hideki Banno, Ryuichi Nishimura, Toshio Irino:
Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2009), Thu-Ses1-P2-6, (57.7%),
Brighton, 6-10 Sep. 2009.
pdf
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hyun-Don Kim:
Robot Auditon: Missing Feature Theory Approach and Active Audition (Invited talk),
Proceeding of the 14th International Symposium of Robotics Research
(ISRR 2009), August 31 - September 3, 2009, Lucerne, Switzerland,
International Foundation of Robotics Research.
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
QUERY-BY-EXAMPLE MUSIC RETRIEVAL APPROACH BASED ON MUSICAL GENRE SHIFT BY CHANGING INSTRUMENT VOLUME,
Proceeding of the 12th International Conference on Digital Audio Effects
(DAFx-09), accepted,
Como, Italy, Sep.1-4. 2009.
-
Shun Shiramatsu,
Tadachika Ozano, Toramatsu Shintani,
Kazunori Komatani,
Tetsuya Ogata,
Toru Takahashi:
Hiroshi G. Okuno:
Development of a Meeting Browser towards Supporting Public Involvement,
Proceedings of International Conference on Computational Science and Engineering, Vol.4, 717-722 (Aug. 2008), IEEE
pdf,
doi:10.1145/10.1109/CSE.2009.362
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Analysis of Motion Searching based on Reliable Predictability using Recurrent Neural Network,
Proceedings of 2009 IEEE/ASME Conference on Advanced Intelligent Mechatronics (AIM 2009), 192-197,
Singapore, July 14-19, 2009.
doi:10.1145/10.1109/AIM.2009.5230015
-
Kazunori Komatani,
Alexander I. Rudnicky:
Predicting Barge-in Utterance Errors by using Implicitly-Supervised ASR Accuracy and Barge-in Rate per User,
Proceedings of the Fourth International Joint Conference on Natural Language
Processing (ACL-IJCNLP 2009), accepted as a short paper, Jul. 2009.
-
Masaki Katsumaru,
Mikio Nakano,
Kazunori Komatani,
Kotaro Funakoshi,
Hiroshi G. Okuno:
A Speech Understanding Framework that Uses Multiple Language Models and
Multiple Understanding Models,
Proceeding of the North American Chapter of the Association for
Computational Linguistics - Human Language Technologies (NAACL HLT)
2009 conference,
accepted, (73/180)
Boulder, CO, May 31 - Jun. 5, 2009.
-
Tetsuya Ogata,
Ryunosuke Yokoya,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Prediction and Imitation of Other's Motions by Reusing Own Forward-Inverse
Model in Robots,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2009), pp.4144-4149, (699/1624),
(May 12-17 (16), 2009), Kobe.
pdf
doi:10.1145/10.1109/ROBOT.2009.5152363
-
Hisashi Kanda,
Tetsuya Ogata,
Toru Takahashi,
Kazunori Komatani,
Hiroshi G. Okuno:
Continuous Vocal Imitation with Self-organized Vowel Spaces in
Recurrent Neural Network,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2009), pp.4438-4443,
(May 12-17 (16), 2009), Kobe.
pdf
doi:10.1145/10.1109/ROBOT.2009.5152818
-
Ryu Takeda,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
ICA-BASED EFFICIENT BLIND DEREVERBERATION AND ECHO CANCELLATION METHOD FOR BARGE-IN-ABLE ROBOT AUDITION,
Proceedings of 2009 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2009),
SS-L7.1, pp.3677-3680, (1178/2633), Taipei, Taiwan, April 19--24 (23), 2009.
pdf
doi:10.1145/10.1109//ICASSP.2009.4960424
-
Hideki Kawahara,
Ryuichi Nisimura, Toshio Irino, Masanori Morise,
Toru Takahashi,
Hideki Banno:
TEMPORALLY VARIABLE MULTI-ASPECT AUDITORY MORPHING ENABLING EXTRAPOLATION WITHOUT OBJECTIVE AND PERCEPTUAL BREAKDOWN,
Proceedings of 2009 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2009),
pp. , April 23.
pdf
Academic Year 2008
Thesis |
Journal Papers |
Book Chapters |
International Conferences |
Domestic Conferences |
Patents
Thesis
- Shun Nishide:
Self-Organization of Invariants for Motion Generation based on
Reliable Predictability,
Ph.D Thesis, Feb. 2009.
- Hyun-Don Kim:
Binaural Active Audition for Humanoid Robots,
Ph.D Thesis, Sep. 2008.
-
Takehiro Abe, MS Thesis, Feb. 2008.
-
Satoshi Ikeda, MS Thesis, Feb. 2008.
-
Hisashi Kanda, MS Thesis, Feb. 2008.
-
Yuji Kubota, MS Thesis, Feb. 2008.
-
Kaiping Wang, MS Thesis, Feb. 2008.
-
Takuma Otsuka, BE Thesis, Feb. 2008.
-
Wataru Hinoshita, BE Thesis, Feb. 2008.
-
Kyoko Matsuyama, BE Thesis, Feb. 2008.
-
Tadanori Yasuraoka, BE Thesis, Feb. 2008.
-
Tatsuhiro Nakagawa, BE Thesis, Feb. 2008.
Peer-reviewed Journal Papers
-
Takehiro Abe,
Katsutoshi Itoyama,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
An Analysis-and-Synthesis Approach for Manipulating Pitch of a Musical
Instrument Sound Considering Pitch-dependency of Timbral Characteristics,
IPSJ Journal, Vol.50, No.3 (Mar., 2009) 1054-1066
IPSJ.
pdf,
D-Lib
-
Satoshi Ikeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Integrating Topic Estimation and Dialogue History for Domain Selection in
Multi-Domain Spoken Dialogue Systems,
IPSJ Journal, Vol.50, No.2 (Feb., 2009) 488-500,
IPSJ.
pdf,
D-Lib
-
Masaharu Morise,
Toru Takahashi,
Hideki Kawahara,
Toshio Irino:
IEIC Trans. A, Vol.J92-A, No.3 (Mar. 2009).
-
Shun Shiramatsu,
Kazunori Komatani,
Koiti Hasida,
Tetsuya Ogata,
Hiroshi G. Okuno:
Game-Theoretic Model of Referential Coherence and Its Empirical
Verification Using Large Japanese and English Corpora,
ACM Transactions on Speech and Language Processing, Vol.5, No.3 (Oct. 2008) Article 6, ACM.
pdf,
doi:10.1145/1410358.1410360
-
Hiromasa Fujihara,
Masataka Goto,
Hiroshi G. Okuno:
An F0 Estimation Method of Vocal Part in Polyphonic Music by Using Statistical
Modelling of Singing Voice and Viterbi Search,
IPSJ Journal, Vol.49, No.10 (Oct. 2008) 3682-3693, IPSJ.
pdf,
D-Lib
-
Kazunori Komatani,
Satoshi Ikeda,
Tetsuya Ogata,
Hiroshi G. Okuno:
Managing Out-of-Grammar Utterances by Topic Estimation with Domain
Extensibility in Multi-Domain Spoken Dialogue Systems,
Speech Communication, No.50 (2008) 836-870.
doi:10.1016/j.specom.2008.05.010
-
Ryu Takeda,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot Audition using an Adaptive Filter Based on Independent Component
Analysis,
Journal of Robotic Society of Japan, Vol.26, No.6 (Sep. 2008)
pp.529-536.
学会サーバ
-
Yuichiro Fukubayashi,
Kazunori Komatani,
Mikio Nakano, Kotaro Funakoshi, Hiroshi Tsujino,
Tetsuya Ogata,
Hiroshi G. Okuno:
WFST-based Language Understanding for Rapid Prototyping of Spoken Dialogue
Systems,
IPSJ Journal,
Vol.49, No.8 (Aug. 2008) pp.2762-2772,
Information Processing Society of Japan,
pdf,
Digital Library.
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Predicting Object Dynamics from Visual Images through Active Sensing Experiences,
Advanced Robotics, Vol.22, No.5 (May 2008) pp.527-546,
doi:10.1163/156855308X294879,
Online version,
VSP and Robotics Society of Japan.
-
Hiroshi G. Okuno,
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata:
A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals,
Journal of Acoustic Society of America, Vol.123, No.5 (May 2008) Pt.2, pp.3066-3067.
-
Hideki Kawahara, Masanori Morise,
Toru Takahashi,
Ryuichi Nishimura, Hideki Banno, Toshio Irino:
A temporally stable representation of power spectra of periodic signals and
its application to F0 and periodicity estimation,
Journal of Acoustic Society of America, Vol.123, No.5 (May 2008) Pt.2, pp.3074-3075.
Book Chapters, Reviews
-
Shun Shiramatsu,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
SalienceGraph: Visualizing Salience Dynamics of Written Discourse
by Using Reference Probability and PLSA,
T. B. Ho and Z-H. Zhou (Eds.): PRICAI-2008: Trends in Artificial
Intelligence, 890-902, (84/234, 35.8%),
Lecture Notes in Computer Science, Vol.5351, Springer-Verlag, Dec. 2008.
doi:10.1007/978-3-540-89197-0_83
-
Satoshi Ikeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Integrating Topic Estimation and Dialogue History for Domain Selection
in Multi-Domain Spoken Dialogue Systems,
Ngoc Thanh Nguyen,Leszek Borzemski,Adam Grzech,Moonis Ali (Eds.):
New Frontiers in Applied Artificial Intelligence,
pp.294-304, Lecture Notes in Artificial Intelligence, Vol.5027,
June, 2008.
doi:10.1007/978-3-540-69052-8_31
-
Hisashi Kanda,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Vocal Imitation using Vocal Tract Model and Recurrent Neural Network,
Masumi Ishikawa, Kenji Doya, Hiroyuki Miyamoto, Takeshi Yamakawa (Eds.):
Neural Information Processing,
14th International Conference, ICONIP 2007, Revised Selected Papers,
Part II, pp.222-232,
Lecture Notes in Computer Science 4985, Springer-Verlag, June 2008.
doi:10.1007/978-3-540-69162-4_24
-
Tetsuya Ogata,
Hideki Kojima,
Hiroshi G. Okuno:
Motion Emergence from Sound using Cross-Modal Mapping on Recurrent
Neural Network,
Aucouturier, J.-J. (ed.) Cheek to Chip: Dancing Robots and AI's Future,
IEEE Intelligent Systems,
Vol.23, No.2 (Apr. 2008), 74--84,
doi:10.1109/MIS.2008.22
Peer-reviewed Conference Papers
-
Masato Onishi,
Toru Takahashi,
Toshio Irino,
Hideki Kawahara:
Vowel-based frequency alignment function design and recognition-based time
alignment for automatic speech morphing,
Proceedings of IEEE Workshop on Spoken Language Technology 2008 (SLT 2008), accepted, Goa, India, December, 15--18, 2008,
-
Yuji Kubota,
Masatoshi Yoshida,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Design and Implementation of 3D Auditory Scene Visualizer towards
Auditory Awareness with Face Tracking,
Proceedings of IEEE International Symposium on Multimedia (ISM2008),
pp.468-476 (acceptance rate for regular papers, 24%),
Berkeley, Dec. 16. 2008.
pdf
doi:10.1109/ISM.2008.107
-
Yuji Kubota,
Shun Shiramatsu,
Masatoshi Yoshida,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
3D Auditory Scene Visualizer With Face Tracking:
Design and Implementation For Auditory Awareness Compensation,
Proceedings of 2nd International Symposium on Universal Communication
(ISUC2008), pp.42-49, IEEE, Osaka, Dec. 15-16. 2008.
pdf
doi:10.1109/ISUC.2008.59
-
Kazuhiro Nakadai,
Hiroshi G. Okuno:
Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino:
An Open Source Software System For Robot Audition HARK and Its Evaluation,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008),
pp.561-566, Daejeon, Korea, Dec. 3, 2008.
pdf
-
Kazumasa Murata,
Kazuhiro Nakadai,
Ryu Takeda,
Hiroshi G. Okuno,
Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino:
A beat-tracking robot for human-robot interaction and its evaluation,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008),
pp.79-84, Daejeon, Korea, Dec. 2, 2008.
pdf
-
Shun Shiramatsu,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
SalienceGraph: Visualizing Salience Dynamics of Written Discourse
by Using Reference Probability and PLSA,
Proceedings of the Tenth Pacific Rim International Conference on
Artificial Intelligence (PRICAI-08), 890-902, (84/234, 35.8%),
Lecture Notes in Computer Science, Vol.5351, Springer-Verlag,
Hanoi, Vienam, Dec. 15-19. 2008.
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Analysis of Reliable Predictability based Motion Generation using RNNPB,
Proceedings of Joint 4th International Conference on Soft Computing and
Intelligent Systems and 9th International Symposium on advanced
Intelligent Systems (SCIS & ISIS 2008)
pp.305-310, Nagoya, JAPAN, September 17-21, 2008.
-
Hideki Kawahara,
Masanori Morise, Hideki Banno,
Toru Takahashi,
Ryuichi Nishimura, Toshio Irino:
Spectral Envelope Recovery beyond the Nyquist Limit for High-Quality Manipulation of Speech Sounds,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2008), pp.22-26,
Brisbane, Sept. 24, 2008.
-
Toru Takahashi,
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:Soft Missing-Feature Mask Generation for Simultaneous Speech Recognition
System in Robots,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2008), pp.992-997,
Brisbane, Sept. 24, 2008.
-
Kazunori Komatani,
Tatsuya Kawahara,
Hiroshi G. Okuno:Predicting ASR Errors by Exploiting Barge-In Rate of Individual Users
for Spoken Dialogue Systems,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2008), pp.183--186,
Brisbane, Sept. 2008.
-
Masaki Katsumaru,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:Expanding Vocabulary for Recognizing User¥'s Abbreviations of Proper Nouns
without Increasing ASR Error Rates in Spoken Dialogue Systems,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2008), pp.187-190,
Brisbane, Sept. 2008.
-
Satoshi Ikeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:Extensibility Verification of Robust Domain Selection against Out-of-Grammar
Utterances in Multi-Domain Spoken Dialogue System,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2008), pp.487-490,
Brisbane, Sept. 2008.
-
Shun Nishide,
Tetsuya Ogata,
Ryunosuke Yokoya,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Active Ssensing based Dynamical Object Feature Extraction,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), pp.1-7, TuAT1.1,
IEEE, RSJ, Nice, 23 Sep. 2008.
pdf
doi:10.1109/IROS.2008.4650794
-
Takeshi Mizumoto,
Ryu Takeda,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:A Robot Listens to Music and Counts Its Beats Aloud by Separating Music
from Counting Voice,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), 1538-1543, WeAT6.1
IEEE, RSJ, Nice, 24 Sep. 2008.
pdf
doi:10.1109/IROS.2008.4650821
Award for Entertainment Robots and Systems (NTF Award) Nomination Finalist.
-
Hyun-Don Kim,
Jinsung Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Target Speech Detection and Separation for Humanoid Robot in Sparse
Dialogue with Noisy Home Environments (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), 1705-1711, WeAT10.4
IEEE, RSJ, Nice, 24 Sep. 2008.
pdf
doi:10.1109/IROS.2008.4650977
-
Hisashi Kanda,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Segmenting Acoustic Signal with Articulatory Movement using
Recurrent Neural Network for Phoneme Aquisition (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), 1712-1717, WeAT10.5
IEEE, RSJ, Nice, 24 Sep. 2008.
pdf
doi:10.1109/IROS.2008.4651060
-
Ryu Takeda,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:Barge-in-able Robot Audition Based on ICA and Missing Feature Theory
under Semi-Blind Situation (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), 1718-1723, WeAT10.6,
IEEE, RSJ, Nice, 24 Sep. 2008.
pdf
doi:10.1109/IROS.2008.4650821
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Design and Evaluation of Two-Channel Sound Source Localization over
Entire Azimuth Range for Moving Talker (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), pp.2197-2203 Sept. 2008
pdf
IEEE, RSJ, Nice, Sept. 2008.
doi:10.1109/IROS.2008.4650947
-
Kazumasa Murata,
Kazuhiro Nakadai,
Kazuyoshi Yoshii,
Ryu Takeda,
Toyotaka Torii,
Hiroshi G. Okuno,
Yuji Hasegawa, Hiroshi Tsujino:
A Robot Uses Its Own Microphone to Synchronize Its Steps to Musical Beats
While Scatting and Singing,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), pp.2459-, WeCT6.1,
IEEE, RSJ, Nice, 24 Sep. 2008.
pdf
doi:10.1109/IROS.2008.4650596
Award for Entertainment Robots and Systems (NTF Award) Nomination Finalist.
-
Kohei Sumi,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Chord Recognition Based on Probabilistic Integration of
Chord Transition and Bass Pitch Estimation,
Proceedings of 9th International Conference on Musical Information
Retreival (ISMIR-2008), 39-44,
Philadelphia, 15 Sep. 2008.
pdf
-
Kohei Sumi,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Chord Recognition Based on Probabilistic Integration of
Chord Transition and Bass Pitch Estimation,
Proceedings of 9th International Conference on Musical Information
Retreival (ISMIR-2008), 39-44,
Philadelphia, 15 Sep. 2008.
pdf
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Instrument Equalizer for Query-by-Example Retrieval: Improving Sound Source
Separation based on Integrated Harmonic and Inharmonic Models,
Proceedings of 9th International Conference on Musical Information
Retreival (ISMIR-2008),133-138,
Philadelphia, 15 Sep. 2008.
pdf
-
Kazumasa Murata,
Kazuhiro Nakadai,
Kazuyoshi Yoshii,
Ryu Takeda,
Toyotake Torii,
Hiroshi G. Okuno,
Yuji Hasegawa, Hiroshi Tsujino:
A Robot Singer with Music Recognition Based on Real-Time Beat Tracking,
Proceedings of 9th International Conference on Musical Information
Retreival (ISMIR-2008), 199-204,
Philadelphia, 15 Sep. 2008.
pdf
-
Takehiro Abe,
Katsutoshi Itoyama,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
SSynthesis Approach for Manipulating Pitch of a Musical Instrument Sound
with Considering Timbral Characteristics,
Proceeding of the 11th International Conference on Digital Audio Effects
(DAFx-08), 249-256,
Espoo, Finland, Sep.1-4. 2008.
pdf
-
Satoshi Ikeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Integrating Topic Estimation and Dialogue History for Domain Selection
in Multi-Domain Spoken Dialogue Systems
Proceeding of the 21st International Conference on Industrial, Engineering and Other Applications of Applied Intelligence Systems (IEA/AIE-2008),
pp.294-304, (acceptance rate is about 30%), LNAI 5027,
Wroclaw, Poland, Jun. 18, 2008.
doi:10.1007/978-3-540-69052-8_31
-
Hiroshi G. Okuno,
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata:
A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals,
Proceedings of Acoustics'08,
CD-ROM , 1pSCa8, June 30, 2008.
-
Hideki Kawahara, Masanori Morise,
Toru Takahashi,
Ryuichi Nishimura, Hideki Banno, Toshio Irino:
A temporally stable representation of power spectra of periodic signals and
its application to F0 and periodicity estimation,
Proceedings of Acoustics'08,
CD-ROM , 1pSCc24, June 30, 2008.
-
Hideki Kawahara, Masanori Morise,
Toru Takahashi,
Ryuichi Nishimura, Hideki Banno, Toshio Irino:
A unified approach for F0 extraction and aperiodicity estimation
based on a temporally stable power spectral representation,
Proceedings of ISCA Tutorial and Research Workshop (ITRW) on
"Speech Analysis and Processing for Knowledge Discovery",
June 4, 2008, Aalborg, DK.
-
Shun Nishide,
Tetsuya Ogata,
Ryunosuke Yokoya,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Object Dynamics Prediction and Motion Generation
based on Reliable Predictability,
Proceedings of IEEE-RAS International Conference
on Robots and Automation (ICRA-2008), 1608-1614,
(May 20, 2008).
pdf
doi:10.1109/ROBOT.2008.4543431
-
Kazuhiro Nakadai,
Shun'ichi Yamamoto,
Hiroshi G. Okuno,
Hirofumi Nakajima, Yuji Hasegawa,
Hiroshi Tsujino:
A Robot Referee for Rock-Paper-Scissors Sound Games,
Proceedings of IEEE-RAS International Conference
on Robots and Automation (ICRA-2008), 3469--3474,
(May 20, 2008).
pdf
doi:10.1109/ROBOT.2008.4543741
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Two-Channel-Based Voice Activity Detection for
Humanoid Robots in Noisy Home Environments,
Proceedings of IEEE-RAS International Conference
on Robots and Automation (ICRA-2008), 3495-3501,
(May 20, 2008).
pdf
doi:10.1109/ROBOT.2008.4543745
-
Hiroshi G. Okuno,
Kazuhiro Nakadai:
COMPUTATIONAL AUDITORY SCENE ANALYSIS AND ITS APPLICATION TO ROBOT AUDITION,
(invited talk),
Proceedings of Hands-free Speech Communication and Microphone Arrays (HSCMA-2008), pp.123-127, May 7, 2008, Trento, Italy.
pdf
doi:10.1109/HSCMA.2008.4538702
-
Hideki Kawahara, Masanori Morise,
Toru Takahashi,
Ryuichi Nishimura, Toshio Irino, Hideki Banno:
TANDEM-STRAIGHT: A Temporally Stable Power Spectral Representation for
Periodic Signals and Applications to Interference-free Spectrum, F0,
and Aperiodicity Estimation,
Proceedings of 2008 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2008),
pp.3933-3936, Las Vegas, Nevada, USA, March 30 - April 4, 2008.
Patents
-
Sound Source Separation System, Sound Source Separation Method, and
Computer Program for Sound Source Separation,
PCT/JP2008/057310, WO 2008/133097
Date of Open: 06.11.2008,
Inventors: Katsutoshi Itoyama, Hiroshi Okuno, Masataka Goto.
Assignee: Kyoto University, AIST.
-
Moving object equipped with ultra-directional speaker,
Patent No. US 7,424,118,
Date of Patent: Sep. 9, 2008.
Inventors: Kiyofumi Mori, Shunji Yoshida, Hiroshi Okuno, Kazuhiro Nakadai,
Hiroshi Tsujino,
PCT No.: PCT/JP2005/002043.
-
Speech Recognition Apparatus,
Application No. 20080167869.
Filed: July 10, 2008.
Inventors: Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto.
PCT No.: PCT/JP05/22601.
Academic Year 2007
Thesis
- Shun Shiramatsu:
Salience-based Modeling of Discourse Context,
Ph.D Thesis, Feb. 2008.
pdf
- Shun'ichi Yamamoto:
Real-Time Robot Audition Software Based on Missing Feature Theory
for Multiple Simultaneous Talkers in Real Environments,
Ph.D Thesis, Feb. 2008.
- Kazuyoshi Yoshii:
Studies on Hybrid Music Recommendation Using Timbral and Rhythmic Features,
Ph.D Thesis, Feb. 2008.
- Katsutoshi Itoyama:
MS Thesis, Feb. 2008.
- Ryu Takeda
MS Thesis, Feb. 2008.
- Yuichiro Fukubayashi:
MS Thesis, Feb. 2008.
- Koichi Tokuda:
MS Thesis,
Feb. 2008.
- Ryunosuke Yokoya:
MS Thesis, Feb. 2008.
- Kohei Sumi:
BE Thesis, Feb. 2007.
- Masaki Katsumaru:
BE Thesis, Feb. 2008.
- Hiroki Saito:
BE Thesis, Feb. 2007.
- Zhang:
BE Thesis, Feb. 2007.
- Takeshi Mizumoto:
BE Thesis, Feb. 2007.
Peer-reviewed Journal Papers
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Simultaneous Realization of Score-informed Sound Source Separation of
Polyphonic Musical Siganals and Constrained Parameter Estimation for
Integrated Model of Harmonic and Inharmonic Structure,
IPSJ Journal, Vol.49, No.3 (Mar., 2008) pp.1465-1479,
Information Processing Society of Japan,
Digital Library,
pdf
-
Ryunosuke Yokoya,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
,
Transactions of Human Interface Society, Vol.10, No.1 (Feb. 2008) pp.59-72.
-
Kazuyoshi Yoshii,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Hybrid Collaborative and Content-based Music Recommendation Using
Incrementally-trainable Probabilistic Generative Model,
IEEE Transactions on Audio, Speech and Language Processing,
Vol.16, No.2 (Feb. 2008) pp.435-447,
pdf,
doi:10.1109/TASL.2007.911503
-
Shun Shiramatsu,
Kazunori Komatani,
Koiti Hasida,
Tetsuya Ogata,
Hiroshi G. Okuno:
A Game-Theoretic Model of Referential Coherence and Its Statistical
Verification Based on Large Japanese and English Corpora,
Natural Language Processing, Vol.14, No.4 (Oct. 2007) pp.199-239.
-
Ryunosuke Yokoya,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Experience Based Imitation Using RNNPB,
Advanced Robotics, Vol.21, No.12 (2007) pp.1351-1367,
doi:10.1163/156855307781746106,
Online version,
VSP and Robotics Society of Japan.
-
Chyon Hae Kim, Jun-ichi Idesawa,
Tetsuya Ogata,
Shigeki Sugano:
Restraining of Noises in Self-Organizing Network Elements,
Journal of Robotics Society of Japan, Vol.25, No.6 (Sep. 2007) pp.115-122.
Digital Library
-
Kazuhiro Nakadai,
Hirofumi Nakashima,
Masamitsu Murase,
Hiroshi G. Okuno,
Yuji Hasegawa, Hiroshi Tsujino:
Tracking of Mulitiple Sound Sources by Integration of Robot-Embedded and
In-Room Microphone Arrays,
Journal of Robotics Society of Japan, Vol.25, No.6 (Sep. 2007) pp.181-191.
Digital Library,
pdf
-
Jean-Marc Valin,
Shun'ichi Yamamoto,
Jean Rouat, Francois Michaud,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
Robust Recognition of Simultaneous Speech By a Mobile Robot,
IEEE Transactions on Robotics,
Vol.23, No.4 (Aug. 2007) pp.742--752,
pdf,
doi:10.1109/TRO.2007.900612
-
Hiroaki ARIE,
Tetsuya Ogata,
Jun TANI, and Shigeki SUGANO:
Reinforcement learning of continuous motor sequence with hidden state,
Advanced Robotics,
Special Issue on Robotic Platforms for Research in Neuroscience,
VSP and Robotics Society of Japan, Vol.21, No.10 (July 2007), pp.1215-1229.
-
Taro Watanabe, Kenji Imamura, Eiichiro Sumita,
Hiroshi G. Okuno:
Statistical machine translation using hierarchical phrase alignment,
Systems and Computers in Japan, Vol.38, No.6 (June 2007) pp.70-79,
doi:10.1002/scj.20271
-
Naoyuki Kanda,
Kazunori Komatani,
Mikio Nakano,
Kazuhiro Nakadai,
Hiroshi Tsujino,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robust Domain Selection Using Dialogue History in Multi-domain Spoken Dialogue Systems,
IPSJ Journal, Vol.48, No.5 (May 2007) pp.1980-1989, IPSJ.
Book Chapters, Articles
-
Hiroshi G. Okuno,
Tetsuya Ogata,
Kazunori Komatani:
Robot Audition from the viewpoint of Computational Auditory Scene Analysis,
Informatics Education
and Research for Knowledge-Creation Society Infrastructure (ICKS'08),
pp.35-40, Jan. 2008.
doi:10.1109/ICKS.2008.10
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Structual Feature Extraction based on Active Sensing Experiences,
Informatics Education
and Research for Knowledge-Creation Society Infrastructure (ICKS'08),
pp.209-212, Jan. 2008.
doi:10.1109/ICKS.2008.9
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Evaluation of Two-Channel-Based Sound Source Localization using
3D Moving Sound Creation Tool,
Informatics Education
and Research for Knowledge-Creation Society Infrastructure (ICKS'08),
pp.210-216.
doi:10.1109/ICKS.2008.25
-
Koiti Hasida,
Shun Shiramatsu,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Meaning Games,
LENLS 2007 Postproceedings, accepted,
LNCS
Oct. 2007.
-
Hiroshi G. Okuno,
Moonis Ali (Eds.):
New Trends in Applied Artificial Intelligence (IEA/AIE-2007),
Lecture Notes in Computer Science, Vol.4570, Springer-Verlag,
14 Jun. 2007, XXI, 1194p. ISBN: 978-3-540-73322-5.
doi:10.1007/978-3-540-73325-6
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Real-Time Auditory and Visual Talker Tracking through integrating EM algorithm
and Particle Filter,
New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570,
pp.280-290, Springer-Verlag.
Kyoto, Jun. 2007.
doi:10.1007/978-3-540-73325-6_28
-
Ryu Takeda,
Shun'ichi Yamamoto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and
MTF-based ASR,
New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570,
pp.384-394, Springer-Verlag.
Kyoto, Jun. 2007.
doi:10.1007/978-3-540-73325-6_38
-
Hiroshi G. Okuno,
Tetsuro Kitahara,
Kazuyoshi Yoshii:
Music Feature Extraction and Music Information Retrieval,
IEE Journal,
Vol.127, No.7 (Jul. 2007).
-
Hiroshi G. Okuno,
Hiroshi Mizoguchi:
Information Integration for Robot Audition: the State-of-the-art and issues.
SICE,
Vol.46, No.6 (Jun. 2007) pp.415-419.
-
Shun'ichi Yamamoto,
Ryu Takeda,
Hiroshi G. Okuno:
Missing Feature Theory Based Automatic Speech Recognition and Its
Application to Simultaneous Multiple Speaker Speech Recognition,
SICE,
Vol.46, No.6 (Jun. 2007) pp.447-452.
-
Shinichi Ueno, Fumihiro Adachi,
Kazunori Komatani,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Bus Information System Based on User Models and Dynamic Generation
of VoiceXML Scripts,
New Frontiers in Artificial Intelligence (JSAI 2003/2004),
LNAI 3609, pp.45-60, 2007.
Springer-Verlag.
Peer-reviewed Conference Papers
-
Yuichiro Fukubayashi,
Kazunori Komatani,
Mikio Nakano, Kotaro Funakoshi, Hiroshi Tsujino,
Tetsuya Ogata,
Hiroshi G. Okuno:
Rapid Prototyping of Robust Language Understanding Modules with Less
Training Data for Spoken Dialogue Systems,
Proceedings of the Third International Joint Conference on Natural Language
Processing (IJCNLP 2008),
accepted, Jan. 2008, Hyderabad, India.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Design and Implementation of A Robot Audition System for Automatic Speech Recognition of Simultaneous Speech,
Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-2007), 111-116, acceptance rate (115/267),
IEEE, Kyoto, Dec. 2007.
pdf
doi:10.1109/ASRU.2007.4430093.
-
Hisashi Kanda,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Vocal Imitation using Vocal Tract Model and Recurrent Neural Network,
Proceedings of International Conference on Neural Information Processing (ICONIP-2007),
Vol.2, pp.222-232, Nov. 2007.
-
Hisashi Kanda,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Vocal Imitation Using Physical Vocal Tract Model,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.1846-1851,
IEEE, RSJ, San Diego, Oct. 2007.
pdf
doi:10.1109/IROS.2007.4399137.
-
Ryunosuke Yokoya,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Discovery of Other Individuals by Projecting a Self-Model Through Imitation,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.1009-1014,
IEEE, RSJ, San Diego, Oct. 2007.
pdf
doi:10.1109/IROS.2007.4399153.
-
Kazuyoshi Yoshii,
Kazuhiro Nakadai,
Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
A Biped Robot that Keeps Steps in Time with Musical Beats while Listening to Music with Its Own Ears,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.1743-1750,
IEEE, RSJ, San Diego, Oct. 2007.
pdf
doi:10.1109/IROS.2007.4399244.
-
Tetsuya Ogata,
Masamitsu Murase,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno,
Two-way Translation of Compound Sentences and Arm Motions by Recurrent Neural Networks,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.1858-1863,
IEEE, RSJ, San Diego, Oct. 2007.
pdf
doi:10.1109/IROS.2007.4399265.
-
Ryu Takeda,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Exploiting Known Sound Sources to Improve ICA-based Robot Audition in Speech Separation and Recognition,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.1757-1762,
IEEE, RSJ, San Diego, Oct. 2007.
pdf
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Auditory and Visual Integration based Localization and Tracking of Humans in Daily-life Environments,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.2021-2027,
IEEE, RSJ, San Diego, Oct. 2007.
pdf
-
Kazuyoshi Yoshii,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Hybrid Collaborative and Content-based Music
Recommendation Using Probabilistic Model with Latent User Preferences,
Proceedings of 8th International Conference on Musical Information
Retreival (ISMIR-2007), long paper (15.8% of 214 submissions),
pp.89-94, Vienna, Sep. 2007.
-
Kazunori Komatani,
Yuichiro Fukubayashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Introducing Utterance Verification in Spoken Dialogue System to Improve Dynamic Help Generation for Novice Users,
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue,
pp.202-205, Sep. 2007
-
Satoshi Ikeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Topic Estimation with Domain Extensibility for Guiding User's Out-of-Grammar
Utterance in Multi-Domain Spoken DIalogue Systems,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2007), pp.2561-2564,, Antwerp, Sep. 2007.
pdf
-
Kazunori Komatani,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Analyzing Temporal Transition of Real User's Behaviors in a Spoken
Dialogue System,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2007), pp.142-145, Antwerp, Sep. 2007.
pdf
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Auditory and VIsual Integration based Localization and Tracking of
Multiple Moving Sounds in Daily-life Environments,
Proceedings of International Workshop on Robot and Human Interaction
(Ro-Man 2007), 399-404, IEEE, Jeju Island, Korea, Aug. 2007.
pdf
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Real-Time Auditory and Visual Talker Tracking through integrating EM algorithm
and Particle Filter,
New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570,
pp.280-290, Springer-Verlag.
Kyoto, Jun. 2007.
doi:10.1007/978-3-540-73325-6_28
-
Ryu Takeda,
Shun'ichi Yamamoto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and
MTF-based ASR,
New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570,
pp.384-394, Springer-Verlag.
Kyoto, Jun. 2007.
doi:10.1007/978-3-540-73325-6_38
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
INTEGRATION AND ADAPTATION OF HARMONIC AND INHARMONIC MODELS FOR SEPARATING POLYPHONIC MUSICAL SIGNALS,
Proceedings of 2007 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2007),
pp.57-60, Hawaii, April 2007, pp.57-60,
(15.1% acceptance rate for lecture presentation)
doi:10.1109/ICASSP.2007.366615
-
Haruhiko Niwa,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Distance Estimation of Hidden Objects Based on
Acoustical Holography by applying Acoustic Diffraction of
Audible Sound,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2007), pp.423-428,
(Apr. 2007).
doi:10.1109/ROBOT.2007.363823
-
Tetsuya Ogata,
Shohei Matsumoto,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Human-Robot Cooperation using Quasi-symbols
Generated by RNNPB Model,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2007), pp.2156-2161,
(Apr. 2007).
doi:10.1109/ROBOT.2007.363640
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Predicting Object Dynamics from Visual Images
through Active Sensing Experiences,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2007), pp.2501-2506,
(Apr. 2007).
doi:10.1109/ROBOT.2007.363841
-
Chyon Hae Kim,
Tetsuya Ogata,
Shigeki Sugano:
Enhancement of Self Organizing Network Elements for Supervised Learning,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2007), WeA3.5,
(Apr. 2007).
Patents
-
Robot acoustic device and robot acoustic system
Patent No. US 7,215,786.
Date of Patent: May 8, 2007.
Inventors: Kazuhiro Nakadai, Hiroshi Okuno, Hiroaki Kitano,
Assignee: Japan Science and Technology Agency.
Academic Year 2006
Peer-reviewed Journal Papers
-
Kazuyoshi Yoshii,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Drumix: An Audio Player with Functions of Realtime Drum-Part
Rearrangement for Active Music Listening,
Journal of Information Proceeding Society of Japan, pp.1229-1239,
Vol.48, No.3 (Mar. 2007), IPSJ.
Vol.3 (2007), pp.134-144.
DL
-
Hyun-Don Kim,
Jong-Suk Choi,
and Munsang Kim:
Human-robot interaction in real environments by audio-visual integration,
International Journal of Control Automation and Systems,
Vol.5, No.1 (Feb. 2007) pp.61-69.
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music,
Journal of Information Proceeding Society of Japan,
Vol.48, No.1 (Jan. 2007), pp.214-226, IPSJ.
IPSJ Digital Courier,
Vol.3 (2007) pp.1-13.
-
Shunsuke Kurotaki, Noriaki Suzuki,
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hideharu Aamano:
Sound Source Separation Filter for Robot Audition used by Dynamic
Reconfigurable Device, DRP
(in Japanese),
IEICE Transaction on Information and Systems,
Vol.J90-D, No.3, pp.897-907, Mar. 2007,
IEICE.
DL
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata
Hiroshi G,. Okuno:
Simultaneous Speech Recognition based on Automatic Missing-Feature Mask
Generation integrated with Sound Source Separation
(in Japanese),
Journal of Robotics Society of Japan, Vol.25, No.1 (Jan. 2007)
pp.92-102.
Digital Library,
pdf
-
Kazuyoshi Yoshii,
Masataka Goto,
Hiroshi G. Okuno:
Drum Sound Recognition for Polyphonic Audio Signals
by Adaptation and Matching of Spectral Templates
with Harmonic Harmonic Structure Suppression,
IEEE Transactions on Audio, Speech and Language Processing,
Vol.15, No.1 (Jan. 2007) pp.333-345,
pdf,
doi:10.1109/TASL.2006.876754
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Instrument Identification in Polyphonic Music:
Feature Weighting to Minimize Influence of Sound Overlaps,
EURASIP Journal on Applied Signal Processing,
Special issue on Music Information Retrieval Based on Signal Processing,
Vol.2007, Article ID 51979, 15 pages, 2007,
doi:10.1155/2007/51979
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Instrument Identification in Polyphonic Music: Feature Weighting Based on Mixed-Sound Template and Use of Musical Context
(in Japanese),
IEICE Transaction on Information and Systems, Vol.J89-D, No.12 (Dec. 2006), pp.2721-2733,
IEICE.
-
Naoyuki Kanda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Spoken Language Understinding Using Dialogue Context in Database Search
(in Japanese),
IPSJ Journal, Vol.47, No.6 (June 2006) pp.1802-1811, IPSJ.
Paper in pdf
-
Hiromasa Fujihara,
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
A Singer Identification Method for Musical Pieces on the Basis of
Accompaniment Sound Reduction and Reliable Frame Selection
(in Japanese),
IPSJ Journal, Vol.47, No.6 (June 2006) pp.1831-1843, IPSJ.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Ryu Takeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. OKuno:
Missing Feature Theory Based Interface Between Sound Source Separation and Automatic Speech Recognition and Applying to Multiple Robots,
(in Japanese),
Journal of Human Interface Society, Vol.8, No.2 (Jun. 2006) pp.203-212.
-
Takamichi Saito,
Kentaro Umesawa,
Hiroshi G. Okuno:
A Privacy-Enhanced Access Control,
Systems and Computers in Japan, (2006)
A Privacy-Enhanced Access Control,
Systems and Computers in Japan, Vol.37, No.5 (May 2006) pp.77-86.
doi:10.1002/scj.10214
-
Tenkai Kim,
尾形 哲也,
Shigeki Sugano;
ローカルルールに基づいた論理回路の自己組織アルゴリズム
(in Japanese),
Transaction on SICE, Vol.42, No.4 (Apr. 2006) pp.334-341.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Ryu Takeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improving Location-Based Speech Recognition of Simultaneous Speech Signals
by Parameter Optimization with Genetic Algorithm (in Japanese),
Human Interface, Vol.8, No.2 (Jun. 2006) pp.203-212.
-
Takamichi Saito,
Kentaro Umesawa,
Hiroshi G. Okuno:
A Privacy-Enhanced Access Control,
Systems and Computers in Japan, (2006)
A Privacy-Enhanced Access Control,
Systems and Computers in Japan, Vol.37, No.5 (May 2006) pp.77-86.
doi:10.1002/scj.10214
Book Chapters, Survey Papers, and Articles
-
Hiroaki Arie, Jun Namikawa,
Tetsuya Ogata,
Jun Tani, Shigeki SUGANO:
Reinforcement Learning Algorithm with CTRNN in Continuous Action Space,
Neural Information Processing (ICONIP-2006),
Part I, LNCS 4232, pp.387-396.
Oct. 2006.
doi:10.1007/11893028_44
-
Shun'ichi Yamamoto,
Ryu Takeda,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition,
PRICAI 2006: Trends in Artificial Intelligence,
LNCS 4099, pp.484-494, accepted as regular paper for ORAL Presentation (14.1%),
Springer-Verlag, Guilin, China, Aug. 2006.
doi:10.1007/11801603_52
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Ryu Takeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Genetic Algorithm based Improvement of Robot's Hearing Capabilities in
Separating and Recognizing Simultaneous Speech Signals,
Advances in Applied Artificial Intelligence (IEA/AIE-2006),
LNAI 4031, pp.207-217, Springer-Verlag.
Annecy, France, Jun. 2006.
doi:10.1007/11779568_24
Peer-reviewed Conference Papers
-
Hiroshi G. Okuno,
Tetsuya Ogata,
Kazunori Komatani:
Computational Auditory Scene Analysis and Its Application to Robot Audition:
Five Years Experience,
Proceedings of the 2nd International Conference on Informatics Research
for Development of Knowledge Society Infrastructure (ICKS 2007),
pp.69-76, Jan. 2007.
doi:10.1109/ICKS.2007.7
-
Shun Shiramatsu,
Kazunori Komatani,
Koiti Hasida,
Tetsuya Ogata,
Hiroshi G. Okuno:
Meaning-Game-based Centering Model with Statistical
Definition of Utility of Referential Expression and
Its Verification Using Japanese and English Corpora,
Proceedings of the 6th Discourse Anaphora and Anaphor Resolution
Colloquium (DAARC2007), pp.121-126,
Lisbon, Mar. 2007.
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Musical Instrument Recognizer ``Instrogram'' and Its Application to Music Retrieval based on Instrumentation Similarity,
Proceedings of IEEE International Symposium on Multimedia (ISM2006),
pp.265-272,
San Diego, Dec. 2006.
doi:10.1109/ISM.2006.113
-
Hiromasa Fujihara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic synchronization between lyrics and music CD recordings based on Viterbi alignment of segregated vocal signals,
Proceedings of IEEE International Symposium on Multimedia (ISM2006),
pp.257-264,
San Diego, Dec. 2006.
doi:10.1109/ISM.2006.38
-
Kazuyoshi Yoshii,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Hybrid Collaborative and Content-based Music
Recommendation Using Probabilistic Model with Latent User Preferences,
Proceedings of 7th International Conference on Musical Information
Retreival (ISMIR-2006), pp.296-301,
Vancouver, CA, Sep. 2006.
pdf
-
Katsutoshi Itoyama,
Tetsuro Kitahara,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Feature Weighting in Automatic Transcription of
Specified Part in Polyphonic Music,
Proceedings of 7th International Conference on Musical Information
Retreival (ISMIR-2006), pp.172-175,
Vancouver, CA, Sep. 2006.
pdf
-
Kazuhiro Nakadai,
Hirofumi Nakajima,
Masamitsu Murase,
Satoshi Kaijiri.
Kentaro Yamada, Yuji Hasegawa,
Hiroshi G. Okuno,
Hiroshi Tsujino:
Real-Time Tracking of Multiple Sound Sources by
Integration of In-Room and Robot-Embedded Microphone Arrays,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 852-859,
IEEE, RSJ, Beijing, China, Sep. 2006.
pdf,
doi:10.1109/IROS.2006.281737.
-
Ryu Takeda,
Shun'ichi Yamamoto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Missing-Feature based Speech Recognition for Two
Simultaneous Speech Signals Separated by ICA with a pair of Humanoid Ears,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 878-885,
IEEE, RSJ, Beijing, China, Sep. 2006.
pdf,
doi:10.1109/IROS.2006.281741,
IEEE Robotics and Automation Society Japan Chapter Young Award
RSJ/SICE Award for IROS 2006 Best Paper Nomination Finalist
(2nd to 45th Place) at IROS-2007.
-
Haruhiko Niwa,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Multiple Acoustical Holography Method for Localization of Objects
in Broad Range using Audible Sound,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 1146-1151,
IEEE, RSJ, Beijing, China, Sep. 2006.
pdf,
doi:10.1109/IROS.2006.281844
-
Chyon Hae KIM,
Tetsuya Ogata,
Shigeki SUGANO:
wEfficient Organization of Network Topology based on Reinforcement Signals,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 3154-3159,
IEEE, RSJ, Beijing, China, Sep. 2006.
pdf
-
Yuki Suga, Chihiro Endo, Daizo Kobayashi, Takeshi Matsumoto,
Tetsuya Ogata,
Shigeki Sugano:
User-Adaptive Human-Robot Interaction System using Interactive EC,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 3663-3668,
IEEE, RSJ, Beijing, China, Sep. 2006.
pdf
-
Ryunosuke Yokoya,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Experience Based Imitation Using RNNPB,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 3669-3674,
IEEE, RSJ, Beijing, China, Sep. 2006.
pdf,
doi:10.1109/IROS.2006.281724.
-
Jong-Suk Choi,
Hyun-Don Kim,
and Munsang Kim:
Probabilistic Speaker Localization in Noisy Environment by
Audio-Visual Integration,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 4704-4709,
IEEE, RSJ, Beijing, China, Sep. 2006.
pdf
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Real-Time Robot Audition System That Recognizes Simultaneous Speech
in the Real World,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 5333-5338,
IEEE, RSJ, Beijing, China, Sep. 2006.
pdf,
doi:10.1109/IROS.2006.282037.
-
Tetsuya Ogata,
Yuya Hattori,
Hideki Kojima,
Kazunori Komatani,
Hiroshi G. Okuno:
Generation of Robot Motions from Environmental Sounds using Inter-modality
Mapping by RNNPB,
Proceedings of Sixth International Workshop on Epigenetic Robotics
(EpiRobo-2006), 95-102, Paris, Sep., 2006.
-
Hiromasa Fujihara,
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Speaker Identification under Noisy Environments by using Harmonic Structure
Extraction and Reliable Frame Weighting,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2006), 1459-1462,
Pittsburgh, Sep. 2006.
pdf
-
Ryu Takeda,
Shun'ichi Yamamoto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improving Speech Recognition of Two Simultaneous Speech Signals by
Integrating ICA BSS and Automatic Missing Feature Mask Generation,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2006), 2302-2305,
Pittsburgh, Sep. 2006.
pdf
-
Yuichiro Fukubayashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Dynamic Help Generation by Estimating User's Mental Model in Spoken Dialogue
Systems,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2006), 1946-1949,
Pittsburgh, Sep. 2006.
pdf
-
Shun'ichi Yamamoto,
Ryu Takeda,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Leak Energy based Missing Feature Mask Generation for ICA and GSS and Its
Evaluation with Simultaneous Speech Recognition,
Proceedings of ISCA Tutorial and Research Workshop on Statistical and
Perceptual Audition (SAPA2006),
pp.42-46,
pdf
-
Kazunori Komatani,
Naoyuki Kanda,
Mikio Nakano, Kazuhiro Nakadai, Hiroshi Tsujino,
Tetsuya Ogata,
Hiroshi G. Okuno:
Multi-Domain Spoken Dialogue System with Extensibility and Robustness
against Speech Recognition Errors,
Proceedings of SIGdial Workshop on Discourse and Dialogue,
9-17, Aug. 2006
-
Hiroshi G. Okuno:
Computational Auditory Scene Analysis
- Towards Listening to Several Thinkgs at Once -,
50th Anniversary Summit of Artificial Intelligence (ASAI50) workshop and abstract booklet,
accepted for inclusion, Monte Verita, Switzerland, July 2006.
-
Takuya Yoshioka,
Takafumi Hikichi, Masato Miyoshi,
Hiroshi G. Okuno:
Robust Decomposition of Inverse Filter of Channel and
Prediction Error Filter of Speech Signal for Dereverberation,
Proceedings of the 14th European Signal Processing Conference
(EUSIPCO 2006), CD-ROM Proceedings, Florence, 2006.
pdf
-
Ryunosuke Yokoya,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Robot Imitation from Active-Sensing Experiences,
Proceedings of Fifth International Conference on Learning and Development
(ICDL06), accepted, Bloomington, IN USA, May 2006.
-
Kazuyoshi Yoshii,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
AN ERROR CORRECTION FRAMEWORK BASED ON DRUM PATTERN PERIODICITY FOR
IMPROVING DRUM SOUND DETECTION,
Proceedings of 2006 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2006),
Vol.V, pp.237-240, Toulouse, May 2006.
pdf,
doi:10.1109/ICASSP.2006.11661256
IEEE Kansai Chapter Young Researcher Award
-
Hiromasa Fujihara,
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
F0 ESTIMATION METHOD FOR SINGING VOICE IN POLYPHONIC AUDIO SIGNAL BASED ON
STATISTICAL VOCAL MODEL AND VITERBI SEARCH,
Proceedings of 2006 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2006),
Vol.V, pp.253-256, Toulouse, May 2006.
pdf,
doi:10.1109/ICASSP.2006.1661260
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Instrogram: A New Musical Instrument Recognition Technique Without Using
Onset Detection Nor F0 Estimation,
Proceedings of 2006 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2006),
Vol.V, pp.229-232, Toulouse, May 2006.
pdf,
doi:10.1109/ICASSP.2006.1661254
IEEE Kansai Chapter Young Researcher Award
-
Kazuhiro Nakadai,
Hirofumi Nakajima,
Masamitsu Murase,
Satoshi Kaijiri.
Kentaro Yamada, Yuji Hasegawa,
Hiroshi G. Okuno,
Hiroshi Tsujino:
ROBUST TRACKING OF MULTIPLE SOUND SOURCES BY SPATIAL INTEGRATION OF ROOM
AND ROBOT MICROPHONE ARRAYS,
Proceedings of 2006 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2006),
Vol.IV, pp.929-932, Toulouse, May 2006.
pdf,
doi:10.1109/ICASSP.2006.1661122
-
Hyun-Don Kim,
Jong-Suk Choi, and Munsang Kim:
Speaker Localization among Multi-faces in Noisy Environment by
Audio-Visual Integration,
Proceedings of IEEE-RAS International Conference
on Robots and Automation (ICRA-2006), 1305-1310,
(May 2006).
doi:10.1109/ICKS.2004.1313411
Patents
-
Speech Recongition Device,
Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto,
European Patent: EP1691344,
Publication Date: 08/16/2006,
Application number: EP20040818533,
Filing Date: 11/12/2004
-
Method and Apparatus for Determining Sound Source,
Patent No. US 7,035,418.
Filing date: June 7, 2000.
Issue date: Apr. 25, 2006.
Inventors: Hiroshi Okuno, Hiroaki Kitano, Yukiko Nakagawa,
Assignee: Japan Science and Technology Agency.
-
Robot audiovisual system
Patent No. US 7,016,505.
Filing date: Nov 1, 2000.
Issue date: Mar 21, 2006.
Inventors: Kazuhiro Nakadai, Hiroshi Okuno, Hiroaki Kitano,
Assignee: Japan Science and Technology Agency.
Academic Year 2005
Peer-Reviewed Journal Papers
-
Yasuhiro Akiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto,
Hiroshi G. Okuno:
Using Multiple Edit Distances to
Automatically Grade Outputs from Machine Translation Systems,
IEEE Transactions on Audio, Speech and Language Processing,
Vol.14, No.2, (Mar. 2006) 393--402.
doi:10.1109/TSA.2005.860770
-
Mototaka Suzuki, Kuniaki Noda, Yuki Suga,
Tetsuya Ogata,
and Shigeki Sugano:
Dynamic Perception after Visually-Guided Grasping by a Human-Like
Autonomous Robot,
Advanced Robotics, Vol.20, No.2 (Feb. 2006) 233-254.
VSP and Robotics Society of Japan.
doi:10.1163/156855306775525785
-
Takuya Yoshioka,
Takafumi Hikichi, Masato Miyoshi,
Hiroshi G. Okuno:
Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals,
IEICE Trans. on Fundamentals of Electronics, Communications, and
Computer Sciences, Vol.E89-A, No.1 (Jan. 2006) 240-247,
IEICE.
-
Tetsuya Ogata,
Hayato Ohba,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Extracting Multi-Modal Dynamics of Objects using RNNPB,
Journal of Robotics and Mechatronics, Vol.17, No.6 (Dec. 2005)
681-688,
Special Issue on Human Modeling in Robotics.
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Pitch-dependent identification of musical instrument sounds,
Applied Intelligence, Vol.23, No.3, pp.267-275,
Springer-Verlag (formerly Kluwer Publishers).
doi:10.1007/s10489-005-4612-1
-
Kenri Kodaka,
Tetsuya Ogata,
Hiroshi G. Okuno:
Walking in Virtual Space with Entrainment Based on a Nonlinear Oscillator,
Journal of Human Interface Society, Vol.7, No.4, 26-36, 2005.
-
Shun Shiramatsu,
Takashi Miyata,
Hiroshi G. Okuno,
Koiti Hasida:
Dissolution of Centering Theory Based on Game Theory and
Its Empirical Verification
(in Japanese),
Natural Language Processing, Vol.12, No.3 (July 2005) 91-110.
-
Shunichi Yamamoto,
Kazuhiro Nakadai,
Hiroshi Tsujino,
Hiroshi G. Okuno:
Missing Feature Theory Based Interface Between Sound Source Separation and
Automatic Speech Recognition and Applying to Multiple Robots
(in Japanese),
Journal of Robotics Society of Japan, Vol.23, No.6 (Aug. 2005) 743-751.
Digital Library,
pdf
-
Tetsuya Ogata,
Shigeki Sugano, and Jun Tani:
Open-end Human-Robot Interaction from the Dynamical Systems Perspective
- Mutual Adaptation and Incremental Learning,
Advanced Roboics, Vol.19. No.6, pp.651-670,
VSP and Robotics Society of Japan.
doi:10.1163/1568553054255655
-
Katsuhisa Ishida,
Tetsuro Kitahara,
Masayuki Takeda:
Improvisation Supporting System Using N-gram-based Melody Appropriateness Determination,
IPSJ Journal, Vol.46, No.7 (July 2005) pp.1548-1559, IPSJ.
Paper in html
Book Chapters
-
Masahiro Nisiyama,
Hiroaki Kawashima, Takatsugu Hirayama,
Takashi Matsuyama:
Facial Expression Representation based on Timing Structures in Faces,
Proceedings of IEEE International Workshop on Analysis and Modeling of
Faces and Gestures (AMFG 2005), LNCS 3723, pp.139-153,
Beijing, Oct. 2005.
-
Tsuyoshi Tasaki,
Shohei Matsumoto,
Hayato Ohba,
Mitsuhiko Toda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Distance Based Dynamic Interaction of Humanoid Robot with Multiple People.
Innovations in Applied Artificial Intelligence (IEA/AIE-2005)
LNAI 3533, 111-120, Best paper award, Springer-Verlag.
Bari, Italy, Jun. 2005.
Paper pdf
doi:10.1007/11504894_18
-
Katsutoshi Uchiyama, Toshiaki Ohji, Mari Oka,
Hiroshi G. Okuno,
Hiroyuki Suzuki, Kenji Fukaya, Modjtaba Sadria, Hubert Durt
Kokoro and Topos -- Bastions of Kokoro,
Kyoto Inernational Culture Forum 2005, pp.62-73, Mar. 2006.
Peer-Reviewed International Conference Papers
-
Kazuyoshi Yoshii,
Masataka Goto,
Hiroshi G. Okuno:
INTER:D A Drum Sound Equalizer for Controlling Volume and Timbre of
Druams,
Proceedings of 2nd European Workshop on the Integration of
Knowledge, Semantic and Digital Media Technologies (EWIMT 2005),
accepted for oral presentation, EU Commission,
IEE Savoy Place, London, Nov. 2005.
-
Shun Shiramatsu,
Kazunori Komatani,
Takashi Miyata, Koiti Hasida,
Hiroshi G. Okuno:
Empirical Verification of Meaning-Game-based Generalization of
Centering Theory with Large Japanese Corpus,
Proceedings of the 19th Pacific Asia Conference on Language, Information,
and
Computation (PACLIC 19),
192-210, Taipei, Dec. 2005.
-
Masahiro Nisiyama,
Hiroaki Kawashima, Takatsugu Hirayama,
Takashi Matsuyama:
Facial Expression Representation based on Timing Structures in Faces,
Proceedings of IEEE International Workshop on Analysis and Modeling of
Faces and Gestures (AMFG 2005), LNCS 3723, pp.139-153,
accepted, Beijing, Oct. 2005.
-
Kenri Kodaka,
Tetsuya Ogata,
Hiroshi G. Okuno:
Walking with Body-sense in Virtual Space Using the Nonlinear Oscillator,
Proceedings of the International Conference on Systems, Man and Cybernetics
(SMC-2005), IEEE,
Hawaii, Oct. 10-12, 2005.
Finalist for Best Student Paper
doi:10.1109/ICSMC.2005.1571166
-
Hiromasa Fujihara,
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
SINGER IDENTIFICATION BASED ON ACCOMPANIMENT SOUND REDUCTION AND RELIABLE FRAME SELECTION,
Proceedings of 6th International Conference on Musical Information
Retreival (ISMIR-2005), 329-336,
London, Sep. 2005.
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
INSTRUMENT IDENTIFICATION IN POLYPHONIC MUSIC: FEATURE WEIGHTING WITH MIXED SOUNDS, PITCH-DEPENDENT TIMBRE MODELING, AND USE OF MUSICAL CONTEXT,
Proceedings of 6th International Conference on Musical Information
Retreival (ISMIR-2005), 558-563,
London, Sep. 2005.
-
Kazunori Komatani,
Naoyuki Kanda,
Tetsuya Ogata,
Hiroshi G. Okuno:
Contextual Constraints based on Dialogue Models in Database Search Task
for Spoken Dialogue Systems,
Proceedings of the Nineth European Conference on
Speech Communication and Technology (Interspeech-2005), 877-880,
Lisboa, Sep. 2005.
Paper in PDF.
-
Masamitsu Murase,
Shun'ichi Yamamoto,
Jean-Marc Valin,
Kazuhiro Nakadai,
Kentaro Yamada,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Multiple Moving Speaker Tracking by Microphone Array on Mobile Robot,
Proceedings of the Nineth European Conference on
Speech Communication and Technology (Interspeech-2005), 249-252,
Lisboa, Sep. 2005.
Paper in PDF.
-
Tetsuro Kitahara,
Katsuhisa Ishida,
Masayuki Takeda:
ism: Improvisation Supporting Systems with Melody Correction and Key Vibration,
Proceedings of International Conference on Entertainment Computing
(ICEC 2005),
Mita, Hyogo, Sep. 2005.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Jean-Marc Valin,
Jean Rouat, Francois Michaud,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Making A Robot Recognize Three Simultaneous Sentences in Real-Time,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2005),
pp.897-892,
IEEE, RSJ, Edmonton, Aug. 2005.
Paper in PDF.
doi:10.1109/IROS.2005.1545094
-
Syunsuke Kurotaki, Noriaki Suzuki,
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hideharu Amano:
Implementation of Active Direction-Pass Filter on Dynamically Reconfigurable
Processor,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2005),
pp.515-520,
IEEE, RSJ, Edmonton, Aug. 2005.
Paper in PDF.
doi:10.1109/IROS.2005.1545033
-
Tsuyoshi Tasaki,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Spatially Mapping of Friendliness for Human-Robot Interaction,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2005),
pp.521-526,
IEEE, RSJ, Edmonton, Aug. 2005.
Paper in PDF.
doi:10.1109/IROS.2005.1545034
-
Mikio Nakano,
Naoyuki Kanda,
Yuji Hasegawa, Toyotaka Torii,
Yohane Takeuchi,
Kazuhiro Nakadai,
Hiroshi Tsujino,
Hiroshi G. Okuno:
A Two-Layer Model for Behavior and Dialogue Planning in Conversational Service Robots,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2005), pp.1542-154,
IEEE, RSJ, Edmonton, Aug. 2005.
Paper in PDF.
doi:10.1109/IROS.2005.1545198
-
Tetsuya Ogata,
Hayato Ohba,
Kazunori Komatani,
Jun Tani,
Hiroshi G. Okuno:
Extracting Multi-Modal Dynamics of Objects using RNNPB
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2005), pp.160-165,
IEEE, RSJ, Edmonton, Aug. 2005.
Paper in PDF.
doi:10.1109/IROS.2005.1544975
-
Kazunori Komatani,
Ryoji Hamabe,
Tetsuya Ogata,
Hiroshi G. Okuno:
Generating Confirmation to Distinguish Phonologically Confusing Word Pairs in Spoken Dialogue Systems
Proceedings of 4th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems, pp.40-45, July 2005.
-
Yuya Hattori,
Hideki Kojima,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot Gesture Generation from Environmental Sounds Using
Inter-modality Mapping,
Proceedings of Fifth International Workshop on Epigenetic Robotics
(EpiRobo-2005), Nara, July 2005.
-
Tsuyoshi Tasaki,
Shohei Matsumoto,
Hayato Ohba,
Mitsuhiko Toda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Distance Based Dynamic Interaction of Humanoid Robot with Multiple People.
Innovations in Applied Artificial Intelligence:
Eighteenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems (IEA/AIE-2005)
LNAI 3533, 111-120, Best paper award, Springer-Verlag.
Bari, Italy, Jun. 2005.
Paper in pdf
doi:10.1007/11504894_18
-
Takuya Yoshioka,
Takafumi Hikichi, Masato Miyoshi,
Hiroshi G. Okuno:
Blind Estimation of Room Resonances Using Popular, Classical, and Jazz Music.
Proceedings of AES 118th Convenvion,
Audio Engineering Society, Barcelona, Spain, May 28-31, 2005.
-
Shun'ichi Yamamoto,
Jean-Marc Valin
Kazuhiro Nakadai,
Hiroshi Tsujino, Jean Rouat, Francois Michaud,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Enhanced Robot Speech Recognition Based on Microphone Array
Source Separation and Missing Feature Theory.
Proceedings of IEEE-RAS International Conference
on Robots and Automation (ICRA-2005), 1489-1494, IEEE,
Barcelona, Apr. 2005.
Patents
-
Robot audiovisual system
Patent No. US 6,967,455
Filing date: Mar 8, 2002
Issue date: Nov 22, 2005
Inventors: Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Okuno, Hiroaki Kitano
Assignee: Japan Science and Technology Agency
-
Speech Recongition Device,
Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto,
Wipo Patent: WO/2005/048239,
Application Number: PCT/JP2004/016883,
Publication Date: 05/26/2005,
Filing Date: 11/12/2004.
Academic Year 2004
Thesis
- Yasuhiro Akiba:
Automatic Evaluation Methods for Machine Translation Systems,
Ph.D Thesis, Jan. 2005.
- Kazushi Ishihara:
MS Thesis, Feb. 2005
- Kenri Kodaka:
MS Thesis, Feb. 2005
- Shun'ichi Yamamoto:
MS Thesis, Feb. 2005
- Ken Yamaguchi:
MS Thesis, Feb. 2005
- Kazuyoshi Yoshii: Drum Sound Recognition for Polyphonic Audio Signals
by Adaptation of Spectral Templates and Suppression of Harmonic Structure,
MS Thesis, Feb. 2005
- Hayato Ohba:
BE Thesis, Feb. 2005
- Taku Oya:
BE Thesis, Feb. 2005
- Satoshi Kaijiri:
BE Thesis, Feb. 2005
- Ryoji Hamabe:
BE Thesis, Feb. 2005
- Masahiro Fujihara:
BE Thesis, Feb. 2005
- Masamitsu Murase:
BE Thesis, Feb. 2005
Peer-Reviewed Journal Papers
-
Tetsuya Ogata,
Shigeki Sugano, and Jun Tani:
Acquisition of Motion Primitives of Robot in Human-Navigation Task:
Towards Human-Robot Interaction based on "Quasi-Symbol",
Transactions of the Japanese Society for Artificial Intelligence,
Vol.20, No.3, pp.188-196. Mar. 2005.
Paper Online Journal
doi:10.1527/tjsai.20.188
-
Tsuyoshi Tasaki,
Shohei Matsumoto,
Hayato Ohba,
Shun'ichi Yamamoto,
Mitsuhiko Toda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Dynamic Communication of Humanoid Robot with Multiple People Based on
Interaction Distance,
Transactions of the Japanese Society for Artificial Intelligence,
Vol.20, No.3, pp.209-219, Mar. 2005.
Paper Online Journal
doi:10.1527/tjsai.20.209
-
Yasuhiro Akiba,
Kenji Imamura, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto,
Hiroshi G. Okuno:
Automatic Grader of MT Outputs in Colloquial Style by Using Multiple Edit
Distance, (in Japanese),
Transactions of the Japanese Society for Artificial Intelligence,
Vol.20, No.3,pp.139-148 (2005).
Paper Online Journal
doi:10.1527/tjsai.20.139
-
Kazushi Ishihara,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Recognition of Onomatopoeia for Environmental Sounds, (in Japanese),
Transactions of the Japanese Society for Artificial Intelligence,
Vol.20, No.3, pp.229-236, March 2005.
Paper Online Journal
doi:10.1527/tjsai.20.229
-
Teruhisa Misu,
Kazunori Komatani,
Youji Seita,
Tatsuya Kawahara:
音声対話によるソフトウェアサポートタスクのための効果的な確認戦略,
IEICE Transaction on Information and Systems, Vol.88-D2, No.3 (Mar. 2005) 499-508,
IEICE,
Paper in pdf
-
Kazunori Komatani,
Shinichi Ueno,
Tatsuya Kawahara,
Hiroshi G. Okuno:
User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance,
User Modeling and User-Adapted Interaction,
Special Issue on Language-Based Interaction: User Modeling and Adaptation,
Vol.15, No.169-183, Kluwer, 2005.
Abstract
doi:10.1007/s11257-004-5659-0
-
Kazuhiro Nakadai,
Daisuke Matsuura,
Hiroshi G. Okuno,
Hiroshi Tsujino:
Improvement of Recognition of Simultaneous Speech Signals Using AV Integration and Scattering Theory for Humanoid Robots,
Speech Communication, Vol.44 (2004) 97--112,
Elsevier, Oct. 2004.
doi:10.1016/j.specom.2004.10.010
-
Tino Lourens,
Hiroshi G. Okuno,
Hiroshi Tsujino:
A computational model of monkey cortical grating cells.
Biological Cybernetics,
Vol.92, No.1 (Jan. 2005) 61--70.
Springer-Verlag.
Paper in pdf
doi:10.1007/s00422-004-0522-2
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Effects of increasing modalities in recognizing three simultaneous speeches,
Speech Communication, Vol.43, No.4, pp.347-359,
Sep. 2004.
doi:10.1016/j.specom.2004.03.008
-
Yasuhisa Hayakawa,
Tetsuya Ogata,
and Shigeki Sugano:
Flexible Assembly Work Cooperating System based on Work State Identifications by Self-Organizing Map,
IEEE/ASME Transactions on Mechatronics,
Vol.9, No.3, accepted, Sept. 2004.
-
Kazunori Komatani,
Shinichi Ueno,
Tatsuya Kawahara,
Hiroshi G Okuno:
User model for Adaptive Response Generation in Spoken Dialogue System,
IEICE Transactions on Information and Systems, Vol.87-D2, No.10 (Oct. 2004) 1921-1928, IEICE.
Paper in pdf
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Tino Lourens,
Hiroaki Kitano:
Sound and Visual Tracking for Humanoid Robot,
Applied Intelligence,
Vol.20, No.3 (May/June, 2004), 253-266,
doi:10.1023/B:APIN.0000021417.62541.e0,
(accepted in Oct. 2002),
Kluwer Publishers.
-
Taro Watanabe, Kenji Imamura, Eiichiro Sumita,
Hiroshi G. Okuno:
Statistical machine translation using hierarchical phrase alignment.
IEICE Transactions on Information and Systems,
Vol.J87-D2, No.4 (Apr. 2004) 978-986, IEICE.
Paper in pdf
Peer-Reviewed International Conference Papers
-
Hiroshi G. Okuno:
Robot Audition: Its Issues and State of the Art (invited talk),
Proceedings of 2nd International Symposium on Life Science,
Kyoto, Feb. 2005.
-
Tetsuya Ogata,
Shigeki Sugano, and Jun Tani:
Acquisition of Motion Primitives of Robot in Human-Navigation Task:
Towards Human-Robot Interaction based on "Quasi-Symbol",
Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems,
315-326, Kyoto, Nov. 2004.
-
Tsuyoshi Tasaki,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot Motion Control using Listener's Back-Channels and Head Gesture Information.
Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems,
327-338, Kyoto, Nov. 2004.
-
Tsuyoshi Tasaki,
Shohei Matsumoto,
Hayato Ohba,
Mitsuhiko Toda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Dynamic Communication of Humanoid Robot with multiple people based on
Interaction Distance,
Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems,
385-392, Kyoto, Nov. 2004.
-
Yuki Suga, Hiroaki ARIE,
Tetsuya Ogata,
and Shigeki Sugano:
Constructivist Approach to Human-Robot Emotional Communication:
Design of Evolutionary Function for WAMOEBA-3,
Proceedings of IEEE/RAS Interanational Conference on Humanoid Robots (Humanoids 2004),
No.76, Los Angels, Nov. 2004.
-
Yuki Suga,
Tetsuya Ogata,
and Shigeki Sugano:
Development of Emotional Communication Robot, WAMOEBA-3,
Proceedings of International Conference on Advanced Mechatronics (ICAM 2004),
413-418, Oct. 2004.
-
Kazuyoshi Yoshii,
Masataka Goto,
Hiroshi G. Okuno:
Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods
Proceedings of 5th International Conference on Musical Information
Retreival (ISMIR-2004), 184-191,
Barcelona, Spain, Oct. 2004.
Paper in pdf
-
Takuya Yoshioka,
Tetsuro Kitahara,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Chord Transcription with Concurrent Recognition
of Chord Symbols and Boundaries.
Proceedings of 5th International Conference on Musical Information
Retreival (ISMIR-2004), 100-105,
Barcelona, Spain, Oct. 2004.
Paper in pdf
-
Tsuyoshi Tasaki,
Takeshi Yamaguchi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot Motion Control using Listener's Back-Channels and Head Gesture Information.
Proceedings of 2004 International Conference on Spoken
Language Processing (ICSLP-2004), 1033-1036, ASA, ASJ, and ESCA,
Korea, Oct. 2004.
-
Kazushi Ishihara,
Yuya Hattori,
Tomohiro Nakatani,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Disambiguation in Determining Phonemes of Sound-Imitation Words for Environmental Sound Recognition.
Proceedings of 2004 International Conference on Spoken
Language Processing (ICSLP-2004), 1485-1488, ASA, ASJ, and ESCA,
Korea, Oct. 2004.
-
Kazuyoshi Yoshii,
Masataka Goto,
Hiroshi G. Okuno:
Drum Sound Identification for Polyphonic Music Using Template Adaptation
and Matching Methods
Proceedings of ISCA Tutorial and Research Workshop
on Statistical and Perceptual Audio Processing (SAPA-2004), accepted, ASA, ASJ, and ESCA,
Korea, Oct. 2004.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Hiroshi Tsujino,
Hiroshi G. Okuno:
Assessment of General Applicability of Robot Audition System by Recognizing Three Simultaneous Speeches.
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2004),
pp.2111-2116,
IEEE, RSJ, Sendai, Sep. 2004.
IEEE Kansai Chapter Young Researcher Award
doi:10.1109/IROS.2005.1544975
-
Tetsuya Ogata,
Masaki Matsunaga, Shigeki Sugano, and Jun Tani:
Human Robot Collaboration Using Behavioral Primitives,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2004),
pp.1592-1597,
IEEE, RSJ, Sendai, Sep. 2004.
-
Yuki SUGA,
Tetsuya Ogata,
and Shigeki Sugano:
Aquisition of Reactive Motion for Communication Robots Using Interactive EC:
Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2004), accepted, Sept. 2004.
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2004),
pp.1198-1203,
IEEE, RSJ, Sendai, Sep. 2004.
-
Yoshihiro Sakamoto,
Tetsuya Ogata,
and Shigeki Sugano:
Human-Robot Communication Using Multiple Recurrent Neural Networks,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2004),
pp.1574-1579,
IEEE, RSJ, Sendai, Sep. 2004.
-
Tsuyoshi Tasaki,
Shohei Matsumoto,
Hayato Ohba,
Mitsuhiko Toda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Dynamic Communication of Humanoid Robot with multiple people based on
Interaction Distance,
Proceedings of International Workshop on Robot and Human Interaction
(Ro-Man-2004), 81-86, IEEE, Kurashiki, Sep. 2004.
Paper in pdf
doi:10.1109/ROMAN.2004.1374732
-
Yuya Hattori,
Kazushi Ishihara,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Repeat Recognition for Environmental Sounds,
Proceedings of International Workshop on Robot and Human Interaction
(Ro-Man-2004), 121-126, IEEE, Kurashiki, Sep. 2004.
Paper in pdf
doi:10.1109/ROMAN.2004.1374734
-
Yusuke Akiwa, Yuki Suga,
Tetsuya Ogata,
and Shigeki Sugano:
Imitation based Human-Robot Communication: Roles of Joint Attention and Motion Prediction,
Proceedings of International Workshop on Robot and Human Interaction
(Ro-Man-2004), 283-288, IEEE, Kurashiki, Sep. 2004.
-
Yasuhiro Akiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto,
Hiroshi G. Okuno:
Using a Mixture of N Best Lists from Multiple MT Systems in
Rank-Sum-Based Confidence Measure for MT Outputs.
Proceedings of the 20th International Conference
on Computational Linguistics (Coling-2004), 322-328,
Geneva, Aug. 2004.
-
Kazunori Komatani,
Teruhisa Misu,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Efficient Confirmation Strategy for Large-scale Text Retrieval Systems with Spoken Dialogue Interface.
Proceedings of the 20th International Conference
on Computational Linguistics (Coling-2004), 1100-1106,
Geneva, Aug. 2004.
-
Kazushi Ishihara,
Tomohiro Nakatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Sound-Imitation Word Recognition from
Environmental Sounds focusing on Ambiguity Problem in Determining Phonemes.
PRICAI 2004: Trends in Artificial Intelligence
(Proc. of Eighth Pacific Rim International Conference on Artificial Intelligence),
LNAI 3157,
pp.909-918, Springer-Verlag,
Auckland, Aug. 2004.
-
Ishida,
Masayuki Takeda,
Tetsuro Kitahara:
ism: Improvisation Supporting Systems with Melody Correction,
Proceedings of the International Symposium on Musical Acoustics
(NIME2004), 177-180,
Hamamatsu, Jun. 2004.
-
Yasuhiro Akiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto,
Hiroshi G. Okuno:
Incremental Methods to Select Test Sentence for Evaluating
Translation Ability.
Proceedings of the fourth international conference on Language Resources and Evaluation (LREC-2004), pp.2015-2018,
Lisbon, Portugal, May 2004.
Paper in pdf
-
Kazunori Komatani,
Ryosuke Itoh,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Recognition of Emotional States in Spoken Dialogue with a Robot.
Innovations in Applied Artificial Intelligence:,
Seventeenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems, IEA/AIE-2004,
LNAI 3029, 413-423, Springer-Verlag.
Ottawa, May. 2004,
Paper at Springer-Verlag
-
Tetsuya Ogata,
Jun Tani:
Open-end Human Robot Interaction from the Dynamical Systems Perspective:
Mutual Adaptation and Incremental Learning.
Innovations in Applied Artificial Intelligence:,
Seventeenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems, IEA/AIE-2004,
LNAI 3029, 435-444, Springer-Verlag.
Ottawa, May. 2004,
Paper at Springer-Verlag
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Category-level Identification of Non-registered Musical Instrument Sounds.
Proceedings of 2004 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2004), Vol.IV, 253-256,
Montreal, May 2004.
Paper in pdf
doi:10.1109/ICASSP.2004.1326811
-
Yohei Sakuraba,
Tetsuro Kitahara,
Hiroshi G. Okuno:
Comparing Features for Forming Music Streams in Automatic Music Transcription.
Proceedings of 2004 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2004), Vol.IV, 273-276,
Montreal, May 2004.
Paper in pdf
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Hiroshi Tsujino, Toshio Yokoyama,
Hiroshi G. Okuno:
Improvement of Robot Audition by Interfacing Sound Source Separation and Automatic Speech Recognition with Missing Feature Theory,
Proceedings of IEEE-RAS International Conference
on Robots and Automation (ICRA-2004), 1517-1523, IEEE,
New Orleans, May. 2004.
Paper in pdf
IEEE Robotics and Automation Society Japan Chapter Young Award
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Acoustical-similarity-based Musical Instrument Hierarchy and
Its Application to Musical Instrument Identification,
Proceedings of the International Symposium on
Musical Acoustics (ISMA2004), 297-300,
Nara, Apr. 2004.
Academic Year 2003
Thesis
- Taro Watanabe :
Example-Based Statistical Machine Translation,
Ph.D Thesis, Feb. 2004.
- Tetsuro Kitahara:
,
MS Thesis, Feb. 2004.
- Yohei Sakuraba:
,
MS Thesis, Feb. 2004.
- Mitsuhiro Sakuraba:
,
MS Thesis, Feb. 2004.
- Naoyuki Kanda:
,
BE Thesis, Feb. 2004.
- Tsuyoshi Tasaki:
,
BE Thesis, Feb. 2004.
- Shohei Matsumoto:
,
BE Thesis, Feb. 2004.
- Yuya Hattori:
,
BE Thesis, Feb. 2004.
- Takuya Yoshioka:
,
BE Thesis, Feb. 2004.
Peer-Reviewed Journal Papers
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Acoustic-feature-based Musical Instrument Hierarchy and Its Application
to Category-level Recognition of Unknown Musical Instruments.
IPSJ Journal, Vol.45, No.3 (Mar. 2004) pp.680-689, IPSJ.
Paper in html
-
Katsuhisa Ishida,
Tetsuro Kitahara,
Masayuki Takeda:
N-gram Based Melody Correction for Improvisation,
to Category-level Recognition of Unknown Musical Instruments.
IPSJ Journal, Vol.45, No.3 (Mar. 2004) pp.680-689, IPSJ.
Paper in html
-
Yoko Yamakata,
Tatsuya Kawahara,
Hiroshi G. Okuno,
Michihiko Minoh:
Belief Network based Disambiguation of Object Reference in Spoken Dialogue System.
Transactions of the Japanese Society for Artificial Intelligence,
Vol.19, No.1 F, pp.47-56 (2004).
Paper Online Journal
-
Taro Watanabe, Eiichiro Sumita,
Hiroshi G. Okuno:
Decoding Algorithms for Statisitcal Machine Translation Considering Generation
Directions,
IPSJ Journal, Vol.44, No.12 (Dec. 2003) 3202-3210, IPSJ.
Paper in html
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Musical Instrument Identification Considering Pitch-dependent
Characteristics of Timbre: A Classifier Based on F0-dependent Multivariate
Normal Distribution.
IPSJ Journal, Vol.44, No.10 (Oct. 2003) 2448-2458, IPSJ.
Paper in html
-
Kazuhiro Nakadai,
Ken-ichi Hidai,
Hiroshi G. Okuno,
Hiroshi Mizoguchi,
Hiroaki Kitano:
Real-time Multiple Talker Tracking by Audio-Visual Integration for Humanoids:
Integration of Active Audition nad Face Recognition.
Journal of Robotics Society of Japan, Vol.21, No.5 (Jul. 2003), pp.517--525.
Digital Library,
pdf
-
Kazunori Komatani,
Hiroaki Kashima, Katsuaki Tanaka,
Tatsuya Kawahara:
Domain-independent Spoken Dialogue Platform for Database Query Using
Key-phrase Spotting Based on Combined Language Model,
IPSJ Journal, Vol.44, No.5 (May 2003) 1333-1342.
Paper in html
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Active audition for humanoid robots that can listen to three
simultaneous talkers.
Journal of the Acoustical Society of America,
Vol.113, No.4, Pt.2 of 2, Apr. 2003, pp.2230.
Abstract at ASA.
Survye Papers
-
Hiroshi G. Okuno,
Kazuhiro Nakadai:
Robot Audition: its research topics and current status.
Joho SHori, Vol.44, No.11 (Nov. 2003) pp.1138-1144, IPSJ.
Article in html
Peer-Reviewed International Conference Papers
-
Hiroshi G. Okuno,
Tetsuya Ogata,
Kazunori Komatani:
Computational Auditory Scene Analysis and Its Application to Robot Audition,
Proceedings of the International Conference on Informatics Research
for Development of Knowledge Society Infrastructure (ICKS 2004),
pp.73-80, Mar. 2004,
doi:10.1109/ICKS.2004.1313411
-
Kazuhiro Nakadai,
Daisuke Matsuura,
Hiroshi G. Okuno,
Hiroaki Kitano:
Applying Scattering Theory to Robot Audition System
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2003),
1147-1152,
IEEE, Las Vegas, Oct. 2003.
Paper in pdf
-
Tetsuya Ogata,
S. Sugano, Jun Tani:
Interactive Learning in Human-Robot Collaboration,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2003),
162-167,
IEEE, Las Vegas, Oct. 2003.
Paper in pdf
-
Kazuhiro Nakadai,
Daisuke Matsuura,
Hiroshi G. Okuno,
Hiroaki Kitano:
Active audition based humanoid system and ist evaluation:
Localization, Seperation and Recognition of Simultaneous Speech.
Proceedings of IEEE/RSJ International Conference
on Humanoids (Humanoids-2003),
Springer-Verlag,
IEEE, Munchen, Oct. 2003.
-
Yohei Sakuraba,
Hiroshi G. Okuno:
Note Recognition of Polyphonic Music by Using
Timbre Similarity and Direction Proximity.
Proceedings of International Computer Music Conference (ICMC2003),
167-170, Singapore, Oct. 2003.
-
Yasuhiro AKiba,
Hiroshi G. Okuno:
Experimental Comparison of MT Evaluation Methods: RED vs. BLEU.
Proceedings of MT Summit IX,
New Orleans, Sep. 2003.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Realizing Personality in Audio-Visually Triggered Non-verbal
Behaviors.
Proceedings of IEEE-RAS International Conference
on Robots and Automation (ICRA-2003), 392-397, IEEE,
Sep. 2003.
Paper in pdf
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Robot Recognizes Three Simultaneous Speech By Active Audition.
Proceedings of IEEE-RAS International Conference
on Robots and Automation (ICRA-2003), 398-403, IEEE,
Sep. 2003.
Paper in pdf
-
Yasuhiro AKiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, and
Hiroshi G. Okuno:
Experimental Comparison of MT Evaluation Methods: RED vs. BLEU.
Proceedings of MT Summit IX,
1-8,
New Orleans, Sep. 2003.
Paper in pdf
-
Kazuhiro Nakadai,
Daisuke Matsuura,
Hiroshi G. Okuno,
Hiroshi Tsujino:
Improvement of Three Simultaneous Speech Recognition
by Using AV Integration and Scattering Theory for Humanoid.
Proceedings of Audio Visual Spoken Processing (AVSP-2003),
157-162, St. Jorioz, France, Sep. 2003.
Paper in pdf
-
Kazunori Komatani,
Shinichi Ueno,
Tatsuya Kawahara,
Hiroshi G. Okuno:
User Modeling in Spoken Dialogue Systems for Flexible Guidance Generation.
Proceedings of the Eighth European Conference on
Speech Communication and Technology (Eurospeech-2003),
745-748, Geneva, Sep. 2003.
-
Kazushi Ishihara,
Yasushi Tsubota,
Hiroshi G. Okuno:
Automatic Transformation of Environmental Sounds into
Sound-Imitation Words Based on Japanese Syllable Structure.
Proceedings of the Eighth European Conference on
Speech Communication and Technology (Eurospeech-2003),
3185-3188, Geneva, Sep. 2003.
-
Kazuhiro Nakadai,
Daisuke Matsuura,
Hiroshi G. Okuno,
Hiroshi Tsujino:
Three Simultaneous Speech Recognition by Integration of Active
Audition and Face Recognition for Humanoid,
Proceedings of the Eighth European Conference on
Speech Communication and Technology (Eurospeech-2003),
2705-2708, Geneva, Sep. 2003.
-
Tatsuya Kawahara,
Ryosuke Ito,
Kazunori Komatani:
Spoken Dialogue System for Queries on Appliance Manuals using Hierarchical Confirmation Strategy.
Proceedings of the Eighth European Conference on
Speech Communication and Technology (Eurospeech-2003),
accepted for presentation,
Geneva, Sep. 2003.
-
Kazunori Komatani,
Shinichi Ueno,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Flexible Guidance Generation using User Model in Spoken Dialogue Systems,
Proceedings of the 41st Annual Meeting of the Association
for Computational Linguistics (ACL 2003),
pp.256-263, Sapporo, Jul. 2003.
-
Taro Watanabe, Eiichiro Sumita, and
Hiroshi G. Okuno:
Chunk-based statistical translation,
Proceedings of the 41st Annual Meeting of the Association
for Computational Linguistics (ACL 2003),
pp.303-310, Sapporo, Jul. 2003.
-
Yasuhiro AKiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, and
Hiroshi G. Okuno:
A Statistical-Informmation-Based Selector of the Best among Multiple Outputs,
Exhibition Brochure of the 41st Annual Meeting of the ACL (ACL 2003),
16,
Sapporo, Jul. 2003.
-
Yoji Kiyota, Sadao Kurohashi,
Teruhisa Misu,
Kazunori Komatani,
Tatsuya Kawahara,
Fuyuko Kido:
Dialog Navigator''A Spoken Dialog Q-A System based on Large Text Knowledge Base.
ACL03 Interactive Poster/Demo Session, pp.149--152 (Companion Volume), 2003.
-
Kazunori Komatani,
Fumihiro Adachi,
Shinichi Ueno,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Flexible Spoken Dialogue System based on User Models and Dynamic Generation of VoiceXML Scripts.
4th SIGdial Workshop on Discourse and Dialogue, pp.87--96, 2003.
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Musical Instrument Identification based on F0-dependent
Multivariate Normal Distribution.
Proceedings of 2003 International Conference on
Muotimedia and Expo (ICME 2003), IEEE,
Vol.III, pp.405-409,
Baltimore, MD, Jul. 2003.
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Pitch-dependent Musical Instrument Identification and
Its Application to Musical Sound Ontology.
In Chung, P,W.H., Hinde, C. and Ali, M. (Eds.)
Developments in Applied Artificial Intelligence,
LNAI 2718, 112-122, Springer-Verlag.
Proceedings of Nineteenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems (IEA/AIE-2003),
Loughborough, UK, Jun. 2003,
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Design and Implementation of Personality of Humanoids
in Human Humanoid Non-verbal Interaction.
In Chung, P,W.H., Hinde, C. and Ali, M. (Eds.)
Developments in Applied Artificial Intelligence,
LNAI 2718, 662-673, Springer-Verlag.
Proceedings of Nineteenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems (IEA/AIE-2003),
Loughborough, UK, Jun. 2003,
-
Hiroshi G. Okuno,
Kazuhiro Nakadai:
Real-time Sound Source Localization and Separation based on
Active Audio-Visual Integration.
In Jose Mira and Jose R. Alvarez (Eds.):
Computational Methods in Neural Modeling,
LNCS 2686, 118-125, Springer-Verlag.
The Seventh International Work Conference on
Artificial and Nataural Neural Networks, IWANN 2003, Proceedings, Part 1,
Ma¥'{o}, Menorca,, Spain, June 2003,
Paper in PDF
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Musical Instrument Identification based on F0-dependent
Multivariate Normal Distribution.
Proceedings of 2003 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2003),
Vol.5, pp.421--424, IEEE, Hong Kong, Apr. 2003.
Paper in PDF
-
Shun Tsuchiya (Ed.):
"Encyclopedia AI",
Feigenbaum, McCarthy. Kyoritsu Publishers, 2003.
Academic Year 2002
Thesis
- Shinya Amano: Studies on Natural Language Processing for
Kana-to-Kanji Conversion and Machine Translation,
Ph.D Thesis, Feb. 2003.
- Kazunori Komatani: Spoken Dialogue Systems for Information Retrieval
with Domain-Independent Dialogue Strategies,
Ph.D Thesis, Oct. 2002.
- Ryosuke Ito:
,
MS Thesis, Feb. 2003.
- Takashi Sumiyoshi:
,
MS Thesis, Feb. 2003.
- Masahiro Hasegawa:
,
MS Thesis, Feb. 2003.
- Naofumi Yoshida:
,
MS Thesis, Feb. 2003.
- Ian R. Lane:
Language Model Switching Based on Topic Detection for Multi-Domain Dialog Speech Recognition,
MS Thesis, Feb. 2003.
- Yuha Aakita:
,
MS Thesis, Aug. 2002.
- Kazushi Ishihara:
,
BS Thesis, Feb. 2003.
- Tasuku Kitade:
,
BS Thesis, Feb. 2003.
- Teruhisa Misu:
,
BS Thesis, Feb. 2003.
- Shun-ichi Yamamoto:
,
BS Thesis, Feb. 2003.
- Kazuyoshi Yoshii:
,
BS Thesis, Feb. 2003.
Peer-Reviewed Journal Papers
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Tino Lourens,
Hiroaki Kitano:
Sound and Visual Tracking for Humanoid Robot.
Applied Intelligence, Kluwer Publisher,
accepted for publication,
International Society for Applied Intelligence, 2003.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Ken'ichi Hidai,
Hiroshi Mizoguchi,
Hiroaki Kitano:
Human-Robot Non-Verbal Interaction Empowered by
Real-Time Auditory and Visual Multiple-Talker Tracking
Advanced Robotics, Vol.17, No.2, pp.115-130,
VSP and Robotics Society of Japan,
2003.
doi:10.1163/156855303321165088
Online version,
pdf
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Issues in Humanoid Audition and Sound Source Localization by Active Audition.
Transaction of JSAI, Vol.18, No.2 F, pp.103-110 (Mar. 2003).
Paper Online Journal
-
Kazunori Komatani,
Tatsuya Kawahara:
,
IPSJ Journal, Vol.43, No.10, pp.3078--3086, 2002.
Paper in html
-
Ryosuke Ito,
Kazunori Komatani,
Tatsuya Kawahara:
,
IPSJ Journal, Vol.43, No.7, pp.2147--2154, 2002.
Paper in html
-
Masahiro Hasegawa,
Yuya Akita,
Tatsuya Kawahara:
,
IPSJ Journal, Vol.43, No.7, pp.2222-2229, 2002.
Paper in html
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Real-Time Auditory and Visual Multiple-Speaker Tracking For
Human-Robot Interaction.
Journal of Robotics and Mechatronics,
special issue on Human Robot Interaction, Vol.14, No.5 (2002) 479-489,
Mechatronics Society of Japan.
-
Kentaro Umesawa,
Takamichi Saito,
Hiroshi G. Okuno:
,
IPSJ Journal, Vol.43, No.8 (Aug. 2002) 1553-1562.
Paper in html
-
Yuasa, T.
and
Okuno, H.G. (Eds.):
Advanced Lisp Technology,
Advanced Information Processing Technology, Vol.4,
Taylor and Francis Publishers, London, UK, May, 2002.
Peer-Reviewed J ournal Papers
-
Takamichi Saito,
Toshiyuki Kitoh,
Kentaro Umesawa,
Hiroshi G. Okuno:
Privacy-Enhanced SPKI Access Control on PKIX and Its Application to Web Server.
Proc. of the Seventeenth International Conference on Advanced
Information Networking and Applications (AINA 2003),
696--703, IEEE, Xi'an, China.
Paper.
doi:10.1109/AINA.2003.1192970
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Auditory Fovea Based Speech Separation and Its Application to
Dialog System.
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2002), 1314-1319, IEEE,
Geneva, Oct. 2002.
-
Yoko Yamakata,
Tatsuya Kawahara,
Hiroshi G. Okuno:
BELIEF NETWORK BASED DISAMBIGUATION OF OBJECT REFERENCE
IN SPOKEN DIALOGUE SYSTEM FOR ROBOT.
Proceedings of 2002 International Conference on Spoken
Language Processing (ICSLP-2002), 170-176, ASA, ASJ, and ESCA,
Denver, Sep. 2002.
-
Yasushi Tsubota,
Tatsuya Kawahara,
Hiroshi G. Okuno,
Masatake Dantsuji:
RECOGNITION AND VERIFICATION OF ENGLISH BY JAPANESE STUDENTS FOR
COMPUTER-ASSISTED LANGUAGE LEARNING SYSTEM.
Proceedings of 2002 International Conference on Spoken
Language Processing (ICSLP-2002), 1205-1208, ASA, ASJ, and ESCA,
Denver, Sep. 2002.
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
AUDITORY FOVEA BASED SPEECH ENCHANCEMENT AND ITS APPLICATION TO
HUMAN-ROBOT DIALOG SYSTEM.
Proceedings of 2002 International Conference on Spoken
Language Processing (ICSLP-2002), 1817-1820, ASA, ASJ, and ESCA,
Denver, Sep. 2002.
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
REAL-TIME SOUND SOURCE LOCALIZATION AND SEPARATION FOR ROBOT AUDITION.
Proceedings of 2002 International Conference on Spoken
Language Processing (ICSLP-2002), 193-196, ASA, ASJ, and ESCA,
Denver, Sep. 2002.
-
Taro Watanabe, Eiichiro Sumita:
Statistical Machine Translation Decoder Base On Phrase.
Proceedings of 2002 International Conference on Spoken
Language Processing (ICSLP-2002), Spec3Co2, ASA, ASJ, and ESCA,
Denver, Sep. 2002.
-
Kazunori Komatani,
Tatsuya Kawahara,
Ryosuke Ito,
Hiroshi G. Okuno:
Efficient Dialogue Strategy to Find Users' Intended Items from Information
Query Results,
Proceedings of the Nineteenth International Conference
on Computational Linguistics (Coling-2002), Vol.1, pp.481-487,
Aug. 2002.
-
Taro Watanabe, Eiichiro Sumita:
Bidirectional Decoding for Statistical Machine Translation.
Proceedings of the Nineteenth International Conference
on Computational Linguistics (Coling-2002), pp.
Aug. 2002.
-
Yasuhiro Akiba, Taro Watanabe, Eiichiro Sumita:
Using Language and Translation Models to Select the Best among Outputs
from Multiple MT Systems,
Proceedings of the Nineteenth International Conference
on Computational Linguistics (Coling-2002), pp.
Aug. 2002.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Realizing Audio-Visually triggered ELIZA-like non-verbal Behaviors.
In Ishizuka, M. and Slaney, J. (eds)
PRICAI-2002 Topics in Artificial Intelligence
(Seventh Pacific Rim International Conference on Artificial Intelligence),
LNAI 2417, 552--562,
Springer-Verlag, Tokyo, Aug. 2002.
Paper in PDF
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Exploiting Auditory Fovea in Humanoid-Human Interaction.
Proceedings of Eighteenth National Conference on
Artificial Intelligence (AAAI-2002), 431-438,
AAAI, Edmonton, Aug. 2002.
Paper in PDF
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Non-Verbal Eliza-like Human Behaviors in Human-Robot Interaction
through Real-Time Auditory and Visual Multiple-Talker Tracking.
Proceedings of the Third International Workshop on
Cognitive Robotics (CogRob-2002),
AAAI, Edmonton, Jul. 2002.
Paper in PDF
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Social Interaction of Humanoid Robot through Auditory and Visual Tracking.
In Hendtlass, T., and Ali, M. (Eds.)
Developments in Applied Artificial Intelligence,
Proceedings of Eighteenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems (IEA/AIE-2002),
Cairns, Australia, June 2002,
LNAI 2358, pp.725-735, Springer-Verlag.
Paper in PDF
-
Yoko Yamakata,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Belief Network based
Disambiguation of Word Reference in Spoken Dialogue System for Robot.
Proceedings of ISCA Tutorial and Research Workshop on
Multi-Modal Dialogue in Mobile Environments, Germany, Jun. 2002.
-
Kazuhiro Nakadai,
Ken'ichi Hidai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Real-Time speaker localization and speech separation
by Audio-Visual Integration,
Proceedings of IEEE-RAS International Conference
on Robots and Automation (ICRA-2002), pp.1043-1049, IEEE,
May 2002.
Paper in PDF
doi:10.1109/ROBOT.2002.1013493
Academic Year 2001
Thesis
- Hirofumi Adachi:
,
MS Thesis, Feb. 2002.
- Yoko Yamakata:
,
MS Thesis, Feb. 2002.
- Raux Antoine Roland:
Intelligibility Assessment and Adaptive Drill Generation
for a Computer-Assisted Pronunciation Learning System,
MS Thesis, Feb. 2002.
- Shinichi Ueno:
,
BS Thesis, Feb. 2002.
- Yohei Sakuraba:
,
BS Thesis, Feb. 2002.
- Kazuya Shitaoka:
,
BS Thesis, Feb. 2002.
- Masahiro Yokoo:
,
BS Thesis, Feb. 2002.
Peer-Reviewed Journal Papers
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Lourens, T.,
Hiroaki Kitano:
Sound and Visual Tracking by Active Audition.
in Jin, Q., Li, J., Zhang, N., Cheng, J., Yu, C., and Noguchi, N (eds)
Enabling Society with Information Technology
pp.174-185, Springer-Verlag, 2002.
- Takamichi Saito,
Kentaro Umesawa,
Hiroshi G. Okuno:
,
Trans. IEICE, Vol.J84-D1, No.11 (Nov. 2001)
pp.1553-1562, IEICE,
Paper in pdf
-
Kentaro Umesawa,
Takamichi Saito,
Hiroshi G. Okuno:
,
IPSJ Journal, Vol.42, No.8 (Aug. 2001) pp.2067-2076.
TAF Telecom Technology Student Award
-
Tatsuya Kawahara,
Akinobu Lee, Tetsunori Kobayashi, Koichi Takeda,
N. Minematsu, Shigeki Sagayama, Katsuya Itou, A. Ito, M. Yamamoto,
A. Yamada, T.Utsuto, Kiyohiro Shikano:
Japanese Dictation ToolKit -- 1999 version --,
Journal of Acoustic Society of Japan, Vol.57, No.3, pp.210--214, 2001
-
M. Mimura and
Tatsuya Kawahara:
Difference of acoustic modeling for read speech and dialogue speech,
Acoustical Science & Technology, Vol.22, No.5, pp.373--374, 2001.
Survey Papers
-
Hiroshi G. Okuno,
Kazuhiro Nakadai:
,
JSAJ, Vol.58, No.3 (Mar. 2002) pp.205-210.
- Hiroaki Kitano:
Hiroshi G. Okuno,
諸橋 峰雄,
京田 耕司,
Kazuhiro Nakadai :
『PCクラスタ構築法 − Linux クラスタベオウルフ』,
産業図書, 2001.
Peer-Reviewed International Conference Papers
-
Kazuhiro Nakadai,
Ken'ichi Hidai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Real-Time Active Human Tracking by Hierarchical Integration of
Audition and Vision.
Proceedings of IEEE-RAS International Conference
on Humanoid Robots (Humanoids2001), pp.91-98, IEEE,
Nov. 2001.
Paper in PDF
-
Kazuhiro Nakadai,
Tatsuya Matsui,
Hiroshi G. Okuno,
Hiroaki Kitano:
Epipolar Geometry Based Sound Localization and Extraction
for Humanoid Audition.
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2001),
1395-1401, IEEE and RSJ, Oct. 2001.
Paper in PDF
doi:10.1109/IROS.2001.977176
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Ken-ichi Hidai,
Hiroshi Mizoguchi,
Hiroaki Kitano:
Human-Robot Interaction Through
Real-Time Auditory and Visual Multiple-Talker Tracking
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2001), 1402-1409,
IEEE and RSJ, Oct. 2001.
Paper in PDF
Nakamura Award for IROS-2001 Best Paper Nomination Finalist
(2nd or 3rd Place) at IROS-2002
doi:10.1109/IROS.2001.977177
-
Tino Lourens,
Hiroshi G. Okuno,
Hiroaki Kitano:
Automatic Graph Extraction from Color Images.
Proc. of 11th International Conference
Image Analysis and Processing (ICIAP 2001), pp.302-308,
Granada, Spain, June 2001.
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Real-Time Multiple Speaker Tracking by Multi-Modal Integration
for Mobile Robots.
Proceedings of European Conforence on
Speech Processing (Eurospeech 2001),
pp.2643-2646, Sep. 2001.
Paper in PDF
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
T. Lourens,
Hiroaki Kitano:
Separating Three Simultaneous Speeches with Two Microphones by
Integrating Auditory and Visual Processing.
Proceedings of European Conforence on
Speech Processing (Eurospeech 2001),
pp.1193-1196, Sep. 2001.
Paper in PDF
-
Akinobu Lee,
Tatsuya Kawahara,
and Kiyohiro Shikano:
Gaussian mixture selection using context-independent HMM,
Proc. IEEE-ICASSP, pp.69--72, 2001.
-
Hiroaki Nanjo,
Tatsuya Kawahara:
Speaking-Rate Dependent Decoding and Adaptation for Spontaneous Lecture Speech Recognition,
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), pp.725--728, 2002.
-
Kazunori Komatani,
K.Tanaka, H.Kashima, and
Tatsuya Kawahara:
Domain-independent spoken dialogue platform using key-phrase spotting based on combined language model,
Proceedings of European Conforence on
Speech Processing (Eurospeech 2001),
pp.1319--1322, 2001.
-
Akinobu Lee,
Tatsuya Kawahara,
and Kiyohiro Shikano:
Julius -- an open source real-time large vocabulary recognition engine,
Proceedings of European Conforence on
Speech Processing (Eurospeech 2001),
pp.1691--1694, 2001
-
Hiroaki Nanjo,
Kazuomi Kato, and
Tatsuya Kawahara:
Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition,
Proceedings of European Conforence on
Speech Processing (Eurospeech 2001),
pp.2531--2534, 2001
-
Tatsuya Kawahara,
Hiroaki Nanjo,
and S.Furui:
Automatic transcription of spontaneous lecture speech,
Proc. IEEE workshop on Automatic Speech Recognition and Understanding,
2001.
-
Kazuhiro Nakadai,
Ken-ichi Hidai,
Hiroshi Mizoguchi,
Hiroshi G. Okuno,
Hiroaki Kitano:
Real-Time Auditory and Visual Multiple-Object Tracking for Robots.
Proc. of 17th International Joint Conference on Artificial Intelligence
(IJCAI-01)
, 1425-1432, Seattle, Aug. 2001.
電気通信普及財団テレコム技術賞奨励賞
Paper in PDF
-
Tino Lourens,
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Detection of Oriented Repetitive Alternating Patterns in Color
Images -- A Computational Model of Monkey Grating Cells.
Proc. of Sixth International Work-Conference
on Artificial and Natural Neural Networks (IWANN2001),
Granada, Spain, June 2001.
LNCS 2084, 95-107, Springer-Verlag.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Tino Lourens,
Hiroaki Kitano:
Sound and Visual Tracking for Humanoid Robot,
Proc. of 17th International Conference on Industrial and Engineering
Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2001)
,
Budapest, Hungary, June 2001.
Lecture Notes in Artificial Intelligence No.2070, 640-650, Springer.
Best Paper Award (1st Place)
-
Ian Frank,
Kumiko Tanaka,
Hiroshi G. Okuno,
Jun'ichi Akita,
Yukiko Nakagawa,
K. Maeda,
Kazuhiro Nakadai,
Hiroaki Kitano:
And The Fans are Going Wild! SIG plus MIKE.
RoboCup 2000: Robot Soccer World Cup IV,
Lecture Notes in Artificial Intelligence No.2019, 139-148,
Springer-Verlag, May 2001.
-
Yukiko Nakagawa,
Hiroshi G. Okuno,
Hiroaki Kitano:
Bridging gap between small sized league and simulator league.
RoboCup 2000: Robot Soccer World Cup IV,
Lecture Notes in Artificial Intelligence No.2019, 209-218,
Springer-Verlag, May 2001.
- Takamichi Saito,
Kentaro Umesawa,
Hiroshi G. Okuno:
An Access Control with Handling Private Information Server.
Proc. of the First International Workshop on
Internet Computing and E-Commerce (ICEC01),
IEEE, San Francisco, April 2001.
Last Update: Mon Nov 9 17:54:47 2009
Copyleft All Wrongs Reserved, 2001-2009.