Publication and Awards of Okuno Laboratory
DBLP:
H.G. Okuno,
K. Itoyama,
S. Nishide,
T. Mizumoto,
L.K. Cahier,
E-H. Kim,
T. Otsuka,
A. Lim,
T. Itohara,
K. Nagira,
OB:
T. Ogata,
T. Takahashi,
N. Yamakawa,
Y. Hirasawa,
N. Nishikawa,
R. Takeda,
W. Hinoshita,
A. Maezawa,
N. Yasuraoka,
K. Matsuyama,
H. Awano,
K. Komatani,
T. Yoshioka,
H. Fujihara,
Katsumaru,
S. Shiramatsu,
S. Ikeda,
H. Kanda
Y. Kubota
Sumi,
H-D. Kim,
S. Yamamoto,
K. Yoshii,
R. Yokoya
S. Naito,
T. Kitahara,
H. Niwa,
M. Yoshida,
T. Tasaki,
S. Matsumoto,
Valin,
Y. Akiba,
T. Watanabe,
K. Ishihara,
Kodaka,
M. Toda,
T. Misu,
I. Lane,
Y. Akita,
Y. Yamakata,
A. Raux,
Ito,
T. Kawahara,
Google Scnholar,
Microsoft Academic Search,
odysa
Academic Year 2013
Thesis |
Journal Papers |
Book Chapters |
International Conferences |
Domestic Conferences |
Patents
-
Tsuyoshi Tasaki,
Tetsuya Ogata,
Hiroshi G. Okuno:
The Iteraction between a Robt and Multiple People based on Spatially Mapping
of Friendliness and Motion Parameters,
Advanced Robotics, accepted with minor modification,
Jul. 2, 2013.
-
Ui-Hyun Kim,
Hiroshi G. Okuno:
Improved Binaural Sound Localization and Tracking for Unknown Time-
Varying Number of Speakers,
Advanced Robotics, accepted, Dec. 28, 2012.
in print, Vo.27, Issue 17 (Nov. 2013).
Published Online on 1 July 2013.
doi: 10.1080/01691864.2013.812177
-
Keisuke Nakamura,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
A real-tome super-resolution robot audition system that improves the robustness
of simultaneous speech recognition
Advanced Robotics, Vol.27 Issue 12, pp.933-945,
Published Online on 10 May 2013.
doi:10.1080/01691864.2013.797139
-
Daichi Sakaue,
Katsutoshi Itoyama,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robust Multipitch Analyzer against Initialization based on
Latent Harmonic Allocation using Overtone Corpus,
Journal of Information Processing, Vol.21, No.2 (Apr. 2013),
pp.246-256, IPSJ, Jan. 2013.
doi:10.2197/ipsjjip.21.246
-
Ui-Hyun Kim,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
Improved Sound Source Localization and Front-Back Disambiguation for
Humanoid Robots with Two Ears,
Recent Trends in Applied Artificial Intelligence,
Lecture Notes in Computer Science, Vol.7906, pp.282-291,
Springer. June 17-21, Amsterdam, The Netherland.
(Acceptance rate for long papers: 34.7%).
The Best Paper Award (1/103 papers)
doi: 10.1007/978-3-642-38577-3_29
-
YangYang Huang,
Takuma Otsuka,
Hiroshi G. Okuno:
A Speaker Dialization System with Robust Speaker Localization and Voice
Activity Detection,
Comtemprary Challenges and solutions in Applied Artificial Intelligence,
Studies in Computational Intelligence, Vol. 489 (2013) pp.77-82,
Springer. June 17-21, Amsterdam, The Netherland.
doi: 10.1007/978-3-319-00651-2_11
-
Mayumi J. Hikita,
Hiroshi G. Okuno:
PROPOSAL OF INTERNATIONAL CONFERENCE PROMOTION --- Destination Branding and Risk Management by a network of conference centres ---,
Proceedings of the First International Conference of Serviceology,
accepted,
Oct. 16-18, AIST Tokyo Waterfront, Japan.
-
Kenta Mochizuki,
Shun Nishide,
Hiroshi G. Okuno,
Tetsuya Ogata:
DevelopmentalHuman-Robot Imitation Learning of Drawing with a Neuro Dynamical
System,
Proceeding of IEEE International Conference on Systems, Man, and Cybernetics (SMC 2011), part SMC: Human-Machine,
accepted, Manchester, UK, 13-16 Oct. 2013.
-
Yoshinori Bando,
Takeshi Muzumoto,
Katsutoshi Itoyama,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
Posture Estimation of Horse-Shaped Robot using Microphone Array Localization,
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2013), accepted, (acceptance rate 43%),
IEEE, RSJ, Tokyo Big Sight, Japan, 3-7 Nov. 2013.
-
Kotaro Furukawa,
Keita Okutani,
Kohei Nagira,
Takuma Otsuka,
Katsutoshi Itoyama,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
Noise Correlation Matrix Estimation for Improving Sound Source Localization
by Multirotor UAV,
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2013), accepted, (acceptance rate 43%),
IEEE, RSJ, Tokyo Big Sight, Japan, 3-7 Nov. 2013.
-
Naoki Hirayama,
Koichiro Yoshino,
Katsutoshi Itoyama,
Shunsuke Mori,
Hiroshi G. Okuno:
Automatic Estimation of Dialect Mixing Ratio for Dialect Speech Recognition,
Proceedings of International Conference on Spoken Language Processing
(Interspeech 2013), (acceptance rate 52%), Aug. , 2013.
Lyon, France.
-
Ui-Hyun Kim,
Hiroshi G. Okuno:
Robust Localization and Tracking of Multiple Speakers in Real Environments for Binaural Robot Audition,
Proceedings of the 14th International Workshop on Image and Audio Analysis
for Multimedia Interactive Services (WIA2MIS 2013),
pp.1-4, Paris, July 3-5, 2013.
-
Kazuki Yazawa,
Daichi Sakaue,
Kohei Nagira,
Katsutoshi Itoyama,
Hiroshi G. Okuno:
AUDIO-BASED GUITAR TABLATURE TRANSCRIPTION USING MULTIPITCH ANALYSIS AND PLAYABILITY CONSTRAINTS,
Proceedings of 2013 International Conference on
Acoustics, Speech and Signal Processing (ICASSP 2013),
AASP-P.1.4, pp.196-200.
Vancouver, Canada, May 26-31.
-
Daichi Sakaue,
Takuma Otsuka,
Katsutoshi Itoyama,
Hiroshi G. Okuno:
INITIALIZATION-ROBUST BAYESIAN MULTIPITCH ANALYZER BASED ON PSYCHOACOUSTICAL AND MUSICAL CRITERIA,
Proceedings of 2013 International Conference on
Acoustics, Speech and Signal Processing (ICASSP 2013),
AASP-P1.10, pp.226-230,
Vancouver, Canada, May 26-31.
-
Naoyuki Kanda,
Katsutoshi Itoyama:
Hiroshi G. Okuno:
MULTIPLE INDEX COMBINATION FOR JAPANESE SPOKEN TERM DETECTION WITH OPTIMUM INDEX SELECTION BASED ON OOV-REGION CLASSIFIER,
Proceedings of 2013 International Conference on
Acoustics, Speech and Signal Processing (ICASSP 2013),
SLP-P5.7, pp.8540-8544,
Vancouver, Canada, May 26-31.
-
Tetsuya Ogata,
Hiroshi G. Okuno:
Integration of behaviors and languages with a hierarchal structure self-organized in a neuro-dynamial model,
IEEE Symposium Series on Computational Intelligence 2013, accepted
Singapore, Apr. 16-19, 2013.
-
Randy Gomez, Keisuke Nakamura,
Kazuhiro Nakadai,
Ui-Hyun Kim,
Hiroshi G. Okuno,
Tatsuya Kawahara:
Hands-Free Human Robot Communication Robust to Speaker's Radial Position,
Proceedings of 2013 IEEE International Conference on Robots and Automation
(ICRA 2013), accepted (acceptance rate 39%),
Karlsruhe, Germany, May 6-10, 2013.
-
Yoshiaki Bando,
Takuma Otsuka,
Takeshi Mizumoto,
Katsutoshi Itoyama,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
ホース型ロボットのマイクロホンアレイを用いた姿勢推定,
日本ロボット学会第31回学術講演会, , 首都大学東京, Sep. 4-6, 2013.
-
古川 孝太郎,
大塚 琢馬,
糸山 克寿,
中臺 一博,
奥乃 博:
Multirotor UAV を用いた音源定位のための雑音相関行列推定,
日本ロボット学会第31回学術講演会, , 首都大学東京, Sep. 4-6, 2013.
-
西牟田 勇哉,
平山 直樹,
大塚 琢馬,
杉山 治,
糸山 克寿,
奥乃 博:
HARKを用いたロボットクイズ司会者HATTACK25 の開発,
日本ロボット学会第31回学術講演会, , 首都大学東京, Sep. 4-6, 2013.
-
音楽音響信号生成システム,
発明者: 安部 武宏, 安良岡 直希, 糸山 克寿, 奥乃 博,
特許第5283289号,登録日2013 (平成25年) 6月7日.
特願2011-500614号, 出願日2011年8月9日, 国立大学法人京都大学.
-
Musical score position estimating apparatus, musical score position estimating method, and musical score position estimating program.
Inventors: Kazuhiro Nakadai, Takuma Otsuka, Hiroshi Okuno,
Patent No. US 8,440,901,
Date of Patent: May 14, 2013.
Date of Application: Mar. 1, 2011.
-
Audio Source Detection System,
Inventors: Hiroshi Tsujino, Kazuhiro Nakadai, Hiroshi Okuno, Takeshi Mizumoto, and Ikkyu Aihara.
Patent No. US 8,416,957,
Date of Patent: Apr. 9, 2013.
Date of Application: Dec. 4, 2009.
Academic Year 2012
Thesis |
Journal Papers |
Book Chapters |
International Conferences |
Domestic Conferences |
Patents
-
Daichi Sakaue,
Katsutoshi Itoyama,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robust Multipitch Analyzer against Initialization based on
Latent Harmonic Allocation using Overtone Corpus,
Journal of Information Processing, accepted,
IPSJ, Jan. 2013.
-
Kohei Nagira,
Takuma Otsuka,
Hiroshi G. Okuno:
Nonparametric Bayesina Sparse Factor Analysis for Frequency Doain Blind Source
Separation without Pearmuation Ambiguity,
EURASIP Journal on Audio, Speech, and Music Processing,
2013, 2013:3.
doi:10.1186/1687-4722-2013-4
目次 (学会サーバ)
-
Kazunori Komatani,
Mikio Nakano,
Masaki Katsumaru,
Kotaro Funakoshi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Allocation of Training Data for Speech Understanding based on
Multiple Model Combinations,
IEICE Transactions on Information and Systems,
Vol.E95-D, No.9 (Sep. 2012) pp.2298-2307.
pdf (学会サーバ)
-
Kazunori Komatani,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Long-Term Analysis of User Behaviors in Deployed Spoken Dialogue System
Dialogue & Discourse,
conditionally accepted pending required revisions, May 21, 2012.
-
Akira Maezawa,
Katsutoshi Itoyama,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automated Violin Fingering Transcription Through Analysis of an Audio Recording,
Computer Music Journal, Fall 2012, Vol.36, No.3, Pages 57-72
Posted Online August 14, 2012
doi:10.1162/COMJ_a_00129
(Free at MIT Press)
-
Shun Nishide,
Jun Tani,
Toru Takahashi,
Hiroshi G. Okuno,
Tetsuya Ogata:
Tool-Body Assimilation of Humanoid Robt using Neuro-Dynamical System,
IEEE Transactions on Autonomous Mental Development,
Vol.4, Issue:2 (june 2012) pp.139-149, 2012.
doi:10.1109/TAMD.2011.2177660
-
Angelica Lim:
Musical Robots and Interactive Multimodal Systems,
Book Review,
International Journal of Synthetic Emotions,
Vol. 3, No. 2 (2012) 84-86.
doi: 10.4018/jse.2012070105
-
Kohei Nagira,
Takuma Otsuka,
Tetsuya Ogata,
Hiroshi G. Okuno:
Infinite Sparse Factor Analysis for Blind Source Separation in Reverberant Environments,
SSPR/SPR2012,
Lecture Notes in Computer Science, Vol. 7626, pp.638-647,
Nov. 7-9. Hiroshima, Japan.
doi: 10.1007/978-3-642-34166-3_70
-
Angelica Lim,
Hiroshi G. Okuno:
Using speech data to recognize emotion in human gait,
Human Behavior Understanding,
A.A. Salah et al. (Eds): Human Behavior Understanding 2012,
Lecture Notes in Computer Science, Vol.7559, pp.52-64, Springer,
Algarve, Portgul, October 7, 2012.
acceptance rate 42%.
Abstract
doi: 10.1007/978-3-642-34014-7_5
-
Katsutoshi Itoyama,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Chord Recognition Based on Probabilistic Integration of Acoustic Features and Chord Transition,
He Jiang et al. (Eds.):
Advanced Research in Applied Artificial Intelligence,
IEA/AIE-2012, pp.58-67,
LNAI Vol.7345. Springer. June 9-12, Dalian, China.
doi: 10.1007/978-3-642-31087-4_7
-
Takeshi Mizumoto,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Adaptive Pitch Control for Robot Thereminist using Unscented Kalman Filter.
H. Jiang, M. Ali, and M. Li (Eds.), Modern Advances in Intelligent
Systems and Tools, Studies in In Computational Intelligence,
pp.19-24, Springer. June 9-12, Dalian, China, 2012. (IEA/AIE-2012)
doi: 10.1007/978-3-642-30732-4_3
-
Takeshi Mizumoto,
Hiromitsu Awano,
Yoshiaki Bando,
Playing with 3D Printer (in Japanese)
Joho Shori ,
Vol.53, No.8 (Aug. 2012)
Information Processing Society of Japan,
DL
-
Hiroshi G. Okuno:
Preface, Special Section for Summer Vacation,
Joho Shori ,
Vol.53, No.8 (Aug. 2012)
Information Processing Society of Japan,
DL
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Takeshi Mizumoto:
Sensing technology for listening to several things at once,
IEICE Magazine,
Vol.95, No.5 (May 2012) pp.401-404, IEICE.
-
Masataka Goto,
Hiroshi G. Okuno,
Preface
Information Processiong, Special Issue on Present and Future of CGM,
Vol.53, No.5 (May 2012) pp.464-465,
IPSJ
DL
-
Naoki Hirayama,
Shinsuke Mori,
Hiroshi G. Okuno:
Statistical Method of Building Dialect Language Models for ASR Systems,
Proceedings of the 24th International Conference
on Computational Linguistics (Coling-2012), 1179-1194,
Mumbai, India, Dec. 8-15, 2012.
-
Tatsuhiko Itohara,
Kazuhiro Nakadai,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improvement of Audio-Visual Score Following in Robot Ensemble with Human Guitarist,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2012), accepted as oral (acceptance rate 57% = 133/233),
IEEE, Osaka, Nov. 30 - Dec. 1, 2012.
-
Hiroshi G. Okuno:
Human Robot Interaction through Robot Audition,
Keynote,
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2012)
Workshop on Motivational Aspect of Robotics in Physical Therapy,
Vilamoura, Algarve, Portgul, October 12, 2012.
-
Keita Mochizuki,
Harumitsu Nobuta,
Shun Nishide,
Hiroshi G. Okuno,
Tetsuya Ogata:
Developmental Human-Robot Imitation Learning with Phased Structuring in
Neuro Dynamical Systems,
Proceedings of IROS-2012 Workshop on Cognitive Neuroscience Robotics,
Pos-3, 6 pages, IEEE, RSJ, Vilamoura, Algarve, Portgul, October 12, 2012.
-
Yuki Yamaguchi,
Harumitsu Nobuta,
Shun Nishide,
Hiroshi G. Okuno,
Tetsuya Ogata:
Developmental Human-Robot Imitation Learning with Phased Structuring in
Neuro Dynamical Systems,
Proceedings of IROS-2012 Workshop on Cognitive Neuroscience Robotics,
PM2-3, 6 pages, IEEE, RSJ, Vilamoura, Algarve, Portgul, October 12, 2012.
-
Takeshi Mizumoto,
Tetsuya Ogata,
Hiroshi G. Okuno:
Who is the leader in a multiperson ensemble? ---Multiperson human-robot ensemble model with leaderness---,
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2012), pp.1413-1419 (812/1801=45.0%),
IEEE, RSJ, Vilamoura, Algarve, Portgul, October 7-12, 2012.
doi: 10.1109/IROS.2012.6385782
-
Joao Lobato Oliveira, Gokhan Ince, Keisuke Nakamura,
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Luis Paulo Reis, Fabien Gouyon,
Live Assessment of Beat Tracking for Robot Audition,
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2012), pp.992-997 (812/1801=45.0%),
IEEE, RSJ, Vilamoura, Algarve, Portgul, October 7-12, 2012.
doi 10.1109/IROS.2012.6386100
-
Takuma Otsuka,
Katsutoshi Ishiguro, Hiroshi Sawada,
Hiroshi G. Okuno:
Unified Auditory Functions based on Bayesian Topic Model,
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2012), pp.2370-2376 (812/1801=45.0%),
IEEE, RSJ, Vilamoura, Algarve, Portgul, October 7-12, 2012.
doi: 10.1109/IROS.2012.6385787
-
Yusuke Yamamura,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Unified Auditory Functions based on Bayesian Topic Model,
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2012), pp.2364-2369 (812/1801=45.0%),
IEEE, RSJ, Vilamoura, Algarve, Portgul, October 7-12, 2012.
doi: 10.1109/IROS.2012.6385765
-
Hiroshi G. Okuno:
Human Robot Interaction through Robot Audition
Keynote,
Workshop on Motivational Aspect of Robotics in Physical Therapy.
IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2012), Vilamoura, Algarve, Portgul, October 12, 2012.
-
Daichi Sakaue,
Takuma Otsuka,
Katsutoshi Itoyama,
Hiroshi G. Okuno:
Bayesian Nonnegative Harmonic-Temporal Factorization
and Its Application to Multipitch Analysis
Proceedings of 13th International Society for Musical Information
Retrieval Conference (ISMIR-2012),
pp.91-96, Porto, Portgul, Oct, 2012.
-
Joao Lobato Oliveira, Gokhan Ince, Keisuke Nakamura,
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Luis Paulo Reis, Fabien Gouyon,
An Active Audition Framework for Auditory-driven HRI: Application to Interactive Robot Dancing,
Proceedings of International Workshop on Robot and Human Interaction
(Ro-Man-2012), pp.1078-1085, IEEE, Paris, Sep 9-13, 2012.
doi: 10.1109/ROMAN.2012.6343892
-
Takuya Yoshioka,
Daichi Sakaue:
Log-normal matrix factorization with application to speech-music separation,
Proceedings of SAPA-SCALE Conference 2012,
pp.80-85, Portland, OR, 7-8 September, 2012.
-
Ikkyu Aihara,
Takeshi Mizumoto,
Takuma Otsuka,
Hiromitsu Awano,
Hiroshi G. Okuno,
Kazuyuki Aihara:
Possible Functions of Call Alternation in Frog Choruses,
Tenth International Congress of Neuroethology,
accepted,
Aug. 5-10, 2012, University of Maryland, College Park, MD, USA.
(poster)
doi: 10.3389/conf.fnbeh.2012.27.00267
-
Takeshi Mizumoto,
Hiromitsu Awano,
Ikkyu Aihara,
Takuma Otsuka,
Hiroshi G. Okuno:
Sound imaging system for visualizing multiple sound sources from two species,
Tenth International Congress of Neuroethology,
accepted,
Aug. 5-10, 2012, University of Maryland, College Park, MD, USA.
(poster)
doi: 10.3389/conf.fnbeh.2012.27.00247
-
Takuma Otsuka,
Katsuhiko Ishiguro, Hiroshi Sawada,
Hiroshi G. Okuno:
Bayesian Unification of Sound Source Localization and Separation with Permutation Resolution,
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence
(AAAI-12),
2038-2045 (26%, 294/1129), July 22-26 (26), 2012, Toronto, Canada.
AAAI server
-
Harumitsu Nobuta,
Kenta Kawamoto, Kuniaki Noda, Kohtaro Sabe,
Hiroshi G. Okuno,
Tetsuya Ogata:
Body area segmentation from visual scene based on predictability of neuro-dynamical system,
Proc. of the 2012 International Joint Conference on Neural Networks
(IJCNN 2012), pp.1-8, Brisbane, Australia, June 10-15, 2012.
doi:10.1109/IJCNN.2012.6252530
-
Shun Nishide,
Jun Tani,
Hiroshi G. Okuno,
Tetsuya Ogata:
Self-organization of Object Features Representing Motion Using Multiple Timescales Recurrent Neural Network,
Proc. of the 2012 International Joint Conference on Neural Networks
(IJCNN 2012), pp.1-8, Brisbane, Australia, June 10-15, 2012.
doi:10.1109/IJCNN.2012.6252714
-
Luis-Kenzo Furuya Cahier,
Tetsuya Ogata,
Hiroshi G. Okuno:
Incremental Probabilistic Geometry Estimation for Robot Scene Understanding,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2012),
pp.3265-3630, (acceptance rate 40%), May 14-18, 2012, St. Paul, MN.
doi: 10.1109/ICRA.2012.6225343
-
Speech recognition system and method for generating a mask of the system.
Inventors: Kazuhiro Nakadai, Toru Takahashi, Hiroshi Okuno,
Patent No. US 8,392,185,
Date of Patent: Mar. 5, 2013.
Date of Application: Aug. 19, 2009.
-
Reverberation suppressing apparatus and reverberation suppressing method .
Inventors: Kazuhiro Nakadai, Hirofumi Nakajima, Hiroshi Okuno, and Ryu Takeda
Patent No. US 8,391,505,
Date of Patent: Mar. 5, 2013.
Date of Application: Jun. 1, 2010.
-
Musical piece recommendation system and method .
Inventors: Masataka Goto, Kazuyoshi Yoshii, Hiroshi Okuno,
Patent No. US 8,370,277,
Date of Patent: Feb. 5, 2013.
Date of Application: Jul. 31, 2008.
-
音源分離システム, 音源分離方法及び音源分離用コンピュータプログラム,
発明者: 糸山 克寿, 奥乃 博, 後藤 真孝,
特許第5201602号, 登録日平成25年2月22日.
特願2009-511801号, 2009年4月14日.
-
音声認識装置及び音声認識装置のマスク生成法,
発明者: 中臺 一博, 高橋 徹, 奥乃 博,
特許第5180928号, 登録日平成25年1月18日.
特開2010-49249号, 2010年3月4日.
特願2009-185164号, 2009年8月7日.
-
音源分離システム,
発明者: 武田 龍, 中臺 一博, 辻野 広司, 奥乃 博,
特許第5178370号, 登録日平成25年1月18日.
特開2009-42754号, 2009年2月26日.
特願2008-191382号, 2008年7月24日.
-
音源追跡システム,方法,およびロボット,
発明者: 中臺 一博, 辻野 広司, 長谷川 雄二, 奥乃 博,
特許第5170440号, 登録日平成25年1月11日.
WO2007/129731, 2007年11月15日.
特願2008-514510号, 2007年5月9日. PCT/JP2007/059599
-
音源定位システム及び音源定位方法,
発明者: 中臺 一博, 奥乃 博, 大塚 琢馬,
特開2013-44950号, 2013年3月4日.
特願2011-182774号, 2011年8月24日.
-
Sound source tracking system, method and robot.
Inventors: Kazuhiro Nakadai, Hiroshi Tsujino, yuji Hasegawa, Hiroshi Okuno,
Patent No. US 8,155,331,
Date of Patent: Apr. 10, 2012.
Date of Application: May 9, 2007.
-
文単位検索方法, 文単位検索装置, コンピュータプログラム, 記憶媒体, 及び文書記憶装置,
発明者: 白松 俊, 駒谷 和範, 奥乃 博,
特許第5167546号, 登録日平成25年1月11日.
特願2008-530812号, 2007年3月16日.
出願者: 京都大学
-
ロボット,
発明者: 中臺 一博, 長谷川 雄二, 辻野 広司, 村田 和真, 武田 龍, 奥乃 博,
特許第5150573号, 登録日平成24年12月7日.
特開2010-026513号, 2010年2月4日.
特願2009-166049号, 2009年7月14日.
-
音分離装置、及び、それを備えたカメラユニット.
発明者:梅田 修志, 堀邊 隆介, 奥乃 博, 高橋 徹.
特開2012-238964号, 2012年12月6日.
特願2011-105404号, 2011年5月10日.
-
音楽音響信号と歌詞の時間的対応付けを自動で行うシステム及び方法,
発明者: 藤原 弘将, 奥乃 博, 後藤 真孝,
特許第5131904号,登録日: 2012年11月16日.
特開2008-134606号, 2008年6月12日.
特願2007-233682号, 2007年9月10日.
-
Language Understanding Device.
Inventors: Mikio Nakano, Hiroshi Okuno, Kazunori Komatani, Yuichiro Fukubayashi, Kotaro Funakoshi.
Patent No. US 8,244,522,
Date of Patent: Aug. 14, 2012.
Date of Application: May 20, 2008.
-
Sound source separation system, sound source separation method, and computer program for sound source separation.
Inventors: Katsutoshi Itoyama, Hiroshi Okuno, and Masataka Goto.
Patent No. US 8,239,052,
Date of Patent: Aug. 7, 2012.
Date of Application: Apr. 14, 2008.
-
楽譜位置推定装置、及び楽譜位置推定方法,
発明者: 中臺 一博, 大塚 琢馬, 奥乃 博
特開2012-168538号, 2012年9月6日.
特願2012-29802号, 2012年2月14日
-
言語理解装置,
発明者: 中野 幹生, 奥乃 博, 福林 雄一朗, 船越 孝太郎,
特許第50664834号,登録日: 2012年8月17日.
特開2008-293019号, 2008年12月4日.
特願2008-134401号, 2008年5月22日.
-
Sound source tracking system, method and robot」
Inventors: Kazuhiro Nakadai, Hiroshi Tsujino, Yuji Hasegawa, and Hiroshi Okuno.
Patent No. US 8,155,331,
Date of Patent: Apr. 10, 2012.
Date of Application: May 9, 2007.
Academic Year 2011
Thesis |
Journal Papers |
Book Chapters |
International Conferences |
Domestic Conferences |
Patents
- Yasuharu Hirasawa:
Under-Determined Blind Speech Separation Using a GMM-Based Sound Spectral Model.
Master Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2012.
- Naoki Nishikawa:
音楽情報検索のための歌詞と音響特徴量を用いた楽曲印象軌跡推定.
Master Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2012.
- Shinpei Aso:
歌声話声自動識別及び歌声の話声自動変換への応用.
Master Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2012.
- Angelica Lim:
Design and Implementation of Emotions for Humanoid Robots based on the
Modality-independent DESIRE Model.
Master Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2012.
- Hideki Takano:
Automatic CHord Recognition System with Adaptation to Modulation
Using Kye Slelection by Reliability,
MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Aug. 2011.
- Nobuhide Yamakawa:
Sound Source Recognition of Impulsive Sound Events using
Matching Pursuit and Formant-wave Function and
Audio Feature Design and Sound Source Adaptation for
Their COnversion of Sound-Imitation Words,
MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Aug. 2011.
-
Angelica Lim,
Takeshi Mizumoto,
Tetsuya Ogata,
Hiroshi G. Okuno:
A musical robot that synchronizes with a co-player using non-verbal cues,
Advanced Robotics, Special Issue on Cutting Edge of Robtics in Japan,
Vol.26 (2012) pp.363-381.
doi:10.1163/156855311X614626
-
Angelica Lim,
Tetsuya Ogata,
Hiroshi G. Okuno:
Towards expressive musical robots: A cross-modal framework
for emotional gesture, voice and music,
EURASIP Journal on Audio, Speech, and Music Processing,
2012:3, Published: 17 January 2012.
doi:10.1186/1687-4722-2012-3
-
Tatsuhiko Itohara,
Takuma Otsuka, Takeshi Muzumoto,
Angelica Lim,
Tetsuya Ogata,
Hiroshi G. Okuno:
A multi-modal tempo and beat tracking system based on
audio-visual information from live guitar performances.
EURASIP Journal on Audio, Speech, and Music Processing,
2012, 2012:6, Published: 20 January 2012.
doi:10.1186/1687-4722-2012-6
-
Ryu Takeda,
Kazuhiro Nakadai,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Efficient Blind Dereverberation and Echo Cancellation based on Independent Component Analysis for Actual Acoustic Signals,
Neural Computation,
Vol.24, Issue 1 (Jan. 2012) pp. 234-272, MIT Press.
doi:10.1162/NECO_a_00219
Posted online Dec. 9, 2011. IF: 2.290
-
Tsuyoshi Tasaki, Fumio Ozaki, Nobuto Matsuhira,
Tetsuya Ogata,
Hiroshi G. Okuno:
People Detection Based on Spatial Mapping of Friendliness and Floor Boundary
Points for a Mobile Navigation Robot,
Journal of Robotics, vol. 2011, Article ID 683975, 10 pages, 2011.
doi:10.1155/2011/683975
Hindawi Publishing
Corp.
-
Kazunori Komatani,
Kyoto Matsuyama,
Ryu Takeda,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Spoken Dialogue System that Uses Information on Locutionary Acts to Interprete
User Utterances,
IPSJ Journal, Vol.52, No.12 (Dec., 2011) pp.3374-3385, IPSJ.
,
DL
-
Shinsuke Mori,
Kazunori Komatani,
Masaki Katsumaru,
Tetsuya Okgata,
Hiroshi G. Okuno:
Automatic Vocabulary Expansion for Abbreviation Recognition in Spoken Dialogue
System,
IPSJ Journal, Vol.52, No.12 (Dec., 2011) pp.3398-3407, IPSJ.
,
DL
-
Angelica Lim,
Takeshi Mizumoto,
Takuma Otsuka,
Luis-Kenzo Furuya Cahier,
Tetsuya Ogata,
Hiroshi G. Okuno:
Musical Robot Co-Player: Real-time Syncrhorinzation with a Human Flutist
Recognizing Visual Start and End Cues,
IPSJ Journal, Vol.52, No.12 (Dec., 2011) pp.3599-3610, IPSJ.
,
DL
-
Naoki Yasuraoka,
Takuya Yoshioka,
Katsutoshi Itoyama,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Musical Sound Separation and Synthesis Using Harmonic/Inharmonic GMM and NMF
for Phrase Replacing System,
IPSJ Journal, Vol.52, No.12 (Dec., 2011) pp.3839-3852, IPSJ.
,
DL
-
Hiromasa Fujihara,
Masataka Goto,
Jun Ogata,
Hiroshi G. Okuno:
LyricSynchronizer: Automatic Synchronization Method Between Musical Audio Signals and Lyrics,
IEEE Journal of Selected Topics in Signal Processing, Vol.5, No.6
(Oct. 2011) pp.1252-1261,
doi:10.1109/JSTSP.2011.2159577
-
Tetsuya Ogata,
Tetsuo Sawaragi, Tadahiro Taniguchi:
Preface,
Advanced Robotics, Vol.25, No.17 (2011) pp. 2125-2126.
10.1163/016918611X59476
-
Zhang Yang,
Tetsuya Ogata,
Shun Nishide,
Toru Takahashi,
Hiroshi G. Okuno:
Classification of Known and Unknown Environmental Sounds based on
Self-organized Space using Recurrent Neural Network,
Advanced Robotics, Vol.25, No.17 (2011) pp. 2127-2141.
10.1163/016918611X595017
-
Shun Nishide,
Jun Tani,
Hiroshi G. Okuno,
Tetsuya Ogata:
Towards Written Text Recognition based on Handwriting Experiences
using Recurrent Neural Network,
Advanced Robotics, Vol.25, No.7 (2011)
pp.2173-2187.
10.1163/016918611X595026
-
Takeshi Mizumoto,
Ikkyu Aihara,
Takuma Otsuka,
Ryu Takeda,
Kazuyuki Aihara,
Hiroshi G. Okuno:
Sound imaging of nocturnal animal calls in their natural habitat,
Journal of Comparative Physiology A:
Neuroethology, Sensory, Neural, and Behavioral Physiology,
Vol.197, No.9, 915-921,
Online First, 17 May 2011.
doi:10.1007/s00359-011-0652-7
,
html,
Supplementary material at MetaPress.
-
Wataru Hinoshita,
Horiaki Arie, Jun Tani,
Hiroshi G. Okuno,
Tetsuya Ogata:
Emergence of Hierarchical Structure mirroring Linguistic Composition
in a Recurrent Neural Network,
Neural Networks, Vol.24, Issue 4 (May 2011) pages 311-320, Elsevier.
doi:10.1016/j.neunet.2010.12.006
-
Kazuhiro Nakadai,
Hiroshi G. Okuno:
Development of Robot Auditon Open-Sourced Software HARK (in Japanese),
Digital Practice,
Vol.2, No.2 (Apr. 2011) pp.133-140, IPSJ.
-
Kohei Sumi,
Katsutoshi Itoyama,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Chord Recognition Based on Integration of Chord and Bass Pitches
Features (in Japanese),
IPSJ Journal, Vol.52, No.4 (Apr. 2011) pp.1803-1812.
Information Processing Society of Japan.
,
DL
-
Kohei Nagira,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Complex Extension of Infinite Sparse Factor Analysis for Blind Source
Separation of Speech Signals,
F. Theis et al. (Eds.): LVA/ICA 2012,
Lecture Notes in Computer Science 7191, Springer-Verlag, pp.388-396, 2012.
Tel-Aviv, Israel, Mar. 12-15, 2012.
-
Yasuharu Hirasawa,
Naoki Yasuraoka,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
A GMM Sound Source Model for Blind Speech Separation in Under-determined
Condisions,
F. Theis et al. (Eds.): LVA/ICA 2012,
Lecture Notes in Computer Science 7191, Springer-Verlag, pp.446-453, 2012.
Tel-Aviv, Israel, Mar. 12-15, 2012.
-
Hiromitsu Awano,
Shun Nishide,
Hiroaki, Arie, June Tani,
Hiroshi G. Okuno,
Tetsuya Ogata:
Use of a Sparse Structure to Improve Learning Performance of
Recurrent Neural Networks,
Proceeding of 18th International Conference on Systems, Man, and Cybernetics (ICONIP 2011),
Part III, pp.323-331,
Lecture Notes in Computer Science 7064, Springer-Verlag, Shanghei, Nov. 13-17, 2011.
-
Hiromitsu Awano,
Shun Nishide,
Hiroaki, Arie, June Tani,
Hiroshi G. Okuno,
Tetsuya Ogata:
Use of a Sparse Structure to Improve Learning Performance of
Recurrent Neural Networks,
Proceeding of 18th International Conference on Systems, Man, and Cybernetics (ICONIP 2011),
Part III, pp.323-331,
Lecture Notes in Computer Science 7064, Springer-Verlag, Shanghei, Nov. 13-17, 2011.
-
Kazunori Komatani,
Kyoko Matsuyama,
Ryu Takeda,
Tetsuya Ogata,
Hiroshi G. Okuno:
Evaluation of Spoken Dialogue System that uses Utterance Timing
to Interprete User Utterances,
R.L-C. Delgado and T. Kobayashi (Eds.):
Proceedings of International Workshop on Spoken Dialogue Systems
(IWSDS2011), pp.315-326, Springer,
Sep. 2011.
doi:10.1007/978-1-4614-1335-6
-
Yasuharu Hirasawa,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot with Two Ears Listens to More Than Two Simultaneous Utterances by Exploiting Harmonic Structures,
K.G. Mehrotra et al. (Eds.): IEA/AIE-2011,
Part I, LNAI 6703, pp.348-358. Springer.
Syracuse, NY, June 28 - July 1, 2011.
-
Nobuhide Yamakawa,
Toru Takahashi,
Tetsuro Kitahara,
Tetsuya Ogata,
Hiroshi G. Okuno:
Environmental Sound Recognition for Robot Audition using Matching-pursuit,
,
K.G. Mehrotra et al. (Eds.): IEA/AIE-2011,
Part II, LNAI 6704, pp.1-10. Springer,
Syracuse, NY, June 28 - July 1, 2011.
-
Zhang Yang,
Tetsuya Ogata,
Shun Nishide,
Toru Takahashi,
Hiroshi G. Okuno:
Cluster Self-organization of Known and Unknown Environmental Sounds
using Recurrent Neural Network,
T. Honkela, W. Duch, M. A. Girolami, S. Kaski (Eds.):
Artificial Neural Networks and Machine Learning - ICANN 2011,
LNCS 6791, pp.167-175, Springer,
(58%, 108/185), Espoo, Finland, June 14-17, 2011.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hyun-Don Kim:
Robot Audition: Missing Feature Theory Approach and Active Audition,
K. Pradalier, R. Siegward, and G. Hirzinger (Eds.): Robotics Research,
STAR 70, pp.227-244, Springer-Verlag.
doi:10.1007/978-3-642-19457-3_14
-
Luis-Kenzo Furuya Cahier,
Tetsuya Ogata,
Hiroshi G. Okuno:
Incremental Probabilistic Geometry Estimation for Robot Scene Understanding,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2012),
accepted (acceptance rate 40%), May 14-18, 2012, St. Paul, MN.
-
Daichi Sakaue,
Katsutoshi Itoyama,
Tetsuya Ogata,
Hiroshi G. Okuno:
INITIALIZATION-ROBUST MULTIPITCH ESTIMATION BASED ON LATENT HARMONIC ALLOCATION USING OVERTONE CORPUS,
Proceedings of 2012 International Conference on
Acoustics, Speech and Signal Processing (ICASSP 2012),
accepted, IEEE, Kyoto, Japan, March 25-30, 2012.
-
Harumitsu Nobuta,
Shun Nishide,
Hiroshi G. Okuno,
Tetsuya Ogata:
Identification of self-body based on dynamic predictability using
neuro-dynamical system,
Proceedings of 2011 IEEE/SICE International Symposium on
System Integration (SII2011),
accepted,
Dec. 20-22, 2011, Kyoto.
-
Shotaro Sano,
Shun Nishide,
Hiroshi G. Okuno,
Tetsuya Ogata:
Predicting Listener Back-Channels for Human-Agent Interaction using
Neuro-dynamical Model,
Proceedings of 2011 IEEE/SICE International Symposium on
System Integration (SII2011),
accepted,
Dec. 20-22, 2011, Kyoto.
-
Naoki Nishikawa,
Hiromasa Fujihara,
Masataka Goto,
Katsutoshi Itoyama,
Tetsuya Ogata,
Hiroshi G. Okuno:
A Musical Mood Trajectory Estimation Method Using Lyrics and
Acoustic Features,
Proceedings of International ACM Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies (MIRUM'11),
51-56, ACM, Nov. 28 - Dec. 1, 2011, Scottsdale, AZ.
-
Angelica Lim,
Tetsuya Ogata,
Hiroshi G. Okuno:
Converting emotional voice to motion for robot telepresence,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2011),
accepted as oral (acceptance rate 17.4% = 28/190),
IEEE, Bled, Slovenia, Oct. 26-28, 2011.
-
Takuma Otsuka,
Kazuhiro Nakadai,
Tetsuya Ogata,
Hiroshi G. Okuno:
Incremental Bayesian Audio-to-Score Alignment with
Flexible Harmonic Structure Models,
Proceedings of 12th International Society for Musical Information
Retrieval Conference (ISMIR-2011), accepte
Miami, FL, Oct. 24-28. 2011.
-
Angelica Lim,
Takeshi Muzumoto,
Takuma Otsuka,
Tatsuhiko Itohara,
Kazuhiro Nakadai,
Tetsuya Ogata,
Hiroshi G. Okuno:
More cowbell! A musical ensemble with the NAO thereminist,
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2011), IROS 2011 Standard Platform Demo,
IEEE, RSJ, San Francisco, 25-30 Sep. 2011.
-
Tatsuhiko Itohara,
Takeshi Muzumoto,
Takuma Otsuka,
Tetsuya Ogata,
Hiroshi G. Okuno:
Particle-filter Based Audio-visual Beat-tracking for Music Robot Ensemble with Human Guitarist,
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2011), pp.118-124, (790/2459 =32.1%),
IEEE, RSJ, San Francisco, 25-30 (26) Sep. 2011.
doi:10.1109/IROS.2011.6094773
IEEE Robotics and Automation Society Japan Chapter Young Award
-
Eui-Hyun Kim,
Takeshi Muzumoto,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improvement of Speaker Localization by Considering Multipath Interference of Sound Wave for Binaural Robot Audition,
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2011), pp.2910-2915, (790/2459 =32.1%),
IEEE, RSJ, San Francisco, 25-30 Sep. 2011.
doi:10.1109/IROS.2011.6094778
-
Shun Nishide,
Jun Tani,
Hiroshi G. Okuno,
Tetsuya Ogata:
Handwriting Prediction Based Character Recognition using
Recurrent Neural Network,
Proceeding of IEEE International Conference on Systems, Man, and Cybernetics (SMC 2011),
pp.2549-2554, June 2011.
doi:10.1109/ICSMC.2011.6084060
-
Takuma Otsuka,
Kazuhiro Nakadai,
Tetsuya Ogata,
Hiroshi G. Okuno:
Bayesian Extension of MUSIC for Sound Source Localization and
Tracking,
Proceedings of International Conference on Spoken Language Processing
(Interspeech 2011), pp.3109-3112, (oral), Aug. 30, 2011.
Florence, Italy.
-
Yasuharu Hirasawa,
Naoki Yasuraoka,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Fast and simple iterative algorithm of Lp-norm minimization
for under-determined speech separation,
Proceedings of International Conference on Spoken Language Processing
(Interspeech 2011), pp.1745-1748,
Florence, Italy, Aug. 29, 2011.
-
Shun Nishide,
Hiroshi G. Okuno,
Tetsuya Ogata,
Jun Tani:
Handwriting prediction based character recognition using recurrent neural network
Proceeding of IEEE International Conference on Systems, Man, and Cybernetics (SMC 2011),
pp. 2549-2554, Anchorage, Oct. 9-12, 2010.
-
Mikio Nakano, Shun Sato,
Kazunori Komatani,
Kyoto Matsuyama,
Kotaro Funakoshi,
Hiroshi G. Okuno:
A Two-Stage Domain Selection Framework for Extensible Multi-Domain Spoken Dialogue Systems,
Proceedings of the 12th SIGDIAL Meeting on Discourse and Dialogue
(SIGDIAL 2011), pp.18-29, accepted as an oral presentation,
June 17-18, 2011, Portland, OR, USA.
Best paper award nomination finalist (4 papers)
-
Katsutoshi Itoyama,
Masataka Goto,
Tetsuya Ogata,
Hiroshi G. Okuno:
SIMULTANEOUS PROCESSING OF SOUND SOURCE SEPARATION AND MUSICAL INSTRUMENT IDENTIFICATION USING BAYESIAN SPECTRAL MODELING,
Proceedings of 2011 International Conference on
Acoustics, Speech and Signal Processing (ICASSP 2011),
pp.3816-3819, Poster, IEEE, Prague, Czech Republic, May 22-28, 2011.
doi:10.1109/ICASSP.2011.5947187
-
Akira Maezawa,
Hiroshi G. Okuno,
Tetsuya Ogata,
Masataka Goto:
POLYPHONIC AUDIO-TO-SCORE ALIGNMENT BASED ON BAYESIAN LATENT HARMONIC ALLOCATION HIDDEN MARKOV MODEL,
Proceedings of 2011 International Conference on
Acoustics, Speech and Signal Processing (ICASSP 2011),
pp.3816-3819, Poster, IEEE, Prague, Czech Republic, May 22-28, 2011.
-
Naoki Yasuraoka,
Hirokazu Kameoka,
Takuya Yoshioka,
Hiroshi G. Okuno:
I-DIVERGENCE-BASED DEREVERBERATION METHOD WITH AUXILIARY FUNCTION APPROACH,
Proceedings of 2011 International Conference on
Acoustics, Speech and Signal Processing (ICASSP 2011),
pp.369-372, Poster, IEEE, Prague, Czech Republic, May 22-28, 2011.
-
Takeshi Mizumoto,
Takami Yoshida,
Kazuhiro Nakadai,
Ryu Takeda,
Takuma Otsuka,
Toru Takahashi,
Hiroshi G. Okuno:
Design and Implementation of Selectable Sound Separation on a
Texai Telepresence System using HARK,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2011), pp.2130-2137,
May 9-13 (10), 2011, Shanghai, China.
Academic Year 2010
Thesis |
Journal Papers |
Book Chapters |
International Conferences |
Domestic Conferences |
Patents
- Ryu Takeda:
A Unified Framework of Blind Separation, Blind Dereverberation and Self-Voice
Cancellation for Real-Time Robot Audition,
Ph.D Thesis (Supervisor: Prof. Hiroshi G. Okuno), Jan. 2011.
- Katsutoshi Itoyama:
,
Ph.D Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
- Takuma Otsuka:
Real-time Audio-to-Score Alignment
using Particle Filter for Co-player Robots,
MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
- Wataru Hinoshita
Cognitive Integration of Language and Sensory-Motor System for Robots
using Neuro-Dynamical Models,
MS Thesis (Supervisor: Assoc. Prof. Tetsuya Ogata), Feb. 2011.
- Akira Maezawa
Score-Aided Inference of Classical Music Interpretation,
MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
- Kyoko Matsuyama
MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
- Naoki Yasuraoka
Musical Audio Signal Modeling Based on Harmonic-Domain
Parametric NMF and I-Divergence-Based Dereveberation for
Application to Phrase Replacing System,
MS Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
- Tatsuya Itohara
Particle Filter-Based Audio-Visual Beat Tracking for Music Robot Ensemble
with Human Guitarist,
BE Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
- Shotaro Sano
Prediction of Back Channel Timing using Neurodynamical Model,
BE Thesis (Supervisor: Assoc. Prof. Tetsuya Ogata), Feb. 2011.
- Kohei Nagira
Blind Source Separation of Actual Speech Signals in Time-Frequency Domain
using isFA,
BE Thesis (Supervisor: Prof. Hiroshi G. Okuno), Feb. 2011.
- Harumitsu Nobuta
Identification of self body and acquisition of body scheme based on
dynamic predictability using neuro-dynamical system,
BE Thesis (Supervisor: Assoc. Prof. Tetsuya Ogata), Feb. 2011.
- Zhang Yang:
Prediction and Classification of Environmental Sounds using Recurrent
Neural Network,
Master Thesis (Supervisor: Assoc. Prof. Tetsuya Ogata), Sep. 2010.
-
Ikkyu Aihara,
Ryu Takeda,
Takeshi Mizumoto,
Takuma Otsuka,
Toru Takahashi,
Hiroshi G. Okuno,
Kazuyuki Aihara:
Complex and Transitive Synchronization in a Frustrated System of
Calling Frogs,
Physical Review E,
Vol.83, Issue 3. 031913 (2011) [5 pages],
21 Mar. 2011.
doi:10.1103/PhysRevE.83.031913
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Query-by-Example Music Information Retrieval by Score-Informed Source Separation
and Remixing Technologies,
EURASIP Journal on Advances in Signal Processing,
Vol.2010, Article ID 172961, 14 pages, Hindawi Pub., Jan. 2011.
online page,
doi:10.1155/2010/172961
-
Takuma Otsuka,
Kazuhiro Nakadai,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Real-Time Audio-to-Score Alignment Using Particle Filter for Co-player Music Robots,
EURASIP Journal on Advances in Signal Processing,
Vol.2011, Article ID 384651, 13 pages, 2011, Hindawi Pub.
online page,
doi:10.1155/2011/384651
-
Takuya Yoshioka,
Tomohiro Nakatani, Masato Miyoshi,
Hiroshi G. Okuno:
Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization,
IEEE Transactions on Audio, Speech and Language Processing,
Vol.19, Issue 1 (Jan. 2011) pp.69-84, IEEE.
doi:10.1109/TASL.2010.2045183
-
Mikio Nakano, Yuji Hasegawa, Kotaro Funakoshi, Yohane Takeuchi, Toyotaka Torii,
Kazuhiro Nakadai,
Naoyuki Kandai,
Kazunori Komatani,
Hiroshi G. Okuno,
Hiroshi Tsujino:
A multi-expert model for dialogue and behavior control of conversational
robots and agents.
Knowledge-Based Systems, Vol.24, No.2 (Mar. 2011) pp.248-256, Elsevier.
doi:10.1016/j.knosys.2010.08.004,
Preprint
-
Kazunori Komatani,
Yuichiro Fukubayashi,
Satoshi Ikeda,
Tetsuya Ogata,
Hiroshi G. Okuno:
Selecting Help Messages by Using Robust Grammar Verification for
Handling Out-of-Grammar Utterances in Spoken Dialogue Systems.
IEICE Transactions Information and Systems, Vol.E93-D, No.12 (Dec. 2010) pp.3359-3367.
doi:10.1587/transinf.E93.D.3359
-
Hiromasa Fujihara,
Masataka Goto,
Hiroshi G. Okuno,
Simultaneous Estimation of Fundamental Frequency of vocal and Vowel Phonems
in Polyphonic Music
(in Japanese),
Journal of Information Processing Society of Japan,
Vol.51, No.10 (Oct. 2010) pp.1995-2006.
DL
-
Takeshi Mizumoto,
Hiroshi Tsujino,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno,
Development of a Theremin Player Robot Based on Arm-Position-to-Pitch and
-Volume Models
(in Japanese),
Journal of Information Processing Society of Japan,
Vol.51, No.10 (Oct. 2010) pp.2007-2019.
DL
-
Takami Yoshida,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
An Improvement in Auido-Visual Voice Activity Detection for Automatic Speech Recognition,
Journal of Robotic Society of Japan, Vol.28, No.8 (Oct. 2010) pp.970-977.
Abstract
-
Tetsuya Ogata,
Shun Nishide,
Hideki Kozima,
Kazunori Komatani,
Hiroshi G. Okuno:
Inter-modality Mapping in Robot with Recurrent Neural Network,
Pattern Recognition Letters, Vol.31, Issue 12 (Sep. 2010) 1560-1569.
doi:10.1016/j.patrec.2010.05.002
-
Wataru Hinoshita,
Tetsuya Ogata,
Hideki Kojima,
Toru Takahashi,
Hiroshi G. Okuno:
Journal of Robotic Society of Japan, Vol.28, No.4 (Apr. 2010) 532-543.
Abstract
-
Takuya Yoshioka,
Tomohiro Nakatani, Masato Miyoshi,
Hiroshi G. Okuno:
Blind Separation and Dereverberation of Speech Mixtures by
Joint Optimization,
IEEE Transactions on Audio, Speech and Language Processing,
in print, IEEE, Mar. 2010.
doi:10.1109/TASL.2010.2045183
-
Masaki Katsumaru,
Mikio Nakano,
Kazunori Komatani,
Kotaro Funakoshi, Hiroshi Tsujino,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improving Speech Understanding Accuracy by Using Multiple Language Models and Multiple Language Understanding Models,
IEICE Trans D, Special Issue on Information Explosion,
Vol.J93-D, No.6 (June 2010) 879-888, IEICE.
PDF at IEICE server
-
Kazuhiro Nakadai,
Toru Takahashi,
Hiroshi G. Okuno,
Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino:
Design and Implementation of Robot Audition System "HARK"
- Open Source Software for Listening to Three Simulteaneous Speakers,
Advanced Robotics, Vol.24 (2019) 739-761,
VSP and Robotics Society of Japan.
doi:10.1163/016918610X493561
-
Tetsuya Ogata,
Wataru Hinoshita:
Symbolic Processes in Multi-modara Communication between Robots,
System/Control/Information, Vol.54, No.11 (Nov. 2011) pp.434-439.
-
Wataru Hinoshita,
Horiaki Arie, Jun Tani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Recognition and Generation of Sentences through Self-organizing Linguistic Hierarchy using MTRNN,
Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.):
Trends in Applied Intelligent Systems
,
LNAI 6098, 42-51,
Cordoba, Spain, June 1-4 (2), 2010.
-
Takuma Otsuka,
Takeshi Mizumoto,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Music-ensemble robot that is capable of playing the theremin while listening to the accompanied music,
Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.):
Trends in Applied Intelligent Systems
,
LNAI 6096, 102-112,
Cordoba, Spain, June 1-4 (2), 2010.
Best paper award,
Heisei 22nd Year C\amp; C Young Researcher Best Paper Award
-
Akira Maezawa,
Katsutoshi Itoyama,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Violin Fingering Estimation Based on Violin Pedagogical Fingering Model Constrained by Bowed Sequence Estimation from Audio Input,
Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.):
Trends in Applied Intelligent Systems
,
LNAI 6098, 249-259,
Cordoba, Spain, June 1-4 (3), 2010.
-
Kyoko Matsuyama,
Kazunori Komatani,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing,
Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.):
Trends in Applied Intelligent Systems
,
LNAI 6097, 585-594,
Cordoba, Spain, June 1-4 (3), 2010.
-
Shun Shiramatsu,
Jun Takasaki, Tatiana Zidrasco, Tadachika Ozono, Toramatsu Shintani,
Hiroshi G. Okuno:
System for Supporting Web-based Public Debate Using Transcripts of Face-to-face Meeting,
Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.):
Trends in Applied Intelligent Systems
,
LNAI 6098, 311-320,
Cordoba, Spain, June 1-4 (4), 2010.
-
Takumi Yoshida,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
An Improvement in Auido-Visual Voice Activity Detection for Automatic Speech Recognition,
Nicol\'{a}s Garc\'{\i}a-Pedrajas, Francisco Herrera, Colin Fyfe, Jos\'{e} Manuel Ben\'{\i}tez, and Moonis Ali (Eds.):
Trends in Applied Intelligent Systems
,
LNAI 6096, 51-61,
Cordoba, Spain, June 1-4 (2), 2010.
-
Zhang Yang,
Tetsuya Ogata,
Shun Nishide,
Toru Takahashi,
Hiroshi G. Okuno:
Method of Discriminating Known and Unknown Environmental Sounds using Recurrent Neural Network,
Proceedings of oint 5th International Conference on Soft Computing and
Intelligent Systems and 11th International Symposium on advanced Intelligent
Systems (SCIS & ISIS 2010), pp. 378-383, Okayama, JAPAN, December 8-12, 2010.
-
Takuma Otsuka,
Kazuhiro Nakadai,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Two-level Synchronization using Particle Filter for Co-player Music Robots,
Proceedings of IEEE/RSJ-2010 Workshop on Robots and Musical Expression,
CD-ROM, Oct. 18, 2010, Taipei, Taiwan.
-
Takeshi Mizumoto,
Angelica Lim,
Takuma Otsuka,
Kazuhiro Nakadai,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno
:
Integration of flutist gesture recognition and beat tracking for human-robot
ensemble,
Proceedings of IEEE/RSJ-2010 Workshop on Robots and Musical Expression,
CD-ROM, Oct. 18, 2010, Taipei, Taiwan.
-
Angelica Lim,
Takeshi Mizumoto,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Programming by Playing and Approaches for Expressive Robot Performances,
Proceedings of IEEE/RSJ-2010 Workshop on Robots and Musical Expression,
CD-ROM, Oct. 18, 2010, Taipei, Taiwan.
-
Yasuharu Hirasawa,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Exploiting Harmonic Structures to Improve Separating Simultaneous Speech in Under-Determined Conditions
(Invited paper),
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2010), pp.450-457 (49.1%), TuBT12.5,
IEEE, RSJ, Taipei, 18-22 Oct. 2010.
doi:10.1109/IROS.2010.5651078
IEEE Robotics and Automation Society Japan Chapter Young Award
-
Toru Takahashi,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
An Improvement in Automatic Speech Recognition Using Soft Missing Feature Masks for Robot Audition,
(Invited paper),
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2010), pp.964-969, TuCT12.2,
IEEE, RSJ, Taipei, 18-22 Oct. 2010.
-
Takumi Yoshida,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
Two-Layered Audio-Visual Speech Recognition for Robots in Noisy Environments
(Invited paper),
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2010), pp.988-993, TuCT12.6,
IEEE, RSJ, Taipei, 18-22 Oct. 2010.
doi:10.1109/IROS.2010.5651205
-
Ryu Takeda,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Speedup and Performance Improvement of ICA-based Robot Audition by Parallel and
Resampling-based Block-wise Processing
(Invited paper),
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2010), pp.1949-1954 (49.1%), TuET12.1,
IEEE, RSJ, Taipei, 18-22 Oct. 2010.
doi:10.1109/IROS.2010.5652757
-
Takeshi Mizumoto,
Takuma Otsuka,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Human-Robot Ensemble between Robot Thereminst and Human Percussionist using Coupled Oscillator Model,
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2010), pp.1957-1962, TuET12.2,
IEEE, RSJ, Taipei, 18-22 Oct. 2010.
doi:10.1109/IROS.2010.5650364
-
Angelica Lim,
Takeshi Mizumoto,
Lois-Kenzo Cahier,
Takuma Otsuka,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot Musical Accompaniment: Integrating Audio and Visual Cues for Real-time Synchronization with a Human Flutist
(Invited paper),
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2010), pp.1964-1969, TuET12.3,
IEEE, RSJ, Taipei, 18-22 Oct. 2010.
doi:10.1109/IROS.2010.5650427
IROS-2011NTF Award for Entertainment Robots and Systems.
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Toru Takahashi,
Kazunori Komatani,
Hiroshi G. Okuno:
Motion Generation Based on Reliable Predictability using Self-organized Object Features
(Invited paper),
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and
Systems (IROS-2010), pp.3453-3458, WeCT13.2,
IEEE, RSJ, Taipei, 18-22 Oct. 2010.
doi:10.1109/IROS.2010.5652609
-
Nobuhide Yamakawa,
Tetsuro Kitahara,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Effects of modelling within- and between-frame temporal
variations in power spectra on non-verbal sound recognition,
Proceedings of International Conference on Spoken Language Processing
(Interspeech 2010), 2342-2345, (oral),
Makuhari, 29 Sep. 2010.
-
Kyoko Matsuyama,
Kazunori Komatani,
Ryu Takeda,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Analyzing User Utterances in Barge-in-able Spoken Dialogue System for Improving Identification Accuracy
Proceedings of International Conference on Spoken Language Processing
(Interspeech 2010), 3050-3053, (acceptance rate 58.2%),
Makuhari, 30 Sep. 2010.
-
Hideki Kawahara, Masanori Morise,
Toru Takahashi,
Hideki Banno, Ryuichi Nishimura, Toshio Irino:
Simplification and extension of non-periodic excitation source representation
for high-quality speech manipulation systems,
Proceedings of International Conference on Spoken Language Processing
(Interspeech 2010), 38-41, (oral),
Makuhari, 27 Sep. 2010.
-
Kazunori Komatani,
Hiroshi G. Okuno:
Online Error Detection of Barge-In Utterances by Using Individual
Users' Utterance Histories in Spoken Dialogue System,
Proceedings of the 11th SIGDIAL Meeting on Discourse and Dialogue
(SIGDIAL 2010), 289-296,
Tokyo, Sep. 24-25, 2010.
at SIGDial Server.
-
Hiromitsu Awano,
Tetsuya Ogata,
Shun Nishide,
Toru Takahashi,
Kazunori Komatani,
Hiroshi G. Okuno:
Human-Robot Cooperation in Arrangement of Objects Using Confidence Measure of
Neuro-dynamcal Systems,
Proceeding of IEEE International Conference on Systems, Man, adn Cybernetics (SMC 2010),
accepted, June 2010.
-
Shimpei Aso,
Takuya Saitou, Masataka Goto,
Katsutoshi Itoyama,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
SpeakBySinging: Converting Singing Voices to Speaking Voices While Retaining Voice Timbre,
Proceedings of the 13th International Conference on Digital Audio Effects
(DAFx-10), 114-121, Graz,
October, 2010,
PDF at DAFx
-
Kazunori Komatani,
Masaki Katsumaru,
Mikio Nakano,
Kotaro Funakoshi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Allocation of Training Data for Rapid Prototyping of
Speech Understanding based on Multiple Model Combination,
Proceedings of COLING 2010, accepted as poster presentation,
(acceptance rate 42%), Beijing, China, September, 2010.
-
Shun Shiramatsu,
Tadachika Ozono, Toramatsu Shintani,
Hiroshi G. Okuno:
A Corpus-based Analysis of Coreferential Recency Effect in Japanese Discourse
for Tracking Dynamic Topic.
Proceedings of the 9th IEEE/ACIS International Conference on Computer
and Information Science (ACIS-ICIS 2010), 645-650, Yamagata, Japan, Aug. 2010.
doi:10.1109/ICIS.2010.65
-
Akira Maezawa,
Katsutoshi Itoyama,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Query-by-Conducting: An interface to retrieve classical-music
interpretations by real-time tempo input,
Proceedings of 11th International Society for Musical Information
Retrieval Conference (ISMIR-2010), 477-482,
Dresden, Aug. 2010.
PDF at ISMIR
-
Ikkyu Aihara,
Takeshi Mizumoto,
Ryu Takeda,
Takuma Otsuka,
Toru Takahashi,
Kazuyuki Aihara,
Hiroshi G. Okuno:
Frustration in Synchronized Calling Behavior of Japanese Tree Frogs,
International Conference on Nyuroethology,
Aug. 5-7 (6-7), 2010, Salamanca, Spain.
(poster)
-
Takeshi Mizumoto,
Ikkyu Aihara,
Takuma Otsuka,
Ryu Takeda,
Kazuyuki Aihara,
Hiroshi G. Okuno:
Sound imaging system for visualizing spatio-temporal behavior of calling
nocturnal animals,
International Conference on Nyuroethology,
Aug. 5-7 (6-7), 2010, Salamanca, Spain.
(poster)
-
Takuma Otsuka,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Design and Implementation of Two-level Synchronization for Interactive Music Robot,
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence
(AAAI-10),
1138-1244 (26.9%, 264/982), July 11-15 (15), 2010, Atlanta, GA.
-
Toru Takahashi,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improvement in Listening Capability for Humanoid Robot HRP-2.
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2010), 470-475, (847/2062),
May 3-8 (4), 2010, Anchorage, Aalaska, USA.
doi:10.1109/ROBOT.2010.5509830
-
Ryu Takeda,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Upper-limit Evaluation of a Robot Audition based on ICA-BSS in Multi-source, Barge-in and Highly Reveberant Conditions.
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2010), 4366-4371, (847/2062),
May 3-8, 2010, Anchorage, Aalaska, USA.
doi:10.1109/ROBOT.2010.5509891
-
Angelica Lim,
Takeshi Mizumoto,
Louis-Kenzo Cahier,
Takuma Otsuka,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Multimodal gesture recognition for robot musical accompaniment,
28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
-
Louis-Kenzo Cahier,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno
:
Probabilistic polygonal mesh for 3D SLAM,
28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
-
Nobuhide Yakamawa,
Toru Takahashi,
Tetsuro Kitahara,
Tetsuya Ogata,
Hiroshi G. Okuno
:
ロボット聴覚のための Matching-Pursuit による環境音の分離音認識,
28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
-
平澤 恭治,
高橋 徹,
尾形 哲也,
奥乃 博
:
調波構造を用いた L1 ノルム最小化に基づく劣決定音源分離手法の性能評価,
28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
-
Takuma Otsuka,
Kazuhiro Nakadai,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno.
:
Predictive Score Following user Particle Filter for Music Robots,
28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
-
水本 武志,
中臺 一博,
大塚 琢馬,
高橋 徹,
尾形 哲也,
奥乃 博
:
打楽器とロボットの合奏のための結合振動子モデルに基づく打撃時刻予想手法,
28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
-
武田 龍,
中臺 一博,
高橋 徹,
尾形 哲也,
奥乃 博
:
リサンプル-ブロック処理と並列化に基づく ICA の実時間実装
,
28th Annual Convention of RSJ, Nagoya Institute of Technology, Sep. 2010.
-
Shimpei Aso,
Takeshi Saitou,
Masataka Goto,
Katsutoshi Itoyama,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
SpeakBySinging: A Speaking Voice Synthesis System Onverting Singing Voices to
Speaking Voices,
SIGMUS, Vol.2010-MUS-86, No.2, pp.
IPSJ, Jul. 2010.
-
Akira Maezawa,
Hiroshi G. Okuno:
Query-by-Conducting: A classical-music interpretation retrieval interface
based on tempo similarity,
SIGMUS, Vol.2010-MUS-86, No.2, pp.
IPSJ, Jul. 2010.
-
Naoki Yasuraoka,
Katsutoshi Itoyama,
Takuya Yoshioka,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Phrase Replacing System for Polyphonic MUsic Waveforms,
音楽情報科学研究会, Vol.2010-MUS-, No., pp.
情報処理学会, Jul. 2010.
-
Kyoko Matsuyama,
Kazunori Komatani,
Ryu Takeda,
Tetsuya Ogata,
Hiroshi G. Okuno:
Analysis of User Utterances and Application to Identify User's Referent in
Barge-in-able Spoken Dialogue System,
SIGMUS, Vol.2010-SLP-86, No.2, pp.
IPSJ, Jul. 2010.
-
Sound location Estimation System,
Inventors: Hiroshi Tsujino, Kazuhiro Nakadai, Hiroshi G. Okuno,
Takeshi Mizumoto, Kazuyuki Aihara,
Issued: No.2010-133964, June 17, 2010.
Filed: No.2009-277075, Dec. 4, 2009.
Academic Year 2009
Thesis |
Journal Papers |
Book Chapters |
International Conferences |
Domestic Conferences |
Patents
- Hiromasa Fujihara:
Statistical Modeling for Recognizing Singing Voices in Polyphonic Music,
Ph.D Thesis, Feb. 2010.
- Takuya Yoshioka:
Speech Enhancement in Reverberatn Environments
Feb. 2010.
Ph.D Thesis, Feb. 2010.
- Masaki Katsumaru:
複数の言語モデルと言語理解モデルによる音声理解の高精度化とそのラピッドプロトタイピングへの適用,
MS Thesis, Feb. 2010.
- Takeshi Mizumoto:
ロボットによるテルミン演奏のための音高・音量特性のモデル化とフィードフォワード制御,
MS Thesis, Feb. 2010.
- Soramichi Akiyama:
文法検証を統合したPOMDPによる対話管理,
BE Thesis, Feb.10, 2009.
- Shinpei Aso:
音韻長・F0・振幅の制御により歌声を話声に変換する話声合成システム SpeakBySinging,
BE Thesis, Feb.10, 2009.
- Akimitsu Awano:
人間とロボットの作業確信度を利用した協調物体配置システム,
BE Thesis, Feb.10, 2009.
- Kyoji Hirasawa:
調波構造を用いた音源分離によるマイク数以上の同時発話認識,
BE Thesis, Feb.10, 2009.
-
Toru Takahashi,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Soft missing-feature mask generation for robot audition,
PALADYN Journal of Behavioral Robotics,
Vol.1, No.1 (Mar. 2010) pp. 37-47, doi:10.2478/s13230-010-0005-1
-
Takuma Otsuka,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Voice-awareness control for a humanoid robot consistent with its body posture
and movements,
PALADYN Journal of Behavioral Robotics,
Vol.1, No.1 (Mar. 2010) pp.80-88, doi:10.2478/s13230-010-0009-x
-
Hiromasa Fujihara,
Masataka Goto,
Tetsuro Kitahara
Hiroshi G. Okuno:
A Modeling of Singing Voice Robust to Accompaniment Sounds and Its
Application to Singer Identification and Vocal-Timbre-Similarity-Based
Music Information Retrieval,
IEEE Transactions on Audio, Speech and Language Processing,
Vol.18, No.3 (Mar. 2010) pp. 638 - 648, IEEE.
doi:10.1109/TASL.2010.2041386
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Binaural active audition for humanoid robots to localise speech over entire
azimuth range,
Applied Bionics and Biomechanics, Special Issue on "Humanoid Robots",
Vol.6, Issue 3-4 (Sep. 2009) pp.355-368,
Taylor & Francis 2009.
doi:10.1080/11762320903007430
-
Hyun-Don Kim,
Jinsung Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Target Speech Detection and Separation for Communication with
Humanoid Robots in Noisy Home Environments,
Advanced Robotics, Vol.23, No.15 (2009) 2093-2111,
VSP and Robotics Society of Japan.
doi:10.1163/016918609X12529300552105
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Self-Organization of Dynamic Object Features based on Bi-Directional Training,
Advanced Robotics, Vol.23, No.15 (2009) 2035-2057.
doi:10.1163/016918609X12529289797027
VSP and Robotics Society of Japan.
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Autonomous Motion Generation based on Reliable Predictability,
Journal of Robotics and Mechatronics,
special issue on Kukanchi Interactive Human-Space Design and Intelligence
Dedicated to Dr. Kazuo Tanie,
Vol.21, No.4 (2009) 478-488.
-
Ryu Takeda,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot audition by multi-channel input Independent Component Analysis (in Japanese).
Journal of Robotics Society of Japan, Vol.27, No.7 (July, 2009) 782-792.
,
at RSJ server.
-
Kazumasa Murata,
Kazuhiro Nakadai,
Ryu Takeda,
Hiroshi G. Okuno,
Yuji Hasegawa, Hiroshi Tsujino:
Musical Beat-Tracking for Robots and Its Application to A Music Robot,
Journal of Robotics Society of Japan, Vol.27, No.7 (July, 2009) 793-801.
,
at RSJ server.
-
Hisashi Kanda,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Simulation of Phoneme Aquisition Process (in Japanese),
Journal of Robotics Society of Japan, Vol.27, No.7 (2009) 902-813.
,
at RSJ server.
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Parameter Estimation for Harmonic and Inharmonic Models by Using Timbre Feature Distributions,
IPSJ Journal,
Vol.50, No.7 (Jul. 2009) 1757-1767, IPSJ.
Journal of Information Processing, Vol.17 (2009) 191-201, IPSJ.
D-Library
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Human Tracking System Integrating Sound and Face Localization Using
an Expection-Maximization Algorithm in Real Environments,
Advanced Robotics, Vol.23, No.6 (May 2009) 629-653,
doi:10.1163/156855309X431659
VSP and Robotics Society of Japan.
-
Masaki Katsumaru,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Adjusting Occurrence Probabilities of Automatically-Generated Abbreviated
Words in Spoken Dialogue Systems,
B.-C. Chien, T.-P. Hong, S.-M. Chen, M. Ali (Eds.):
Next-Generation Applied Intelligence, 22nd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems,
Lecture Notes in Artificial Intelligence 5579, pp.481-490,
Tainan, Taiwan, Jun. 24-27, 2009.
doi:10.1007/978-3-642-02568-6_49
-
Shun Shiramatsu,
Yuji Kubota,
Kazunori Komatani,
Tetsuya Ogata,
Toru Takahashi,
Hiroshi G. Okuno:
Visualization-based Approaches to Support Context Sharing towards Public
Involment Support System,
Opportunities and Challenges for Next-Generation Applied Intelligence,
Studies in Computational Intelligence, Springer, Vol.214,
pp.111--117, Tainan, Taiwan, Jun. 24-27, 2009.
doi:10.1007/978-3-540-92814-0_18
-
Kazunori Komatani,
Tatsuya Kawahara,
Hiroshi G. Okuno:
A Model of Temporally Changing User Behaviors in a Deployed Spoken
Dialogue System,
G.-J. Houben et al. (Eds.):
UMAP 2009, First and Seventeenth International Conference on User Modeling, Adaptation, and Personalization,
Lecture Notes in Computer Science 5535, pp.408-414,
Trento, Italy, Jun. 22-26, 2009.
doi:10.1007/978-3-642-02247-0_45
-
Naoki Yasuraoka,
Takuya Yoshioka,
Tomohiro Nakatani, Aatsushi Nakamura,
Hiroshi G. Okuno:
MUSIC DEREVERBERATION USING HARMONIC STRUCTURE SOURCE MODEL AND WIENER FILTER,
Proceedings of 2010 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2010),
pp.53-56, (lecture), Dallus, March, 2010.
-
Takuya Yoshioka,
Tomohiro Nakatani,
Hiroshi G. Okuno:
NOISY SPEECH ENHANCEMENT BASED ON PRIOR KNOWLEDGE ABOUT SPECTRAL ENVELOPE AND HARMONIC STRUCTURE,
Proceedings of 2010 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2010),
pp.4270-4273, (lecture+poster 48.8\%), Dallus, March, 2010.
-
Akira Maezawa,
Katsutoshi Itoyama,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Bowed String Sequence Estimation of a Violin Based on Adaptive Audio Signal
Classification and Context-Dependent Error Correction,
Proceedings of IEEE International Symposium on Multimedia (ISM2009),
pp.9-16, (acceptance rate for full papers, 19.6%),
San Diego, Dec. 14-16, 2009.
doi:10.1109/ISM.2009.30
-
Ryu Takeda,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Estimation of Reverberation Time with Robot Speech to Improve ICA-based Robot Audition,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2009),
pp.250-355, IEEE, Paris, Dec. 7-10, 2009.
.
doi:10.1109/ICHR.2009.5379572
-
Takuma Otsuka,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Voice quality manipulation for humanoid robots consistent with their head movements,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2009),
pp.405-410, IEEE, Paris, Dec. 7-10, 2009.
.
doi:10.1109/ICHR.2009.5379569
-
Takumi Yoshida,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
Automatic Speech Recognition Improved by Two-Layered Audio-Visual,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008),
pp.604-609, IEEE, Paris, Dec. 7-10, 2009.
.
doi:10.1109/ICHR.2009.5379586
-
Hiromasa Fujihara,
Masataka Goto,
Hiroshi G. Okuno:
A NOVEL FRAMEWORK FOR RECOGNIZING PHONEMES OF SINGING VOICE IN POLYPHONIC MUSIC,
Proceedings of 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), pp.17-20,
Oct. 18-21, New Paltz, NY, 2009.
doi:10.1109/IROS.2009.5354527
-
Takuya Yoshioka,
Hirokazu Kameoka, Tomohiro Nakatani,
Hiroshi G. Okuno:
Statistical models for speech dereverberation,
Proceedings of 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), pp.145-148,
Oct. 18-21, New Paltz, NY, 2009.
doi:10.1109/IROS.2009.5354489
-
Naoki Yasuraoka,
Takehiro Abe,
Katsutoshi Itoyama,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Changing Timbre and Phrase in Existing Musical Performances as You Like,
Proceedings of the ACM International Confernece on Multimedia (ACM Multimedia 2009), 203-212
(16% 22/138), Beijing, China, Oct. 19-24, 2009.
,
doi:10.1145/1631272.1631302
-
Ryu Takeda,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Step-size Parameter Adaptation of Multi-channel Semi-blind ICA with Piecewise Linear Model for Barge-in-able Robot Audition (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.2273-2282, (900/1650),
IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009.
doi:10.1109/IROS.2009.5354527
-
Takuma Otsuka,
Kazumasa Murata,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Incremental Polyphonic Audio to Score Alignment using Beat Tracking for Singer Robots (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.2289-2296,
IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009.
doi:10.1109/IROS.2009.5354637
-
Takeshi Mizumoto,
Hiroshi Tsujino,
Toru Takahashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Thereminist Robot: Development of a Robot Theremin Player with Feedforward and Feedback Arm Control based on a Theremin's Pitch Model (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.2297-2302,
IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009.
doi:10.1109/IROS.2009.5354473
-
Toru Takahashi,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Missing-Feature-Theory-based Robust Simultaneous Speech Recognition System with Non-clean Speech Acoustic Model (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.2730-2735,
IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009.
doi:10.1109/IROS.2009.5354201
-
Wataru Hinoshita,
Tetsuya Ogata,
Hideki Kozima,
Hisashi Kanda,
Toru Takahashi,
Hiroshi G. Okuno:
Emergence of Evolutional Interaction with Voice and Motion between Two Robots using RNN,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.4196-4291,
IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009.
doi:10.1109/IROS.2009.5354887
-
Shun Nishide,
Tetsuhiro Nakagawa,
Tetsuya Ogata,
Jun Tani,
Toru Takahashi,
Hiroshi G. Okuno:
Modeling Tool-Body Assimilation using Second-order Recurrent Neural Network,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.5376-5381, (900/1650),
IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009.
doi:10.1109/IROS.2009.5354655
-
Hisashi Kanda,
Tetsuya Ogata,
Toru Takahashi,
Kazunori Komatani,
Hiroshi G. Okuno:
Phoneme Acquisition Model based on Vowel Imitation using Recurrent Neural Network,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2009), pp.5388-5393,
IEEE, RSJ, St. Louis, 12-14 (14) Oct. 2009.
doi:10.1109/IROS.2009.5354825
-
Kazunori Komatani,
Satoshi Ikeda,
Yuichiro Fukubayashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Ranking Help Message Candidates Based on Robust Grammar Verification Results
and Utterance History in Spoken Dialogue Systems,
Proceedings of the 10th SIGdial Workshop on Discourse and Dialogue (SigDial 2009),
314-321, Sep. 12, 2009.
-
Kyoko Matsuyama,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Enabling A User To Specify An Item At Any Time During System Enumeration,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2009), Mon-Ses2-P4-1, (57.7%),
Brighton, 6-10 Sep. 2009.
-
Masaki Katsumaru,
Mikio Nakano,
Kazunori Komatani,
Kotaro Funakoshi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improving Speech Understanding Accuracy with Limited Training
Data Using Multiple Language Models and Multiple Understanding Models,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2009), Thu-Ses1-P4-9, (57.7%), Brighton, 6-10 (10) Sep. 2009.
-
Hideki Kawahara,
Masanori Morise,
Toru Takahashi,
Hideki Banno, Ryuichi Nishimura, Toshio Irino:
Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2009), Thu-Ses1-P2-6, (57.7%),
Brighton, 6-10 Sep. 2009.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hyun-Don Kim:
Robot Auditon: Missing Feature Theory Approach and Active Audition (Invited talk),
Proceeding of the 14th International Symposium of Robotics Research
(ISRR 2009), August 31 - September 3, 2009, Lucerne, Switzerland,
International Foundation of Robotics Research.
Springer STAR series.
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
QUERY-BY-EXAMPLE MUSIC RETRIEVAL APPROACH BASED ON MUSICAL GENRE SHIFT BY CHANGING INSTRUMENT VOLUME,
Proceeding of the 12th International Conference on Digital Audio Effects
(DAFx-09), accepted,
Como, Italy, Sep.1-4. 2009.
-
Shun Shiramatsu,
Tadachika Ozono, Toramatsu Shintani,
Kazunori Komatani,
Tetsuya Ogata,
Toru Takahashi,
Hiroshi G. Okuno:
Development of a Meeting Browser towards Supporting Public Involvement,
Proceedings of the 12th IEEE International Conference on Computational
Science and Engineering (CSE-09), pp.717-722, Vancouver, Canada,
Aug., 2009.
doi:10.1109/CSE.2009.362
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Analysis of Motion Searching based on Reliable Predictability using Recurrent Neural Network,
Proceedings of 2009 IEEE/ASME Conference on Advanced Intelligent Mechatronics (AIM 2009), 192-197, Singapore, July 14-19, 2009.
doi:10.1145/10.1109/AIM.2009.5230015
-
Kazunori Komatani,
Alexander I. Rudnicky:
Predicting Barge-in Utterance Errors by using Implicitly-Supervised ASR Accuracy and Barge-in Rate per User,
Proceedings of the Fourth International Joint Conference on Natural Language
Processing (ACL-IJCNLP 2009), pp.89-92, Jul. 2009.
-
Masaki Katsumaru,
Mikio Nakano,
Kazunori Komatani,
Kotaro Funakoshi,
Hiroshi G. Okuno:
A Speech Understanding Framework that Uses Multiple Language Models and
Multiple Understanding Models,
Proceeding of the North American Chapter of the Association for
Computational Linguistics - Human Language Technologies (NAACL HLT)
2009 Conference,
pp.133-136, (40%),
Boulder, CO, May 31 - Jun. 5, 2009.
-
Tetsuya Ogata,
Ryunosuke Yokoya,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Prediction and Imitation of Other's Motions by Reusing Own Forward-Inverse
Model in Robots,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2009), pp.4144-4149, (699/1624),
(May 12-17 (16), 2009), Kobe.
doi:10.1145/10.1109/ROBOT.2009.5152363
-
Hisashi Kanda,
Tetsuya Ogata,
Toru Takahashi,
Kazunori Komatani,
Hiroshi G. Okuno:
Continuous Vocal Imitation with Self-organized Vowel Spaces in
Recurrent Neural Network,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2009), pp.4438-4443,
(May 12-17 (16), 2009), Kobe.
doi:10.1145/10.1109/ROBOT.2009.5152818
-
Ryu Takeda,
Kazuhiro Nakadai,
Toru Takahashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
ICA-BASED EFFICIENT BLIND DEREVERBERATION AND ECHO CANCELLATION METHOD FOR BARGE-IN-ABLE ROBOT AUDITION,
Proceedings of 2009 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2009),
SS-L7.1, pp.3677-3680, (1178/2633), Taipei, Taiwan, April 19--24 (23), 2009.
doi:10.1145/10.1109//ICASSP.2009.4960424
-
Hideki Kawahara,
Ryuichi Nisimura, Toshio Irino, Masanori Morise,
Toru Takahashi,
Hideki Banno:
TEMPORALLY VARIABLE MULTI-ASPECT AUDITORY MORPHING ENABLING EXTRAPOLATION WITHOUT OBJECTIVE AND PERCEPTUAL BREAKDOWN,
Proceedings of 2009 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2009),
pp. , April 23.
-
Robotics visual and auditory system,
Patent No. US 7,526,361,
Date of Patent: Apr. 28, 2009.
Inventors: Kazuhiro Nakadai, Hiroshi Okuno, Hiroaki Kitano,
PCT No.: PCT/JP02/08827.
Academic Year 2008
Thesis |
Journal Papers |
Book Chapters |
International Conferences |
Domestic Conferences |
Patents
Thesis
- Shun Nishide:
Self-Organization of Invariants for Motion Generation based on
Reliable Predictability,
Ph.D Thesis, Feb. 2009.
- Hyun-Don Kim:
Binaural Active Audition for Humanoid Robots,
Ph.D Thesis, Sep. 2008.
-
Takehiro Abe, MS Thesis, Feb. 2008.
-
Satoshi Ikeda, MS Thesis, Feb. 2008.
-
Hisashi Kanda, MS Thesis, Feb. 2008.
-
Yuji Kubota, MS Thesis, Feb. 2008.
-
Kaiping Wang, MS Thesis, Feb. 2008.
-
Takuma Otsuka, BE Thesis, Feb. 2008.
-
Wataru Hinoshita, BE Thesis, Feb. 2008.
-
Kyoko Matsuyama, BE Thesis, Feb. 2008.
-
Tadanori Yasuraoka, BE Thesis, Feb. 2008.
-
Tatsuhiro Nakagawa, BE Thesis, Feb. 2008.
Peer-reviewed Journal Papers
-
Takehiro Abe,
Katsutoshi Itoyama,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
An Analysis-and-Synthesis Approach for Manipulating Pitch of a Musical
Instrument Sound Considering Pitch-dependency of Timbral Characteristics,
IPSJ Journal, Vol.50, No.3 (Mar., 2009) 1054-1066
IPSJ.
,
D-Lib
-
Satoshi Ikeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Integrating Topic Estimation and Dialogue History for Domain Selection in
Multi-Domain Spoken Dialogue Systems,
IPSJ Journal, Vol.50, No.2 (Feb., 2009) 488-500,
IPSJ.
,
D-Lib
-
Masaharu Morise,
Toru Takahashi,
Hideki Kawahara,
Toshio Irino:
IEIC Trans. A, Vol.J92-A, No.3 (Mar. 2009).
-
Shun Shiramatsu,
Kazunori Komatani,
Koiti Hasida,
Tetsuya Ogata,
Hiroshi G. Okuno:
A Game-Theoretic Model of Referential Coherence and Its Empirical
Verification Using Large Japanese and English Corpora,
ACM Transactions on Speech and Language Processing, Vol.5, No.3 (Oct. 2008) Article 6, ACM.
,
doi:10.1145/1410358.1410360
-
Hiromasa Fujihara,
Masataka Goto,
Hiroshi G. Okuno:
An F0 Estimation Method of Vocal Part in Polyphonic Music by Using Statistical
Modelling of Singing Voice and Viterbi Search,
IPSJ Journal, Vol.49, No.10 (Oct. 2008) 3682-3693, IPSJ.
,
D-Lib
-
Chyon Hae Kin,
Tetsuya Ogata,
Shigeki Sugano:
Reinforcement Signal Propagation Algorithm for Logic Circuit,
Journal of Robotics and Mechatronics,
Vol.20, No.5 (Oct. 2008)
pp757-774.
-
Kazunori Komatani,
Satoshi Ikeda,
Tetsuya Ogata,
Hiroshi G. Okuno:
Managing out-of-grammar utterances by topic estimation with domain
extensibility in multi-domain spoken dialogue systems,
Speech Communication, Vol.50, No.10 (2008) 836-870.
doi:10.1016/j.specom.2008.05.010
-
Ryu Takeda,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot Audition using an Adaptive Filter Based on Independent Component
Analysis,
Journal of Robotic Society of Japan, Vol.26, No.6 (Sep. 2008)
pp.529-536.
Digital Library
-
Yuki Suga,
Tetsuya Ogata,
Shigeki Sugano:
Human-Adaptive Robot Interaction using Interactive EC with
Human-Machine Hybrid Evaluation,
Journal of Robotics and Mechatronics,
Vol.20, No.4 (Aug. 2008) pp.610-620.
-
Hayeong Jeong,
Shun Shiramatsu,
Kiyoshi Kobayashi, and Tsuyoshi Hatori:
Discourse Analysis of Public Debates Using Corpus Linguistic Methodologies,
Journal of Computers, Vol.3, No.8 (Aug. 2008) pp.58--68.
-
Yuichiro Fukubayashi,
Kazunori Komatani,
Mikio Nakano, Kotaro Funakoshi, Hiroshi Tsujino,
Tetsuya Ogata,
Hiroshi G. Okuno:
WFST-based Language Understanding for Rapid Prototyping of Spoken Dialogue
Systems,
IPSJ Journal,
Vol.49, No.8 (Aug. 2008) pp.2762-2772,
Information Processing Society of Japan,
,
Digital Library.
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Predicting Object Dynamics from Visual Images through Active Sensing Experiences,
Advanced Robotics, Vol.22, No.5 (May 2008) pp.527-546,
doi:10.1163/156855308X294879
Online version,
VSP and Robotics Society of Japan.
-
Hiroshi G. Okuno,
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata:
A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals,
Journal of Acoustic Society of America, Vol.123, No.5 (May 2008) Pt.2, pp.3066-3067.
-
Hideki Kawahara, Masanori Morise,
Toru Takahashi,
Ryuichi Nishimura, Hideki Banno, Toshio Irino:
A temporally stable representation of power spectra of periodic signals and
its application to F0 and periodicity estimation,
Journal of Acoustic Society of America, Vol.123, No.5 (May 2008) Pt.2, pp.3074-3075.
Book Chapters, Reviews
-
Shun Shiramatsu,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
SalienceGraph: Visualizing Salience Dynamics of Written Discourse
by Using Reference Probability and PLSA,
T. B. Ho and Z-H. Zhou (Eds.): PRICAI-2008: Trends in Artificial
Intelligence, 890-902, (84/234, 35.8%),
Lecture Notes in Computer Science, Vol.5351, Springer-Verlag, Dec. 2008.
doi:10.1007/978-3-540-89197-0_83
-
Satoshi Ikeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Integrating Topic Estimation and Dialogue History for Domain Selection
in Multi-Domain Spoken Dialogue Systems,
Ngoc Thanh Nguyen,Leszek Borzemski,Adam Grzech,Moonis Ali (Eds.):
New Frontiers in Applied Artificial Intelligence,
pp.294-304, Lecture Notes in Artificial Intelligence, Vol.5027,
June, 2008.
doi:10.1007/978-3-540-69052-8_31
-
Hisashi Kanda,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Vowel Imitation using Vocal Tract Model and Recurrent Neural Network,
Masumi Ishikawa, Kenji Doya, Hiroyuki Miyamoto, Takeshi Yamakawa (Eds.):
Neural Information Processing,
14th International Conference, ICONIP 2007, Revised Selected Papers,
Part II, pp.222-232,
Lecture Notes in Computer Science 4985, Springer-Verlag, June 2008.
doi:10.1007/978-3-540-69162-4_24
-
Tetsuya Ogata,
Hideki Kojima,
Hiroshi G. Okuno:
Motion Emergence from Sound using Cross-Modal Mapping on Recurrent
Neural Network,
Aucouturier, J.-J. (ed.) Cheek to Chip: Dancing Robots and AI's Future,
IEEE Intelligent Systems,
Vol.23, No.2 (Apr. 2008), 74--84,
doi:10.1109/MIS.2008.22
Peer-reviewed Conference Papers
-
Masato Onishi,
Toru Takahashi,
Toshio Irino,
Hideki Kawahara:
Vowel-based frequency alignment function design and recognition-based time
alignment for automatic speech morphing,
Proceedings of IEEE Workshop on Spoken Language Technology 2008 (SLT 2008), accepted, Goa, India, December, 15--18, 2008,
-
Yuji Kubota,
Masatoshi Yoshida,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Design and Implementation of 3D Auditory Scene Visualizer towards
Auditory Awareness with Face Tracking,
Proceedings of IEEE International Symposium on Multimedia (ISM2008),
pp.468-476 (acceptance rate for regular papers, 24%),
Berkeley, Dec. 16. 2008.
doi:10.1109/ISM.2008.107
-
Yuji Kubota,
Shun Shiramatsu,
Masatoshi Yoshida,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
3D Auditory Scene Visualizer With Face Tracking:
Design and Implementation For Auditory Awareness Compensation,
Proceedings of 2nd International Symposium on Universal Communication
(ISUC2008), pp.42-49, IEEE, Osaka, Dec. 15. 2008.
doi:10.1109/ISUC.2008.59
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino:
An Open Source Software System For Robot Audition HARK and Its Evaluation,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008),
pp.561-566, Daejeon, Korea, Dec. 3, 2008.
doi:10.1109/ICHR.2008.4756031
-
Kazumasa Murata,
Kazuhiro Nakadai,
Ryu Takeda,
Hiroshi G. Okuno,
Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino:
A Beat-Tracking Robot for Human-Robot Interaction and Its Evaluation,
Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008),
pp.79-84, Daejeon, Korea, Dec. 2, 2008.
doi:10.1109/ICHR.2008.4755935
-
Shun Shiramatsu,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
SalienceGraph: Visualizing Salience Dynamics of Written Discourse
by Using Reference Probability and PLSA,
Proceedings of the Tenth Pacific Rim International Conference on
Artificial Intelligence (PRICAI-08), 890-902, (84/234, 35.8%),
Lecture Notes in Computer Science, Vol.5351, Springer-Verlag,
Hanoi, Vienam, Dec. 15-19. 2008.
doi:10.1007/978-3-540-89197-0_83
-
Hiroshi G. Okuno:
Computational Auditory Scene Analysis and Its Application to Robot Audition
(Invited Talk),
Proceedings of the Second International Symposium on Robotics and Artificial
Intelligence,
University of Electro-Communication and Shanghai Jiao Tong University, 9 Oct. 2008.
-
Ikkyu Aihara:
Synchronization and Frustration in Calling Behavior of Japanese Tree Frogs,
Dynamics Days Asia Pacific 5 (DDAP5), September, 2008.(oral)
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Analysis of Reliable Predictability based Motion Generation using RNNPB,
Proceedings of Joint 4th International Conference on Soft Computing and
Intelligent Systems and 9th International Symposium on advanced
Intelligent Systems (SCIS & ISIS 2008),
pp.305-310, Nagoya, JAPAN, September 17-21, 2008.
-
Hideki Kawahara,
Masanori Morise, Hideki Banno,
Toru Takahashi,
Ryuichi Nishimura, Toshio Irino:
Spectral Envelope Recovery beyond the Nyquist Limit for High-Quality Manipulation of Speech Sounds,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2008), pp.22-26,
Brisbane, Sept. 24, 2008.
-
Toru Takahashi,
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Soft Missing-Feature Mask Generation for Simultaneous Speech Recognition
System in Robots,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2008), pp.992-997,
Brisbane, Sept. 24, 2008.
-
Kazunori Komatani,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Predicting ASR Errors by Exploiting Barge-In Rate of Individual Users
for Spoken Dialogue Systems,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2008), pp.183--186,
Brisbane, Sept. 2008.
-
Masaki Katsumaru,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno
Expanding Vocabulary for Recognizing User\'s Abbreviations of Proper Nouns
without Increasing ASR Error Rates in Spoken Dialogue Systems,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2008), pp.187-190,
Brisbane, Sept. 2008.
-
Satoshi Ikeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:Extensibility Verification of Robust Domain Selection against Out-of-Grammar
Utterances in Multi-Domain Spoken Dialogue System,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2008), pp.487-490,
Brisbane, Sept. 2008.
-
Shun Nishide,
Tetsuya Ogata,
Ryunosuke Yokoya,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Active Ssensing based Dynamical Object Feature Extraction,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), pp.1-7, TuAT1.1,
IEEE, RSJ, Nice, 23 Sep. 2008.
doi:10.1109/IROS.2008.4650794
-
Takeshi Mizumoto,
Ryu Takeda,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
A Robot Listens to Music and Counts Its Beats Aloud by Separating Music
from Counting Voice,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), 1538-1543, WeAT6.1
IEEE, RSJ, Nice, 24 Sep. 2008.
doi:10.1109/IROS.2008.4650821
Award for Entertainment Robots and Systems (NTF Award) Nomination Finalist.
-
Hyun-Don Kim,
Jinsung Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Target Speech Detection and Separation for Humanoid Robot in Sparse
Dialogue with Noisy Home Environments (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), 1705-1711, WeAT10.4
IEEE, RSJ, Nice, 24 Sep. 2008.
doi:10.1109/IROS.2008.4650977
-
Hisashi Kanda,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Segmenting Acoustic Signal with Articulatory Movement using
Recurrent Neural Network for Phoneme Aquisition (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), 1712-1717, WeAT10.5
IEEE, RSJ, Nice, 24 Sep. 2008.
doi:10.1109/IROS.2008.4651060
-
Ryu Takeda,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Barge-in-able Robot Audition Based on ICA and Missing Feature Theory
under Semi-Blind Situation (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), 1718-1723, WeAT10.6,
IEEE, RSJ, Nice, 24 Sep. 2008.
doi:10.1109/IROS.2008.4650821
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Design and Evaluation of Two-Channel Sound Source Localization over
Entire Azimuth Range for Moving Talker (Invited paper),
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), pp.2197-2203 Sept. 2008
IEEE, RSJ, Nice, Sept. 2008.
doi:10.1109/IROS.2008.4650947
-
Kazumasa Murata,
Kazuhiro Nakadai,
Kazuyoshi Yoshii,
Ryu Takeda,
Toyotaka Torii,
Hiroshi G. Okuno,
Yuji Hasegawa, Hiroshi Tsujino:
A Robot Uses Its Own Microphone to Synchronize Its Steps to Musical Beats
While Scatting and Singing,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2008), pp.2459-2464, WeCT6.1,
IEEE, RSJ, Nice, 24 Sep. 2008.
doi:10.1109/IROS.2008.4650596
Award for Entertainment Robots and Systems (NTF Award) Nomination Finalist.
-
Kohei Sumi,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Chord Recognition Based on Probabilistic Integration of
Chord Transition and Bass Pitch Estimation,
Proceedings of 9th International Conference on Musical Information
Retrieval (ISMIR-2008), 39-44,
Philadelphia, 15 Sep. 2008.
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Instrument Equalizer for Query-by-Example Retrieval: Improving Sound Source
Separation based on Integrated Harmonic and Inharmonic Models,
Proceedings of 9th International Conference on Musical Information
Retrieval (ISMIR-2008),133-138,
Philadelphia, 15 Sep. 2008.
-
Kazumasa Murata,
Kazuhiro Nakadai,
Kazuyoshi Yoshii,
Ryu Takeda,
Toyotake Torii,
Hiroshi G. Okuno,
Yuji Hasegawa, Hiroshi Tsujino:
A Robot Singer with Music Recognition Based on Real-Time Beat Tracking,
Proceedings of 9th International Conference on Musical Information
Retrieval (ISMIR-2008), 199-204,
Philadelphia, 15 Sep. 2008.
-
Takehiro Abe,
Katsutoshi Itoyama,
Kazuyoshi Yoshii,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Synthesis Approach for Manipulating Pitch of a Musical Instrument Sound
with Considering Timbral Characteristics,
Proceeding of the 11th International Conference on Digital Audio Effects
(DAFx-08), 249-256,
Espoo, Finland, Sep.1-4. 2008.
-
Satoshi Ikeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Integrating Topic Estimation and Dialogue History for Domain Selection
in Multi-Domain Spoken Dialogue Systems,
Proceeding of the 21st International Conference on Industrial,
Engineering and Other Applications of Applied Intelligence Systems (IEA/AIE-2008),
pp.294-304, (acceptance rate is about 30%), LNAI 5027,
Wroclaw, Poland, Jun. 18, 2008.
doi:10.1007/978-3-540-69052-8_31
-
Hiroshi G. Okuno,
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata:
A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals,
Proceedings of Acoustics'08,
CD-ROM , 1pSCa8, June 30, 2008.
-
Hideki Kawahara, Masanori Morise,
Toru Takahashi,
Ryuichi Nishimura, Hideki Banno, Toshio Irino:
A temporally stable representation of power spectra of periodic signals and
its application to F0 and periodicity estimation,
Proceedings of Acoustics'08,
CD-ROM , 1pSCc24, June 30, 2008.
-
Hideki Kawahara, Masanori Morise,
Toru Takahashi,
Ryuichi Nishimura, Hideki Banno, Toshio Irino:
A unified approach for F0 extraction and aperiodicity estimation
based on a temporally stable power spectral representation,
Proceedings of ISCA Tutorial and Research Workshop (ITRW) on
"Speech Analysis and Processing for Knowledge Discovery",
June 4, 2008, Aalborg, DK.
-
Shun Nishide,
Tetsuya Ogata,
Ryunosuke Yokoya,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Object Dynamics Prediction and Motion Generation
based on Reliable Predictability,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2008), 1608-1614,
(May 20, 2008).
doi:10.1109/ROBOT.2008.4543431
-
Kazuhiro Nakadai,
Shun'ichi Yamamoto,
Hiroshi G. Okuno,
Hirofumi Nakajima, Yuji Hasegawa,
Hiroshi Tsujino:
A Robot Referee for Rock-Paper-Scissors Sound Games,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2008), 3469--3474,
(May 20, 2008).
doi:10.1109/ROBOT.2008.4543741
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Two-Channel-Based Voice Activity Detection for
Humanoid Robots in Noisy Home Environments,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2008), 3495-3501,
(May 20, 2008).
doi:10.1109/ROBOT.2008.4543745
-
Hiroshi G. Okuno,
Kazuhiro Nakadai:
COMPUTATIONAL AUDITORY SCENE ANALYSIS AND ITS APPLICATION TO ROBOT AUDITION
(invited talk),
Proceedings of Hands-free Speech Communication and Microphone Arrays (HSCMA-2008), pp.123-127, May 7, 2008, Trento, Italy.
doi:10.1109/HSCMA.2008.4538702
-
Hideki Kawahara,
Masanori Morise,
Toru Takahashi,
Ryuichi Nishimura, Toshio Irino, Hideki Banno:
TANDEM-STRAIGHT: A Temporally Stable Power Spectral Representation for
Periodic Signals and Applications to Interference-free Spectrum, F0,
and Aperiodicity Estimation,
Proceedings of 2008 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2008),
pp.3933-3936, Las Vegas, Nevada, USA, March 30 - April 4, 2008.
Patents
-
Sound Source Separation System, Sound Source Separation Method, and
Computer Program for Sound Source Separation,
PCT/JP2008/057310, WO 2008/133097
Date of Open: 06.11.2008,
Inventors: Katsutoshi Itoyama, Hiroshi Okuno, Masataka Goto.
Assignee: Kyoto University, AIST.
-
Moving object equipped with ultra-directional speaker,
Patent No. US 7,424,118,
Date of Patent: Sep. 9, 2008.
Inventors: Kiyofumi Mori, Shunji Yoshida, Hiroshi Okuno, Kazuhiro Nakadai,
Hiroshi Tsujino,
PCT No.: PCT/JP2005/002043.
-
Speech Recognition Apparatus,
Application No. 20080167869.
Filed: July 10, 2008.
Inventors: Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto.
PCT No.: PCT/JP05/22601.
Academic Year 2007
Thesis
- Shun Shiramatsu:
Salience-based Modeling of Discourse Context,
Ph.D Thesis, Feb. 2008.
- Shun'ichi Yamamoto:
Real-Time Robot Audition Software Based on Missing Feature Theory
for Multiple Simultaneous Talkers in Real Environments,
Ph.D Thesis, Feb. 2008.
- Kazuyoshi Yoshii:
Studies on Hybrid Music Recommendation Using Timbral and Rhythmic Features,
Ph.D Thesis, Feb. 2008.
- Katsutoshi Itoyama:
MS Thesis, Feb. 2008.
- Ryu Takeda
MS Thesis, Feb. 2008.
- Yuichiro Fukubayashi:
MS Thesis, Feb. 2008.
- Koichi Tokuda:
MS Thesis,
Feb. 2008.
- Ryunosuke Yokoya:
MS Thesis, Feb. 2008.
- Kohei Sumi:
BE Thesis, Feb. 2007.
- Masaki Katsumaru:
BE Thesis, Feb. 2008.
- Hiroki Saito:
BE Thesis, Feb. 2007.
- Zhang:
BE Thesis, Feb. 2007.
- Takeshi Mizumoto:
BE Thesis, Feb. 2007.
Peer-reviewed Journal Papers
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Simultaneous Realization of Score-informed Sound Source Separation of
Polyphonic Musical Siganals and Constrained Parameter Estimation for
Integrated Model of Harmonic and Inharmonic Structure,
IPSJ Journal, Vol.49, No.3 (Mar., 2008) pp.1465-1479,
Information Processing Society of Japan,
Digital Library,
-
Ryunosuke Yokoya,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
,
Transactions of Human Interface Society, Vol.10, No.1 (Feb. 2008) pp.59-72.
-
Kazuyoshi Yoshii,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Hybrid Collaborative and Content-based Music Recommendation Using
Incrementally-trainable Probabilistic Generative Model,
IEEE Transactions on Audio, Speech and Language Processing,
Vol.16, No.2 (Feb. 2008) pp.435-447,
,
doi:10.1109/TASL.2007.911503
-
Shun Shiramatsu,
Kazunori Komatani,
Koiti Hasida,
Tetsuya Ogata,
Hiroshi G. Okuno:
A Game-Theoretic Model of Referential Coherence and Its Statistical
Verification Based on Large Japanese and English Corpora,
Natural Language Processing, Vol.14, No.4 (Oct. 2007) pp.199-239.
-
Ryunosuke Yokoya,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Experience Based Imitation Using RNNPB,
Advanced Robotics, Vol.21, No.12 (2007) pp.1351-1367,
doi:10.1163/156855307781746106
Online version,
VSP and Robotics Society of Japan.
-
Chyon Hae Kim, Jun-ichi Idesawa,
Tetsuya Ogata,
Shigeki Sugano:
Restraining of Noises in Self-Organizing Network Elements,
Journal of Robotics Society of Japan, Vol.25, No.6 (Sep. 2007) pp.115-122.
Digital Library
-
Kazuhiro Nakadai,
Hirofumi Nakashima,
Masamitsu Murase,
Hiroshi G. Okuno,
Yuji Hasegawa, Hiroshi Tsujino:
Tracking of Mulitiple Sound Sources by Integration of Robot-Embedded and
In-Room Microphone Arrays,
Journal of Robotics Society of Japan, Vol.25, No.6 (Sep. 2007) pp.181-191.
,
at RSJ server.
-
Jean-Marc Valin,
Shun'ichi Yamamoto,
Jean Rouat, Francois Michaud,
Kazuhiro Nakadai,
Hiroshi G. Okuno:
Robust Recognition of Simultaneous Speech By a Mobile Robot,
IEEE Transactions on Robotics,
Vol.23, No.4 (Aug. 2007) pp.742--752.
,
doi:10.1109/TRO.2007.900612
-
Hiroaki Arie,
Tetsuya Ogata,
Jun Tani, and Shigeki Sugano:
Reinforcement learning of continuous motor sequence with hidden state,
Advanced Robotics,
Special Issue on Robotic Platforms for Research in Neuroscience,
VSP and Robotics Society of Japan, Vol.21, No.10 (July 2007), pp.1215-1229.
Online version
doi:10.1163/156855307781389365
-
Taro Watanabe,
Kenji Imamura, Eiichiro Sumita,
Hiroshi G. Okuno:
Statistical machine translation using hierarchical phrase alignment,
Systems and Computers in Japan, Vol.38, No.6 (June 2007) pp.70-79,
doi:10.1002/scj.20271
-
Naoyuki Kanda,
Kazunori Komatani,
Mikio Nakano,
Kazuhiro Nakadai,
Hiroshi Tsujino,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robust Domain Selection Using Dialogue History in Multi-domain Spoken Dialogue Systems,
IPSJ Journal, Vol.48, No.5 (May 2007) pp.1980-1989, IPSJ.
Book Chapters, Articles
-
Hiroshi G. Okuno,
Tetsuya Ogata,
Kazunori Komatani:
Robot Audition from the viewpoint of Computational Auditory Scene Analysis,
Informatics Education
and Research for Knowledge-Creation Society Infrastructure (ICKS'08),
pp.35-40, Jan. 2008.
doi:10.1109/ICKS.2008.10
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Structual Feature Extraction based on Active Sensing Experiences,
Informatics Education
and Research for Knowledge-Creation Society Infrastructure (ICKS'08),
pp.209-212, Jan. 2008.
doi:10.1109/ICKS.2008.9
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Evaluation of Two-Channel-Based Sound Source Localization using
3D Moving Sound Creation Tool,
Informatics Education
and Research for Knowledge-Creation Society Infrastructure (ICKS'08),
pp.210-216.
doi:10.1109/ICKS.2008.25
-
Koiti Hasida,
Shun Shiramatsu,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Meaning Games,
LENLS 2007 Postproceedings, accepted,
LNCS
Oct. 2007.
-
Hiroshi G. Okuno,
Moonis Ali (Eds.):
New Trends in Applied Artificial Intelligence (IEA/AIE-2007),
Lecture Notes in Computer Science, Vol.4570, Springer-Verlag,
14 Jun. 2007, XXI, 1194p. ISBN: 978-3-540-73322-5.
doi:10.1007/978-3-540-73325-6
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Real-Time Auditory and Visual Talker Tracking through integrating EM algorithm
and Particle Filter,
New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570,
pp.280-290, Springer-Verlag.
Kyoto, Jun. 2007.
doi:10.1007/978-3-540-73325-6_28
-
Ryu Takeda,
Shun'ichi Yamamoto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and
MTF-based ASR,
New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570,
pp.384-394, Springer-Verlag.
Kyoto, Jun. 2007.
doi:10.1007/978-3-540-73325-6_38
-
Hiroshi G. Okuno,
Tetsuro Kitahara,
Kazuyoshi Yoshii:
Music Feature Extraction and Music Information Retrieval,
IEE Journal,
Vol.127, No.7 (Jul. 2007).
-
Hiroshi G. Okuno,
Hiroshi Mizoguchi:
Information Integration for Robot Audition: the State-of-the-art and issues,
SICE,
Vol.46, No.6 (Jun. 2007) pp.415-419.
-
Shun'ichi Yamamoto,
Ryu Takeda,
Hiroshi G. Okuno:
Missing Feature Theory Based Automatic Speech Recognition and Its
Application to Simultaneous Multiple Speaker Speech Recognition,
SICE,
Vol.46, No.6 (Jun. 2007) pp.447-452.
-
Shinichi Ueno, Fumihiro Adachi,
Kazunori Komatani,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Bus Information System Based on User Models and Dynamic Generation
of VoiceXML Scripts,
New Frontiers in Artificial Intelligence (JSAI 2003/2004),
LNAI 3609, pp.45-60, 2007.
Springer-Verlag.
doi:10.1007/978-3-540-71009-7_4
Peer-reviewed Conference Papers
-
Yuichiro Fukubayashi,
Kazunori Komatani,
Mikio Nakano, Kotaro Funakoshi, Hiroshi Tsujino,
Tetsuya Ogata,
Hiroshi G. Okuno:
Rapid Prototyping of Robust Language Understanding Modules with Less
Training Data for Spoken Dialogue Systems,
Proceedings of the Third International Joint Conference on Natural Language
Processing (IJCNLP 2008), pp.210-216, Jan. 2008, Hyderabad, India.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Design and Implementation of A Robot Audition System for Automatic Speech Recognition of Simultaneous Speech,
Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-2007), 111-116, acceptance rate (115/267),
IEEE, Kyoto, Dec. 2007.
doi:10.1109/ASRU.2007.4430093
-
Hisashi Kanda,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Vocal Imitation using Vocal Tract Model and Recurrent Neural Network,
Proceedings of International Conference on Neural Information Processing (ICONIP-2007),
Vol.2, pp.222-232, Nov. 2007.
-
Hisashi Kanda,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Vocal Imitation Using Physical Vocal Tract Model,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.1846-1851,
IEEE, RSJ, San Diego, Oct. 2007.
doi:10.1109/IROS.2007.4399137
-
Ryunosuke Yokoya,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Discovery of Other Individuals by Projecting a Self-Model Through Imitation,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.1009-1014,
IEEE, RSJ, San Diego, Oct. 2007.
doi:10.1109/IROS.2007.4399153
-
Kazuyoshi Yoshii,
Kazuhiro Nakadai,
Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
A Biped Robot that Keeps Steps in Time with Musical Beats while Listening to Music with Its Own Ears,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.1743-1750,
IEEE, RSJ, San Diego, Oct. 2007.
doi:10.1109/IROS.2007.4399244
-
Tetsuya Ogata,
Masamitsu Murase,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno,
Two-way Translation of Compound Sentences and Arm Motions by Recurrent Neural Networks,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.1858-1863,
IEEE, RSJ, San Diego, Oct. 2007.
doi:10.1109/IROS.2007.4399265
-
Ryu Takeda,
Kazuhiro Nakadai,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Exploiting Known Sound Sources to Improve ICA-based Robot Audition in Speech Separation and Recognition,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.1757-1762,
IEEE, RSJ, San Diego, Oct. 2007.
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Auditory and Visual Integration based Localization and Tracking of Humans in Daily-life Environments,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2007), pp.2021-2027,
IEEE, RSJ, San Diego, Oct. 2007.
-
Kazuyoshi Yoshii,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Hybrid Collaborative and Content-based Music
Recommendation Using Probabilistic Model with Latent User Preferences,
Proceedings of 8th International Conference on Musical Information
Retrieval (ISMIR-2007), long paper (15.8% of 214 submissions),
pp.89-94, Vienna, Sep. 2007.
-
Kazunori Komatani,
Yuichiro Fukubayashi,
Tetsuya Ogata,
Hiroshi G. Okuno:
Introducing Utterance Verification in Spoken Dialogue System to Improve Dynamic Help Generation for Novice Users,
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue,
pp.202-205, Sep. 2007
-
Satoshi Ikeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Topic Estimation with Domain Extensibility for Guiding User's Out-of-Grammar
Utterance in Multi-Domain Spoken DIalogue Systems,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2007), pp.2561-2564,, Antwerp, Sep. 2007.
-
Kazunori Komatani,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Analyzing Temporal Transition of Real User's Behaviors in a Spoken
Dialogue System,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2007), pp.142-145, Antwerp, Sep. 2007.
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Auditory and VIsual Integration based Localization and Tracking of
Multiple Moving Sounds in Daily-life Environments,
Proceedings of International Workshop on Robot and Human Interaction
(Ro-Man 2007), 399-404, IEEE, Jeju Island, Korea, Aug. 2007.
-
Hyun-Don Kim,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Real-Time Auditory and Visual Talker Tracking through integrating EM algorithm
and Particle Filter,
New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570,
pp.280-290, Springer-Verlag.
Kyoto, Jun. 2007.
doi:10.1007/978-3-540-73325-6_28
-
Ryu Takeda,
Shun'ichi Yamamoto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and
MTF-based ASR,
New Trends in Applied Artificial Intelligence (IEA/AIE-2007), LNAI 4570,
pp.384-394, Springer-Verlag.
Kyoto, Jun. 2007.
doi:10.1007/978-3-540-73325-6_38
-
Katsutoshi Itoyama,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
INTEGRATION AND ADAPTATION OF HARMONIC AND INHARMONIC MODELS FOR SEPARATING POLYPHONIC MUSICAL SIGNALS,
Proceedings of 2007 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2007),
pp.57-60, Hawaii, April 2007, pp.57-60,
(15.1% acceptance rate for lecture presentation)
doi:10.1109/ICASSP.2007.366615
-
Haruhiko Niwa,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Distance Estimation of Hidden Objects Based on
Acoustical Holography by applying Acoustic Diffraction of
Audible Sound,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2007), pp.423-428,
(Apr. 2007).
doi:10.1109/ROBOT.2007.363823
-
Tetsuya Ogata,
Shohei Matsumoto,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Human-Robot Cooperation using Quasi-symbols
Generated by RNNPB Model,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2007), pp.2156-2161,
(Apr. 2007).
doi:10.1109/ROBOT.2007.363640
-
Shun Nishide,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Predicting Object Dynamics from Visual Images
through Active Sensing Experiences,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2007), pp.2501-2506,
(Apr. 2007).
doi:10.1109/ROBOT.2007.363841
-
Chyon Hae Kim,
Tetsuya Ogata,
Shigeki Sugano:
Enhancement of Self Organizing Network Elements for Supervised Learning,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2007), WeA3.5,
(Apr. 2007).
Patents
-
Robot acoustic device and robot acoustic system
Patent No. US 7,215,786.
Date of Patent: May 8, 2007.
Inventors: Kazuhiro Nakadai, Hiroshi Okuno, Hiroaki Kitano,
Assignee: Japan Science and Technology Agency.
Academic Year 2006
Peer-reviewed Journal Papers
-
Kazuyoshi Yoshii,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Drumix: An Audio Player with Functions of Realtime Drum-Part
Rearrangement for Active Music Listening,
IPSJ Journal, Vol.48, No.3 (Mar. 2007), 1229-1239, IPSJ,
IPSJ Digital Courier,
Vol.3 (2007), pp.134-144.
DL
-
Hyun-Don Kim,
Jong-Suk Choi,
and Munsang Kim:
Human-robot interaction in real environments by audio-visual integration,
International Journal of Control Automation and Systems,
Vol.5, No.1 (Feb. 2007) pp.61-69.
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music,
IPSJ Journal,
Vol.48, No.1 (Jan. 2007) pp.214-226, IPSJ.
IPSJ Digital Courier,
Vol.3 (2007) pp.1-13.
-
Shunsuke Kurotaki, Noriaki Suzuki,
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hideharu Aamano:
Sound Source Separation Filter for Robot Audition used by Dynamic
Reconfigurable Device, DRP
(in Japanese),
IEICE Transaction on Information and Systems,
Vol.J90-D, No.3, pp.897-907, Mar. 2007,
IEICE.
DL
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G,. Okuno:
Simultaneous Speech Recognition based on Automatic Missing-Feature Mask
Generation integrated with Sound Source Separation
(in Japanese),
Journal of Robotics Society of Japan, Vol.25, No.1 (Jan. 2007)
pp.92-102.
at RSJ server.
-
Kazuyoshi Yoshii,
Masataka Goto,
Hiroshi G. Okuno:
Drum Sound Recognition for Polyphonic Audio Signals
by Adaptation and Matching of Spectogram Templates
with Harmonic Structure Suppression,
IEEE Transactions on Audio, Speech and Language Processing,
Vol.15, No.1 (Jan. 2007) pp.333-345,
,
doi:10.1109/TASL.2006.876754
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Instrument Identification in Polyphonic Music:
Feature Weighting to Minimize Influence of Sound Overlaps,
EURASIP Journal on Applied Signal Processing,
Special issue on Music Information Retrieval Based on Signal Processing,
Vol.2007, Article ID 51979, 15 pages, 2007,
doi:10.1155/2007/51979
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Instrument Identification in Polyphonic Music: Feature Weighting Based on Mixed-Sound Template and Use of Musical Context
(in Japanese),
IEICE Transaction on Information and Systems, Vol.J89-D, No.12 (Dec. 2006), pp.2721-2733,
IEICE.
-
Naoyuki Kanda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Spoken Language Understinding Using Dialogue Context in Database Search
(in Japanese),
IPSJ Journal, Vol.47, No.6 (June 2006) pp.1802-1811, IPSJ.
-
Hiromasa Fujihara,
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
A Singer Identification Method for Musical Pieces on the Basis of
Accompaniment Sound Reduction and Reliable Frame Selection
(in Japanese),
IPSJ Journal, Vol.47, No.6 (June 2006) pp.1831-1843, IPSJ.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Ryu Takeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. OKuno:
Missing Feature Theory Based Interface Between Sound Source Separation and Automatic Speech Recognition and Applying to Multiple Robots,
(in Japanese),
Journal of Human Interface Society, Vol.8, No.2 (Jun. 2006) pp.203-212.
-
Takamichi Saito,
Kentaro Umesawa,
Hiroshi G. Okuno:
A Privacy-Enhanced Access Control,
Systems and Computers in Japan, (2006)
A Privacy-Enhanced Access Control,
Systems and Computers in Japan, Vol.37, No.5 (May 2006) pp.77-86.
doi:10.1002/scj.10214
-
Tenkai Kim,
尾形 哲也,
Shigeki Sugano;
ローカルルールに基づいた論理回路の自己組織アルゴリズム
(in Japanese),
Transaction on SICE, Vol.42, No.4 (Apr. 2006) pp.334-341.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Ryu Takeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improving Location-Based Speech Recognition of Simultaneous Speech Signals
by Parameter Optimization with Genetic Algorithm (in Japanese),
Human Interface, Vol.8, No.2 (Jun. 2006) pp.203-212.
-
Takamichi Saito,
Kentaro Umesawa,
Hiroshi G. Okuno:
A Privacy-Enhanced Access Control,
Systems and Computers in Japan, (2006)
A Privacy-Enhanced Access Control,
Systems and Computers in Japan, Vol.37, No.5 (May 2006) pp.77-86.
doi:10.1002/scj.10214
Book Chapters, Survey Papers, and Articles
-
Hiroaki Arie, Jun Namikawa,
Tetsuya Ogata,
Jun Tani, Shigeki Sugano:
Reinforcement Learning Algorithm with CTRNN in Continuous Action Space,
Neural Information Processing (ICONIP-2006),
Part I, LNCS 4232, pp.387-396.
Oct. 2006.
doi:10.1007/11893028_44
-
Shun'ichi Yamamoto,
Ryu Takeda,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition,
PRICAI 2006: Trends in Artificial Intelligence,
LNCS 4099, pp.484-494, accepted as regular paper for ORAL Presentation (14.1%),
Springer-Verlag, Guilin, China, Aug. 2006.
doi:10.1007/11801603_52
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Ryu Takeda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Genetic Algorithm based Improvement of Robot's Hearing Capabilities in
Separating and Recognizing Simultaneous Speech Signals,
Moonis Ali, Richard Dapoigny (Eds.):
Advances in Applied Artificial Intelligence (IEA/AIE-2006),
LNAI 4031, pp.207-217, Springer-Verlag.
Annecy, France, Jun. 2006.
doi:10.1007/11779568_24
Peer-reviewed Conference Papers
-
Hiroshi G. Okuno,
Tetsuya Ogata,
Kazunori Komatani:
Computational Auditory Scene Analysis and Its Application to Robot Audition:
Five Years Experience,
Proceedings of the 2nd International Conference on Informatics Research
for Development of Knowledge Society Infrastructure (ICKS 2007),
pp.69-76, Jan. 2007.
doi:10.1109/ICKS.2007.7
-
Shun Shiramatsu,
Kazunori Komatani,
Koiti Hasida,
Tetsuya Ogata,
Hiroshi G. Okuno:
Meaning-Game-based Centering Model with Statistical
Definition of Utility of Referential Expression and
Its Verification Using Japanese and English Corpora,
Proceedings of the 6th Discourse Anaphora and Anaphor Resolution
Colloquium (DAARC2007), pp.121-126,
Lisbon, Mar. 2007.
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Musical Instrument Recognizer ``Instrogram'' and Its Application to Music Retrieval based on Instrumentation Similarity,
Proceedings of IEEE International Symposium on Multimedia (ISM2006),
pp.265-272,
San Diego, Dec. 2006.
doi:10.1109/ISM.2006.113
-
Hiromasa Fujihara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic synchronization between lyrics and music CD recordings based on Viterbi alignment of segregated vocal signals,
Proceedings of IEEE International Symposium on Multimedia (ISM2006),
pp.257-264,
San Diego, Dec. 2006.
doi:10.1109/ISM.2006.38
-
Kazuyoshi Yoshii,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Hybrid Collaborative and Content-based Music
Recommendation Using Probabilistic Model with Latent User Preferences,
Proceedings of 7th International Conference on Musical Information
Retrieval (ISMIR-2006), pp.296-301,
Vancouver, CA, Sep. 2006.
-
Katsutoshi Itoyama,
Tetsuro Kitahara,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Feature Weighting in Automatic Transcription of
Specified Part in Polyphonic Music,
Proceedings of 7th International Conference on Musical Information
Retrieval (ISMIR-2006), pp.172-175,
Vancouver, CA, Sep. 2006.
-
Kazuhiro Nakadai,
Hirofumi Nakajima,
Masamitsu Murase,
Satoshi Kaijiri,
Kentaro Yamada, Yuji Hasegawa,
Hiroshi G. Okuno,
Hiroshi Tsujino:
Real-Time Tracking of Multiple Sound Sources by
Integration of In-Room and Robot-Embedded Microphone Arrays,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 852-859,
IEEE, RSJ, Beijing, China, Sep. 2006.
,
doi:10.1109/IROS.2006.281737
-
Ryu Takeda,
Shun'ichi Yamamoto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Missing-Feature based Speech Recognition for Two
Simultaneous Speech Signals Separated by ICA with a pair of Humanoid Ears,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 878-885,
IEEE, RSJ, Beijing, China, Sep. 2006.
,
doi:10.1109/IROS.2006.281741
IEEE Robotics and Automation Society Japan Chapter Young Award,
RSJ/SICE Award for IROS 2006 Best Paper Nomination Finalist
(2nd to 5th Place) at IROS-2007.
-
Haruhiko Niwa,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Multiple Acoustical Holography Method for Localization of Objects
in Broad Range using Audible Sound,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 1146-1151,
IEEE, RSJ, Beijing, China, Sep. 2006.
,
doi:10.1109/IROS.2006.281844
-
Chyon Hae Kim,
Tetsuya Ogata,
Shigeki Sugano:
Efficient Organization of Network Topology based on Reinforcement Signals,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 3154-3159,
IEEE, RSJ, Beijing, China, Sep. 2006.
-
Yuki Suga, Chihiro Endo, Daizo Kobayashi, Takeshi Matsumoto,
Tetsuya Ogata,
Shigeki Sugano:
User-Adaptive Human-Robot Interaction System using Interactive EC,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 3663-3668,
IEEE, RSJ, Beijing, China, Sep. 2006.
-
Ryunosuke Yokoya,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Experience Based Imitation Using RNNPB,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 3669-3674,
IEEE, RSJ, Beijing, China, Sep. 2006.
,
doi:10.1109/IROS.2006.281724
-
Jong-Suk Choi,
Hyun-Don Kim,
and Munsang Kim:
Probabilistic Speaker Localization in Noisy Environment by
Audio-Visual Integration,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 4704-4709,
IEEE, RSJ, Beijing, China, Sep. 2006.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Real-Time Robot Audition System That Recognizes Simultaneous Speech
in the Real World,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2006), 5333-5338,
IEEE, RSJ, Beijing, China, Sep. 2006.
,
doi:10.1109/IROS.2006.282037
-
Tetsuya Ogata,
Yuya Hattori,
Hideki Kojima,
Kazunori Komatani,
Hiroshi G. Okuno:
Generation of Robot Motions from Environmental Sounds using Inter-modality
Mapping by RNNPB,
Proceedings of Sixth International Workshop on Epigenetic Robotics
(EpiRobo-2006), 95-102, Paris, Sep., 2006.
-
Hiromasa Fujihara,
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Speaker Identification under Noisy Environments by using Harmonic Structure
Extraction and Reliable Frame Weighting,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2006), 1459-1462,
Pittsburgh, Sep. 2006.
-
Ryu Takeda,
Shun'ichi Yamamoto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Improving Speech Recognition of Two Simultaneous Speech Signals by
Integrating ICA BSS and Automatic Missing Feature Mask Generation,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2006), 2302-2305,
Pittsburgh, Sep. 2006.
-
Yuichiro Fukubayashi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Dynamic Help Generation by Estimating User's Mental Model in Spoken Dialogue
Systems,
Proceedings of International Conference on Spoken Language Processing
(Interspeech-2006), 1946-1949,
Pittsburgh, Sep. 2006.
-
Shun'ichi Yamamoto,
Ryu Takeda,
Kazuhiro Nakadai,
Mikio Nakano, Hiroshi Tsujino,
Jean-Marc Valin,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Leak Energy based Missing Feature Mask Generation for ICA and GSS and Its
Evaluation with Simultaneous Speech Recognition,
Proceedings of ISCA Tutorial and Research Workshop on Statistical and
Perceptual Audition (SAPA2006),
pp.42-46,
-
Kazunori Komatani,
Naoyuki Kanda,
Mikio Nakano, Kazuhiro Nakadai, Hiroshi Tsujino,
Tetsuya Ogata,
Hiroshi G. Okuno:
Multi-Domain Spoken Dialogue System with Extensibility and Robustness
against Speech Recognition Errors,
Proceedings of SIGdial Workshop on Discourse and Dialogue,
9-17, Aug. 2006
-
Hiroshi G. Okuno:
Computational Auditory Scene Analysis
- Towards Listening to Several Thinkgs at Once -,
50th Anniversary Summit of Artificial Intelligence (ASAI50) workshop and abstract booklet,
accepted for inclusion, Monte Verita, Switzerland, July 2006.
-
Takuya Yoshioka,
Takafumi Hikichi, Masato Miyoshi,
Hiroshi G. Okuno:
Robust Decomposition of Inverse Filter of Channel and
Prediction Error Filter of Speech Signal for Dereverberation,
Proceedings of the 14th European Signal Processing Conference
(EUSIPCO 2006), CD-ROM Proceedings, Florence, 2006.
-
Ryunosuke Yokoya,
Tetsuya Ogata,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Robot Imitation from Active-Sensing Experiences,
Proceedings of Fifth International Conference on Learning and Development
(ICDL06), accepted, Bloomington, IN USA, May 2006.
-
Kazuyoshi Yoshii,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
AN ERROR CORRECTION FRAMEWORK BASED ON DRUM PATTERN PERIODICITY FOR
IMPROVING DRUM SOUND DETECTION,
Proceedings of 2006 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2006),
Vol.V, pp.237-240, Toulouse, May 2006.
,
doi:10.1109/ICASSP.2006.11661256
IEEE Kansai Chapter Young Researcher Award
-
Hiromasa Fujihara,
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
F0 ESTIMATION METHOD FOR SINGING VOICE IN POLYPHONIC AUDIO SIGNAL BASED ON
STATISTICAL VOCAL MODEL AND VITERBI SEARCH,
Proceedings of 2006 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2006),
Vol.V, pp.253-256, Toulouse, May 2006.
,
doi:10.1109/ICASSP.2006.1661260
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Instrogram: A New Musical Instrument Recognition Technique Without Using
Onset Detection Nor F0 Estimation,
Proceedings of 2006 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2006),
Vol.V, pp.229-232, Toulouse, May 2006.
,
doi:10.1109/ICASSP.2006.1661254
IEEE Kansai Chapter Young Researcher Award
-
Kazuhiro Nakadai,
Hirofumi Nakajima,
Masamitsu Murase,
Satoshi Kaijiri.
Kentaro Yamada, Yuji Hasegawa,
Hiroshi G. Okuno,
Hiroshi Tsujino:
ROBUST TRACKING OF MULTIPLE SOUND SOURCES BY SPATIAL INTEGRATION OF ROOM
AND ROBOT MICROPHONE ARRAYS,
Proceedings of 2006 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2006),
Vol.IV, pp.929-932, Toulouse, May 2006.
,
doi:10.1109/ICASSP.2006.1661122
-
Hyun-Don Kim,
Jong-Suk Choi, and Munsang Kim:
Speaker Localization among Multi-faces in Noisy Environment by
Audio-Visual Integration,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2006), 1305-1310,
(May 2006).
10.1109/ROBOT.2006.1641889
Patents
-
Speech Recongition Device,
Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto,
European Patent: EP1691344,
Publication Date: 08/16/2006,
Application number: EP20040818533,
Filing Date: 11/12/2004
-
Method and Apparatus for Determining Sound Source,
Patent No. US 7,035,418.
Filing date: June 7, 2000.
Issue date: Apr. 25, 2006.
Inventors: Hiroshi Okuno, Hiroaki Kitano, Yukiko Nakagawa,
Assignee: Japan Science and Technology Agency.
-
Robot audiovisual system
Patent No. US 7,016,505.
Filing date: Nov 1, 2000.
Issue date: Mar 21, 2006.
Inventors: Kazuhiro Nakadai, Hiroshi Okuno, Hiroaki Kitano,
Assignee: Japan Science and Technology Agency.
Academic Year 2005
Peer-Reviewed Journal Papers
-
Yasuhiro Akiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto,
Hiroshi G. Okuno:
Using Multiple Edit Distances to
Automatically Grade Outputs from Machine Translation Systems,
IEEE Transactions on Audio, Speech and Language Processing,
Vol.14, No.2, (Mar. 2006) 393--402.
doi:10.1109/TSA.2005.860770
-
Mototaka Suzuki, Kuniaki Noda, Yuki Suga,
Tetsuya Ogata,
and Shigeki Sugano:
Dynamic Perception after Visually-Guided Grasping by a Human-Like
Autonomous Robot,
Advanced Robotics, Vol.20, No.2 (Feb. 2006) 233-254.
VSP and Robotics Society of Japan.
doi:10.1163/156855306775525785
-
Takuya Yoshioka,
Takafumi Hikichi, Masato Miyoshi,
Hiroshi G. Okuno:
Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals,
IEICE Trans. on Fundamentals of Electronics, Communications, and
Computer Sciences, Vol.E89-A, No.1 (Jan. 2006) 240-247,
IEICE.
-
Tetsuya Ogata,
Hayato Ohba,
Jun Tani,
Kazunori Komatani,
Hiroshi G. Okuno:
Extracting Multi-Modal Dynamics of Objects using RNNPB,
Journal of Robotics and Mechatronics,
Vol.17, No.6 (Dec. 2005) pp.681-688,
Special Issue on Human Modeling in Robotics.
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Pitch-dependent identification of musical instrument sounds,
Applied Intelligence, Vol.23, No.3, pp.267-275,
Springer-Verlag (formerly Kluwer Publishers).
doi:10.1007/s10489-005-4612-1
-
Kenri Kodaka,
Tetsuya Ogata,
Hiroshi G. Okuno:
Walking in Virtual Space with Entrainment Based on a Nonlinear Oscillator,
Journal of Human Interface Society, Vol.7, No.4, 26-36, 2005.
-
Shun Shiramatsu,
Takashi Miyata,
Hiroshi G. Okuno,
Koiti Hasida:
Dissolution of Centering Theory Based on Game Theory and
Its Empirical Verification
(in Japanese),
Natural Language Processing, Vol.12, No.3 (July 2005) 91-110.
-
Shunichi Yamamoto,
Kazuhiro Nakadai,
Hiroshi Tsujino,
Hiroshi G. Okuno:
Missing Feature Theory Based Interface Between Sound Source Separation and
Automatic Speech Recognition and Applying to Multiple Robots
(in Japanese),
Journal of Robotics Society of Japan, Vol.23, No.6 (Aug. 2005) 743-751.
,
at RSJ server.
-
Tetsuya Ogata,
Shigeki Sugano, and Jun Tani:
Open-end Human-Robot Interaction from the Dynamical Systems Perspective
- Mutual Adaptation and Incremental Learning,
Advanced Roboics, Vol.19. No.6, pp.651-670,
VSP and Robotics Society of Japan.
doi:10.1163/1568553054255655
-
Katsuhisa Ishida,
Tetsuro Kitahara,
Masayuki Takeda:
Improvisation Supporting System Using N-gram-based Melody Appropriateness Determination,
IPSJ Journal, Vol.46, No.7 (July 2005) pp.1548-1559, IPSJ.
in html
Book Chapters
-
Masahiro Nisiyama,
Hiroaki Kawashima, Takatsugu Hirayama,
Takashi Matsuyama:
Facial Expression Representation based on Timing Structures in Faces,
Proceedings of IEEE International Workshop on Analysis and Modeling of
Faces and Gestures (AMFG 2005), LNCS 3723, pp.139-153,
Beijing, Oct. 2005.
-
Tsuyoshi Tasaki,
Shohei Matsumoto,
Hayato Ohba,
Mitsuhiko Toda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Distance Based Dynamic Interaction of Humanoid Robot with Multiple People.
Innovations in Applied Artificial Intelligence (IEA/AIE-2005)
LNAI 3533, 111-120, Best paper award, Springer-Verlag.
Bari, Italy, Jun. 2005.
Paper
doi:10.1007/11504894_18
-
Katsutoshi Uchiyama, Toshiaki Ohji, Mari Oka,
Hiroshi G. Okuno,
Hiroyuki Suzuki, Kenji Fukaya, Modjtaba Sadria, Hubert Durt
Kokoro and Topos -- Bastions of Kokoro,
Kyoto Inernational Culture Forum 2005, pp.62-73, Mar. 2006.
Peer-Reviewed International Conference Papers
-
Shun Shiramatsu,
Kazunori Komatani,
Takashi Miyata, Koiti Hasida,
Hiroshi G. Okuno:
Empirical Verification of Meaning-Game-based Generalization of
Centering Theory with Large Japanese Corpus,
Proceedings of the 19th Pacific Asia Conference on Language, Information,
and
Computation (PACLIC 19),
192-210, Taipei, Dec. 2005.
-
Kazuyoshi Yoshii,
Masataka Goto,
Hiroshi G. Okuno:
INTER:D A Drum Sound Equalizer for Controlling Volume and Timbre of
Druams,
Proceedings of 2nd European Workshop on the Integration of
Knowledge, Semantic and Digital Media Technologies (EWIMT 2005),
pp.205--212, EU Commission,
IEE Savoy Place, London, Nov. 2005.
IEEE
-
Masahiro Nisiyama,
Hiroaki Kawashima, Takatsugu Hirayama,
Takashi Matsuyama:
Facial Expression Representation based on Timing Structures in Faces,
Proceedings of IEEE International Workshop on Analysis and Modeling of
Faces and Gestures (AMFG 2005), LNCS 3723, pp.139-153,
accepted, Beijing, Oct. 2005.
-
Kenri Kodaka,
Tetsuya Ogata,
Hiroshi G. Okuno:
Walking with Body-sense in Virtual Space Using the Nonlinear Oscillator,
Proceedings of the International Conference on Systems, Man and Cybernetics
(SMC-2005), 324--329, IEEE,
Hawaii, Oct. 10-12, 2005.
Finalist for Best Student Paper
doi:10.1109/ICSMC.2005.1571166
-
Kazuyoshi Yoshii,
Masataka Goto,
Hiroshi G. Okuno:
AdaMast: A Drum Sound Recognizer based on Adaptation and Matching of
Spectrogram Templates,
Proceedings of
MIREX 2005,
London, Sep. 2005.
Paper .
Best in Class Award.
-
Hiromasa Fujihara,
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
SINGER IDENTIFICATION BASED ON ACCOMPANIMENT SOUND REDUCTION AND RELIABLE FRAME SELECTION,
Proceedings of 6th International Conference on Musical Information
Retrieval (ISMIR-2005), 329-336,
London, Sep. 2005.
-
Tetsuro Kitahara,
Masataka Goto,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
INSTRUMENT IDENTIFICATION IN POLYPHONIC MUSIC: FEATURE WEIGHTING WITH MIXED SOUNDS, PITCH-DEPENDENT TIMBRE MODELING, AND USE OF MUSICAL CONTEXT,
Proceedings of 6th International Conference on Musical Information
Retrieval (ISMIR-2005), 558-563,
London, Sep. 2005.
-
Kazunori Komatani,
Naoyuki Kanda,
Tetsuya Ogata,
Hiroshi G. Okuno:
Contextual Constraints based on Dialogue Models in Database Search Task
for Spoken Dialogue Systems,
Proceedings of the Nineth European Conference on
Speech Communication and Technology (Interspeech-2005), 877-880,
Lisboa, Sep. 2005.
.
-
Masamitsu Murase,
Shun'ichi Yamamoto,
Jean-Marc Valin,
Kazuhiro Nakadai,
Kentaro Yamada,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Multiple Moving Speaker Tracking by Microphone Array on Mobile Robot,
Proceedings of the Nineth European Conference on
Speech Communication and Technology (Interspeech-2005), 249-252,
Lisboa, Sep. 2005.
.
-
Tetsuro Kitahara,
Katsuhisa Ishida,
Masayuki Takeda:
ism: Improvisation Supporting Systems with Melody Correction and Key Vibration,
Proceedings of International Conference on Entertainment Computing
(ICEC 2005),
Mita, Hyogo, Sep. 2005.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Jean-Marc Valin,
Jean Rouat, Francois Michaud,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Making A Robot Recognize Three Simultaneous Sentences in Real-Time,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2005),
pp.4040-4045,
IEEE, RSJ, Edmonton, Aug. 2005.
.
doi:10.1109/IROS.2005.1545094
-
Syunsuke Kurotaki, Noriaki Suzuki,
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hideharu Amano:
Implementation of Active Direction-Pass Filter on Dynamically Reconfigurable
Processor,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2005),
pp.3175-3180,
IEEE, RSJ, Edmonton, Aug. 2005.
.
doi:10.1109/IROS.2005.1545033
-
Tsuyoshi Tasaki,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Spatially Mapping of Friendliness for Human-Robot Interaction,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2005),
pp.1277-1282,
IEEE, RSJ, Edmonton, Aug. 2005.
.
doi:10.1109/IROS.2005.1545034
-
Mikio Nakano,
Naoyuki Kanda,
Yuji Hasegawa, Toyotaka Torii,
Yohane Takeuchi,
Kazuhiro Nakadai,
Hiroshi Tsujino,
Hiroshi G. Okuno:
A Two-Layer Model for Behavior and Dialogue Planning in Conversational Service Robots,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2005), pp.3329-3335,
IEEE, RSJ, Edmonton, Aug. 2005.
.
doi:10.1109/IROS.2005.1545198
-
Tetsuya Ogata,
Hayato Ohba,
Kazunori Komatani,
Jun Tani,
Hiroshi G. Okuno:
Extracting Multi-Modal Dynamics of Objects using RNNPB
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2005), pp.966-971,
IEEE, RSJ, Edmonton, Aug. 2005.
.
doi:10.1109/IROS.2005.1544975
-
Kazunori Komatani,
Ryoji Hamabe,
Tetsuya Ogata,
Hiroshi G. Okuno:
Generating Confirmation to Distinguish Phonologically Confusing Word Pairs in Spoken Dialogue Systems
Proceedings of 4th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems, pp.40-45, July 2005.
-
Yuya Hattori,
Hideki Kojima,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot Gesture Generation from Environmental Sounds Using
Inter-modality Mapping,
Proceedings of Fifth International Workshop on Epigenetic Robotics
(EpiRobo-2005), 139-140, Nara, July 2005.
-
Takuya Yoshioka,
Takafumi Hikichi, Masato Miyoshi,
Hiroshi G. Okuno:
Blind Estimation of Room Resonances Using Popular, Classical, and Jazz Music.
Proceedings of AES 118th Convenvion, 6632,
Audio Engineering Society, Barcelona, Spain, May 28-31, 2005.
-
Tsuyoshi Tasaki,
Shohei Matsumoto,
Hayato Ohba,
Mitsuhiko Toda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Distance Based Dynamic Interaction of Humanoid Robot with Multiple People.
Innovations in Applied Artificial Intelligence:
Eighteenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems (IEA/AIE-2005)
LNAI 3533, 111-120, Best paper award, Springer-Verlag.
Bari, Italy, Jun. 2005.
Paper
doi:10.1007/11504894_18
-
Shun'ichi Yamamoto,
Jean-Marc Valin
Kazuhiro Nakadai,
Hiroshi Tsujino, Jean Rouat, Francois Michaud,
Tetsuya Ogata,
Kazunori Komatani,
Hiroshi G. Okuno:
Enhanced Robot Speech Recognition Based on Microphone Array
Source Separation and Missing Feature Theory.
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2005), 1477-1482, IEEE,
Barcelona, Apr. 2005.
Patents
-
Robot audiovisual system
Patent No. US 6,967,455
Filing date: Mar 8, 2002
Issue date: Nov 22, 2005
Inventors: Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Okuno, Hiroaki Kitano
Assignee: Japan Science and Technology Agency
-
Speech Recongition Device,
Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi Okuno, Shunichi Yamamoto,
Wipo Patent: WO/2005/048239,
Application Number: PCT/JP2004/016883,
Publication Date: 05/26/2005,
Filing Date: 11/12/2004.
Academic Year 2004
Thesis
- Yasuhiro Akiba:
Automatic Evaluation Methods for Machine Translation Systems,
Ph.D Thesis, Jan. 2005.
- Kazushi Ishihara:
MS Thesis, Feb. 2005
- Kenri Kodaka:
MS Thesis, Feb. 2005
- Shun'ichi Yamamoto:
MS Thesis, Feb. 2005
- Ken Yamaguchi:
MS Thesis, Feb. 2005
- Kazuyoshi Yoshii: Drum Sound Recognition for Polyphonic Audio Signals
by Adaptation of Spectral Templates and Suppression of Harmonic Structure,
MS Thesis, Feb. 2005
- Hayato Ohba:
BE Thesis, Feb. 2005
- Taku Oya:
BE Thesis, Feb. 2005
- Satoshi Kaijiri:
BE Thesis, Feb. 2005
- Ryoji Hamabe:
BE Thesis, Feb. 2005
- Masahiro Fujihara:
BE Thesis, Feb. 2005
- Masamitsu Murase:
BE Thesis, Feb. 2005
Peer-Reviewed Journal Papers
-
Tetsuya Ogata,
Shigeki Sugano, and Jun Tani:
Acquisition of Motion Primitives of Robot in Human-Navigation Task:
Towards Human-Robot Interaction based on "Quasi-Symbol",
Transactions of the Japanese Society for Artificial Intelligence,
Vol.20, No.3, pp.188-196. Mar. 2005.
Online Journal
doi:10.1527/tjsai.20.188
-
Tsuyoshi Tasaki,
Shohei Matsumoto,
Hayato Ohba,
Shun'ichi Yamamoto,
Mitsuhiko Toda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Dynamic Communication of Humanoid Robot with Multiple People Based on
Interaction Distance,
Transactions of the Japanese Society for Artificial Intelligence,
Vol.20, No.3, pp.209-219, Mar. 2005.
Online Journal
doi:10.1527/tjsai.20.209
-
Yasuhiro Akiba,
Kenji Imamura, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto,
Hiroshi G. Okuno:
Automatic Grader of MT Outputs in Colloquial Style by Using Multiple Edit
Distance, (in Japanese),
Transactions of the Japanese Society for Artificial Intelligence,
Vol.20, No.3,pp.139-148 (2005).
Online Journal
doi:10.1527/tjsai.20.139
-
Kazushi Ishihara,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Recognition of Onomatopoeia for Environmental Sounds, (in Japanese),
Transactions of the Japanese Society for Artificial Intelligence,
Vol.20, No.3, pp.229-236, March 2005.
Online Journal
doi:10.1527/tjsai.20.229
-
Teruhisa Misu,
Kazunori Komatani,
Youji Seita,
Tatsuya Kawahara:
IEICE Transaction on Information and Systems,
Vol.88-D2, No.3 (Mar. 2005) 499-508,
IEICE,
-
Kazunori Komatani,
Shinichi Ueno,
Tatsuya Kawahara,
Hiroshi G. Okuno:
User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance,
User Modeling and User-Adapted Interaction,
Special Issue on Language-Based Interaction: User Modeling and Adaptation,
Vol.15, No.1-2 (2005) pp.169-183, Kluwer.
Abstract
doi:10.1007/s11257-004-5659-0
-
Kazuhiro Nakadai,
Daisuke Matsuura,
Hiroshi G. Okuno,
Hiroshi Tsujino:
Improvement of Recognition of Simultaneous Speech Signals Using AV Integration and Scattering Theory for Humanoid Robots,
Speech Communication, Vol.44, Issues 1-4 (Oct. 2004) 97--112,
Elsevier.
doi:10.1016/j.specom.2004.10.010
-
Tino Lourens,
Hiroshi G. Okuno,
Hiroshi Tsujino:
A computational model of monkey cortical grating cells.
Biological Cybernetics,
Vol.92, No.1 (Jan. 2005) 61--70.
Springer-Verlag.
doi:10.1007/s00422-004-0522-2
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,Kazuhiro Nakadai,
Hiroaki Kitano:
Effects of increasing modalities in recognizing three simultaneous speeches,
Speech Communication, Vol.43, Issues 4, pp.347-359,
Sep. 2004.
doi:10.1016/j.specom.2004.03.008
-
Yasuhisa Hayakawa,
Tetsuya Ogata,
and Shigeki Sugano:
Flexible Assembly Work Cooperating System based on Work State Identifications by Self-Organizing Map,
IEEE/ASME Transactions on Mechatronics,
Vol.9, No.3, accepted, Sept. 2004.
-
Kazunori Komatani,
Shinichi Ueno,
Tatsuya Kawahara,
Hiroshi G Okuno:
User model for Adaptive Response Generation in Spoken Dialogue System,
IEICE Transactions on Information and Systems, Vol.87-D2, No.10 (Oct. 2004) 1921-1928, IEICE.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Kazuhiro Nakadai,
Tino Lourens,
Hiroaki Kitano:
Sound and Visual Tracking for Humanoid Robot,
Applied Intelligence,
Vol.20, No.3 (May/June, 2004), 253-266,
doi:10.1023/B:APIN.0000021417.62541.e0,
(accepted in Oct. 2002),
Kluwer Publishers.
-
Taro Watanabe, Kenji Imamura, Eiichiro Sumita,
Hiroshi G. Okuno:
Statistical machine translation using hierarchical phrase alignment,
IEICE Transactions on Information and Systems,
Vol.J87-D2, No.4 (Apr. 2004) 978-986, IEICE.
Book Chapters
-
Kazushi Ishihara,
Tomohiro Nakatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Sound-Imitation Word Recognition from
Environmental Sounds focusing on Ambiguity Problem in Determining Phonemes,
PRICAI 2004: Trends in Artificial Intelligence
(Proc. of Eighth Pacific Rim International Conference on Artificial Intelligence),
LNAI 3157,
pp.909-918, Springer-Verlag,
Auckland, Aug. 2004.
doi:10.1007/b99563
html
-
Kazunori Komatani,
Ryosuke Itoh,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Recognition of Emotional States in Spoken Dialogue with a Robot,
Innovations in Applied Artificial Intelligence:,
Seventeenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems, IEA/AIE-2004,
LNAI 3029, 413-423, Springer-Verlag.
Ottawa, May. 2004,
Springer-Verlag
-
Tetsuya Ogata,
Jun Tani:
Open-end Human Robot Interaction from the Dynamical Systems Perspective:
Mutual Adaptation and Incremental Learning.
Innovations in Applied Artificial Intelligence:,
Seventeenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems, IEA/AIE-2004,
LNAI 3029, 435-444, Springer-Verlag.
Ottawa, May. 2004,
Springer-Verlag
Peer-Reviewed International Conference Papers
-
Hiroshi G. Okuno:
Robot Audition: Its Issues and State of the Art (invited talk),
Proceedings of 2nd International Symposium on Life Science,
Kyoto, Feb. 2005.
-
Tetsuya Ogata,
Shigeki Sugano, and Jun Tani:
Acquisition of Motion Primitives of Robot in Human-Navigation Task:
Towards Human-Robot Interaction based on "Quasi-Symbol",
Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems,
315-326, Kyoto, Nov. 2004.
-
Tsuyoshi Tasaki,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot Motion Control using Listener's Back-Channels and Head Gesture Information,
Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems,
327-338, Kyoto, Nov. 2004.
-
Tsuyoshi Tasaki,
Shohei Matsumoto,
Hayato Ohba,
Mitsuhiko Toda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Dynamic Communication of Humanoid Robot with multiple people based on
Interaction Distance,
Proceedings of 2nd International Workshop on Man-Machine Symbiotic Systems,
385-392, Kyoto, Nov. 2004.
-
Yuki Suga, Hiroaki ARIE,
Tetsuya Ogata,
and Shigeki Sugano:
Constructivist Approach to Human-Robot Emotional Communication:
Design of Evolutionary Function for WAMOEBA-3,
Proceedings of IEEE/RAS Interanational Conference on Humanoid Robots (Humanoids 2004),
No.76, Los Angels, Nov. 2004.
-
Yuki Suga,
Tetsuya Ogata,
and Shigeki Sugano:
Development of Emotional Communication Robot, WAMOEBA-3,
Proceedings of International Conference on Advanced Mechatronics (ICAM 2004),
413-418, Oct. 2004.
-
Kazuyoshi Yoshii,
Masataka Goto,
Hiroshi G. Okuno:
Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods,
Proceedings of 5th International Conference on Musical Information
Retrieval (ISMIR-2004),
184-191,
Barcelona, Spain, Oct. 2004.
-
Takuya Yoshioka,
Tetsuro Kitahara,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Automatic Chord Transcription with Concurrent Recognition
of Chord Symbols and Boundaries,
Proceedings of 5th International Conference on Musical Information
Retrieval (ISMIR-2004),
100-105,
Barcelona, Spain, Oct. 2004.
-
Tsuyoshi Tasaki,
Takeshi Yamaguchi,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Robot Motion Control using Listener's Back-Channels and Head Gesture Information,
Proceedings of 2004 International Conference on Spoken
Language Processing (ICSLP-2004),
1033-1036, ASA, ASJ, and ESCA,
Korea, Oct. 2004.
-
Kazushi Ishihara,
Yuya Hattori,
Tomohiro Nakatani,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Disambiguation in Determining Phonemes of Sound-Imitation Words for Environmental Sound Recognition,
Proceedings of 2004 International Conference on Spoken
Language Processing (ICSLP-2004),
1485-1488, ASA, ASJ, and ESCA,
Korea, Oct. 2004.
-
Kazuyoshi Yoshii,
Masataka Goto,
Hiroshi G. Okuno:
Drum Sound Identification for Polyphonic Music Using Template Adaptation
and Matching Methods,
Proceedings of ISCA Tutorial and Research Workshop
on Statistical and Perceptual Audio Processing (SAPA-2004),
accepted, ASA, ASJ, and ESCA,
Korea, Oct. 2004.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Hiroshi Tsujino,
Hiroshi G. Okuno:
Assessment of General Applicability of Robot Audition System by Recognizing Three Simultaneous Speeches,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2004),
pp.2111-2116,
IEEE, RSJ, Sendai, Sep. 2004.
IEEE Kansai Chapter Young Researcher Award
doi:10.1109/IROS.2004.1389721
-
Tetsuya Ogata,
Masaki Matsunaga, Shigeki Sugano, and Jun Tani:
Human Robot Collaboration Using Behavioral Primitives,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2004), pp.1592-1597,
IEEE, RSJ, Sendai, Sep. 2004.
doi:10.1109/IROS.2004.1389721
-
Yuki SUGA,
Tetsuya Ogata,
and Shigeki Sugano:
Aquisition of Reactive Motion for Communication Robots Using Interactive EC:
Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2004), accepted, Sept. 2004.
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2004),
pp.1198-1203,
IEEE, RSJ, Sendai, Sep. 2004.
doi:10.1109/IROS.2004.1389721
-
Yoshihiro Sakamoto,
Tetsuya Ogata,
and Shigeki Sugano:
Human-Robot Communication Using Multiple Recurrent Neural Networks,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2004),
pp.1574-1579,
IEEE, RSJ, Sendai, Sep. 2004.
-
Tsuyoshi Tasaki,
Shohei Matsumoto,
Hayato Ohba,
Mitsuhiko Toda,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Dynamic Communication of Humanoid Robot with multiple people based on
Interaction Distance,
Proceedings of International Workshop on Robot and Human Interaction
(Ro-Man-2004), 71-76, IEEE, Kurashiki, Sep. 2004.
doi:10.1109/ROMAN.2004.1374732
-
Yuya Hattori,
Kazushi Ishihara,
Kazunori Komatani,
Tetsuya Ogata,
Hiroshi G. Okuno:
Repeat Recognition for Environmental Sounds,
Proceedings of International Workshop on Robot and Human Interaction
(Ro-Man-2004), 83-88, IEEE, Kurashiki, Sep. 2004.
doi:10.1109/ROMAN.2004.1374734
-
Yusuke Akiwa, Yuki Suga,
Tetsuya Ogata,
and Shigeki Sugano:
Imitation based Human-Robot Communication: Roles of Joint Attention and Motion Prediction,
Proceedings of International Workshop on Robot and Human Interaction
(Ro-Man-2004), 283-288, IEEE, Kurashiki, Sep. 2004.
-
Yasuhiro Akiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto,
Hiroshi G. Okuno:
Using a Mixture of N Best Lists from Multiple MT Systems in
Rank-Sum-Based Confidence Measure for MT Outputs,
Proceedings of the 20th International Conference
on Computational Linguistics (Coling-2004),
322-328,
Geneva, Aug. 2004.
-
Kazunori Komatani,
Teruhisa Misu,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Efficient Confirmation Strategy for Large-scale Text Retrieval Systems with Spoken Dialogue Interface,
Proceedings of the 20th International Conference
on Computational Linguistics (Coling-2004),
1100-1106,
Geneva, Aug. 2004.
-
Katsuhisa Ishida,
Masayuki Takeda,
Tetsuro Kitahara:
ism: Improvisation Supporting Systems with Melody Correction,
Proceedings of the International Symposium on Musical Acoustics
(NIME2004), 177-180,
Hamamatsu, Jun. 2004.
-
Yasuhiro Akiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto,
Hiroshi G. Okuno:
Incremental Methods to Select Test Sentence for Evaluating
Translation Ability,
Proceedings of the fourth international conference on Language Resources and Evaluation (LREC-2004),
pp.2015-2018,
Lisbon, Portugal, May 2004.
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Category-level Identification of Non-registered Musical Instrument Sounds,
Proceedings of 2004 International Conference on Acoustics,
Speech and Signal Processing (ICASSP'2004), Vol.IV, 253-256,
Montreal, May 2004.
doi:10.1109/ICASSP.2004.1326811
-
Yohei Sakuraba,
Tetsuro Kitahara,
Hiroshi G. Okuno:
Comparing Features for Forming Music Streams in Automatic Music Transcription,
Proceedings of 2004 International Conference on Acoustics,
Speech and Signal Processing (ICASSP'2004), Vol.IV, 273-276,
Montreal, May 2004.
-
Shun'ichi Yamamoto,
Kazuhiro Nakadai,
Hiroshi Tsujino, Toshio Yokoyama,
Hiroshi G. Okuno:
Improvement of Robot Audition by Interfacing Sound Source Separation and Automatic Speech Recognition with Missing Feature Theory,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2004), 1517-1523, IEEE,
New Orleans, May. 2004.
IEEE Robotics and Automation Society Japan Chapter Young Award
doi:10.1109/ROBOT.2004.1308039
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Acoustical-similarity-based Musical Instrument Hierarchy and
Its Application to Musical Instrument Identification,
Proceedings of the International Symposium on
Musical Acoustics (ISMA2004), 297-300,
Nara, Apr. 2004.
Academic Year 2003
Thesis
- Taro Watanabe :
Example-Based Statistical Machine Translation,
Ph.D Thesis, Feb. 2004.
- Tetsuro Kitahara:
,
MS Thesis, Feb. 2004.
- Yohei Sakuraba:
,
MS Thesis, Feb. 2004.
- Mitsuhiro Sakuraba:
,
MS Thesis, Feb. 2004.
- Naoyuki Kanda:
,
BE Thesis, Feb. 2004.
- Tsuyoshi Tasaki:
,
BE Thesis, Feb. 2004.
- Shohei Matsumoto:
,
BE Thesis, Feb. 2004.
- Yuya Hattori:
,
BE Thesis, Feb. 2004.
- Takuya Yoshioka:
,
BE Thesis, Feb. 2004.
Peer-Reviewed Journal Papers
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Acoustic-feature-based Musical Instrument Hierarchy and Its Application
to Category-level Recognition of Unknown Musical Instruments,
IPSJ Journal,
Vol.45, No.3 (Mar. 2004) pp.680-689, IPSJ.
in html
-
Katsuhisa Ishida,
Tetsuro Kitahara,
Masayuki Takeda:
N-gram Based Melody Correction for Improvisation,
to Category-level Recognition of Unknown Musical Instruments.
IPSJ Journal, Vol.45, No.3 (Mar. 2004) pp.680-689, IPSJ.
in html
-
Yoko Yamakata,
Tatsuya Kawahara,
Hiroshi G. Okuno,
Michihiko Minoh:
Belief Network based Disambiguation of Object Reference in Spoken Dialogue System,
Transactions of the Japanese Society for Artificial Intelligence,
Vol.19, No.1 F, pp.47-56 (2004).
Online Journal
-
Taro Watanabe, Eiichiro Sumita,
Hiroshi G. Okuno:
Decoding Algorithms for Statisitcal Machine Translation Considering Generation
Directions,
IPSJ Journal, Vol.44, No.12 (Dec. 2003) 3202-3210, IPSJ.
in html
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Musical Instrument Identification Considering Pitch-dependent
Characteristics of Timbre: A Classifier Based on F0-dependent Multivariate
Normal Distribution,
IPSJ Journal,
Vol.44, No.10 (Oct. 2003) 2448-2458, IPSJ.
in html
-
Kazuhiro Nakadai,
Ken-ichi Hidai,
Hiroshi G. Okuno,
Hiroshi Mizoguchi,
Hiroaki Kitano:
Real-time Multiple Talker Tracking by Audio-Visual Integration for Humanoids:
Integration of Active Audition nad Face Recognition,
Journal of Robotics Society of Japan,
Vol.21, No.5 (Jul. 2003), pp.517--525.
,
at RSJ server.
-
Kazunori Komatani,
Hiroaki Kashima, Katsuaki Tanaka,
Tatsuya Kawahara:
Domain-independent Spoken Dialogue Platform for Database Query Using
Key-phrase Spotting Based on Combined Language Model,
IPSJ Journal, Vol.44, No.5 (May 2003) 1333-1342.
in html
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Active audition for humanoid robots that can listen to three
simultaneous talkers.
Journal of the Acoustical Society of America,
Vol.113, No.4, Pt.2 of 2, Apr. 2003, pp.2230.
Abstract at ASA.
Book Chapters and Survye Papers
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Pitch-dependent Musical Instrument Identification and
Its Application to Musical Sound Ontology,
In Chung,
P,W.H., Hinde, C. and Ali, M. (Eds.)
Developments in Applied Artificial Intelligence,
LNAI 2718, 112-122, Springer-Verlag.
Proceedings of Nineteenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems (IEA/AIE-2003),
Loughborough, UK, Jun. 2003,
doi:10.1007/3-540-45034-3_12
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Design and Implementation of Personality of Humanoids
in Human Humanoid Non-verbal Interaction,
In Chung, P,W.H., Hinde, C. and Ali, M. (Eds.)
Developments in Applied Artificial Intelligence,
LNAI 2718, 662-673, Springer-Verlag.
Proceedings of Nineteenth International Conference on
Industrial and Engineering Applications of Artificial Intelligence and
Expert Systems (IEA/AIE-2003),
Loughborough, UK, Jun. 2003,
doi:10.1007/3-540-45034-3_67
-
Hiroshi G. Okuno,
Kazuhiro Nakadai:
Real-time Sound Source Localization and Separation based on
Active Audio-Visual Integration,
In Jose Mira and Jose R. Alvarez (Eds.):
Computational Methods in Neural Modeling,
LNCS 2686, 118-125, Springer-Verlag.
The Seventh International Work Conference on
Artificial and Nataural Neural Networks, IWANN 2003, Proceedings, Part 1,
Ma\'{o}, Menorca,, Spain, June 2003,
doi:10.1007/3-540-44868-3_16
-
Hiroshi G. Okuno,
Kazuhiro Nakadai:
Robot Audition: its research topics and current status,
Joho SHori,
Vol.44, No.11 (Nov. 2003) pp.1138-1144, IPSJ.
Article in html
Peer-Reviewed International Conference Papers
-
Hiroshi G. Okuno,
Tetsuya Ogata,
Kazunori Komatani:
Computational Auditory Scene Analysis and Its Application to Robot Audition,
Proceedings of the International Conference on Informatics Research
for Development of Knowledge Society Infrastructure (ICKS 2004),
pp.73-80, Mar. 2004,
doi:10.1109/ICKS.2004.1313411
-
Kazuhiro Nakadai,
Daisuke Matsuura,
Hiroshi G. Okuno,
Hiroaki Kitano:
Applying Scattering Theory to Robot Audition System,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2003),
1147-1152, IEEE, Las Vegas, Oct. 2003.
-
Tetsuya Ogata,
Shigeki Sugano, Jun Tani:
Interactive Learning in Human-Robot Collaboration,
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2003),
162-167,
IEEE, Las Vegas, Oct. 2003.
-
Kazuhiro Nakadai,
Daisuke Matsuura,
Hiroshi G. Okuno,
Hiroaki Kitano:
Active audition based humanoid system and ist evaluation:
Localization,
Seperation and Recognition of Simultaneous Speech.
Proceedings of IEEE/RSJ International Conference
on Humanoids (Humanoids-2003),
Springer-Verlag,
IEEE, Munchen, Oct. 2003.
-
Yohei Sakuraba,
Hiroshi G. Okuno:
Note Recognition of Polyphonic Music by Using
Timbre Similarity and Direction Proximity.
Proceedings of International Computer Music Conference (ICMC2003),
167-170, Singapore, Oct. 2003.
-
Yasuhiro AKiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto,
Hiroshi G. Okuno:
Experimental Comparison of MT Evaluation Methods: RED vs. BLEU.
Proceedings of MT Summit IX, 1-8,
New Orleans, Sep. 2003.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Realizing Personality in Audio-Visually Triggered Non-verbal
Behaviors.
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2003), 392-397, IEEE,
Sep. 2003.
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Robot Recognizes Three Simultaneous Speech By Active Audition.
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2003), 398-403, IEEE,
Sep. 2003.
-
Yasuhiro AKiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, and
Hiroshi G. Okuno:
Experimental Comparison of MT Evaluation Methods: RED vs. BLEU.
Proceedings of MT Summit IX,
1-8,
New Orleans, Sep. 2003.
-
Kazuhiro Nakadai,
Daisuke Matsuura,
Hiroshi G. Okuno,
Hiroshi Tsujino:
Improvement of Three Simultaneous Speech Recognition
by Using AV Integration and Scattering Theory for Humanoid.
Proceedings of Audio Visual Spoken Processing (AVSP-2003),
157-162, St. Jorioz, France, Sep. 2003.
-
Kazunori Komatani,
Shinichi Ueno,
Tatsuya Kawahara,
Hiroshi G. Okuno:
User Modeling in Spoken Dialogue Systems for Flexible Guidance Generation.
Proceedings of the Eighth European Conference on
Speech Communication and Technology (Eurospeech-2003),
745-748, Geneva, Sep. 2003.
-
Kazushi Ishihara,
Yasushi Tsubota,
Hiroshi G. Okuno:
Automatic Transformation of Environmental Sounds into
Sound-Imitation Words Based on Japanese Syllable Structure.
Proceedings of the Eighth European Conference on
Speech Communication and Technology (Eurospeech-2003),
3185-3188, Geneva, Sep. 2003.
-
Kazuhiro Nakadai,
Daisuke Matsuura,
Hiroshi G. Okuno,
Hiroshi Tsujino:
Three Simultaneous Speech Recognition by Integration of Active
Audition and Face Recognition for Humanoid,
Proceedings of the Eighth European Conference on
Speech Communication and Technology (Eurospeech-2003),
2705-2708, Geneva, Sep. 2003.
-
Tatsuya Kawahara,
Ryosuke Ito,
Kazunori Komatani:
Spoken Dialogue System for Queries on Appliance Manuals using Hierarchical Confirmation Strategy.
Proceedings of the Eighth European Conference on
Speech Communication and Technology (Eurospeech-2003),
accepted for presentation,
Geneva, Sep. 2003.
-
Kazunori Komatani,
Shinichi Ueno,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Flexible Guidance Generation using User Model in Spoken Dialogue Systems,
Proceedings of the 41st Annual Meeting of the Association
for Computational Linguistics (ACL 2003),
pp.256-263, Sapporo, Jul. 2003.
-
Taro Watanabe, Eiichiro Sumita, and
Hiroshi G. Okuno:
Chunk-based statistical translation,
Proceedings of the 41st Annual Meeting of the Association
for Computational Linguistics (ACL 2003),
pp.303-310, Sapporo, Jul. 2003.
-
Yasuhiro AKiba,
Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, and
Hiroshi G. Okuno:
A Statistical-Informmation-Based Selector of the Best among Multiple Outputs,
Exhibition Brochure of the 41st Annual Meeting of the ACL (ACL 2003),
16,
Sapporo, Jul. 2003.
-
Yoji Kiyota, Sadao Kurohashi,
Teruhisa Misu,
Kazunori Komatani,
Tatsuya Kawahara,
Fuyuko Kido:
Dialog Navigator''A Spoken Dialog Q-A System based on Large Text Knowledge Base.
ACL03 Interactive Poster/Demo Session, pp.149--152 (Companion Volume), 2003.
-
Kazunori Komatani,
Fumihiro Adachi,
Shinichi Ueno,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Flexible Spoken Dialogue System based on User Models and Dynamic Generation of VoiceXML Scripts.
4th SIGdial Workshop on Discourse and Dialogue, pp.87--96, 2003.
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Musical Instrument Identification based on F0-dependent
Multivariate Normal Distribution.
Proceedings of 2003 International Conference on
Muotimedia and Expo (ICME 2003), IEEE,
Vol.III, pp.405-409,
Baltimore, MD, Jul. 2003.
-
Tetsuro Kitahara,
Masataka Goto,
Hiroshi G. Okuno:
Musical Instrument Identification based on F0-dependent
Multivariate Normal Distribution.
Proceedings of 2003 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'2003),
Vol.5, pp.421--424, IEEE, Hong Kong, Apr. 2003.
-
Shun Tsuchiya (Ed.):
"Encyclopedia AI",
Feigenbaum, McCarthy. Kyoritsu Publishers, 2003.
Academic Year 2002
Thesis
- Shinya Amano: Studies on Natural Language Processing for
Kana-to-Kanji Conversion and Machine Translation,
Ph.D Thesis, Feb. 2003.
- Kazunori Komatani: Spoken Dialogue Systems for Information Retrieval
with Domain-Independent Dialogue Strategies,
Ph.D Thesis, Oct. 2002.
- Ryosuke Ito:
,
MS Thesis, Feb. 2003.
- Takashi Sumiyoshi:
,
MS Thesis, Feb. 2003.
- Masahiro Hasegawa:
,
MS Thesis, Feb. 2003.
- Naofumi Yoshida:
,
MS Thesis, Feb. 2003.
- Ian R. Lane:
Language Model Switching Based on Topic Detection for Multi-Domain Dialog Speech Recognition,
MS Thesis, Feb. 2003.
- Yuha Aakita:
,
MS Thesis, Aug. 2002.
- Kazushi Ishihara:
,
BS Thesis, Feb. 2003.
- Tasuku Kitade:
,
BS Thesis, Feb. 2003.
- Teruhisa Misu:
,
BS Thesis, Feb. 2003.
- Shun-ichi Yamamoto:
,
BS Thesis, Feb. 2003.
- Kazuyoshi Yoshii:
,
BS Thesis, Feb. 2003.
Peer-Reviewed Journal Papers
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Tino Lourens,
Hiroaki Kitano:
Sound and Visual Tracking for Humanoid Robot,
Applied Intelligence,
Vol.20, No.3 (May/June, 2004), 253-266,
doi:10.1023/B:APIN.0000021417.62541.e0
(accepted in Oct. 2002),
Kluwer Publishers.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Tino Lourens,
Hiroaki Kitano:
Sound and Visual Tracking for Humanoid Robot.
Applied Intelligence, Kluwer Publisher,
accepted for publication,
International Society for Applied Intelligence, 2003.
doi:10.1007/3-540-45324-5_19
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Ken'ichi Hidai,
Hiroshi Mizoguchi,
Hiroaki Kitano:
Human-Robot Non-Verbal Interaction Empowered by
Real-Time Auditory and Visual Multiple-Talker Tracking
Advanced Robotics,
Vol.17, No.2, pp.115-130,
VSP and Robotics Society of Japan,
2003.
doi:10.1163/156855303321165088
Online version,
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Issues in Humanoid Audition and Sound Source Localization by Active Audition.
Transaction of JSAI, Vol.18, No.2 F, pp.103-110 (Mar. 2003).
Online Journal
-
Kazunori Komatani,
Tatsuya Kawahara:
,
IPSJ Journal, Vol.43, No.10, pp.3078--3086, 2002.
in html
-
Ryosuke Ito,
Kazunori Komatani,
Tatsuya Kawahara:
,
IPSJ Journal, Vol.43, No.7, pp.2147--2154, 2002.
in html
-
Masahiro Hasegawa,
Yuya Akita,
Tatsuya Kawahara:
,
IPSJ Journal, Vol.43, No.7, pp.2222-2229, 2002.
html
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Real-Time Auditory and Visual Multiple-Speaker Tracking For
Human-Robot Interaction.
Journal of Robotics and Mechatronics,
special issue on Human Robot Interaction, Vol.14, No.5 (2002) 479-489,
Mechatronics Society of Japan.
-
Kentaro Umesawa,
Takamichi Saito,
Hiroshi G. Okuno:
,
IPSJ Journal, Vol.43, No.8 (Aug. 2002) 1553-1562.
in html
Books, Book Chapters and Survye Papers
-
Yuasa, T.
and
Okuno, H.G. (Eds.):
Advanced Lisp Technology,
Advanced Information Processing Technology, Vol.4,
Taylor and Francis Publishers, London, UK, May, 2002.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Realizing Audio-Visually triggered ELIZA-like non-verbal Behaviors.
In Ishizuka, M. and Slaney, J. (eds)
PRICAI-2002 Topics in Artificial Intelligence,
LNAI 2417, 552--562,
Springer-Verlag, Tokyo, Aug. 2002.
doi:10.1007/3-540-45683-X_59
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Social Interaction of Humanoid Robot through Auditory and Visual Tracking.
In Hendtlass, T., and Ali, M. (Eds.)
Developments in Applied Artificial Intelligence (IEA/AIE-2002),
Cairns, Australia, June 2002,
LNAI 2358, pp.725-735, Springer-Verlag.
doi:10.1007/3-540-48035-8_70
Peer-Reviewed Conference Papers
-
Takamichi Saito,
Toshiyuki Kitoh,
Kentaro Umesawa,
Hiroshi G. Okuno:
Privacy-Enhanced SPKI Access Control on PKIX and Its Application to Web Server.
Proc. of the Seventeenth International Conference on Advanced
Information Networking and Applications (AINA 2003),
696--703, IEEE, Xi'an, China.
Paper.
doi:10.1109/AINA.2003.1192970
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Auditory Fovea Based Speech Separation and Its Application to
Dialog System.
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2002), 1320-1325, IEEE,
Geneva, Oct. 2002.
doi:10.1109/IRDS.2002.1043937
-
Yoko Yamakata,
Tatsuya Kawahara,
Hiroshi G. Okuno:
BELIEF NETWORK BASED DISAMBIGUATION OF OBJECT REFERENCE
IN SPOKEN DIALOGUE SYSTEM FOR ROBOT.
Proceedings of 2002 International Conference on Spoken
Language Processing (ICSLP-2002), 170-176, ASA, ASJ, and ESCA,
Denver, Sep. 2002.
-
Yasushi Tsubota,
Tatsuya Kawahara,
Hiroshi G. Okuno,
Masatake Dantsuji:
RECOGNITION AND VERIFICATION OF ENGLISH BY JAPANESE STUDENTS FOR
COMPUTER-ASSISTED LANGUAGE LEARNING SYSTEM.
Proceedings of 2002 International Conference on Spoken
Language Processing (ICSLP-2002), 1205-1208, ASA, ASJ, and ESCA,
Denver, Sep. 2002.
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
AUDITORY FOVEA BASED SPEECH ENCHANCEMENT AND ITS APPLICATION TO
HUMAN-ROBOT DIALOG SYSTEM.
Proceedings of 2002 International Conference on Spoken
Language Processing (ICSLP-2002), 1817-1820, ASA, ASJ, and ESCA,
Denver, Sep. 2002.
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
REAL-TIME SOUND SOURCE LOCALIZATION AND SEPARATION FOR ROBOT AUDITION.
Proceedings of 2002 International Conference on Spoken
Language Processing (ICSLP-2002), 193-196, ASA, ASJ, and ESCA,
Denver, Sep. 2002.
-
Taro Watanabe,
Eiichiro Sumita:
Statistical Machine Translation Decoder Base On Phrase.
Proceedings of 2002 International Conference on Spoken
Language Processing (ICSLP-2002), 1889-1892, Spec3Co2, ASA, ASJ, and ESCA,
Denver, Sep. 2002.
-
Kazunori Komatani,
Tatsuya Kawahara,
Ryosuke Ito,
Hiroshi G. Okuno:
Efficient Dialogue Strategy to Find Users' Intended Items from Information
Query Results,
Proceedings of the Nineteenth International Conference
on Computational Linguistics (Coling-2002), Vol.1, pp.481-487,
Aug. 2002.
-
Taro Watanabe,
Eiichiro Sumita:
Bidirectional Decoding for Statistical Machine Translation.
Proceedings of the Nineteenth International Conference
on Computational Linguistics (Coling-2002), pp.1079-
Aug. 2002.
-
Yasuhiro Akiba,
Taro Watanabe,
Eiichiro Sumita:
Using Language and Translation Models to Select the Best among Outputs
from Multiple MT Systems,
Proceedings of the Nineteenth International Conference
on Computational Linguistics (Coling-2002), pp.
Aug. 2002.
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Exploiting Auditory Fovea in Humanoid-Human Interaction.
Proceedings of Eighteenth National Conference on
Artificial Intelligence (AAAI-2002), 431-438,
AAAI, Edmonton, Aug. 2002.
AAAI Contents
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Hiroaki Kitano:
Non-Verbal Eliza-like Human Behaviors in Human-Robot Interaction
through Real-Time Auditory and Visual Multiple-Talker Tracking.
Proceedings of the Third International Workshop on
Cognitive Robotics (CogRob-2002),
AAAI, Edmonton, Jul. 2002.
-
Yoko Yamakata,
Tatsuya Kawahara,
Hiroshi G. Okuno:
Belief Network based
Disambiguation of Word Reference in Spoken Dialogue System for Robot.
Proceedings of ISCA Tutorial and Research Workshop on
Multi-Modal Dialogue in Mobile Environments, Germany, Jun. 2002.
-
Kazuhiro Nakadai,
Ken'ichi Hidai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Real-Time speaker localization and speech separation
by Audio-Visual Integration,
Proceedings of IEEE-RAS International Conference
on Robotics and Automation (ICRA-2002), pp.1043-1049, IEEE,
May 2002.
doi:10.1109/ROBOT.2002.1013493
Academic Year 2001
Thesis
- Hirofumi Adachi:
,
MS Thesis, Feb. 2002.
- Yoko Yamakata:
,
MS Thesis, Feb. 2002.
- Raux Antoine Roland:
Intelligibility Assessment and Adaptive Drill Generation
for a Computer-Assisted Pronunciation Learning System,
MS Thesis, Feb. 2002.
- Shinichi Ueno:
,
BS Thesis, Feb. 2002.
- Yohei Sakuraba:
,
BS Thesis, Feb. 2002.
- Kazuya Shitaoka:
,
BS Thesis, Feb. 2002.
- Masahiro Yokoo:
,
BS Thesis, Feb. 2002.
Peer-Reviewed Journal Papers
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Lourens, T.,
Hiroaki Kitano:
Sound and Visual Tracking by Active Audition.
in Jin, Q., Li, J., Zhang, N., Cheng, J., Yu, C., and Noguchi, N (eds)
Enabling Society with Information Technology
pp.174-185, Springer-Verlag, 2002.
-
Takamichi Saito,
Kentaro Umesawa,
Hiroshi G. Okuno:
,
Trans. IEICE, Vol.J84-D1, No.11 (Nov. 2001)
pp.1553-1562, IEICE,
-
Kentaro Umesawa,
Takamichi Saito,
Hiroshi G. Okuno:
,
IPSJ Journal, Vol.42, No.8 (Aug. 2001) pp.2067-2076.
TAF Telecom Technology Student Award
-
Tatsuya Kawahara,
Akinobu Lee, Tetsunori Kobayashi, Koichi Takeda,
N. Minematsu, Shigeki Sagayama, Katsuya Itou, A. Ito, M. Yamamoto,
A. Yamada, T.Utsuto, Kiyohiro Shikano:
Japanese Dictation ToolKit -- 1999 version --,
Journal of Acoustic Society of Japan, Vol.57, No.3, pp.210--214, 2001
-
M. Mimura and
Tatsuya Kawahara:
Difference of acoustic modeling for read speech and dialogue speech,
Acoustical Science & Technology, Vol.22, No.5, pp.373--374, 2001.
Translated Books, Book Chapters and Survey Papers
-
Hiroshi G. Okuno,
Kazuhiro Nakadai:
,
JSAJ, Vol.58, No.3 (Mar. 2002) pp.205-210.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Tino Lourens,
Hiroaki Kitano:
Sound and Visual Tracking for Humanoid Robot,
Engineering of Intelligent Systems: 14th International Conference on Industrial and Engineering
Applications of Artificial Intelligence and Expert Systems, IEA/AIE-2001,
Proceedings,
Budapest, Hungary, June 2001.
LNAI 2070, 640-650, Springer.
Best Paper Award (1st Place)
doi:10.1007/3-540-45034-3_67
-
Tino Lourens,
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Detection of Oriented Repetitive Alternating Patterns in Color
Images -- A Computational Model of Monkey Grating Cells.
Connectionist Models of Neurons, Learning Processes and
Artificial Intelligence: Sixth International Work-Conference
on Artificial and Natural Neural Networks, IWANN 2001, Proceeding, Part I,
Granada, Spain, June 2001.
LNCS 2084, 95-107, Springer-Verlag.
doi:10.1007/3-540-45720-8_12/
-
Ian Frank,
Kumiko Tanaka,
Hiroshi G. Okuno,
Jun'ichi Akita,
Yukiko Nakagawa,
K. Maeda,
Kazuhiro Nakadai,
Hiroaki Kitano:
And The Fans are Going Wild! SIG plus MIKE.
RoboCup 2000: Robot Soccer World Cup IV,
LNAI 2019, 139-148,
Springer-Verlag, May 2001.
doi:10.1007/3-540-45324-5_12
-
Yukiko Nakagawa,
Hiroshi G. Okuno,
Hiroaki Kitano:
Bridging gap between small sized league and simulator league.
RoboCup 2000: Robot Soccer World Cup IV,
LNAI 2019, 209-218,
Springer-Verlag, May 2001.
doi:10.1007/3-540-45324-5_19
- Hiroaki Kitano:
Hiroshi G. Okuno,
Mineo Morohashi,
Koji Kyoda,
Kazuhiro Nakadai :
PC Cluster -- Beowulf
Sangyo Publisher, 2001.
Peer-Reviewed International Conference Papers
-
Kazuhiro Nakadai,
Ken'ichi Hidai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Real-Time Active Human Tracking by Hierarchical Integration of
Audition and Vision.
Proceedings of IEEE-RAS International Conference
on Humanoid Robots (Humanoids2001), pp.91-98, IEEE,
Nov. 2001.
-
Kazuhiro Nakadai,
Tatsuya Matsui,
Hiroshi G. Okuno,
Hiroaki Kitano:
Epipolar Geometry Based Sound Localization and Extraction
for Humanoid Audition.
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2001),
1395-1401, IEEE and RSJ, Oct. 2001.
doi:10.1109/IROS.2001.977176
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Ken-ichi Hidai,
Hiroshi Mizoguchi,
Hiroaki Kitano:
Human-Robot Interaction Through
Real-Time Auditory and Visual Multiple-Talker Tracking
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2001), 1402-1409,
IEEE and RSJ, Oct. 2001.
Nakamura Award for IROS-2001 Best Paper Nomination Finalist
(2nd or 3rd Place) at IROS-2002
doi:10.1109/IROS.2001.977177
-
Tino Lourens,
Hiroshi G. Okuno,
Hiroaki Kitano:
Automatic Graph Extraction from Color Images.
Proc. of 11th International Conference
Image Analysis and Processing (ICIAP 2001), pp.302-308,
Granada, Spain, June 2001.
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Real-Time Multiple Speaker Tracking by Multi-Modal Integration
for Mobile Robots.
Proceedings of European Conforence on
Speech Processing (Eurospeech 2001),
pp.2643-2646, Sep. 2001.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
T. Lourens,
Hiroaki Kitano:
Separating Three Simultaneous Speeches with Two Microphones by
Integrating Auditory and Visual Processing.
Proceedings of European Conforence on
Speech Processing (Eurospeech 2001),
pp.1193-1196, Sep. 2001.
-
Akinobu Lee,
Tatsuya Kawahara,
and Kiyohiro Shikano:
Gaussian mixture selection using context-independent HMM,
Proc. IEEE-ICASSP, pp.69--72, 2001.
-
Hiroaki Nanjo,
Tatsuya Kawahara:
Speaking-Rate Dependent Decoding and Adaptation for Spontaneous Lecture Speech Recognition,
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), pp.725--728, 2002.
-
Kazunori Komatani,
K.Tanaka, H.Kashima, and
Tatsuya Kawahara:
Domain-independent spoken dialogue platform using key-phrase spotting based on combined language model,
Proceedings of European Conforence on
Speech Processing (Eurospeech 2001),
pp.1319--1322, 2001.
-
Akinobu Lee,
Tatsuya Kawahara,
and Kiyohiro Shikano:
Julius -- an open source real-time large vocabulary recognition engine,
Proceedings of European Conforence on
Speech Processing (Eurospeech 2001),
pp.1691--1694, 2001
-
Hiroaki Nanjo,
Kazuomi Kato, and
Tatsuya Kawahara:
Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition,
Proceedings of European Conforence on
Speech Processing (Eurospeech 2001),
pp.2531--2534, 2001
-
Tatsuya Kawahara,
Hiroaki Nanjo,
and S.Furui:
Automatic transcription of spontaneous lecture speech,
Proc. IEEE workshop on Automatic Speech Recognition and Understanding,
2001.
-
Kazuhiro Nakadai,
Ken-ichi Hidai,
Hiroshi Mizoguchi,
Hiroshi G. Okuno,
Hiroaki Kitano:
Real-Time Auditory and Visual Multiple-Object Tracking for Robots.
Proc. of 17th International Joint Conference on Artificial Intelligence
(IJCAI-01)
, 1425-1432, Seattle, Aug. 2001.
電気通信普及財団テレコム技術賞奨励賞
-
Tino Lourens,
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Detection of Oriented Repetitive Alternating Patterns in Color
Images -- A Computational Model of Monkey Grating Cells.
Proc. of Sixth International Work-Conference
on Artificial and Natural Neural Networks (IWANN2001),
Granada, Spain, June 2001.
LNCS 2084, 95-107, Springer-Verlag.
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Tino Lourens,
Hiroaki Kitano:
Sound and Visual Tracking for Humanoid Robot,
Proc. of 17th International Conference on Industrial and Engineering
Applications of Artificial Intelligence and Expert Systems (IEA/AIE-2001)
,
Budapest, Hungary, June 2001.
Lecture Notes in Artificial Intelligence No.2070, 640-650, Springer.
Best Paper Award (1st Place)
-
Ian Frank,
Kumiko Tanaka,
Hiroshi G. Okuno,
Jun'ichi Akita,
Yukiko Nakagawa,
K. Maeda,
Kazuhiro Nakadai,
Hiroaki Kitano:
And The Fans are Going Wild! SIG plus MIKE.
RoboCup 2000: Robot Soccer World Cup IV,
Lecture Notes in Artificial Intelligence No.2019, 139-148,
Springer-Verlag, May 2001.
-
Yukiko Nakagawa,
Hiroshi G. Okuno,
Hiroaki Kitano:
Bridging gap between small sized league and simulator league.
RoboCup 2000: Robot Soccer World Cup IV,
Lecture Notes in Artificial Intelligence No.2019, 209-218,
Springer-Verlag, May 2001.
- Takamichi Saito,
Kentaro Umesawa,
Hiroshi G. Okuno:
An Access Control with Handling Private Information Server.
Proc. of the First International Workshop on
Internet Computing and E-Commerce (ICEC01),
IEEE, San Francisco, April 2001.
Before Academic Year 2000
Peer-Reviewed Journal Papers
-
Takahide Hoshide, Masayoshi Nose, Hisazumi Tsuchida, Kisaku Fujimoto,
Hiroshi G. Okuno:
Adaptive real-time planning for multimedia communication services by
multiagent system,
Electronics and Communications in Japan (Part I: Communications),
Vol.84, No.2 (Feb. 2001) pp.90-98, Weiley.
doi:10.1002/1520-6424(200102)84:2<90::AID-ECJA10>3.0.CO;2-C
-
Hiroshi G. Okuno,
Tomohiro Nakatani, Takeshi Kawabata:
Listening to two simultaneous speeches,
Speech Communication, Vol.27, Issue 3-4 (Apr. 1999) pp.299-310,
doi:10.1016/S0167-6393(98)00080-6
-
Hiroshi G. Okuno,
Shin-ichi Minato, Hideki Isozaki:
On the propertyies of combination set operations,
Information Processing Letters,
Vol.66, Issue 4 (May 1998) pp.195-199, Elsevier.
doi:10.1016/S0020-0190(98)00067-2
-
Hiroshi G. Okuno:
Experience of parallel AI programming with parallel Lisp,
Future Generation Computer Systems, Vol.7, Issues 2-3 (April 1992)
pp.211-219,
doi:10.1016/0167-739X(92)90008-Y
-
Hiroshi G. Okuno,
Nobuyasu Osato,
Ikuo Takeuchi:
Firmware approach to fast Lisp interpreter,
SIGMICRO Newsletter,
Vol.19, Issue 1-2 (June 1988) pp.5-11, ACM,
ACM DL
-
Ikuo Takeuchi,
Hiroshi G. Okuno,
Nobuyasu Osato:
TAO: a harmonic mean of Lisp, Prolog and Smalltalk,
SIGPLAN Notices, Vol.18, Issue 7 (July 1983) pp.65-74,
ACM DL
Books, Book Chapters and Survey Papers
-
Hiroshi G. Okuno,
Koji M. Kyoda, Mineo Morohashi, and
Hiroaki Kitano:
Initial Assessment of ERATO-1 Beowulf-Class Cluster,
Ito, T. and Yuasa, T. (eds.)
Parallel and Distributed Computing in Symbolic and
Irregular Applications,
World Scientific Publishing, 372--383, 2000.
-
Kazuhiro Nakadai,
Lourens, T.,
Hiroshi G. Okuno,
Hiroaki Kitano:
Humanoid Active Audition System Improved by The Cover Acoustics.
In Mizoguchi, R. and Slaney, J. (eds)
"PRICAI 2000 Topics in Artificial Intelligence"
Lecture Notes in Artificial Intelligence 1886, 544--554,
Springer-Verlag, Melborne, Aug. 2000.
doi:10.1007/3-540-44533-1_55
-
Hiroshi G. Okuno,
Tomohiro Nakatani, Takeshi Kawabata:
Cocktail-Party Effect with Computational Auditory Scene AAnalysis
-- Preliminary Report --
Advances in Human Factors/Ergonomics, Vol.20, Part2, 1995, pp.503-508,
doi:10.1016/S0921-2647(06)80266-2
Peer-Reviewed International Conference Papers
-
Hiroshi G. Okuno,
Kazuhiro Nakadai,
T. Lourens,
Hiroaki Kitano:
Sound and Visual Tracking for Humanoid.
Proceedings of 2000 International Conference on
Information Society in the 21st Century: Emerging Technologies and
New Challenges, Univ. of Aizu, 254--261, Aizu-Wakamatsu, Nov. 2000.
-
Kazuhiro Nakadai,
Matsui, T.,
Hiroshi G. Okuno,
Hiroaki Kitano:
Active Audition System and Humanoid Exterior Design.
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2000),
IEEE, 1453--1461, Takamatsu, Nov. 2000.
doi:10.1109/IROS.2000.893225
-
Hiroaki Kitano,
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Sabisch, T., and Matsui, T.:
Design and Architecture of SIG the Humanoid: An Experiemntal
Platformfor Integratind Perception in RoboCup Humanoid Challenge.
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2000), IEEE, 181--190,
Takamatsu, Nov. 2000.
doi:10.1109/IROS.2000.894602
-
Fermin, I., Ishiguro, H.,
Hiroshi G. Okuno,
Hiroaki Kitano:
A Framework for Integrating Sensory Information in a Humanoid Robot.
Proceedings of IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS-2000), IEEE, 1748--1753,
Takamatsu, Nov. 2000.
doi:10.1109/IROS.2000.894602
-
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Humanoid Active Audition System.
Proceedings of First IEEE-RAS Internationa l Conference on
Humanoid Robots (Humanoids2000),
Cambridge, Sep. 2000.
-
T. Lourens,
Kazuhiro Nakadai,
Hiroshi G. Okuno,
Hiroaki Kitano:
Selective Attention by Integration of Vision and Audition.
Proceedings of First IEEE-RAS Internationa l Conference on
Humanoid Robots (Humanoids2000),
Cambridge, Sep. 2000.
-
Frank, I., Tanaka-Ishii, K.,
Hiroshi G. Okuno,Nakagawa, Y., Meada, K.,
Kazuhiro Nakadai,
Hiroaki Kitano:
And The Fans are Going Wild! SIG plus MIKE.
Proceedings of the Fourth International Workshop on
RoboCup (RoboCup-2000), LNCS 2019, 267--276,
Springer-Verlag, Melbourne, Aug. 2000.
-
Nakagawa, Y.,
Hiroshi G. Okuno,
Hiroaki Kitano:
Bridging gap between small sized league and simulator league.
Proceedings of the Fourth International Workshop on
RoboCup (RoboCup-2000), LNCS 2019, 1--10,
Springer-Verlag Melborne, Aug. 2000.
-
Kazuhiro Nakadai,
T. Lourens,
Hiroshi G. Okuno,
Hiroaki Kitano:
Active Audition for Humanoid.
Proceedings of the Seventeenth National Conference on
Artificial Intelligence (AAAI-2000), 832--839,
AAAI, Austin, Aug. 2000.
AAAI Contents
-
Yukiko Nakagawa,
Hiroshi G. Okuno,
Hiroaki Kitano:
Using Vision to Improve Sound Source Separation,
Proceedings of the Sixteenth National Conference on
Artificial Intelligence (AAAI-1999), 768--773,
AAAI, Orlando, Aug. 1999.
AAAI Contents
-
Tomohiro Nakatani,
Hiroshi G. Okuno:
Sound Ontology for Computational Auditory Scence Analysis,
Proceedings of the Fifteenth National Conference on
Artificial Intelligence (AAAI-1998), 1004--1009,
AAAI, Madison, Aug. 1998.
AAAI Contents
-
Hiroshi G. Okuno,
Tomohiro Nakatani, Takeshi Kawabata:
Understanding Three Simultaneous Speeches
Proceedings of the Fifteenth International Joint Conference on
Artificial Intelligence (IJCAI-1997), Vol.1, pp.30--35,
IJCAI, Nagoya, Aug. 1997.
IJCAI Contents
-
Hiroshi G. Okuno,
Tomohiro Nakatani, Takeshi Kawabata:
Interfacing Sound Stream Segregation to Automatic Speech Recognition -- Preliminary Results on Listening to Several Sounds Simultaneously,
Proceedings of the Thirteenth National Conference on
Artificial Intelligence (AAAI-1996), 1082--1089,
AAAI, Portland, Aug. 1996.
AAAI Contents
-
Tomohiro Nakatani,
Hiroshi G. Okuno,
Takeshi Kawabata:
Residue-Driven Architecture for Computational Auditory Scene Analysis
Proceedings of the Thirteenth International Joint Conference on
Artificial Intelligence (IJCAI-1995), 165--174,
IJCAI, Montreal, Aug. 1997.
IJCAI Contents
-
Tomohiro Nakatani,
Hiroshi G. Okuno,
Takeshi Kawabata:
Auditory Stream Segregation in Auditory Scene Analysis with a
Multi-Agent System,
Proceedings of the Twelfth National Conference on
Artificial Intelligence (AAAI-1994), 1082--1089,
AAAI, Seattle, Aug. 1994.
AAAI Contents
-
Saito, T., Umesawa, K.,
Hiroshi G. Okuno:
Privacy Enhanced Access Control by SPKI,
Proc. of the Seventh International Conference on
Parallel and Distributed Systems:
International Workshop on Next-Generation Internet
Technologies and Applications 2000 (NGITA00), 301--306, IEEE,
Iwate, July. 2000.
doi:10.1109/PADSW.2000.884605
-
Saito, T., Umesawa, K.,
Hiroshi G. Okuno:
Privacy-Enhanced Access Control by SPKI and Its Application to Web Server.
Proc. of Ninth IEEE International Workshops on
Enabling Technologies: Infrastructure for Collaborative Enterprises,
IEEE, 201--206, NIST, Maryland, June 2000.
doi:10.1109/ENABL.2000.883729
-
Hiroaki Kitano,
Hiroshi G. Okuno,
Kazuhiro Nakadai,
Fermin, I., Sabisch, T., Nakagawa, Y., and Matsui, T.:
Designing a huymanoid head for RoboCup challenge.
Proceedings of the Fourth International Conference
on Autonomous Agents (Agents2000), ACM, pp.17--18,
Barcelona, June 2000.
ACM DL
-
Tomohiro Nakatani, Masataka Goto,
Hiroshi G. Okuno:
Localization by harmonic structure and its application to harmonic sound
stream segregation,
Proceedings of 1996 International Conference on Acoustics, Speech,
and Signal Processing (ICASSP-1996), Vol.2, 653-656, ASA, ASJ, and ESCA,
Atlanta,
doi:10.1109/ICASSP.1996.543205
-
Hiroshi G. Okuno,
Tomohiro Nakatani, Takeshi Kawabata:
A new speech enhancement: speech stream segregation,
Proceedings of 1996 International Conference on Spoken
Language Processing (ICSLP-1996), Vol.4, 2356-2359, ASA, ASJ, and ESCA,
Yokohama,
doi:10.1109/ICSLP.1996.607281
-
Tomohiro Nakatani, Takeshi Kawabata,
Hiroshi G. Okuno:
A Computational Model of Sound Stream Segregation with Multi-Agent Paradigm,
Proceedings of 1995 International Conference on
Acoustics, Speech and Signal Processing (ICASSP'1995),
Vol.4, pp.2671--2674, IEEE, 1995.
doi:10.1109/ICASSP.1995.480111
-
Hiroshi G. Okuno,
Anoop Gupta:
Parallel execution of OPS5 in QLISP,
Proceedings of the Fourth Conference on Artificial Intelligence Applications,
pp.268-273, IEEE, 1988.
doi:10.1109/CAIA.1988.196114
-
Hiroshi G. Okuno,
Nobuyasu Osato,
Ikuo Takeuchi:
Firmware approach to fast Lisp interpreter,
Proceedings of the 20th Annual Workshop on Microprogramming (MICRO-20),
pp.1-11, ACM, Boulder, 1987.
ACM DL
-
Hiroshi G. Okuno,
Ikuo Takeuchi,
Nobuyasu Osato, Yasushi Hibino, Kazufumi Watanabe:
TAO: A fast interpreter-centered system on LISP machine ELIS
Proceedings of the 1984 ACM Symposium on LISP and functional programming (LFP'84)
Austin, Texas, pp.140-149, 1984.
ACM DL
Last modified: Wed Aug 15 23:23:58 JST 2012
Copyleft All Wrongs Reserved, 2001-2010.