Gerasimos Potamianos: Publications and Patents per Research Area / Topic


Published work is clustered into three broad categories. First are listed the most recent works on perception technologies for ambient intelligence in smart spaces, conducted as part of European Projects CHIL, DICIT, and NETCARITY (2004-) as well as work on audio-visual speech processing (1997-). This is followed by earlier work (up to 1998) on language modeling, statistical image analysis, and signal processing / filter design.



    A. AUDIO-VISUAL PERCEPTION TECHNOLOGIES FOR AMBIENT INTELLIGENCE IN SMART SPACES

    General:

  1. A. Waibel, R. Stiefelhagen, R. Carlson, J. Casas, J. Kleindienst, L. Lamel, O. Lanz, D. Mostefa, M. Omologo, F. Pianesi, L. Polymenakos, G. Potamianos, J. Soldatos, G. Sutschet, and J. Terken, Computers in the Human Interaction Loop, To Appear In: Handbook of Ambient Intelligence and Smart Environments, H. Nakashima, J. Augusto, and H. Aghajan (Eds.), Springer, 2009.
  2. Far-Field Speech Processing:

  3. G. Potamianos, L. Lamel, M. Wolfel, J. Huang, E. Marcheret, C. Barras, J. McDonough, J. Hernando, D. Macho, and C. Nadeu, Automatic Speech Recognition, To Appear In: Computers in the Human Interaction Loop, A. Waibel and R. Stieflhagen (Eds.), Ch. 6, Springer, 2009.
  4. J. Huang, E. Marcheret, K. Visweswariah, and G. Potamianos, The IBM RT07 evaluation systems for speaker diarization on lecture meetings, in Multimodal Technologies for Perception of Humans: International Evaluation Workshops CLEAR 2007 and RT 2007, Baltimore, Maryland, 2007, LNCS vol. 4625, pp. 497-508, Springer, Berlin, 2008.
  5. J. Huang, E. Marcheret, K. Visweswariah, V. Libal, and G. Potamianos, The IBM Rich Transcription Spring 2007 speech-to-text systems for lecture meetings, in Multimodal Technologies for Perception of Humans: International Evaluation Workshops CLEAR 2007 and RT 2007, Baltimore, Maryland, 2007, LNCS vol. 4625, pp. 429-441, Springer, Berlin, 2008.
  6. J. Huang, E. Marcheret, K. Visweswariah, V. Libal, and G. Potamianos, Detection, diarization, and transcription of far-field lecture speech, Proc. Conf. Int. Speech Comm. Assoc. (Interspeech), pp. 2161-2164, Antwerp, Belgium, 2007.
  7. E. Marcheret, G. Potamianos, K. Visweswariah, and J. Huang, The IBM RT06s evaluation system for speech activity detection in CHIL seminars, Proc. RT06s Evaluation Works. - held with Joint Works. on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), S. Renals, S. Bengio, and J.G. Fiscus (Eds.), LNCS 4299, pp. 323-335, Washington DC, 2006.
  8. J. Huang, M. Westphal, S. Chen, O. Siohan, D. Povey, V. Libal, A. Soneiro, H. Schulz, T. Ross, and G. Potamianos, The IBM Rich Transcription Spring 2006 speech-to-text system for lecture meetings, Proc. RT06s Evaluation Works. - held with Joint Works. on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), S. Renals, S. Bengio, and J.G. Fiscus (Eds.), LNCS 4299, pp. 432-443, Washington DC, 2006.
  9. S.M. Chu, E. Marcheret, and G. Potamianos, Automatic speech recognition and speech activity detection in the CHIL smart room, Proc. Joint Works. on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), LNCS vol. 3869, pp. 332-343, Edinburgh, United Kingdom, 2005.
  10. E. Marcheret, K. Visweswariah, and G. Potamianos, Speech activity detection fusing acoustic phonetic and energy features, Proc. Europ. Conf. Speech Comm. Technol. (Interspeech), pp. 241-244, Lisbon, Portugal, 2005.
  11. D. Macho, J. Padrell, A. Abad, C. Nadeu, J. Hernando, J. McDonough, M. Wolfel, U. Klee, M. Omologo, A. Brutti, P. Svaizer, G. Potamianos, and S.M. Chu, Automatic speech activity detection, source localization, and speech recognition on the CHIL seminar corpus, Proc. Int. Conf. Multimedia Expo (ICME), Amsterdam, The Netherlands, 2005.
  12. Conversational Interaction:

  13. G. Potamianos, J. Huang, E. Marcheret, V. Libal, R. Balchandran, M. Epstein, L. Seredi, M. Labsky, L. Ures, M. Black, and P. Lucey, Far-field multimodal speech processing and conversational interaction in smart spaces, Proc. Joint Work. Hands-Free Speech Communication and Microphone Arrays (HSCMA), Trento, Italy, 2008.
  14. R. Balchandran, M. Epstein, G. Potamianos, and L. Seredi, A multi-modal spoken dialog system for interactive TV, Proc. Int. Conf. Multimodal Interfaces (ICMI) - Demo Papers, pp. 191-192, Chania, Greece, 2008.
  15. Acoustic / Multimodal Scene Analysis for Event Detection / Activity Classification:

  16. V. Libal, B. Ramabhadran, N. Mana, F. Pianesi, P. Chippendale, O. Lanz, and G. Potamianos, Multimodal classification of activities of daily living inside smart homes, Proc. Int. Works. Ambient Assisted Living (IWAAL), LNCS vol. 5518, Part II, pp. 687-694, Salamanca, Spain, 2009.
  17. X. Zhuang, J. Huang, G. Potamianos, and M. Hasegawa-Johnson, Acoustic fall detection using Gaussian mixture models and GMM supervectors, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), pp. 69-72, Taipei, Taiwan, 2009.
  18. J. Huang, X. Zhuang, V. Libal, and G. Potamianos, Long-time span acoustic activity analysis from far-field sensors in smart homes, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), pp. 4173-4176, Taipei, Taiwan, 2009.
  19. A Leone, G. Diraco, C. Distante, P. Siciliano, M. Malfatti, L. Gonzo, M. Grassi, A. Lombardi, G. Rescio, P. Malcovati, V. Libal, J. Huang, and G. Potamianos, A multi-sensor approach for people fall detection in home environment, Proc. Work. Multi-Camera and Multi-Modal Sensor Fusion (M2SFA2), Held in Conjunction with the: 10th Europ. Conf. Computer Vision (ECCV), Marseille, France, 2008.
  20. M. Grassi, A Leone, M. Malfatti, A. Lombardi, G. Rescio, G. Diraco, C. Distante, V. Libal, J. Huang, G. Potamianos, P. Malcovati, L. Gonzo, and P. Siciliano, A hardware-software framework for high-reliability people fall detection, Proc. 7th IEEE Conf. on Sensors (SENSORS), pp. 1328-1331, Lecce, Italy, 2008.
  21. Computer Vision:

  22. K. Bernardin, R. Stiefelhagen, A. Pmevmatikakis, O. Lanz, A. Brutti, J. Casas, and G. Potamianos, Person Tracking, To Appear In: Computers in the Human Interaction Loop, A. Waibel and R. Stieflhagen (Eds.), Ch. 3, Springer, 2009.
  23. A. Tyagi, J.W. Davis, and G. Potamianos, Steepest descent for efficient covariance tracking, Proc. IEEE Work. Motion and Video Computing (WMVC), Copper Mountain, Colorado, 2008.
  24. Z. Zhang, G. Potamianos, A.W. Senior, and T.S. Huang, Joint face and head tracking inside multi-camera smart rooms, Signal, Image and Video Processing, vol. 1, pp. 163-178, 2007.
  25. A. Tyagi, M. Keck, J.W. Davis, and G. Potamianos, Kernel-based 3D tracking, Proc. IEEE Int. Work. Visual Surveillance (VS/CVPR), Minneapolis, Minnesota, 2007.
  26. A. Tyagi, G. Potamianos, J.W. Davis, and S.M. Chu, Fusion of multiple camera views for kernel-based 3D tracking, Proc. IEEE Works. Motion and Video Computing (WMVC), Austin, Texas, 2007.
  27. G. Potamianos and Z. Zhang, A joint system for single-person 2D-face and 3D-head tracking in CHIL seminars, Proc. CLEAR Evaluation Works., LNCS vol. 4122, Southampton, United Kingdom, 2006.
  28. Z. Zhang, G. Potamianos, S.M. Chu, J. Tu, and T.S. Huang, Person tracking in smart rooms using dynamic programming and adaptive subspace learning, Proc. Int. Conf. Multimedia Expo. (ICME), pp. 2061-2064, Toronto, Canada, 2006.
  29. A.W. Senior, G. Potamianos, S. Chu, Z. Zhang, and A. Hampapur, A comparison of multicamera person-tracking algorithms, Proc. IEEE Int. Works. Visual Surveillance (VS/ECCV), Graz, Austria, 2006.
  30. Z. Zhang, G. Potamianos, M. Liu, and T. Huang, Robust multi-view multi-camera face detection inside smart rooms using spatio-temporal dynamic programming, Proc. Int. Conf. Automatic Face and Gesture Recog. (FGR), Southampton, United Kingdom, 2006.
  31. Z. Zhang, G. Potamianos, A. Senior, S. Chu, and T. Huang, A joint system for person tracking and face detection, Proc. Int. Works. Human-Computer Interaction (ICCV 2005 Works. on HCI), pp. 47-59, Beijing, China, 2005.
  32. Corpora:

  33. D. Mostefa, N. Moreau, K. Choukri, G. Potamianos, S.M. Chu, A. Tyagi, J.R. Casas, J. Turmo, L. Christoforetti, F. Tobia, A. Pnevmatikakis, V. Mylonakis, F. Talantzis, S. Burger, R. Stiefelhagen, K. Bernardin, and C. Rochet, The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms, Journal of Language Resources and Evaluation, vol. 41, pp. 389-407, 2008.
  34. B. AUDIO-VISUAL SPEECH PROCESSING

    Overview Papers on Audio-Visual Speech Processing and Recognition:

  35. G. Potamianos, C. Neti, J. Luettin, and I. Matthews, Audio-Visual Automatic Speech Recognition: An Overview, To Appear: Audio-Visual Speech Processing, E. Vatikiotis-Bateson, G. Bailly, and P. Perrier (Eds.), MIT Press, ISBN: 0-26-222078-4, 2009.
  36. G. Potamianos, Audio-Visual Speech Recognition, Short Article, Encyclopedia of Language and Linguistics, Second Edition, (Speech Technology Section - Computer Understanding of Speech), K. Brown (Ed. In Chief), Elsevier, Oxford, United Kingdom, ISBN: 0-08-044299-4, 2006.
  37. P.S. Aleksic, G. Potamianos, and A.K. Katsaggelos, Exploiting Visual Information in Automatic Speech Processing, In: Handbook of Image and Video Processing, Second Edition, Al. Bovic (Ed.), ch. 10.8, pp. 1263-1289, Elsevier Academic Press, Burlington, MA, ISBN: 0-12-119792-1, 2005.
  38. G. Potamianos, C. Neti, G. Gravier, A. Garg, and A.W. Senior, Recent advances in the automatic recognition of audio-visual speech, Invited, Proceedings of the IEEE, vol. 91, no. 9, pp. 1306-1326, 2003.
  39. G. Potamianos, C. Neti, and S. Deligne, Joint audio-visual speech processing for recognition and enhancement, Proc. Works. Audio-Visual Speech Process., pp. 95-104, St. Jorioz, France, 2003.
  40. C. Neti, G. Iyengar, G. Potamianos, A. Senior, and B. Maison, Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction, Proc. Int. Conf. Spoken Language Process. (ICSLP), vol. III, pp. 11-14, Beijing, China, 2000.
  41. Audio-Visual Speech Recognition Systems, Experiments, Data:

  42. J. Huang, G. Potamianos, J. Connell, and C. Neti. Audio-visual speech recognition using an infrared headset, Speech Communication, vol. 44, no. 4, pp. 83-96, 2004.
  43. G. Potamianos, C. Neti, J. Huang, J.H. Connell, S. Chu, V. Libal, E. Marcheret, N. Haas, and J. Jiang, Towards practical deployment of audio-visual speech recognition, Invited, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. 3, pp. 777-780, Montreal, Canada, 2004.
  44. J. Huang, G. Potamianos, and C. Neti, Improving audio-visual speech recognition with an infrared headset, Proc. Works. Audio-Visual Speech Process. (AVSP), pp. 175-178, St. Jorioz, France, 2003.
  45. G. Potamianos and C. Neti, Audio-visual speech recognition in challenging environments, Proc. Eur. Conf. Speech Comm. Tech. (Eurospeech), pp. 1293-1296, Geneva, Switzerland, 2003.
  46. J.H. Connell, N. Haas, E. Marcheret, C. Neti, G. Potamianos, and S. Velipasalar, A real-time prototype for small-vocabulary audio-visual ASR, Proc. Int. Conf. Multimedia Expo (ICME), vol. II, pp. 469-472, Baltimore, MD, 2003.
  47. C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, and D. Vergyri, Large-vocabulary audio-visual speech recognition: A summary of the Johns Hopkins Summer 2000 Workshop, Proc. Works. Multimedia Signal Process. (MMSP), pp. 619-624, Cannes, France, 2001.
  48. G. Potamianos, C. Neti, G. Iyengar, and E. Helmuth, Large-vocabulary audio-visual speech recognition by machines and humans, Proc. Europ. Conf. Speech Comm. Technol. (Eurospeech), pp. 1027-1030, Aalborg, Denmark, 2001.
  49. G. Potamianos and C. Neti, Automatic speechreading of impaired speech, Proc. Works. Audio-Visual Speech Process. (AVSP), pp. 177-182, Aalborg, Denmark, 2001.
  50. C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, D. Vergyri, J. Sison, A. Mashari, and J. Zhou, Audio-Visual Speech Recognition, Final Workshop 2000 Report, Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, 2000.
  51. G. Potamianos and A. Potamianos, Speaker adaptation for audio-visual speech recognition, Proc. Europ. Speech Comm. Technol. (Eurospeech), vol. 3, pp. 1291-1294, Budapest, Hungary, 1999.
  52. G. Potamianos, E. Cosatto, H.P. Graf, and D.B. Roe, Speaker independent audio-visual database for bimodal ASR, Proc. Europ. Tutorial Research Work. Audio-Visual Speech Process. (AVSP), pp. 65-68, Rhodes, Greece, 1997.
  53. Audio-Visual Fusion:

  54. S.M. Chu, V. Goel, E. Marcheret, and G. Potamianos, Method for Likelihood Computation in Multi-Stream HMM Based Speech Recognition, Patent No.: US007480617B2, Jan. 20, 2009.
  55. J.H. Connell, N. Haas, E. Marcheret, C.V. Neti, and G. Potamianos, Audio-Only Backoff in Audio-Visual Speech Recognition System, Patent No.: US007251603B2, July 31, 2007.
  56. E. Marcheret, V. Libal, and G. Potamianos, Dynamic stream weight modeling for audio-visual speech recognition, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. 4, pp. 945-948, Honolulu, HI, 2007.
  57. E. Marcheret, S.M. Chu, V. Goel, and G. Potamianos, Efficient likelihood computation in multi-stream HMM based audio-visual speech recognition, Proc. Int. Conf. Spoken Lang. Process. (ICSLP), Jeju Island, Korea, 2004.
  58. S.M. Chu, V. Libal, E. Marcheret, C. Neti, and G. Potamianos, Multistage information fusion for audio-visual speech recognition, Proc. Int. Conf. Multimedia Expo (ICME), Taipei, Taiwan, 2004.
  59. A. Garg, G. Potamianos, C. Neti, and T.S. Huang, Frame-dependent multi-stream reliability indicators for audio-visual speech recognition, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. I, pp. 24-27, Hong Kong, China, 2003.
  60. G. Gravier, G. Potamianos, and C. Neti, Asynchrony modeling for audio-visual speech recognition, Proc. Human Language Technology Conference (HLT), pp. 1-6, San Diego, CA, 2002.
  61. G. Gravier, S. Axelrod, G. Potamianos, and C. Neti, Maximum entropy and MCE based HMM stream weight estimation for audio-visual ASR, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), pp. 853-856, Orlando, FL, 2002.
  62. G. Potamianos, J. Luettin, and C. Neti, Hierarchical discriminant features for audio-visual LVCSR, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. 1, pp. 165-168, Salt Lake City, UT, 2001.
  63. J. Luettin, G. Potamianos, and C. Neti, Asynchronous stream modeling for large-vocabulary audio-visual speech recognition, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. 1, pp. 169-172, Salt Lake City, UT, 2001.
  64. H. Glotin, D. Vergyri, C. Neti, G. Potamianos, and J. Luettin, Weighting schemes for audio-visual fusion in speech recognition, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. 1, pp. 173-176, Salt Lake City, UT, 2001.
  65. G. Potamianos and C. Neti, Stream confidence estimation for audio-visual speech recognition, Proc. Int. Conf. Spoken Language Process. (ICSLP), vol. III, pp. 746-749, Beijing, China, 2000.
  66. G. Potamianos and H.P. Graf, Discriminative training of HMM stream exponents for audio-visual speech recognition, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. 6, pp. 3733-3736, Seattle, WA, 1998.
  67. Non-Frontal AVASR:

  68. P. Lucey, G. Potamianos, and S. Sridharan, Visual Speech Recognition Across Multiple Views, To Appear In: Visual Speech Recognition: Lip Segmentation and Mapping, A. Wee-Chung Liew and S. Wang (Eds.), Information Science Publishing Press, 2009.
  69. P. Lucey, G. Potamianos, and S. Sridharan, Patch-based analysis of visual speech from multiple views, Proc. Int. Conf. Auditory-Visual Speech Process. (AVSP), pp. 69-73, Tangalooma, Australia, 2008.
  70. P. Lucey, G. Potamianos, and S. Sridharan, A unified approach to multi-pose audio-visual ASR, Proc. Conf. Int. Speech Comm. Assoc. (Interspeech), pp. 650-653, Antwerp, Belgium, 2007.
  71. P. Lucey, G. Potamianos, and S. Sridharan, An extended pose-invariant lipreading system, Proc. Work. Audio-Visual Speech Process. (AVSP), pp. 176-180, Hilvarenbeek, The Netherlands, 2007.
  72. G. Potamianos and P. Lucey, Audio-visual ASR from multiple views inside smart rooms, Proc. Int. Conf. Multisensor Fusion and Integration for Intelligent Systems (MFI), pp. 35-40, Heidelberg, Germany, 2006.
  73. P. Lucey and G. Potamianos, Lipreading using profile versus frontal views, Proc. Works. Multimedia Signal Process. (MMSP), pp. 24-28, Victoria, Canada, 2006.
  74. Other Audio-Visual Speech Technologies:

  75. K. Kumar, J. Navratil, E. Marcheret, V. Libal, and G. Potamianos, Robust audio-visual speech synchrony detection by generalized bimodal linear prediction, (To Appear:) Proc. Conf. Int. Speech Comm. Assoc. (Interspeech), Brighton, United Kingdom, 2009.
  76. K. Kumar, J. Navratil, E. Marcheret, V. Libal, G. Ramaswamy, and G. Potamianos, Audio-visual speech synchronization detection using a bimodal linear prediction model, Proc. IEEE Comp. Soc. Works. Biometrics, Held in Association with CVPR, Miami Beach, Florida, 2009.
  77. S. Deligne, C.V. Neti, and G. Potamianos, Audio-Visual Codebook Dependent Cepstral Normalization, Patent No.: US007319955B2, Jan. 15, 2008.
  78. U.V. Chaudhari, C. Neti, G. Potamianos, and G.N. Ramaswamy, Automated Decision Making Using Time-Varying Stream Reliability Prediction, Patent No.: US007228279B2, June 5, 2007.
  79. V. Libal, J. Connell, G. Potamianos, and E. Marcheret, An embedded system for in-vehicle visual speech activity detection, Int. Work. Multimedia Signal Process. (MMSP), pp. 255-258, Chania, Greece, 2007.
  80. P. de Cuetos, G.R. Iyengar, C.V. Neti, and G. Potamianos, System and Method for Microphone Activation Using Visual Speech Cues, Patent No.: US006754373B1, June 22, 2004.
  81. E. Cosatto, H.P. Graf, G. Potamianos, and J. Schroeter, Audio-Visual Selection Process for the Synthesis of Photo-Realistic Talking-Head Animations, Patent No.: US006654018B1, Nov. 25, 2003.
  82. U.V. Chaudhari, G.N. Ramaswamy, G. Potamianos, and C. Neti, Information fusion and decision cascading for audio-visual speaker recognition based on time varying stream reliability prediction, Proc. Int. Conf. Multimedia Expo (ICME), vol. III, pp. 9-12, Baltimore, MD, July 2003.
  83. U.V. Chaudhari, G.N. Ramaswamy, G. Potamianos, and C. Neti, Audio-visual speaker recognition using time-varying stream reliability prediction, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. V, pp. 712-715, Hong Kong, China, 2003.
  84. S. Deligne, G. Potamianos, and C. Neti, Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization), Int. Conf. Spoken Lang. Process., pp. 1449-1452, Denver, CO, 2002.
  85. R. Goecke, G. Potamianos, and C. Neti, Noisy audio feature enhancement using audio-visual speech data, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), pp. 2025-2028, Orlando, FL, 2002
  86. E. Cosatto, G. Potamianos, and H.P. Graf, Audio-visual unit selection for the synthesis of photo-realistic talking-heads, Proc. Int. Conf. Multimedia Expo (ICME), vol. II, pp. 619-622, New York, 2000.
  87. Visual Feature Extraction:

  88. G. Potamianos and P. Scanlon, Exploiting lower face symmetry in appearance-based automatic speechreading, Proc. Works. Audio-Visual Speech Process. (AVSP), pp. 79-84, Vancouver Island, Canada, 2005.
  89. P. Scanlon, G. Potamianos, V. Libal, and S.M. Chu, Mutual information based visual feature selection for lipreading, Proc. Int. Conf. Spoken Lang. Process. (ICSLP), pp. Jeju Island, Korea, 2004.
  90. G. Potamianos, C. Neti, G. Iyengar, A.W. Senior, and A. Verma, A cascade visual front end for speaker independent automatic speechreading, Int. J. Speech Technology, vol. 4, pp. 193-208, 2001.
  91. G. Potamianos and C. Neti, Improved ROI and within frame discriminant features for lipreading, Proc. Int. Conf. Image Process. (ICIP), vol. III, pp. 250-253, Thessaloniki, Greece, 2001.
  92. G. Iyengar, G. Potamianos, C. Neti, T. Faruquie, and A. Verma, Robust detection of visual ROI for automatic speechreading, Proc. Works. Multimedia Signal Process. (MMSP), pp. 79-84, Cannes, France, 2001.
  93. I. Matthews, G. Potamianos, C. Neti, and J. Luettin, A comparison of model and transform-based visual features for audio-visual LVCSR, Proc. Int. Conf. Multimedia Expo (ICME), Tokyo, Japan, 2001.
  94. G. Potamianos, A. Verma, C. Neti, G. Iyengar, and S. Basu, A cascade image transform for speaker independent automatic speechreading, Proc. Int. Conf. Multimedia Expo (ICME), vol. II, pp. 1097-1100, New York, NY, 2000.
  95. G. Potamianos and H.P. Graf, Linear discriminant analysis for speechreading, Proc. Works. Multimedia Signal Process., pp. 221-226, Los Angeles, CA, 1998.
  96. G. Potamianos, H.P. Graf, and E. Cosatto, An image transform approach for HMM based automatic lipreading, Proc. Int. Conf. Image Process. (ICIP), vol. III, pp. 173-177, Chicago, IL, 1998.
  97. Face Detection and Tracking:

  98. J. Jiang, G. Potamianos, and G. Iyengar, Improved face finding in visually challenging environments, Proc. Int. Conf. Multimedia Expo (ICME), Amsterdam, The Netherlands, 2005.
  99. J. Jiang, G. Potamianos, H. Nock, G. Iyengar, and C. Neti, Improved face and feature finding for audio-visual speech recognition in visually challenging environments, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. 5, pp. 873-876, Montreal, Canada, 2004.
  100. E. Cosatto, H.P. Graf, and G. Potamianos, Robust Multi-Modal Method for Recognizing Objects, Patent No.: US006118887A, Sep. 12, 2000.
  101. H.P. Graf, E. Cosatto, and G. Potamianos, Machine vision of faces and facial features, Proc. R.I.E.C. Int. Symp. Design Archit. Inform. Process. Systems Based Brain Inform. Princ., pp. 48-53, Sendai, Japan, 1998.
  102. H.P. Graf, E. Cosatto, and G. Potamianos, Robust recognition of faces and facial features with a multi-modal system, Proc. Int. Conf. Systems Man Cybern. (SMC), pp. 2034-2039, Orlando, FL, 1997.
  103. C. EARLIER WORK

    Statistical Language Modeling:

  104. G. Potamianos and F. Jelinek, A study of n-gram and decision tree letter language modeling methods, Speech Communication, vol. 24, no. 3, pp. 171-192, 1998.
  105. Statistical Image Analysis - Markov Random Fields:

  106. G. Potamianos and J. Goutsias, Stochastic approximation algorithms for partition function estimation of Gibbs random fields, IEEE Transactions on Information Theory, vol. 43, no. 6, pp. 1948-1965, 1997.
  107. G. Potamianos, Efficient Monte Carlo estimation of partition function ratios of Markov random fields, Proc. Conf. Inform. Sci. Systems (CISS), vol. II, pp. 1212-1215, Princeton, NJ, 1996.
  108. G. Potamianos and J. Goutsias, A unified approach to Monte Carlo likelihood estimation of Gibbs random field images, Proc. Conf. Inform. Sci. Systems (CISS), vol. I, pp. 84-90, Princeton, NJ, 1994.
  109. G. Potamianos, Stochastic Simulation Algorithms for Partition Function Estimation of Markov Random Field Images, Ph.D. Thesis, Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, 1994.
  110. G. Potamianos and J. Goutsias, Partition function estimation of Gibbs random field images using Monte Carlo simulations, IEEE Transactions on Information Theory, vol. 39, no. 4, pp. 1322-1332, 1993.
  111. G. Potamianos and J. Goutsias, An analysis of Monte Carlo methods for likelihood estimation of Gibbsian images, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. V, pp. 519-522, Minneapolis, MN, 1993.
  112. G. Potamianos and J. Goutsias, On computing the likelihood function of partially observed Markov random field images using Monte Carlo simulations, Proc. Conf. Inform. Sci. Systems (CISS), vol. I, pp. 357-362, Princeton, NJ, 1992.
  113. G. Potamianos and J. Goutsias, A novel method for computing the partition function of Markov random field images using Monte Carlo simulations, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. 4, pp. 2325-2328, Toronto, Canada, 1991.
  114. Signal Processing - Filter Design:

  115. G. Potamianos and J. Diamessis, Frequency sampling design of 2-D IIR filters using continued fractions, Proc. Int. Symp. Circuits Systems (ISCAS), vol. 3, pp. 2454-2457, New Orleans, LA, 1990.
  116. J. Diamessis and G. Potamianos, A novel method for designing IIR filters with nonuniform samples, Proc. Conf. Inform. Sci. Systems (CISS), vol. 1, pp. 192-195, Princeton, NJ, 1990.
  117. J. Diamessis and G. Potamianos, Modeling unequally spaced 2-D discrete signals by rational functions, Proc. Int. Symp. Circuits Systems (ISCAS), vol. 2, pp. 1508-1511, Portland, OR, 1989.
  118. G. Potamianos, Design of 2-D Digital Filters Using Continued Fractions, Diploma Thesis, Department of Electrical and Computer Engineering, National Technical University of Athens, Athens, Greece, 1988.

Last Updated on June 19th, 2009


Back to Makis' Home Page