Michael Picheny is the Senior Manager of the Watson Multimodal Team located at the IBM TJ Watson Research Center in Yorktown Heights. Michael has worked in the Speech Recognition area since 1981, joining IBM after finishing his doctorate at MIT. He has been heavily involved in the development of almost all of IBM's recognition systems, ranging from the world's first real-time large vocabulary discrete system through IBM's product lines for telephony and embedded systems. He has published numerous papers in both journals and conferences on almost all aspects of speech recognition. He has received several awards from IBM for his work, including a corporate award, three outstanding Technical Achievement Awards and two Research Division Awards. He is the co-holder of over 30 patents and was named a Master Inventor by IBM in 1995 and again in 2000. Michael served as an Associate Editor of the IEEE Transactions on Acoustics, Speech, and Signal Processing from 1986-1989, was the chairman of the Speech Technical Committee of the IEEE Signal Processing Society from 2002-2004, and is a Fellow of the IEEE and of ISCA (International Speech Communication Association) and served on the ISCA Board from 2005-2013. He was the co-general chair of the IEEE ASRU 2011 Workshop in Hawaii. He served as an Adjunct Professor in the Electrical Engineering Department of Columbia University in 2012 and co-taught a course in speech recognition.
Activities in Michael's group currently cover a multitude of interests in the area of speech and language processing. Work in large vocabulary speech recognition includes transcription and keyword search from broadcast news and conversations across multiple languages. We also work on developing new speech algorithms and engines for mobile applications. Conversational Systems work covers speech recognition and speech synthesis. Please contact me for more information!
Watson has taken a small but significant step in making computer conversations more human-like by releasing its first Text-to-Speech Voice with expressive capabilities. Todayâ€™s text-to-speech...
We continue to enhance our IBM Watson Developer Cloud speech services to give developers additional language support, keyword search capabilities, and control over pronunciations.
IBM Watson is proud to announce a major advance in the transcription of conversational speech.
We are pleased to take our first steps in bringing the ability to recognize speech ("Speech to Text") and produce speech ("Text to Speech") to...
Back to top