Dmitry Zotkin

Associate Research Scientist
3363 A.V. Williams Building
(301) 405-1049
Education: 
Ph.D., University of Maryland (Computer Science)
Biography: 

Dmitry N. Zotkin is an associate research scientist in UMIACS and a member of the Perceptual Interfaces and Reality Laboratory, the Center for Automation Research, and the Computer Vision Laboratory.

Zotkin is working with audio and acoustic signal processing. His main research interests are spatial audio capture and reproduction. Zotkin also works in related areas, such as microphone arrays, auditory scene analysis, and fast numerical methods for the acoustic wave equation.

He is an author/co-author for two book chapters, 12 journal papers and more than 40 referred conference publications. Zotkin was the main author of a 2006 paper describing a novel fast personalization/customization method for a personal 3-D audio system. The University of Maryland has obtained a patent on the relevant technology and has licensed it to companies aimed at widespread use of personalized spatial audio at the consumer level.

Zotkin is a regular reviewer for several audio-related IEEE Transactions and for the Journal of the Acoustical Society of America. He has served on the program committee or as a reviewer for many of the major conferences in his research area. He is also a member of the Acoustical Society of America.

Zotkin received a combined B.S./M.S. degree in applied mathematics and physics from the Moscow Institute of Physics and Technology in the Dolgoprudny, Moscow region, Russia, in 1996 and a doctorate in computer science from the University of Maryland in 2002.

Publications

2011


Srinivasan BV, Garcia-Romero D, Zotkin DN, Duraiswami R.  2011.  Kernel partial least squares for speaker recognition. Twelfth Annual Conference of the International Speech Communication Association.

Srinivasan BV, Zotkin DN, Duraiswami R.  2011.  A partial least squares framework for speaker recognition. Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on.
:5276-5279.

2010


Vasan Srinivasan B, Duraiswami R, Zotkin DN.  2010.  Kernelized Rényi distance for speaker recognition. Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on.
:4506-4509.

O'Donovan AE, Duraiswami R, Zotkin DN.  2010.  Automatic matched filter recovery via the audio camera. Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on.
:2826-2829.

Zotkin DN, Duraiswami R.  2010.  Signal Processing for Audio HCI. Handbook of Signal Processing Systems.
:243-265.

O'donovan A, Duraiswami R, Zotkin DN, Gumerov NA.  2010.  Audio visual scene analysis using spherical arrays and cameras.. The Journal of the Acoustical Society of America. 127(3):1979-1979.

2009


Zotkin DN, Duraiswami R, Gumerov NA.  2009.  Regularized HRTF fitting using spherical harmonics. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09.
:257-260.

Zotkin DN, Duraiswami R.  2009.  Plane-wave decomposition of a sound scene using a cylindrical microphone array. Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on.
:85-88.

O'donovan A, Duraiswami R, Gumerov NA, Zotkin DN.  2009.  Imaging room acoustics with the audio camera.. The Journal of the Acoustical Society of America. 125(4):2544-2544.

2008


Zotkin DN, Duraiswami R, Gumerov NA.  2008.  Sound field decomposition using spherical microphone arrays. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008.
:277-280.

O'Donovan A, Duraiswami R, Zotkin DN.  2008.  Imaging concert hall acoustics using visual and audio cameras. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008.
:5284-5287.

2007


Zotkin DN, Duraiswami R, Gumerov NA.  2007.  Efficient Conversion of X.Y Surround Sound Content to Binaural Head-Tracked Form for HRTF-Enabled Playback. IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. 1:I-21-I-24-I-21-I-24.

Zotkin DN, Raykar VC, Duraiswami R, Davis LS.  2007.  Multimodal Tracking for Smart Videoconferencing and Video Surveillance. Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on.
:1-2.

Duraiswami R, Zotkin DN, Gumerov NA.  2007.  Fast Evaluation of the Room Transfer Function Using Multipole Expansion. Audio, Speech, and Language Processing, IEEE Transactions on. 15(2):565-576.

Gumerov NA, Duraiswami R, Zotkin DN.  2007.  Fast Multipole Accelerated Boundary Elements for Numerical Computation of the Head Related Transfer Function. IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. 1:I-165-I-168-I-165-I-168.

2006


Duraiswami R, Li Z, Zotkin DN, Grassi E.  2006.  Spherical and hemispherical microphone arrays for capture and analysis of sound fields. The Journal of the Acoustical Society of America. 120(5):3225-3225.

Yerukhimovich A, Duraiswami R, Gumerov NA, Zotkin DN.  2006.  Frequency Independent Flexible Spherical Beamforming Via Rbf Fitting. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 5:V-V-V-V.

Zotkin DN, Duraiswami R, Grassi E, Gumerov NA.  2006.  Fast head-related transfer function measurement via reciprocity. The Journal of the Acoustical Society of America. 120(4):2202-2215.

Duraiswami R, Zotkin DN, O'donovan A.  2006.  Capture and rendering of spatial sound over headphones. The Journal of the Acoustical Society of America. 120(5):3094-3094.

2005


Yegnanarayana B, Prasanna SRM, Duraiswami R, Zotkin DN.  2005.  Processing of reverberant speech for time-delay estimation. IEEE Transactions on Speech and Audio Processing. 13(6):1110-1118.

Zotkin DN, Chi T, Shamma SA, Duraiswami R.  2005.  Neuromimetic sound representation for percept detection and manipulation. EURASIP Journal on Applied Signal Processing. 9:1350-1350.

Duraiswami R, Li Z, Zotkin DN, Grassi E, Gumerov NA.  2005.  Plane-wave decomposition analysis for spherical microphone arrays. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005.
:150-153.

2004


Duraiswami R, Zotkin DN, Gumerov NA.  2004.  INTERPOLATION AND RANGE EXTRAPOLATION OF HEAD RELATED TRANSFER FUNCTIONS. IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING. 4

2003


Zotkin DN, Shamma SA, Ru P, Duraiswami R, Davis LS.  2003.  Pitch and timbre manipulations using cortical representation of sound. Multimedia and Expo, IEEE International Conference on. 3:381-384.

Mohan A, Duraiswami R, Zotkin DN, DeMenthon D, Davis LS.  2003.  Using computer vision to generate customized spatial audio. Multimedia and Expo, IEEE International Conference on. 3:57-60.

Zotkin DN, Hwang J, Duraiswami R, Davis LS.  2003.  HRTF personalization using anthropometric measurements. Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on..
:157-160.

Zotkin DN, Shamma SA, Ru P, Duraiswami R, Davis LS.  2003.  AUDIO-P2. 1: PITCH AND TIMBRE MANIPULATIONS USING CORTICAL REPRESENTATION OF SOUND. IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING. 5

2002


Zotkin DN, Duraiswami R, Davis LS.  2002.  Joint audio-visual tracking using particle filters. EURASIP J. Appl. Signal Process.. 2002(1):1154-1164.

Zotkin DN, Duraiswami R, Davis LS, Mohan A, Raykar V.  2002.  Virtual audio system customization using visual matching of ear parameters. 16th International Conference on Pattern Recognition, 2002. Proceedings. 3:1003-1006vol.3-1003-1006vol.3.

Zotkin DN, Duraiswami R, Davis LS.  2002.  Creation of virtual auditory spaces. 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2

Zotkin DN, Duraiswami R, Davis LS.  2002.  Customizable auditory displays. Proceedings of the International Conference on Auditory Display.
:167-176.

2001


Duraiswami R, Gumerov NA, Zotkin DN, Davis LS.  2001.  Efficient evaluation of reverberant sound fields. Applications of Signal Processing to Audio and Acoustics, 2001 IEEE Workshop on the.
:203-206.

Zotkin DN, Duraiswami R, Nanda H, Davis LS.  2001.  Multimodal tracking for smart videoconferencing. Second International Conference on Multimedia and Expo, Tokyo, Japan.

Ghose K, Zotkin DN, Duraiswami R, Moss CF.  2001.  Multimodal localization of a flying bat. Acoustics, Speech, and Signal Processing, IEEE International Conference on. 5:3057-3060.

Duraiswami R, Zotkin DN, Davis LS.  2001.  Active speech source localization by a dual coarse-to-fine search. 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 5:3309-3312vol.5-3309-3312vol.5.

Zotkin DN, Duraiswami R, Davis LS.  2001.  Multimodal 3-D tracking and event detection via the particle filter. IEEE Workshop on Detection and Recognition of Events in Video, 2001. Proceedings.
:20-27.

Haritaoglu I, Cozzi A, Koons D, Flickner M, Zotkin DN, Yacoob Y.  2001.  Attentive toys. International Conference on Multimedia and Expo. 22:25-25.

2000


Zotkin DN, Duraiswami R, Davis LS, Haritaoglu I.  2000.  An audio-video front-end for multimedia applications. 2000 IEEE International Conference on Systems, Man, and Cybernetics. 2:786-791vol.2-786-791vol.2.

Zotkin DN, Duraiswami R, Philomin V, Davis LS.  2000.  Smart videoconferencing. 2000 IEEE International Conference on Multimedia and Expo, 2000. ICME 2000. 3:1597-1600vol.3-1597-1600vol.3.

Duraiswami R, Zotkin DN, Borovikov EA, Davis LS.  2000.  Active source location and beamforming. The Journal of the Acoustical Society of America. 107:2790-2790.

Zotkin DN, Keleher PJ, Perkovic D.  2000.  Attacking the bottlenecks of backfilling schedulers. Cluster Computing.

1999


Zotkin DN, Duraiswami R, Hariatoglu I, Davis LS, Otsuka T.  1999.  A real-time audio–video front-end for multimedia applications. The Journal of the Acoustical Society of America. 106:2271-2271.

1998


Soffer A, Samet H, Zotkin DN.  1998.  Pictorial query trees for query specification in image databases. Fourteenth International Conference on Pattern Recognition, 1998. Proceedings. 1:919-921vol.1-919-921vol.1.