Georgiou, J., Pouliquen, P., Cassidy, A., Garreau, G., Andreou, C., Stuarts, G., d'Urbal, C., Andreou, A. G., Denham, S. L., Wennekers, T., Mill, R., Winkler, I., Bohm, T. M., Szalardy, O., Klump, G. M., Jones, S. J., & Bendixen, A. (2011). A multimodal-corpus data collection system for cognitive acoustic scene analysis. In 45th Annual Conference on Information Sciences and Systems (CISS) (pp. 37-42). Baltimore, USA: IEEE.

A multimodal-corpus data collection system for cognitive acoustic scene analysis

We report on the design and the collection of a multi-modal data corpus for cognitive acoustic scene analysis. Sounds are generated by stationary and moving sources (people), that is by omni-directional speakers mounted on people's heads. One or two subjects walk along predetermined systematic and random paths, in synchrony and out of sync. Sound is captured in multiple microphone systems, including a four MEMS microphone directional array, two electret microphones situated in the ears of a stuffed gerbil head, and a Head Acoustics, head-shoulder unit with ICP microphones. Three micro-Doppler units operating at different frequencies were employed to capture gait and the articulatory signatures as well as location of the people in the scene. Three ground vibration sensors were recording the footsteps of the walking people. A 3D MESA camera as well as a web-cam provided 2D and 3D visual data for system calibration and ground truth. Data were collected in three environments ranging from a well controlled environment (anechoic chamber), an indoor environment (large classroom) and the natural environment of an outside courtyard. A software tool has been developed for the browsing and visualization of the data.


Cognitive and Biological Psychology

University of Leipzig
Faculty of Life Sciences
Institute of Psychology
Neumarkt 9-19
D-04109 Leipzig


Dagmar Schrödl
Phone: +49 341 97-39570
Email: dagmar dot schroedl at uni-leipzig dot de

Fax: +49 341 97-39271