SCORE FUSION AND CALIBRATION IN MULTIPLE LANGUAGE DETECTORS WITH LARGE PERFORMANCE VARIATION
Presented by: Raymond W. M. Ng, Author(s): Raymond W. M. Ng, The Chinese University of Hong Kong, Hong Kong SAR of China; Cheung-Chi Leung, Institute for Infocomm Research, Singapore; Tan Lee, The Chinese University of Hong Kong, Hong Kong SAR of China; Bin Ma, Haizhou Li, Institute for Infocomm Research, Singapore
In a large-scale language detection task, performance variation found between different component systems and different target languages has an adverse effect to the pooled error statistics. Special care has to be taken in score fusion and calibration. In this paper, we use a prosodic LID system to fuse with a phonotactic LID system using NIST Language Recognition Evaluation 2009 experimental data. Among four logistic regression models, the one which gives the lowest Cavg is chosen. We further explore our previously proposed calibration algorithm based on the minimum erroneous deviation criterion. The algorithm is made more robust by removing the predetermined list of target languages to be calibrated, as well as by adding an optimization constraint which enforces calibration in the data portion with a large performance variation. The fusion and calibration operations together bring a 33.9% relative Cavg reduction compared with the original result from a phonotactic LID system.