130 likes | 237 Vues
Summary poster from North American Symposium on Bat Research discussing problems with current technology, machine learning methods, experiments, results, and future work in improving automated analysis of Microchiropteran echolocation calls. Collaboration opportunities also highlighted.
E N D
Automatic detection and classification of Microchiropteran echolocation calls: Why the current technology is wrong and what can be done about it A summary of the poster presented at the North American Symposium on Bat Research, Sacramento, CA, Oct. 19-22, 2005 Mark D. Skowronski and John G. Harris Computational Neuro-Engineering Lab Electrical and Computer Engineering University of Florida, Gainesville, FL, USA October 25, 2005
Overview • Review of bat call analysis • Machine learning methods • Latest results (JASA manuscript) • Future work • Conclusions
Bat call analysis • Most bats constantly emit acoustic chirps during flight for navigation/hunting: echolocation • Echolocation calls provide useful information: • Presence • Family/genus/species • Flight characteristics • Hunting strategies, social interactions • Automated analysis desired: • Volumous data: 8 hrs of recordings, 16 bit, 200 kHz produces 11.5 GBytes data • Repeatable, objective, accurate, robust, fast
Conventional automated methods • For detection: • Frame-based, ~1ms duration • Frame energy compared to threshold • No frequency/temporal information • For classification: • Global call features from frames: min/max frequency, duration, frequency at peak energy • Discriminant function analysis on global features • No amplitude information, limited power in DFA model
Machine learning methods • For detection: • Frame-based features: log spectral peak, frequency at spectral peak, first- and second-order temporal derivatives, spectral mean subtraction • Gaussian mixture model (GMM) for features of hand-labeled calls AND background noise • Log likelihood difference, between call and background GMMs, compared to threshold • For classification: • Same frame-based features as for detection • GMM or hidden Markov model (HMM) trained from hand-labeled calls, one model for each species • Classifier output: label from model with maximum log likelihood
Detection experiment • Database of bat calls • 5 species, 5 recording locations, 3 systems • Pipistrellus bodenheimeri, Dead Sea, Pettersson • Molossus molossus, West Indies, Pettersson • Lasiurus borealis, Ontario, Avisoft • Lasiurus cinereus semotus, Hawaii, Avisoft • Tadarida brasiliensis, Gainesville, Custom • 2941 hand-labeled calls • Detection experiment design • Discrete events: 20-ms bins • Discrete outcomes: Yes or No: does a bin contain any part of a bat call?
Detector examples Each gray column is a hand-labeled call from a pass of 25 calls from L. borealis. The black horizontal lines represent the thresholds for equal sensitivity/specificity.
Classification experiment • Database of bat calls, same as for detection experiment • Cross-validation design • 50% test/train sets from hand-labeled calls • 20 trials
Classification results Results over 20 trials of 50% random test/train. GMM and HMM results statistically insignificant (t-test, p>0.9).
Future work • Repeat with zero-crossing data. • More species, more locations. • Optimize experiment parameters: # Gaussians, # states, frame size/ rate, derivative size, … • Detection/classification of pass of calls. • Microphone array data for direction of arrival. • Speaker identification, habitat variations, regional variations. • Collaborate with more bat researchers to get data.
Conclusions • GMM detection error 8x lower than broadband energy detector. • GMM/HMM classification error 28x lower than DFA baseline. • Machine learning methods superior to conventional methods because: • More information used • More powerful models • Cross-discipline work ripe for methods developed for human speech.
Further information • http://www.cnel.ufl.edu/~markskow • markskow@cnel.ufl.edu Acknowledgements Bat data kindly provided by: Brock Fenton, U. of Western Ontario, Canada