1 / 21

Who Are We?

Real-time Recognition of Whale Calls using SoundID Neil J Boucher, SoundID . Australia Michihiro Jinnai, Nagoya Women’s University. Japan. Who Are We?. SoundID www.soundid.net Founded 2002 Engineers specialising in sound recognition

neena
Télécharger la présentation

Who Are We?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Real-time Recognition of Whale Calls using SoundID Neil J Boucher, SoundID. AustraliaMichihiro Jinnai, Nagoya Women’s University. Japan

  2. Who Are We? SoundID www.soundid.net Founded 2002 Engineers specialising in sound recognition Aim to produce human expert and better quality recognition Designed to run in real-time Designed to handle terabytes of data with ease Have our own patented methods

  3. SoundID Mk I in glorious 1-d!

  4. 1-d Recognition Results monitoring a Dawn Chorus over 70 minutes

  5. Birds are Easy, Whales are Hard The 1-d method works well for most bird calls as just shown. Frogs and bats also easy. Whales are harder because the 1/f noise and other background noises “smear” the 1-d image. Worse still, most of the whale noises are in-band.

  6. 2-d Depiction of Same Call

  7. Two Methods Compared

  8. The “Bottom of the Coke Bottle” or “FFT-like” View of the Same Call at Frame-Width 513 Points

  9. Geometric Distance Euclidean Distance is the simple straight line (first order) distance between two points. Geometric Distance (measured in angular degrees) is the angle between two vectors that we use to describe the difference in shape. It is a fourth order measure, vaguely related to the skew of a standard distribution.

  10. The Measure of Geometric Distance As a measure of similarity we can use the GD between two n-dimensional shapes to measure their similarity. Two identical shapes have a GD = zero Two totally dissimilar shapes have a GD of 90 degrees. The spectra of two sounds that sound “similar” to the human ear can have a GD in the range 0 to about 6 degrees.

  11. Value of Geometric Distance Once we have a library of reference sounds we can compare that library with the detected target sound. Each sound in the library is compared in turn with the target and the one with the lowest GD is declared the closest match. Then, if the lowest GD is small enough to be declared a match (typically GD<6 degrees) the target has been identified!

  12. More is Better Notice now that the bigger the reference library the better the chance of having a matching reference and the greater the accuracy. Notice also that the reference library is not restricted to sounds that are in some way related and any group of sounds can comprise part of the library.

  13. Comparison of Two Adjacent Calls. Note no Filter used or Needed!

  14. Same Calls Zoomed In for a Closer Look. Note the smaller GD Freq x 3.3

  15. A Cluster Analysis Based on GD = 6 degrees. Note 6 clusters!

  16. Two Sweeps that are not Clustered (672 and 438 seconds) 672 x 3.3f • 438 x 3.3f

  17. Analysis of 524 Gunshot Sounds from File 20080126 Running a cluster of 526 Gunshots each with each other results in 275,625 comparisons being made. Sorting these into GD = 6 Clusters, we find they fall into 64 groups. A few are unique with only one member while large clusters of 120 or more are also found.

  18. The Cluster Analysis is Automated

  19. An Example of Two Very Different Gunshots Identified with the Cluster Analysis 161500-58796 23450-863375

  20. Progress to Date The 2-d Spectrum recognition module is complete and tested. It works well and can match a human expert for accuracy. The speed of recognition has been found to be fast enough for real-time recognition. The detection module is under development. Release date is late 2013.

  21. Find Us At www.soundid.net

More Related