1 / 6

Course Projects 40967 Speech Processing 1392-2

Course Projects 40967 Speech Processing 1392-2. Project Reports. Oral Presentation: 15 Minutes Date: Written report: 10 to 20 pages References are to be handed-in. Project Titles. Voice Conversion, Parrot Voice Generation Kaldi ASR toolkit, preparing windows version

leland
Télécharger la présentation

Course Projects 40967 Speech Processing 1392-2

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Course Projects40967Speech Processing1392-2

  2. Project Reports • Oral Presentation: 15 Minutes • Date: • Written report: 10 to 20 pages • References are to be handed-in

  3. Project Titles Voice Conversion, Parrot Voice Generation Kaldi ASR toolkit, preparing windows version Persian CSR using Kaldi Spoken Term Detection using Kaldi

  4. Project Titles Automatic language identification SNR estimation of a given speech file Limited Domain TTS Automatic Punctuation insertion in Persian text

  5. Project Titles • Confidence measure for words in CSR • Unsupervised speaker adaptation for CSR systems • 4-gram language model extraction for Persian CSR • Persian Spontaneous speech recognition

  6. Project Titles • Homograph pronunciation detection in Persian • OOV in CSR systems • Methods for recognizing large number of names over the phone (telephone directory) • Homograph detection using graphical models

More Related