1 / 16

Goals : Down sample 20 khz TIDigits data to 16 khz .

Goals : Down sample 20 khz TIDigits data to 16 khz . 2. Use Down sample data run regression test and Compare results posted in Sphinx-4 website 3. Based on the results, make decisions (issue with microprocessor, floating point etc .) By Jaykrishna shukla , Mubin Ahmed and Cara Santin.

mason
Télécharger la présentation

Goals : Down sample 20 khz TIDigits data to 16 khz .

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Goals : • Down sample 20 khzTIDigits data to 16 khz. • 2. Use Down sample data run regression test and • Compare results posted in Sphinx-4 website • 3. Based on the results, make decisions • (issue with microprocessor, floating point etc.) • By Jaykrishnashukla, Mubin Ahmed and Cara Santin

  2. Learned : • Cygwin not effective to run Sox • effective to run linux command line interface to build application • Easy to install

  3. Training Acoustic model using Sphinx Train URL:

  4. Introduction to Training Q1.What is acoustic model ? A1. model used by a speech recognizer for decoding language spoken by a person and modeling numerically how the language sounds when spoken in a form that can be stored on a computer. Q2. what is training A2. process that wants to converge on a solution yielding the most likely sequence of vectors for a given acoustic unit. Q3. why is training required? A3. In order to generate a set of acoustic model for any audio data, one needs to follow a particular set of steps which is named as training, hence to generate acoustic model, training is required.

  5. Training acoustic model using SphinxTrain 1.0 overview • The Flow chart for the Training Procedure

  6. SphinxTrain 1.0 & auto generation • The new version of sphinx train has a build all option, that generates all the required files that were shown in the flow chart from previous slide. However, in order to do object specific function, one needs to modify the config file according to the purpose of the task.

  7. This week’s accomplishment • The two major goals that I achieved this week were: • Finished the complete training process for the an4 demo. • Worked on generating the feature model for the TI Digit short test data. • Sample output of a training process (it took more than 20 min to compile this code)

  8. Generating the feature vectors • There two main step in generating the feature vector: • 1. Generate the .Fileids file (it is just the path list of all the data file) • 2. Modify the Make_feats (perl script) to in order to read the correct data in and change the default settings that the SphinxTrain comes with.

  9. Conclusion and Future • The main problem in feature generation is that the Make_feats file has default settings for the an4 tutorial, hence to getting it working we have to change the configuration for both the make_feats file and the SphinxTrain connfig file (because the config file determines what goes in to the make_feats file. Follow the below Example )

  10. Training Acoustic model using Sphinx Train Jaykrishnashukla,MubinAmehed& caraSantin Department of Electrical and Computer Engineering Temple University URL:

  11. Introduction to Training Q1.What is acoustic model ? A1. model used by a speech recognizer for decoding language spoken by a person and modeling numerically how the language sounds when spoken in a form that can be stored on a computer. Q2. what is training A2. process that wants to converge on a solution yielding the most likely sequence of vectors for a given acoustic unit. Q3. why is training required? A3. In order to generate a set of acoustic model for any audio data, one needs to follow a particular set of steps which is named as training, hence to generate acoustic model, training is required.

  12. Training acoustic model using SphinxTrain 1.0 overview • The Flow chart for the Training Procedure

  13. SphinxTrain 1.0 & auto generation • The new version of sphinx train has a build all option, that generates all the required files that were shown in the flow chart from previous slide. However, in order to do object specific function, one needs to modify the config file according to the purpose of the task.

  14. This week’s accomplishment • The two major goals that I achieved this week were: • Finished the complete training process for the an4 demo. • Worked on generating the feature model for the TI Digit short test data. • Sample output of a training process (it took more than 20 min to compile this code)

  15. Generating the feature vectors • There two main step in generating the feature vector: • 1. Generate the .Fileids file (it is just the path list of all the data file) • 2. Modify the Make_feats (perl script) to in order to read the correct data in and change the default settings that the SphinxTrain comes with.

  16. Conclusion and Future • The main problem in feature generation is that the Make_feats file has default settings for the an4 tutorial, hence to getting it working we have to change the configuration for both the make_feats file and the SphinxTrainconnfig file (because the config file determines what goes in to the make_feats file. Follow the below Example )

More Related