Speech recognition model training method, system, mobile terminal and storage medium
A speech recognition model and training method technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of long time and low training efficiency, and achieve the effect of improving efficiency, reducing model training time, and reducing labor costs.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0050] see figure 1 , is a flow chart of the speech recognition model training method provided in the first embodiment of the present invention, including steps:
[0051] Step S10, acquiring sample speech, sample text corresponding to the sample speech, and text corpus, and constructing a text dictionary based on the sample text and the text corpus;
[0052] Wherein, the sample speech is a language to be recognized by the speech recognition model, such as Cantonese or Hokkien, and the sample text is expressed in Mandarin, and there is a one-to-one correspondence between the sample speech and the sample text;
[0053] Specifically, through the acquisition of the sample speech and sample text, a corresponding data set is constructed, and 20% of the data in the data set are randomly selected as the test set;
[0054] In this step, before the step of constructing a text dictionary according to the sample text and the text corpus, the method includes:
[0055] Deleting the special ...
Embodiment 2
[0070] see figure 2 , is a flow chart of the speech recognition model training method provided by the second embodiment of the present invention, including steps:
[0071] Step S11, acquiring sample speech, sample text and text corpus corresponding to the sample speech;
[0072] Step S21, traversing the local pre-stored training text, adding all non-repetitive characters to the text dictionary to build a character set;
[0073] Among them, each character is represented by a corresponding unique ID;
[0074] Step S31, the characters in the sample text and the text corpus are replaced with corresponding IDs according to the character set, and the characters in the text corpus that are not in the character set are represented by a first identifier;
[0075] Among them, the first identification can adopt expressed in a manner;
[0076] Step S41, adding the first identification to the character set, and using the current maximum ID of the character set plus 1 to represent; ...
Embodiment 3
[0113] see Figure 5 , is a schematic structural diagram of the speech recognition model training system 100 provided by the third embodiment of the present invention, including: a dictionary construction module 10, a vector calculation module 11, a model training module 12 and a model integration module 13, wherein:
[0114] Dictionary construction module 10 is used to obtain sample speech, sample text and text corpus corresponding to the sample speech, and construct a text dictionary according to the sample text and the text corpus.
[0115] Wherein, the dictionary construction module 10 is also used for:
[0116] Traversing the local pre-stored training text, adding all non-repetitive characters to the text dictionary to build a character set, and each character is represented by a corresponding unique ID;
[0117] replacing the characters in the sample text and the text corpus with corresponding IDs according to the character set;
[0118] Representing charac...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


