Polyphone processing method and device and model training method and device

A processing method and model training technology, applied in the computer field, can solve problems such as excessive feature vocabulary, increased training and reasoning costs, and complex feature forms

Pending Publication Date: 2020-11-20
BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the rule-based disambiguation system for Chinese polyphonic characters has the problem that a polyphonic character may match multiple rules or a rule may match multiple correct pronunciations of a certain polyphonic character, resulting in rule conflicts. The neural network Chinese of the statistical model In the polyphonic character disambiguation system, when the neural network is used as the feature extraction module, there will be a problem of complex feature forms, and the extracted feature dimension is too high, which increases the cost of training and reasoning, and reduces the prediction efficiency. As a feature extraction module, the analysis process will have the problem that the required feature vocabulary is too large and the encoding is not accurate enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Polyphone processing method and device and model training method and device
  • Polyphone processing method and device and model training method and device
  • Polyphone processing method and device and model training method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0104] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementation disclosed below.

[0105] Terms used in one or more embodiments of the present application are for the purpose of describing specific embodiments only, and are not intended to limit the one or more embodiments of the present application. As used in one or more embodiments of this application and the appended claims, the singular forms "a", "the", and "the" are also intended to include the plural forms unless the context clearly dictates otherwise. It should also be understood that the term "and / or" used in one or more embodiments of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a polyphone processing method and device and a model training method and device. The polyphone processing method comprises the steps: receiving a Chinese statement containing polyphones, and determining first position features of the polyphones in the Chinese statement; performing word segmentation on the Chinese statement, and determining part-of-speech features of the polyphone based on words obtained by word segmentation and part-of-speech of the words; obtaining a word containing a polyphone, and determining a second position feature of the polyphone in the word containing the polyphone; inputting the first position feature, the part-of-speech feature and the second position feature into a polyphone disambiguation model to obtain pronunciation corresponding to the polyphone.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a method and device for processing polyphonic characters, a method and device for model training, a computing device, and a computer-readable storage medium. Background technique [0002] The earliest Chinese polyphone ambiguity disambiguation system was based on rules, which were summarized by linguists and encoded into computers in a computer-understandable manner; however, with the increase of data scale, data-based statistical models have gradually become the The preferred solution for disambiguating phonetic characters, that is, traditional machine learning models (such as decision tree models, maximum entropy models) and deep neural networks have begun to be applied to disambiguating Chinese polyphonic characters. [0003] However, the rule-based disambiguation system for Chinese polyphonic characters has the problem that a polyphonic character may match multipl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/289G06F40/30G06N3/04G06N3/08
CPCG06F40/289G06F40/30G06N3/049G06N3/08G06N3/045
Inventor 张文博李长亮
Owner BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products