Chinese phonetic alphabet splitting method and device

A Chinese pinyin and pinyin technology, applied in the field of information retrieval, can solve problems such as easy input errors, pinyin input errors, and inability to split

Pending Publication Date: 2019-05-10
MIGU CO LTD +1
View PDF5 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At the same time, pinyin input is more prone to input errors than Chinese character input, and there is a problem of polyphonic characters. However, the current technology only allows correct splitting when the user's input is completely correct. If there is an input error, it cannot be split. In reality, Pinyin input errors are common; in addition, the existing technology directly splits the pinyin data, resulting in low split efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese phonetic alphabet splitting method and device
  • Chinese phonetic alphabet splitting method and device
  • Chinese phonetic alphabet splitting method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] In order to understand the characteristics and technical contents of the embodiments of the present invention in more detail, the implementation of the embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. The attached drawings are only for reference and description, and are not intended to limit the embodiments of the present invention.

[0061] figure 1 It is a schematic flow chart of a method for splitting Chinese phonetic alphabet according to the embodiment of the present invention, as figure 1 Shown, the method for described Chinese phonetic alphabet splitting comprises the following steps:

[0062] Step 101: Obtain the pinyin data to be split.

[0063] In the retrieval scenario, the pinyin data to be split here may be a pinyin search term input by the user. For example: in audio and video APP application scenarios, users usually enter a pinyin search term in the search box to search for audio and v...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese phonetic alphabet splitting method and device. The method comprises the steps of obtaining to-be-split phonetic alphabet data; performing normalization processing onthe pinyin data to be split to obtain normalized pinyin data; carrying out similarity matching on the normalized pinyin data and standard pinyin data, and determining target standard pinyin data withthe highest similarity with the normalized pinyin data; and splitting the target standard pinyin data, and taking the obtained splitting result of the target standard pinyin data as the splitting result of the to-be-split pinyin data.

Description

technical field [0001] The invention relates to information retrieval technology, in particular to a method and device for splitting Chinese phonetic alphabet. Background technique [0002] In application search, Chinese pinyin search is a common search method. For example, the user enters liudehuawangqingshui to search for Wangqingshui sung by Andy Lau. At this time, it is necessary to be able to split the two entities of liudehua and wangqingshui. At the same time, pinyin input is more prone to input errors than Chinese character input, and there is a problem of polyphonic characters. However, the current technology only allows correct splitting when the user's input is completely correct. If there is an input error, it cannot be split. In reality, Pinyin input errors are ubiquitous; in addition, in the prior art, the pinyin data is directly split, resulting in low split efficiency. Contents of the invention [0003] In order to solve the above technical problems, the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/023
Inventor 王昌
Owner MIGU CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products