Data processing method and device, identification method and device and computing equipment

A technology of data processing and text recognition, applied in digital data processing, natural language data processing, computing, etc., can solve the problems of low text recognition accuracy and low model accuracy, so as to improve text recognition accuracy and training accuracy The effect of improving the accuracy of dictionary fusion

Pending Publication Date: 2021-05-25
ALIBABA GRP HLDG LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present application provides a data processing method, device and computing equipment to solve the technical problems of low model accuracy and low text recognition accuracy in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device, identification method and device and computing equipment
  • Data processing method and device, identification method and device and computing equipment
  • Data processing method and device, identification method and device and computing equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067]In order to enable those skilled in the art to better understand the solutions of the present application, the following will clearly and completely describe the technical solutions in the embodiments of the present application in conjunction with the drawings in the embodiments of the present application.

[0068] In some processes described in the specification and claims of the present application and the description in the above-mentioned drawings, multiple operations appearing in a specific order are included, but it should be clearly understood that these operations may not be performed in the order in which they appear herein Execution or parallel execution, the serial numbers of the operations, such as 101, 102, etc., are only used to distinguish different operations, and the serial numbers themselves do not represent any execution order. Additionally, these processes can include more or fewer operations, and these operations can be performed sequentially or in pa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a data processing method and device, an identification method and device and computing equipment. A plurality of elements obtained by segmenting a training text are respectively expressed as nodes; wherein the element is composed of a single character or multiple characters; the plurality of different types of dictionaries are represented as nodes respectively; edges between the nodes are represented by using an incidence relation between the nodes, and a first graph is constructed; a text recognition model is trained by utilizing the first graph and training labels respectively marked for the plurality of elements; feature words in the text to be processed can be recognized and obtained by utilizing the text recognition model. According to the technical scheme provided by the embodiment of the invention, the text expression accuracy is improved, the model training accuracy is improved, and the text recognition accuracy is improved.

Description

technical field [0001] The embodiments of the present application relate to the field of computer application technologies, and in particular, to a data processing method, device, and mobile terminal. Background technique [0002] Sequence tagging is a common problem in natural language processing. Sequence tagging can solve problems such as word segmentation, named entity recognition, and keyword extraction. [0003] The so-called sequence labeling refers to labeling each element in the sequence with a certain type of label in the label set, and performing model training, so that the model can realize the identification of element labels in the sequence to be processed. In natural language processing, a sequence can refer to the composition of multiple elements formed by word segmentation or segmentation of text, and the sequence labeling problem is essentially a text recognition problem. Taking named entity recognition as an example, it can realize the recognition of name...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/242G06F40/295G06F40/289
Inventor 丁瑞雪谢朋峻马春平黄非司罗
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products