Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data processing method and device and computer readable storage medium

A data processing device and data processing technology, applied in the field of data processing, can solve problems such as low accuracy, difficulty in obtaining labeled data, and expensive labeling costs

Active Publication Date: 2020-04-21
TENCENT TECH (SHENZHEN) CO LTD
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] During the research and practice of related technologies, the inventors of this application found that in related technologies, the cost of labeling is very expensive, it is difficult to obtain a large amount of labeling data, and based on common text as upper and lower information identification, the efficiency of data processing is poor , so that the accuracy of the judgment of the hyponymy relationship is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device and computer readable storage medium
  • Data processing method and device and computer readable storage medium
  • Data processing method and device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0065] In this embodiment, description will be made from the perspective of a data processing device. Specifically, the data processing device may be integrated in an electronic device with a storage unit and a microprocessor installed therein, which may include a server or a terminal. .

[0066] A data processing method, comprising: collecting positive word pair sample data and negative word pair sample data; training an autoencoder according to the positive word pair sample data and negative word pair sample data to obtain a trained autoencoder; The post-autoencoder extracts the feature information corresponding to the positive word pair sample data and the negative word pair sample data; the feature information is input into the binary classifier for training, and the trained binary classifier is obtained; combined with the trained autoencoder and The trained binary classifier recognizes the hyponym relationship of the word to be recognized to the data.

[0067] see figu...

Embodiment 2

[0137] According to the method described in Embodiment 1, an example will be given below for further detailed description.

[0138] In this embodiment, the data processing method is described by taking the executing subject as a server as an example.

[0139] see image 3 , image 3 Another schematic flowchart of the data processing method provided in the embodiment of the present application.

[0140] The method flow may include:

[0141] In step 201, the server collects positive word pair sample data.

[0142] Among them, the server collects a plurality of positive word pair sample data, and the positive word pair sample data includes correct hypernym vectors and hyponym vectors, for example, the hyponym vectors corresponding to the hyponyms of "Western World" and "tv series" The hypernym vector corresponding to the hypernym of .

[0143] In step 202, the server collects the preset initial negative word pair sample data, and inputs the initial negative word pair sample ...

Embodiment 3

[0185] In order to better implement the data processing method provided in the embodiment of the present application, the embodiment of the present application further provides a device based on the above data processing method. The meanings of the nouns are the same as those in the above data processing method, and for specific implementation details, please refer to the description in the method embodiments.

[0186] see Figure 5a , Figure 5a It is a schematic structural diagram of a data processing device provided in the embodiment of the present application, wherein the data processing device may include an acquisition unit 301, a first training unit 302, an extraction unit 303, a second training unit 304, an identification unit 305, and the like.

[0187] The collection unit 301 is configured to collect sample data of positive word pairs and sample data of negative word pairs.

[0188] Wherein, the collection unit 301 simultaneously collects positive word pair sample ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a data processing method and device and a computer readable storage medium. According to the embodiment of the invention, the method comprises: collecting positive word pair sample data and negative word pair sample data; training the auto-encoder according to the positive word pair sample data and the negative word pair sample data to obtain a trained auto-encoder; extracting feature information corresponding to the positive word pair sample data and the negative word pair sample data through the trained auto-encoder; inputting the feature informationinto a binary classifier for training to obtain a trained binary classifier; and combining the trained auto-encoder and the trained binary classifier to identify the hyponymy relationship of the to-be-identified word pair data. Therefore, the positive and negative word pair sample data simultaneously train the auto-encoder, and the feature information corresponding to the positive and negative word pair sample data is extracted based on the trained auto-encoder to perform combined training on the binary classifier, thereby realizing accurate identification of the hyponymy relationship, and greatly improving the data processing efficiency and the judgment accuracy of the hyponymy relationship.

Description

technical field [0001] The present application relates to the technical field of data processing, and in particular to a data processing method, device, and computer-readable storage medium. Background technique [0002] With the development of the network and the wide application of computers, data processing technology is becoming more and more important. For example, the mining technology of hypernyms has always been an important research topic in the field of natural language processing, and it is the basic ability of natural language understanding. , Intent recognition, or user interest point mining in recommendation systems all play a very important role. [0003] In related technologies, the problem of hyponym discrimination is generally solved through the scheme of sequence labeling problem, that is, the two tasks of hypernym relationship extraction and discrimination are combined into one task, and a model is combined and trained, from the text where hyponym and hyp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/279G06K9/62G06N3/08
CPCG06N3/088G06F18/2433Y02D10/00
Inventor 林振斌王晓利
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products