Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text processing method and device, electronic equipment and storage medium

A text processing and text technology, applied in the fields of devices, text processing methods, electronic equipment and storage media, can solve the problems of manpower and time consumption, insufficient text processing applicability and scalability, and inability to meet knowledge map construction, etc. Accuracy, increase the receptive field, improve the effect of precision

Pending Publication Date: 2022-07-12
PINGAN INT SMART CITY TECH CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] The training data acquisition method used in the construction process of the current domain knowledge map requires a lot of manual labeling of entities, which consumes a lot of manpower and time
In addition, the data basis of the open source Chinese knowledge graph is generally derived from the results of the extraction and fusion of major encyclopedias. The applicability and scalability of text processing in special fields are insufficient, and it cannot meet the construction of knowledge graphs in special fields.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method and device, electronic equipment and storage medium
  • Text processing method and device, electronic equipment and storage medium
  • Text processing method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0039] figure 1 This is a flowchart of the text processing method provided in Embodiment 1 of the present invention. The text processing method specifically includes the following steps. According to different requirements, the order of the steps in the flowchart can be changed, and some can be omitted.

[0040] S11 , obtaining historical text, and using a distance supervised learning technology to obtain training data based on the historical text.

[0041] In an optional embodiment, the solution provided in this application can be applied to the field of vocational education knowledge graph construction, and the historical text includes vocational training course text. The electronic device may acquire the historical text in response to user input, and may also pre-store the historical text in the memory of the electronic device, or pre-store the historical text in other devices communicatively connected to the electronic device. In addition, the electronic device can also ...

Embodiment 2

[0098] figure 2 It is a structural diagram of the text processing apparatus provided by the second embodiment of the present invention.

[0099] In some embodiments, the text processing apparatus 20 may include a plurality of functional modules composed of computer program segments. The computer program of each program segment in the text processing apparatus 20 can be stored in the memory of the electronic device and executed by at least one processor to execute (see details for details). figure 1 Description) Function for text processing.

[0100] In this embodiment, the text processing apparatus 20 may be divided into a plurality of functional modules according to the functions performed by the text processing apparatus 20 . The functional modules may include: an acquisition module 201 , a training module 202 , a generation module 203 , an identification module 204 and an extraction module 205 . The modules referred to in the present invention refer to a series of compu...

Embodiment 3

[0159] This embodiment provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium. When the computer program is executed by a processor, the steps in the foregoing text processing method embodiments are implemented, for example, figure 1 S11-S15 shown:

[0160] S11, obtaining historical text, and using remote supervised learning technology to obtain training data based on the historical text;

[0161] S12, using the training data to train a convolutional neural network to obtain an entity recognition model;

[0162] S13, train a language model based on the training data to obtain a relationship generation model;

[0163] S14, obtain the text to be processed, and identify entities in the text to be processed based on the entity recognition model and the new word discovery technology;

[0164] S15, based on the relationship generation model and the preset information entropy threshold and the dependency syntax analysis...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of natural language processing, and provides a text processing method and device, electronic equipment and a storage medium, and the method comprises the steps: obtaining training data based on a historical text through employing a remote supervised learning technology; training a convolutional neural network by using the training data to obtain an entity recognition model; training a language model based on the training data to obtain a relationship generation model; obtaining a to-be-processed text, and identifying an entity in the to-be-processed text based on the entity identification model and a new word discovery technology; and finally, based on the relationship generation model, a preset information entropy threshold and a dependency syntax analysis technology, performing relationship extraction on the entities in the to-be-processed text to obtain a relationship extraction result in the to-be-processed text, thereby reducing the labor cost of obtaining the entity training data, and improving the user experience. And the accuracy and applicability of relation extraction of text processing in the vocational education field are improved.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to a text processing method, device, electronic device and storage medium. Background technique [0002] The acquisition method of training data used in the current domain knowledge graph construction process requires a lot of manual labeling of entities, which consumes a lot of manpower and time. In addition, the data basis of the open source Chinese knowledge graph is generally derived from the results of the extraction and fusion of major encyclopedias. The applicability and scalability of text processing in special fields are insufficient, and it cannot meet the construction of knowledge graphs in special fields. SUMMARY OF THE INVENTION [0003] In view of the above, it is necessary to propose a text processing method, device, electronic device and storage medium, which can obtain entity training data through remote supervision, reduce labor costs,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06F40/211G06F40/216G06F16/36G06N3/04G06N3/08
CPCG06F40/295G06F40/211G06F40/216G06F16/367G06N3/08G06N3/045
Inventor 刘静
Owner PINGAN INT SMART CITY TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products