Natural language information data processing method and device

A natural language and data processing technology, applied in the field of information processing, can solve problems such as lack of creativity, over-fitting of generated content, and single content, and achieve the effect of improving sentence creativity, eliminating over-fitting phenomenon, and reasonable word collocation

Inactive Publication Date: 2017-03-15
HISENSE
View PDF2 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the process of generating natural language information, the computer's creative ability depends on the training corpus. When the corpus is not large enough, it will easily l

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Natural language information data processing method and device
  • Natural language information data processing method and device
  • Natural language information data processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] In order to make the above objects, features and advantages of the embodiments of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0069] One of the core concepts of the embodiments of the present invention is to first use a corpus of a specific style to generate natural language information, but because there is an overfitting phenomenon in the generated natural language information, the content in the natural language information will be different from that in the corpus of a specific style. The content is consistent or partially consistent; it affects the quality of the generated natural language information, so replace the content in the natural language information with the content in the natural language information that is consistent or partially consistent with the specific style of the corpus to eliminate the phenomenon of overfittin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a natural language information data processing method and device. The natural language information data processing method includes: acquiring a first corpus and a second corpus; generating natural language information according to the second corpus; determining a second characteristic word in the natural language information; extracting a first characteristic word from the first corpus; and using the first characteristic word to replace the second characteristic word meeting a preset rule. According to the embodiment of the invention, more reasonable and effective first characteristic words can be provided through determination of the rationality of the replaced natural language information, and the second characteristic word is replaced; the natural language information can achieve reasonable matching and fluent statements; overfitting phenomenon can be accurately and reasonably eliminated in a generation process of a natural language; high quality natural language information can be generated; the matching rationality and the statement fluency of the generated natural language information can be ensured; and the statement creativity of the natural language information can be improved.

Description

technical field [0001] The present invention relates to the technical field of information processing, in particular to a data processing method for natural language information and a data processing device for natural language information. Background technique [0002] In the prior art, deep learning is a new field in machine learning research. Its motivation is to establish and simulate the neural network of human brain for analysis and learning. It imitates the mechanism of human brain to explain data, such as image information, text information and natural language information. [0003] With the application of deep learning in natural language information processing, rule-based natural language information processing is gradually being eliminated. Deep learning-based natural language information processing no longer requires cumbersome rule configuration, continuous addition and maintenance of rules, and syntactic analysis , Analyzing semantics, computers can create and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
CPCG06F40/242G06F40/30
Inventor 袁丽甘信军
Owner HISENSE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products