Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Natural language information processing method

A natural language and processing method technology, applied in the field of natural language processing, can solve the problems of not taking into account the relationship between words, low accuracy, and labor-intensive work

Inactive Publication Date: 2020-10-09
汪秀英
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the jieba word segmentation technology is based on the Trie tree structure to scan word graphs, it generates a directed acyclic graph composed of all possible word formations of Chinese characters in a sentence. Dynamic programming is used to find the maximum probability path and find the largest segmentation combination based on word frequency. , does not take into account the interrelationship between words, so word segmentation results do not contain the intrinsic information in words; the word sense disambiguation method based on probability statistics has good flexibility and high disambiguation efficiency, but the disambiguation task is Constrained by the size and category of the corpus, its accuracy is low, while the method based on the semantic dictionary has a high accuracy rate, but it takes a lot of work to build a dictionary, the efficiency is difficult to improve, and the disambiguation method is single; the existing keyword extraction The algorithm is mainly the TextRank algorithm, which is a graph-based sorting algorithm. It uses the co-occurrence window to realize the relationship between some words, sorts the subsequent keywords, and directly extracts keywords from the text itself. However, this method does not analyze words. Whether the difference in importance will affect the weight transfer of adjacent nodes, and the overall information of the document corpus is not used, the weight information of words has no practical meaning, and the strength of the connection cannot be distinguished

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Natural language information processing method
  • Natural language information processing method
  • Natural language information processing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0103] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0104]On the basis of realizing the word segmentation of natural language information based on the relationship between words and accurately eliminating the ambiguity, the keyword information in the natural language information is extracted, and the extracted keywords are used to analyze the natural language information. Classification. refer to figure 1 As shown, it is a schematic diagram of a method for processing natural language information provided by an embodiment of the present invention.

[0105] In this embodiment, the processing method of natural language information includes:

[0106] S1. Use the Huffman tree to store natural language information, and convert the natural language information into natural language vectors.

[0107] First, the present invention takes a window of an appropriate size as the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of natural language processing, and discloses a natural language information processing method. The method comprises the following steps: storing natural language information by utilizing a Huffman tree, and converting the natural language information into a natural language vector; carrying out word segmentation processing on the natural language vector by utilizing a pre-trained LC-CRF model; extracting vocabulary unit vectors of ambiguous words, and forming an ambiguous word feature matrix; inputting the ambiguous word feature matrix into a pre-constructed word sense disambiguation model, and identifying semantics of ambiguous words by utilizing the word sense disambiguation model; iteratively updating the weight of each word based on the weighted node initial value of the word comprehensive weight and the node probability transfer matrix, and selecting the first N words with relatively large weights as keywords; and taking a keyword vector in the natural language information as an input vector, and identifying emotion in the natural language information by utilizing the neural network based on the segmented pooling layer. The naturallanguage information is processed.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to a method for processing natural language information. Background technique [0002] At this stage, the degree of informatization of the entire world has reached a new height with the development of Internet technology, and at the same time it has brought an unimaginable growth rate to the amount of information in human society. In daily life, massive natural language information brings convenience to human beings, but also brings great troubles, that is, how to efficiently obtain the required content from large-scale information. [0003] Due to the great complexity of natural language, on the one hand, natural language has no fixed pattern, so it has rich ways of expression. Humans have their own habits when expressing their thoughts, so different people often have different expressions when describing the same thing ; On the other hand, natural lang...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F40/289G06F40/30G06N3/04
CPCG06F16/35G06F40/289G06F40/30G06N3/044G06N3/045
Inventor 汪秀英
Owner 汪秀英
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products