Unlock instant, AI-driven research and patent intelligence for your innovation.

Corpus generalization method and man-machine conversation sentiment analysis method used in industrial field

A technology of man-machine dialogue and corpus, applied in the field of data processing

Pending Publication Date: 2021-05-28
SANY HEAVY IND CO LTD (CN)
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a corpus generalization method and a human-computer dialogue emotion analysis method used in the industrial field, which is used to solve the defect that the generalization of the corpus by manually defining sentence templates in the prior art has great limitations, and realize the industrial field Corpus Generalization in Human-Computer Dialogue

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Corpus generalization method and man-machine conversation sentiment analysis method used in industrial field
  • Corpus generalization method and man-machine conversation sentiment analysis method used in industrial field
  • Corpus generalization method and man-machine conversation sentiment analysis method used in industrial field

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, the described embodiments are part of the embodiments of the present invention , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0051] Due to the limitations of the current corpus generalization method, it is not suitable for rapid application and implementation. In order to solve the above technical problems, the embodiment of the present invention provides a corpus generalization method, which includes:

[0052] S1. Obtain an initial text corpus in the industrial field, and replace entity words in the initial text c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a corpus generalization method and a man-machine conversation sentiment analysis method used in the industrial field. The corpus generalization method comprises the following steps: acquiring an initial text corpus in the industrial field, and replacing entity words in the initial text corpus to obtain a first type of text corpus; performing word segmentation processing on the initial text corpus and / or the first type of text corpus, and based on synonyms of words obtained by the word segmentation processing, replacing the words obtained by the word segmentation processing to obtain a second type of text corpus; performing dependency syntactic analysis on at least one of the initial text corpus, the first type of text corpus and the second type of text corpus, and performing sentence pattern transformation on the at least one based on an analysis result to obtain a third type of text corpus; and generalizing the initial text corpus based on at least two of the first type of text corpus, the second type of text corpus and the third type of text corpus. By means of the method, expansion of text corpora needed by functions such as man-machine conversation in the industrial field can be completed.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a corpus generalization method and a man-machine dialogue emotion analysis method used in the industrial field. Background technique [0002] The realization of human-computer interaction, chat dialogue and other functions in related professional fields such as industry requires a large amount of corpus data as support for model training and effect evaluation, and it is often difficult to accumulate relevant corpus in these scenarios. Therefore, corpus generalization is needed to increase the corpus for model training and effect evaluation. [0003] Corpus generalization refers to expanding a specific sentence into a type of sentence with the same meaning or in similar scenarios. At present, corpus generalization is usually performed by manually defining sentence templates for fixed application scenarios. This method of manually defining sentence templates has great limi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/247G06F40/289G06F40/211G06F40/279G06F40/242
CPCG06F16/353G06F40/247G06F40/289G06F40/211G06F40/279G06F40/242
Inventor 王健健蒋华晨刘扬
Owner SANY HEAVY IND CO LTD (CN)
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More