Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text intention matching method oriented to intelligent questions and answers and based on internal correlation coding

A technology of intelligent question answering and matching methods, applied in text database query, unstructured text data retrieval, computer parts, etc., can solve problems such as troubles, difficulty in fully capturing complex deep semantic features, and lack of features

Active Publication Date: 2021-03-09
南方电网互联网服务有限公司
View PDF9 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Some works try to extract deep semantic features through the attention mechanism. Although these works have obtained some deep semantic features from different angles, they are generally troubled by the lack of features in the deep encoding process.
In particular, in the face of the diversity and complexity of Chinese semantics, it is difficult for existing methods to fully capture the complex and deep semantic features contained in the text.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text intention matching method oriented to intelligent questions and answers and based on internal correlation coding
  • Text intention matching method oriented to intelligent questions and answers and based on internal correlation coding
  • Text intention matching method oriented to intelligent questions and answers and based on internal correlation coding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0106] as attached Figure 9 As shown, the main framework structure of the present invention includes a multi-granularity embedding module, an internal correlation encoding module, a global reasoning module and a label prediction module. Among them, the multi-granularity embedding module performs embedding operations on the input text according to word granularity and word granularity, and transfers the result after embedding encoding to the internal correlation encoding module. The internal correlation encoding module consists of a base encoding layer and a soft-aligned encoding layer, such as Figure 8 shown. Among them, the structure of the basic coding layer is as follows Figure 7 As shown, it encodes the word / word embedding representation imported from the multi-granularity embedding module using the long-term short-term memory network LSTM, and then concatenates the word / word embedding representation and the encoding result, and passes it into the bidirectional long-t...

Embodiment 2

[0112] as attached figure 1 As shown, the text intent matching method based on internal correlation coding for intelligent question answering of the present invention, the specific steps are as follows:

[0113] S1. Construct the text intent matching knowledge base, as attached figure 2 As shown, the specific steps are as follows:

[0114] S101. Download a published text intent matching dataset from the Internet, or manually construct a dataset that meets requirements, and use it as raw data for constructing a text intent matching knowledge base.

[0115] Example: There are many public text intent matching data sets for intelligent question answering systems on the Internet, such as the BQ data set [Jing Chen, Qingcai Chen, Xin Liu, Haijun Yang, Daohe Lu, and BuzhouTang.2018.The BQ corpus: A large -scale domain-specific Chinese corpus forsentence semantic equivalence identification. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages...

Embodiment 3

[0250] as attached Figure 6 As shown, the text intent matching device based on the intelligent question answering based on internal correlation coding of embodiment 2, the device includes,

[0251] The text intent matching knowledge base construction unit is used to obtain a large amount of text intent matching data from the Internet, and then perform hyphenation and word segmentation operations on it to form a text intent matching knowledge base; the text intent matching knowledge base construction unit includes,

[0252] The original data acquisition unit is responsible for downloading the automatic question-and-answer text intent matching data set that has been published on the Internet, or manually constructing a data set that meets the requirements, and using it as the original data for constructing the text intent matching knowledge base;

[0253] The data preprocessing unit is responsible for performing hyphenation and word segmentation operations on the original data ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text intention matching method oriented to intelligent questions and answers and based on internal correlation coding, and belongs to the field of artificial intelligence. Inorder to solve the technical problem of how to accurately judge whether a text intention is matched or not, the adopted technical scheme is as follows: a text intention matching model consisting of amulti-granularity embedding module, an internal correlation encoding module, a global reasoning module and a label prediction module is constructed and trained to realize deep encoding of informationof different granularities of a text, and meanwhile, a soft alignment attention mechanism is used for obtaining internal correlation information between different granularities; a representation of the text and a multi-granularity representation between the texts are generated through global maximum pooling and global average pooling; similarity calculation is performed on the representations ofthe two texts, and a similarity calculation result is combined with the multi-granularity representation between the texts to obtain a final interaction information representation of the text pair; and the text pair intention matching degree is calculated to achieve the purpose of judging whether the text pair intention is matched or not.

Description

technical field [0001] The invention relates to the technical fields of artificial intelligence and natural language processing, in particular to a text intent matching method based on internal correlation coding for intelligent question answering. Background technique [0002] As a branch of text semantic matching, text intent matching is crucial for intelligent question answering systems. In an intelligent question answering system, text intent matching is used to judge whether two texts have similar intents and whether they can be answered with the same answer. Textual intent matching is a very challenging task, which has not been fully addressed by existing methods. [0003] Existing methods usually simply consider the literal features of the text at the sentence level, while ignoring the deep semantic features. Some works try to extract deep semantic features through the attention mechanism. Although these works have obtained some deep semantic features from different...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/332G06F40/289G06F40/30G06K9/62
CPCG06F16/3344G06F16/3329G06F40/289G06F40/30G06F18/214
Inventor 鹿文鹏赵鹏宇张旭
Owner 南方电网互联网服务有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products