Unlock instant, AI-driven research and patent intelligence for your innovation.

A text data retrieval method and device

A text data and data technology, applied in the direction of unstructured text data retrieval, text database query, special data processing applications, etc., can solve the problem of poor data retrieval effect, messy text data across systems and fields, and cannot efficiently meet practical applications Requirements and other issues to achieve the effect of improving retrieval efficiency and facilitating calculation

Active Publication Date: 2020-09-01
GUANGDONG POWER GRID CO LTD
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] This application provides a text data retrieval method and device, which are used to solve the technical problems that text data is disorderly, cross-system and cross-field, and the sharp increase in data volume leads to poor retrieval effect and cannot efficiently meet the needs of practical applications

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A text data retrieval method and device
  • A text data retrieval method and device
  • A text data retrieval method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0057] For ease of understanding, see figure 1 , Embodiment 1 of a text data retrieval method provided by the present application, comprising:

[0058] Step 101. Construct feature vectors extracted from preset text data into a vector set, where the feature vectors include first keywords and first feature weights.

[0059] It should be noted that the preset text data is collected and processed cross-domain and cross-system text data at different levels, such as log information, business financial sales management software text records, customer service complaints and suggestions, email comments, etc.; in the abstract It is difficult to find the internal connection of these text data at the level, and it needs to be transformed into a mathematical model that is convenient for research, that is, feature vectors; the extraction of feature vectors is the process of feature extraction, and the selected feature items are very important. The features in this embodiment Vectors are di...

Embodiment 2

[0069] For ease of understanding, see figure 2 , the embodiment of the present application provides a second embodiment of a text data retrieval method, including:

[0070] Step 201, collecting messy original text data.

[0071] Step 202, performing a data cleaning operation on the original text data to obtain preset text data.

[0072] It should be noted that the original text data includes structured and unstructured. In the enterprise IT system, such as log information, business financial sales management and other software text records, customer service complaints and email comments, etc., contain a large amount of text data; The original text data involves a messy system or field, and the data levels are not the same, and there is no correlation. The purpose of retrieval is to retrieve the most similar text information among these messy data based on the existing text information; the specific method of collection can be Install Agent on site to collect, analyze and pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a text data retrieval method and device. The method includes: firstly, constructing feature vectors extracted from preset text data into a vector set, and the feature vectors include first keywords and first feature weights; then, Classify the set of vectors according to the first similarity between the preset hotspot vectors and the feature vectors to obtain the feature vector category library; secondly, construct the search vector according to the preset search hotspots, and the search vector includes the second keyword and the second feature weight ; Then, randomly select a category in the feature vector category library, calculate the second similarity between each feature vector in the category and the retrieval vector, and obtain the maximum similarity; finally, use the first feature weight instead of the second one according to the preset conditions Two feature weights, and iterative retrieval, get the retrieval feature vector. It solves the technical problem that the retrieval effect is poor and the actual application requirements cannot be efficiently met.

Description

technical field [0001] The present application relates to the technical field of text retrieval, in particular to a text data retrieval method and device. Background technique [0002] In recent years, the rapid development of the Internet has ushered in an era of explosive growth of information. With the gradual transfer of daily life to the Internet, the era of big data has become inevitable. As a frontier concept of the global Internet, big data mainly includes two characteristics: one is the sharp increase in the amount of information; the other is the exponential growth in the amount of information available to individuals. [0003] Artificial intelligence is the study of how computers simulate or implement human learning behaviors to acquire new knowledge or skills, and reorganize existing knowledge structures to continuously improve their own performance. With the development of artificial intelligence, artificial intelligence has also been applied to various fields...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33
CPCG06F16/3334G06F16/3347
Inventor 侯凯李耀东金波
Owner GUANGDONG POWER GRID CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More