Method and system for Chinese full-text search in database
A database and full-text technology, applied in the direction of text database indexing, digital data information retrieval, unstructured text data retrieval, etc., can solve the problems of large amount of calculation, large amount of data, low efficiency, etc., achieve good recognition and retrieval, increase The effect of reading and writing speed and storing a large amount of data
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0061] This embodiment discloses a method for database Chinese full-text retrieval, such as figure 1 shown, including the following steps:
[0062] Receive text data to be inserted into the database;
[0063] Carry out binary word segmentation processing for each adjacent two Chinese characters of the text data, and create an inverted index for the text data at the same time;
[0064] In the word segmentation process, for the binary phrases obtained by each word segmentation, write the position information of the binary phrases and the binary phrases in the text data into the inverted index;
[0065] Receive the text to be retrieved, and perform binary word segmentation processing to obtain multiple binary phrases to be retrieved;
[0066] In the database, a full-text search is performed based on the inverted index and the plurality of binary phrases to be searched.
[0067] Specifically, such as figure 2 As shown, the method includes the process 1 of inserting new text d...
Embodiment 2
[0098] As a modification of Embodiment 1, this embodiment provides a method for Chinese full-text search in a database, such as Figure 5 shown, including the following steps:
[0099] A method for database Chinese full-text retrieval, is characterized in that, comprises the following steps:
[0100] Pre-create the inverted index structure;
[0101] Receive text data to be inserted into the database;
[0102] Carry out binary word segmentation processing for each adjacent two Chinese characters as a group of the text data;
[0103] In the word segmentation process, for the binary phrases obtained by each word segmentation, write the position information of the binary phrases and the binary phrases in the text data into the inverted index;
[0104] Receive the text to be retrieved, and perform binary word segmentation processing to obtain multiple binary phrases to be retrieved;
[0105] In the database, a full-text search is performed based on the inverted index and the pl...
Embodiment 3
[0120] Based on the retrieval method in Embodiment 1, this embodiment provides a database Chinese full-text retrieval system.
[0121] A database Chinese full-text retrieval system, such as Figure 7 shown, including client, database system and server; where,
[0122] The client receives the text to be retrieved input by the user, generates a retrieval request and sends it to the server;
[0123] The server, connected to the database system, is configured to: receive text data and insert it into the database, and generate a corresponding inverted index of the text data, specifically including:
[0124] Step 101: receiving text data to be inserted into the database;
[0125] Step 102: Preprocessing the text data;
[0126] Step 103: perform binary segmentation on the preprocessed text data every two adjacent Chinese characters as a group, and create an inverted index for the text data at the same time; the inverted index structure includes a three-level index, wherein, The p...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


