Method and device for establishing example sentence index and method and device for indexing example sentences

A kind of technology of example sentence and index, applied in example sentence retrieval method and device, example sentence index creation method and device field, can solve the problems such as inability to realize

Inactive Publication Date: 2012-09-05
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF3 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For some advanced syntax-based searches, it cannot be implemented
For example, if a user wants to search for example sentences for "diffic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for establishing example sentence index and method and device for indexing example sentences
  • Method and device for establishing example sentence index and method and device for indexing example sentences
  • Method and device for establishing example sentence index and method and device for indexing example sentences

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0131] figure 1 The flow chart of the example sentence index creation method provided by Embodiment 1 of the present invention, such as figure 1 As shown, perform the following steps for each example sentence in the example sentence library:

[0132] Step 101: Perform word segmentation processing on the example sentence.

[0133] Here, the word segmentation processing technology is a relatively mature technology in the field. For the English example sentence, since English itself is based on words, and words are separated by spaces, word segmentation can be easily realized. Chinese is based on characters. Existing word segmentation methods based on string matching, understanding-based word segmentation methods, or statistics-based word segmentation methods can be used to process Chinese word segmentation. The more commonly used ones are based on string matching. The maximum forward matching algorithm in the word segmentation method. The method of word segmentation processin...

Embodiment 2

[0184] Figure 4 The example sentence retrieval method flowchart that provides for the embodiment of the present invention two, as Figure 4 As shown, the method includes the following steps:

[0185] Step 401: Receive user's query.

[0186] In the embodiment of the present invention, the grammatical rules of the query can be defined in advance, and the user can input the query based on the grammatical rules.

[0187] The input query needs to contain query items. If there are multiple query items, it can further include the logical relationship between query items. Wherein, the query item is at least one of the following: the combination of words and the part of speech corresponding to the words, the combination of words and the NE types corresponding to the words, the combination of words and the syntactic roles corresponding to the words, and the combination of words and words The combination between; the logical relationship is: intersection or difference.

[0188] Pref...

Embodiment 3

[0222] Figure 5 The structural diagram of the device for creating an example sentence index provided by Embodiment 3 of the present invention, such as Figure 5As shown, the device may include: a text analysis unit 500 and an index establishment unit 510 .

[0223] The text analysis unit 500 is configured to perform text analysis on each example sentence in the example sentence database.

[0224] The index building unit 510 is used to create an index corresponding to each example sentence according to the analysis result of the text analysis unit 500; wherein the index includes at least one of the following: the combination of the words in the example sentence and the part of speech corresponding to the word, the example sentence The combination of the word in and the corresponding named entity type of the word, the combination of the word in the example sentence and the syntactic role corresponding to the word, and the combination of words and words in the example sentence....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a device for establishing an example sentence index and a method and a device for indexing example sentences. A special index is established for the example sentences by performing text analysis on the example sentences in an example sentence library; when a user inputs a grammar-based advanced search requirement, the search requirement input by the user is resolved; search results of respective inquiry items are acquired according to resolved inquiry items; and the search results of the respective inquiry items are integrated and processed according to a logic relation of the resolved inquiry items. The established index and the inquiry items are at least one of the following combinations: a combination of terms in the example sentences and parts of speech corresponding to the terms, a combination of terms in the example sentences and types of named entities corresponding to the terms, a combination of terms in the example sentences and syntactic roles corresponding to the terms, and a combination of terms in the example sentences. According to the methods and the devices, the grammar-based advanced search can be realized, so that the search effect can be improved.

Description

【Technical field】 [0001] The invention relates to the field of computer technology, in particular to a method and device for creating an example sentence index and a method and device for example sentence retrieval. 【Background technique】 [0002] Information retrieval refers to the process and technology of organizing information in a certain way and finding relevant information according to the needs of information users. Information retrieval has been widely used in literature, multimedia and translation fields. [0003] In the existing information retrieval technology, there is a special information retrieval: example sentence retrieval, which is used to retrieve example sentences containing certain keywords. Example sentence retrieval is usually used to display example sentences in monolingual dictionaries or in translation technology. However, the existing example sentence retrieval is usually based solely on keyword matching. For example, when it is applied to the ex...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
Inventor 赵世奇吴甜王海峰吴华
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products