Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Rule-based method for patent abstract automatic extraction and keyword indexing

An automatic extraction and keyword technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as high labor costs

Inactive Publication Date: 2010-04-07
北京中献电子技术开发有限公司
View PDF5 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The indexing method of patent documents disclosed in Chinese patent application 200610024618.7 only solves the problems of quick reading and understanding of patent documents, but cannot fundamentally solve the problem of patent retrieval
[0010] The present invention aims to adopt a rule-based automatic patent abstract extraction and keyword indexing method, thereby significantly improving the efficiency of deep processing of patent documents and solving the current situation of high labor costs in deep processing of patent documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Rule-based method for patent abstract automatic extraction and keyword indexing
  • Rule-based method for patent abstract automatic extraction and keyword indexing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0042] The following are the process and results of automatic abstract extraction and keyword automatic indexing of Chinese patent application 00100617.7 using the method of the present invention. The original text of the patent is as follows:

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention a rule-based method for patent abstract automatic extraction and keyword indexing, which mainly comprises the steps: automatically marking key words such as characteristic and technique words and phrases in the full text of the patent literature according to a background knowledge base; determining the functions and mutual relationships of paragraphs in the article according to the types, times, position relations and the like of the occurrence of the characteristic words and phrases in the paragraphs; extracting key paragraphs of the paragraphs to form the extract; and finally, extracting key works from the extract to form the index items of the literature. The method for patent abstract automatic extraction and keyword indexing of the invention consists of five modules: a knowledge base module, a characteristic work marking module, a paragraph analysis and evaluation module, an extract automatic writing module and an indexing module. The method of the invention can obviously improve the efficiency of the deep processing of patent data and reduce the cost of the data processing. And the indexing result has a high retrieval value.

Description

technical field [0001] The invention belongs to the field of natural language computer processing, and in particular relates to a method for automatically extracting patent abstracts and keyword indexing based on rules. Background technique [0002] With the rapid growth of the number of patent documents, realizing the recall rate and precision rate of patent document data has increasingly become the focus and difficulty of patent information retrieval. For a long time, the retrieval of patent information using original patent data often has a relatively serious problem of contradictory recall and precision. Since the original information of patent documents comes from the applicant's original submitted materials, in order to realize the description and protection of the patented technology, a large number of directly related and indirectly related technical materials are often cited to describe the patented technology. Therefore, in patent retrieval, in order to ensure the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 王维王进胡先勇王海虹李红梅崔征
Owner 北京中献电子技术开发有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products