Patent information analysis method and device

A technology of patent information and analysis method, applied in the field of computer information, can solve problems such as waste of manpower, low efficiency, invalid regular expression rules, etc., and achieve the effect of saving manpower and improving efficiency

Active Publication Date: 2014-07-16
北京彼速信息技术有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Although this method of parsing patent information has high parsing efficiency, website owners on the Internet often adjust the HTML format to display different effects on the webpage. This adjustment will inevitably lead to invalidation of regular expression rules set by users, thus The data parsed by the above parsing method is wrong or cannot be parsed
Unless the user reanalyzes the HTML format, rewrites the regular expression rules that can accurately locate each data item, and updates it into the computer program, this obviously brings a huge workload to the user, wastes manpower, and is inefficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Patent information analysis method and device
  • Patent information analysis method and device
  • Patent information analysis method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0048] figure 1 The flow chart of the patent information analysis method provided by Embodiment 1 of the present invention, as shown in figure 1 As shown, the method may include the following steps:

[0049] Step 101: Select the parsed patent information from the database as the basic data, and obtain the HTML format webpage of the patent information from the website.

[0050] Since the method provided by the embodiment of the present invention actually uses the patent information that has been parsed to perform reverse analysis to obtain a regular expression when the HTML format of the page changes, that is to say, this process can be started when the HTML format changes Because once the regular expression is parsed, as long as the HTML format does not change, the regular expression can be used to analyze the patent information. Therefore, it is possible to regularly detect whether the HTML format of the website changes, and once the HTML format changes are detected, the ex...

Embodiment 2

[0095] image 3 The structural diagram of the patent information analysis device provided for Embodiment 2 of the present invention, as shown in image 3 As shown, the device may include: a basic data acquisition unit 310 , a web page acquisition unit 320 , a rule formatting unit 330 and an information parsing unit 340 .

[0096] The basic data acquisition unit 310 selects the patent information that has been parsed from the database as the basic data.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a patent information analysis method and device, wherein the method comprises the steps of: selecting analyzed patent information from a database to serve as basic data, and acquiring HTML (Hypertext Markup Language)-format webpages of the patent information from a website; specific to each data item in the basic data, respectively acquiring character strings capable of uniquely locating all the data items from the acquired HTML-format webpages, and respectively formatting the character strings into regular expressions for analyzing all data items; and analyzing the patent information from the unanalyzed HTML-format webpage of the website by using the regular expressions for analyzing all the data items, and saving the patent information obtained through analysis into the database. The method and the device disclosed by the invention can be used for adaptively establishing an analysis rule of the patent information so as to ensure that the analysis rule of the patent information can be updated automatically even though the HTML formats of the webpages change, so that the patent information is analyzed correctly, manpower is saved and the efficiency is increased.

Description

【Technical field】 [0001] The invention relates to the field of computer information technology, in particular to a patent information analysis method and device. 【Background technique】 [0002] With the rapid development of Internet technology, the Internet has become the main means for people to obtain information, and so is patent information. Almost all patent information in the world is released through the Internet, making it easier for people to obtain patent information, thereby promoting technological innovation and development. Now more and more enterprise users search patent information on the Internet and parse it into accurate data and save it in the local database, thus forming their own patent information library for in-depth use. [0003] When analyzing patent data published in Hypertext Markup Language (HTML) format, the user usually analyzes the patent information in HTML format, writes regular expressions that can accurately locate each data item (such as ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 谢国利
Owner 北京彼速信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products