Web crawler method for financial warehouse warrant risk control

A web crawler and financial warehouse technology, which is applied in the web crawler field of financial warehouse receipt risk control, can solve the problems of difficult goods market price statistics, early financial transaction risks, and valuation of goods that cannot be mortgaged, so as to ensure controllability and Efficiency, improve processing efficiency and classification accuracy, and ensure maximum effect

Active Publication Date: 2016-11-09
BEIJING UNIV OF TECH
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Banks are limited by technical limitations, making it difficult to make statistics on the market prices of all goods, and unable to make reasonable valuations of mortgaged goods, which has early become a potential financial transaction risk
[0004] To solve the commodity valuation problem of bulk commodities, it is first necessary to obtain the price information of the commodity in the market. However, due to the limitations of factors such as massive data and accurate information extraction, the network currently used for risk control of financial warehouse receipts, that is, the valuation of commodity prices Crawler technology is in a blank state

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Web crawler method for financial warehouse warrant risk control
  • Web crawler method for financial warehouse warrant risk control

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0032] like figure 1 As shown, the embodiment of the present invention provides a web crawler method for financial warehouse receipt risk control, including the following steps:

[0033] Step 1, establish keyword database and abstract database.

[0034] A certain amount of sample data is required at the initial stage of establishing the keyword database and the abstract database. The sample data needs to be obtained in advance, and the amount of data is small, but the category of each record has been determined.

[0035] Chinese word segmentation methods such as Lucene are used to extract the keywords of each record of the sample data, and at the same time filter out non-related words such as symbols, stop words, people, and place names. The extracted keywords form a keyword library.

[0036] The feature vector is calculated for each record...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a web crawler method for financial warehouse warrant risk control. Double bloom filter keyword matching is adopted, and the web crawler information comprising the cargo information result is screened rapidly; different classes of cargo are sorted accurately on the basis of a classification matching manner, the regulation is compared by the combination of the threshold value, and new cargo classes are added automatically; on the basis of an information mechanism, load balancing of front and rear end tasks of the whole processing process is realized, the controllability and the efficiency of the processing process are ensured to be maximum, and local hot spots are prevented. With the adoption of the technical scheme, efficient crawling and accurate screening of financial warehouse warrant mortgage cargo information can be realized.

Description

technical field [0001] The invention belongs to the related field of web crawler algorithms, and in particular relates to a web crawler method for risk control of financial warehouse receipts. Background technique [0002] As a new type of storage transaction and mortgage method, financial warehouse receipts are widely used by banks and storage companies with the popularization of Internet applications. Small and medium-sized enterprises mortgage the goods to the bank, and the bank evaluates the value of the goods by itself or by entrusting a third-party evaluation company. According to the evaluation results, the bank will issue corresponding loans to SMEs. At the same time, the bank entrusts the logistics and warehousing company to store and supervise the mortgaged goods. [0003] However, in order to avoid the corresponding risks, banks often choose those products with small price changes, strong liquidity and good resilience as financing objects, such as fixed assets a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06Q40/02
CPCG06F16/9535G06Q40/03
Inventor 李浩
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products