Random forest technology-based similar file retrieval method

A random forest and file retrieval technology, applied to network data retrieval, other database retrieval, network data indexing, etc., can solve problems such as ineffective retrieval, improve retrieval accuracy and coverage, and avoid instability

Active Publication Date: 2016-09-07
安徽富驰信息技术有限公司
View PDF8 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This patented process helps search or match related documents more accurately without being affected from any issues like stability and overfitness. It uses Random Forest (RF) techniques that use specific technical factors such as industrial properties and expertise. These methods improve upon existing algorithms but still have limitations due to their fixed structure.

Problems solved by technology

The technical problem addressed in this patents relates to improving the efficiency and effectiveness of searching documents related to specific areas like legal case searches or investigative procedures conducted during litanyings. Current methods either require manual input or lack precision due to factors like subjectivity involved while navigating through irrelevant data sources. This can lead to poorly identified scenarios that may result in incorrect conclusions being drawn up later when reviewing them again afterward. There exists a challenge where current systems only provide basic level recommendations without providing detailed analysis capabilities.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Random forest technology-based similar file retrieval method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In order to have a further understanding and understanding of the structural features of the present invention and the achieved effects, the preferred embodiments and accompanying drawings are used for a detailed description, as follows:

[0039] Such as figure 1 Shown, a kind of similar file retrieval method based on random forest technique described in the present invention, comprises the following steps:

[0040] The first step is the organization of the judgment documents. The judgment documents are organized according to the cause of action. Due to the particularity of this application document, it is proposed to construct the feature tree according to the industry characteristics of different fields and industries. Therefore, for different fields, the industry characteristics they have are not the same, and this is for convenience. The elaboration of the technical solution is based on the characteristics of the judicial case to explain the technical classificatio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a random forest technology-based similar file retrieval method. Compared with the prior art, the method has the advantage that the defect of incapability of performing effective retrieval in a specific field is overcome. The method comprises the following steps of organizing a judgment document; constructing a case characteristic tree; performing weight training on the case characteristic tree, performing training for different targets by adopting a random forest method, and calculating a comprehensive weight of case characteristics; obtaining retrieval information, and inputting filtration conditions and query conditions of the retrieval information in an input mode of conditional selection, a text with conditions or the whole judgment document; calculating a case similarity matrix; and outputting a retrieval result, obtaining similar cases from the case similarity matrix, finding n cases most similar to the query conditions or cases with the similarity greater than s, making statistics on the information, and performing visual display. According to the method, the problems of instability and over-fitting caused by a single decision-making tree are effectively avoided by using the random forest-based method.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner 安徽富驰信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products