Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Text data classification method and device, equipment and storage medium

A classification method and data technology, applied in the fields of instrument, calculation, character and pattern recognition, etc., can solve problems such as inability to classify data

Active Publication Date: 2022-06-24
深圳大道云科技有限公司
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The main purpose of the present invention is to solve the technical problem that the classification of prior art data cannot classify the data with too many kinds and quantities in the process of data classification of real estate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text data classification method and device, equipment and storage medium
  • Text data classification method and device, equipment and storage medium
  • Text data classification method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] Embodiments of the present invention provide a method, apparatus, device, and storage medium for classifying text data.

[0055] The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present invention and the above-mentioned drawings are used to distinguish similar objects and are not necessarily used to describe a specific order or sequence. It is to be understood that data so used may be interchanged under appropriate circumstances so that the embodiments described herein can be practiced in sequences other than those illustrated or described herein. Furthermore, the terms "comprising" or "having" and any variations thereof are intended to cover non-exclusive inclusion, for example, a process, method, system, product or device comprising a series of steps or units is not necessarily limited to those expressly listed steps or units, but may include other steps or units not expressly listed or inherent to these processes, me...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of data classification, and discloses a text data classification method and device, equipment and a storage medium. The method comprises the following steps: acquiring file image data of a real estate; based on a preset OCR (Optical Character Recognition) algorithm, carrying out recognition processing on the file image data to obtain image feature data; based on the feature arrangement position of the image feature data, converting the image feature data into an N-dimensional vector to obtain an N-dimensional feature vector, N being a positive integer; reading a preset N-dimensional test node set, and calculating Euclidean distances between the N-dimensional feature vector and all N-dimensional test nodes in the N-dimensional test node set in the same N-dimensional space to obtain a measurement distance set; screening out measurement distances smaller than a preset division threshold in the measurement distance set to obtain a screened distance set; performing classification regression processing on the screening distance set according to a preset regression algorithm to obtain an image type; and determining the image type as the type of the file image data.

Description

technical field [0001] The present invention relates to the field of data classification, in particular to a text data classification method, device, equipment and storage medium. Background technique [0002] In real estate financial transactions, it involves some very important identification documents and real estate certification documents to prove and use, mainly including: ID card, real estate certificate, marriage certificate, personal credit certificate, etc., which are used in business processing and processing All kinds of data are mostly in the form of pictures and images, which are generally obtained through paper scanning or mobile terminal photographing. Some techniques for classifying types of real estate data. [0003] There are also some techniques for classifying data in the prior art, but in the process of classifying real estate data in the prior art, it is impossible to classify data with too many types, so a new technology is required. SUMMARY OF THE...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06V30/413G06V10/774G06K9/62
CPCG06F18/214
Inventor 杨志陈耀麟李欢欢曾云奎秦在振
Owner 深圳大道云科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products