Hash sample balance cancer labeling method for histopathologic image

A technique for pathology and samples, applied in the field of image analysis, which can solve the problems of slow operation, large time overhead and machine cost of integrated methods

Pending Publication Date: 2021-06-04
SOUTH CHINA UNIV OF TECH
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the imbalance rate is high, the ensemble method needs to train enough base classifiers to achieve better results. At the same time, multiple models also bring greater time overhead and machine cost when deployed.
Another reason for the slowness of ensemble methods is the use of distance-based resampling methods in each iteration to obtain balanced data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hash sample balance cancer labeling method for histopathologic image
  • Hash sample balance cancer labeling method for histopathologic image
  • Hash sample balance cancer labeling method for histopathologic image

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0053] The hash-based sampling method (HBU) proposed by the present invention is an under-sampling method that selects representative samples belonging to multiple classes to construct a balanced training set. In the process of undersampling, the features of multi-class images are firstly extracted by convolutional autoencoder, and then the images in high-dimensional feature space are mapped to low-dimensional binary space by hash method to generate hash codes for all multi-class image samples. Each hash code corresponds to a subspace in the original feature space, also known as a hash bucket. Finally, calculate the selection ratio of the samples drawn in each hash bucket, and select a representative sample. figure 1 The algorithm flowchart of HBU is shown.

[0054] In the method of the present invention, we need to perform feature extraction on multi-category images first. If traditional manual methods including local binary patterns or root filter banks are used to extract...

Embodiment 2

[0089] Such as figure 1 As shown, a hash sample balanced cancer labeling method for histopathological images described in the present invention includes the following steps:

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Hash sample balance cancer labeling method for histopathologic images, which comprises the following steps: extracting features of various types of images by using a convolutional auto-encoder, extracting image block features by using an unsupervised convolutional auto-encoder CAE, and using a supervised convolutional neural network (CNN) for a final classification task; mapping an image in a high-dimensional feature space to a low-dimensional binary space by using a Hash method, forming Hash codes for all multi-class image samples, and wherein each Hash code corresponds to a subspace in an original feature space, also called a Hash bucket; and calculating the selection proportion of the extracted samples in each hash bucket, and selecting representative samples. The invention provides an efficient resampling method HBU for the class imbalance problem in a large-scale pathological histological image classification task. The pairwise distance between the samples does not need to be calculated, and the invention has high efficiency and high expansibility.

Description

technical field [0001] The invention relates to the technical field of image analysis, in particular to a hash sample balanced cancer labeling method for histopathological images. Background technique [0002] Early diagnosis of cancer based on ultra-high resolution pathological images of patients plays an important role in medicine. Existing methods mainly divide the original histopathological image into a large number of image blocks, and then judge whether the image blocks are cancer tissue images, so as to realize accurate judgment and location of cancer lesions. However, the number of normal tissue image blocks in reality often far exceeds that of cancer tissue block images, resulting in an imbalance problem in the dataset, making it difficult to effectively train a cancer tissue image classifier. Therefore, this paper proposes a hash sample balanced cancer labeling method for histopathological images. Based on the hash method, a balanced training set is extracted to t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08G06T7/00
CPCG06N3/088G06N3/08G06T7/0012G06T2207/20081G06T2207/20084G06T2207/30096G06V2201/03G06N3/045G06F18/213G06F18/2155G06F18/214
Inventor 吴永贤丘林田星张建军王婷余洪华
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products