Book automatic classification device

An automatic classification and book technology, applied in the field of information management, can solve the problems of low efficiency of manual classification and inability to fully cover book information, and achieve the effect of rich classification types, high classification accuracy and high classification efficiency

Inactive Publication Date: 2018-04-20
SICHUAN JIUDINGZHIYUAN INTPROP OPERATIONS CO LTD
0 Cites 4 Cited by

AI-Extracted Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to: aim at the above existing problems, to provide a solution to automatically realize the rapid classification of books, t...
View more

Abstract

The invention discloses a book automatic classification device comprising an image collector, a character recognizer, a character matching circuit and an information extraction circuit, wherein the character recognizer comprises an image processing circuit, an eigenvalue extraction circuit and an eigenvalue matching circuit which are connected in sequence. The eigenvalue matching circuit includesa paragraph division module circuit and a library matching module circuit connected with the paragraph division module circuit and a character eigenvalue library. The eigenvalue extraction circuit includes an image projection module circuit, an image preprocessing module circuit and an eigenvalue extraction module circuit which are connected in sequence. By identifying characters in a book cover image, corresponding book attribute information is further matched in a book database, and then book classification information is extracted. The device can provide rich classification item basis for book classification and have rich classification categories, wide coverage information, high classification accuracy and high classification efficiency.

Application Domain

Relational databasesCharacter and pattern recognition +1

Technology Topic

Information extractionImage pre processing +4

Image

  • Book automatic classification device
  • Book automatic classification device
  • Book automatic classification device

Examples

  • Experimental program(1)

Example Embodiment

[0046] All the features disclosed in this specification, or all disclosed methods or steps in the process, except for mutually exclusive features and/or steps, can be combined in any manner.
[0047] Any feature disclosed in this specification (including any additional claims and abstract), unless specifically stated, can be replaced by other equivalent or alternative features with similar purposes. That is, unless otherwise stated, each feature is just one example of a series of equivalent or similar features.
[0048] Such as figure 1 As shown, this embodiment discloses an automatic book classification device, which is characterized in that it includes an image collector, a character recognizer, a character matching circuit, and an information extraction circuit connected in sequence, in which:
[0049] The image collector is configured to: collect the image data of the book cover and pass it to the text recognizer;
[0050] The text recognizer is configured to: recognize the book cover text of the image data, and output the cover text information to the text matching circuit;
[0051] The text matching circuit is configured to: receive the cover text information, match the book attribute information in the book database according to the cover text information, and output it to the information extraction circuit;
[0052] The information extraction circuit is configured to extract the classification information of the book from the book attribute information.
[0053] Preferably, the classification information includes one or more of the subject classification information of the book, the price classification information of the book, the audience classification information of the book, or the evaluation grade classification information of the book.
[0054] Such as figure 2 As shown, in one embodiment, the above-mentioned character recognizer includes an image processing circuit, a feature value extraction circuit, and a feature value matching circuit that are sequentially connected, wherein:
[0055] The image processing circuit is configured to: preprocess the image data, and output a book cover image to the feature value extraction circuit;
[0056] The feature value extraction circuit is configured to: extract the feature value of the book cover image and output it to the feature value matching circuit;
[0057] The feature value matching circuit is configured to match the received feature value to the corresponding text in the text feature value library, and output cover text information.
[0058] Preferably, the preprocessing includes: cover image positioning, edge extraction and binarization processing, or also includes morphological processing. That is, preprocessing includes: cover image positioning, edge extraction and binarization processing, or cover image positioning, edge extraction, binarization processing and morphological processing.
[0059] In a specific embodiment, the image collector is a camera device or a scanning device.
[0060] Such as image 3 As shown, the characteristic value matching circuit includes a paragraph dividing module circuit, and a library matching module circuit connecting the paragraph dividing module circuit and the character characteristic value library, wherein:
[0061] The library matching module circuit is configured to: sequentially match the received feature values ​​to corresponding characters in the character feature value library, and output the recognized characters to the paragraph dividing module circuit;
[0062] The paragraph dividing module circuit is configured to: receive the text sent by the library matching module circuit, divide the received text into several paragraphs according to the layout of the text in the book cover image, and output the cover text information divided into several paragraphs;
[0063] The text matching circuit is configured to: when the received cover text information is sequentially matched to the book attribute information in the book database according to the divided paragraphs, stop matching subsequent paragraphs, and output the book attribute information to the information extraction circuit.
[0064] In one embodiment, the above-mentioned paragraph dividing module circuit is configured to: add interval identifiers where the characters are not continuous according to the continuity of text layout in the book cover image;
[0065] The text matching circuit is configured to receive cover text information, and every time the text between two consecutive interval identifiers is matched to the book attribute information in the book database, stop matching subsequent text matching, and output the book attribute information to the information Extract the circuit.
[0066] In a specific embodiment, the above-mentioned book database includes a book name item, a publisher item, and an author item that are related to each other, and the related book name item, publisher item, and author item correspond to the same book attribute information;
[0067] The text matching circuit is configured to receive cover text information, and when the text between every two consecutive interval identifiers is matched to the corresponding item under the book name item, publisher item, or author item of the book database, extract the The book attribute information corresponding to the matched item is output to the information extraction circuit.
[0068] Such as Figure 4 As shown, in one embodiment, the feature value extraction circuit includes an image projection module circuit, an image preprocessing module circuit, and a feature value extraction module circuit that are sequentially connected, wherein:
[0069] The image projection module circuit is configured to connect to the image processing circuit, and project the book cover image in the horizontal or vertical direction, and divide it into several image blocks;
[0070] The image preprocessing module circuit is configured to: perform preprocessing on the plurality of image blocks and output a plurality of binarized image blocks;
[0071] The characteristic value extraction module circuit is configured to: sequentially extract the characteristic values ​​of the several binarized image blocks, and sequentially output the extracted characteristic values ​​to the characteristic value matching circuit.
[0072] Further, the feature value matching circuit sequentially recognizes the feature value corresponding text of the binarized image block output by the feature value extraction module circuit and outputs it to the text matching circuit as cover text information;
[0073] The text matching circuit is also configured to: sequentially match the text between every two interval identifiers in the cover text information sent by the feature value matching circuit to the corresponding items under the book name item, publisher item, or author item of the book database At this time, the book attribute information corresponding to the matched item is extracted and output to the information extraction circuit, and a processing stop signal is sent to the image preprocessing module circuit, so that the image preprocessing module circuit stops processing subsequent image blocks.
[0074] Preferably, the book database is an authorized book publisher database or an authorized book agent database.
[0075] The present invention is not limited to the foregoing specific embodiments. The present invention extends to any new feature or any new combination disclosed in this specification, and any new method or process step or any new combination disclosed.

PUM

no PUM

Description & Claims & Application Information

We can also present the details of the Description, Claims and Application information to help users get a comprehensive understanding of the technical details of the patent, such as background art, summary of invention, brief description of drawings, description of embodiments, and other original content. On the other hand, users can also determine the specific scope of protection of the technology through the list of claims; as well as understand the changes in the life cycle of the technology with the presentation of the patent timeline. Login to view more.

Similar technology patents

Remote entrance guard internet-of-things apparatus

InactiveCN106204836ASimple structureWide coverage
Owner:合肥若涵信智能工程有限公司

Data transmission method of wireless networking system and wireless networking system

PendingCN112135268ALarge network capacitywide coverage
Owner:NEW SINGULARITY INT TECHN DEV

Classification and recommendation of technical efficacy words

  • wide coverage

Method and apparatus for filtering rubbish contents

InactiveCN101510879AWide coveragereduce workload
Owner:TENCENT TECH (SHENZHEN) CO LTD

Intelligent public transport information interaction and display system

ActiveCN103761874AStructurally and functionally completewide coverage
Owner:JILIN UNIV

Taxi passenger carrying site selection method and system

ActiveCN105139638Awide coverageReasonable density distribution
Owner:FUJIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products