Text recognition method and device, electronic equipment and storage medium

A text recognition and text technology, applied in unstructured text data retrieval, text database clustering/classification, electronic digital data processing, etc., can solve the problem that the accuracy of topic keywords is not very high, reduce the accuracy of topic key sentences, etc. question

Active Publication Date: 2019-08-16
BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD +1
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, since the accuracy of the topic keywords extracted by the unsupervised keyword screening method is not very high, the accuracy of extracting the topic key sentences of each article is greatly reduced, so that when users look up articles, the topics they view The key sentence is not necessarily the actual topic key sentence of the article

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text recognition method and device, electronic equipment and storage medium
  • Text recognition method and device, electronic equipment and storage medium
  • Text recognition method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0087] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the specification. However, this specification can be implemented in many other ways different from those described here, and those skilled in the art can make similar extensions without violating the connotation of this specification, so this specification is not limited by the specific implementations disclosed below.

[0088] Terms used in one or more embodiments of this specification are for the purpose of describing specific embodiments only, and are not intended to limit one or more embodiments of this specification. As used in one or more embodiments of this specification and the appended claims, the singular forms "a", "the", and "the" are also intended to include the plural forms unless the context clearly dictates otherwise. It should also be understood that the term "and / or" used in one or more embodiments of the present specification refers t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text recognition method and device, electronic equipment and a storage medium. The text recognition method comprises the steps of acquiring a text set of multiple texts; extracting theme keywords of the text in the text set, and obtaining actual theme keywords extracted from at least one text in the text set; determining first distribution of the theme keywords in each text in the text set and second distribution of the practical theme keywords in each text in the text set; inputting the texts in the text set carrying the first distribution and the second distributioninto a classifier for recognition, and obtaining key sentences and non-key sentences of the texts in the text set. Through the text recognition method, the key sentences and the non-key sentences of the text can be rapidly and accurately obtained according to the technical scheme. The non-key sentences of the text are cleaned, the key sentences of the text are conveniently marked. Construction efficiency of a knowledge graph is improved. The key sentences of the text are reserved, so that a user can rapidly know the main content of the text when looking up the text.

Description

technical field [0001] This specification relates to the technical field of natural language processing, in particular to a text recognition method. This specification also relates to a text recognition device, an electronic device, and a computer-readable storage medium. Background technique [0002] With the development of Internet technology, obtaining the required information through the Internet is a means that everyone often uses. When users query information in the same field through the Internet, for the convenience of users, they can quickly understand the theme of each article when querying information. , by filtering and displaying the topic key sentences of each article to the user, the user can know whether each article contains the required information by viewing the topic key sentences. [0003] In the prior art, when extracting the topic key sentences of each article, there are many methods that can be realized. The topic keywords of each article can be extr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/36G06F17/27
CPCG06F16/35G06F16/367G06F40/216
Inventor 李长亮樊骏锋汪美玲唐剑波
Owner BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products