Supercharge Your Innovation With Domain-Expert AI Agents!

Text core word recognition method and device

A recognition method and a technology of a recognition device, which are applied in the Internet field, can solve the problems that the accuracy of core words needs to be further improved, and achieve the effect of improving the recognition accuracy and improving the accuracy

Active Publication Date: 2021-05-14
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Although there are existing solutions for automatically extracting core words in texts, the accuracy of the identified core words needs to be further improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text core word recognition method and device
  • Text core word recognition method and device
  • Text core word recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for ease of description, only parts related to the invention are shown in the drawings.

[0026] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0027] Please refer to figure 1 , which shows an exemplary system architecture 100 to which the embodiments of the present application can be applied.

[0028] Such as figure 1 As shown, the system architecture 100 may include terminal devices 101 , 102 , a network 103 and servers 104 , 105 , 10...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a text core word recognition method and device, wherein the method includes: according to the first-level sample text, the first-level preferred core words of the first-level sample text, and the entity characteristics of the first-level preferred core words , training to obtain a conditional random field CRF model; from the keywords extracted for the secondary sample text by using the CRF model and at least one keyword extraction algorithm, select the secondary preferred core of the secondary sample text words; according to the secondary sample text and the secondary preferred core words, the deep neural network model is trained to obtain a text core word recognition model; and using the text core word recognition model to identify the core words of the target text. By applying this application, the recognition accuracy of the text core word recognition model can be improved through multi-level optimization of training samples, thereby improving the accuracy of the extracted core words.

Description

technical field [0001] The present disclosure generally relates to the field of Internet technology, and specifically relates to a text core word recognition method and device. Background technique [0002] With the development of computer and network technology, the number of digitized files is increasing at an alarming rate. People spend a lot of time and energy reading and searching for documents every day. In order to save time and improve people's work efficiency, various concise representations of original documents (such as abstracts, keywords, core words, etc.) have emerged as the times require. Core words are defined as words that compress important information and core content of the original text. People can use it to quickly understand the general content of the text without reading the full text. In information retrieval, core words are often used by us to find content-related text or pictures and videos carrying text. [0003] For example, through keyword-b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/284G06F40/289
CPCG06F40/284G06F40/289
Inventor 骆彬尹存祥徐国强钟辉强秦首科
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More