Unlock instant, AI-driven research and patent intelligence for your innovation.

Text core word identification method and device

A recognition method and a technology of a recognition device, which are applied in the Internet field, can solve the problems that the accuracy of core words needs to be further improved, and achieve the effect of improving the recognition accuracy and improving the accuracy

Active Publication Date: 2018-07-27
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF4 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Although there are existing solutions for automatically extracting core words in texts, the accuracy of the identified core words needs to be further improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text core word identification method and device
  • Text core word identification method and device
  • Text core word identification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for ease of description, only parts related to the invention are shown in the drawings.

[0026] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0027] Please refer to figure 1 , which shows an exemplary system architecture 100 to which the embodiments of the present application can be applied.

[0028] Such as figure 1 As shown, the system architecture 100 may include terminal devices 101 , 102 , a network 103 and servers 104 , 105 , 10...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text core word identification method and device. The method comprises the steps of training to obtain a condition random field (CRF) model according to a primary sample text,primary preferential core words of the primary sample text and entity features of the primary preferential core words; using the CRF model and at least one keyword extracting algorithm to select secondary preferential core words of a secondary sample text respectively for keywords extracted from the secondary sample text; according to the secondary sample text and the secondary preferential corewords, training a deep neural network model according to the secondary sample text and the secondary preferential core words to obtain a text core word identification model; using the text core word identification model to identify core words of a target text. By means of the text core word identification method and device, through optimization of a training sample from multiple layers, the identification accuracy of the text core word identification model is improved, so that the accuracy of the extracted core words is improved.

Description

technical field [0001] The present disclosure generally relates to the field of Internet technology, and specifically relates to a text core word recognition method and device. Background technique [0002] With the development of computer and network technology, the number of digitized files is increasing at an alarming rate. People spend a lot of time and energy reading and searching for documents every day. In order to save time and improve people's work efficiency, various concise representations of original documents (such as abstracts, keywords, core words, etc.) have emerged as the times require. Core words are defined as words that compress important information and core content of the original text. People can use it to quickly understand the general content of the text without reading the full text. In information retrieval, core words are often used by us to find content-related text or pictures and videos carrying text. [0003] For example, through keyword-b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/284G06F40/289
Inventor 骆彬尹存祥徐国强钟辉强秦首科
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD