Unlock instant, AI-driven research and patent intelligence for your innovation.

Theme extraction method and device, terminal and storage medium

An extraction method and topic technology, applied in the field of terminals and storage media, topic extraction methods, and devices, can solve problems such as the inability to recognize single hashtags, and achieve the effects of improving accuracy, reducing the probability of missing tags and over-extracting

Pending Publication Date: 2021-12-24
NETEASE (HANGZHOU) NETWORK CO LTD
View PDF14 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In some applications that use the double pound sign extraction mode, the content between the two pound signs will be extracted as a label only if it starts with a pound sign and ends with a pound sign. If only the beginning or end contains a pound sign, Tags are not extracted; however, single hashtags cannot be recognized and processed directly as normal text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Theme extraction method and device, terminal and storage medium
  • Theme extraction method and device, terminal and storage medium
  • Theme extraction method and device, terminal and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by those skilled in the art without making creative efforts belong to the scope of protection of this application.

[0062] Embodiments of the present application provide a topic extraction method, device, terminal, and storage medium. Specifically, this embodiment provides a topic extraction method suitable for a topic extraction device, and the topic extraction device can be integrated in a computer device.

[0063] Wherein, the computer equipment may be equipment such as a terminal, such as a smart phone, a tablet computer, a notebook computer, a touch screen, a game console, a personal computer (PC, Perso...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a theme extraction method and device, a terminal and a storage medium, and the method comprises the steps: obtaining a to-be-extracted text, responding to a tag start symbol recognized in the to-be-extracted text, and determining whether a preset tag end symbol exists after the tag start symbol; if a label end symbol exists after the label start symbol, determining the text content between the label start symbol and the label end symbol as a target theme of the to-be-extracted text; if the type of the label ending symbol is the same as that of the label starting symbol, determining whether a new label ending symbol exists after the label ending symbol or not; and if yes, determining the text content between the label ending symbol and the new label ending symbol as the target theme of the to-be-extracted text. According to the scheme, the probability of label omission and excessive extraction can be reduced, and the accuracy of a label extraction result is further improved.

Description

technical field [0001] The present application relates to the field of computer technology, and in particular to a subject extraction method, device, terminal and storage medium. Background technique [0002] In the current mainstream information application programs, there are two ways of double pound sign recognition mode and single pound sign recognition mode. In some applications using the single hash sign extraction mode, each tag starts with a hash sign and ends with a space or punctuation mark. If there are multiple hash signs, each hash sign extracts a tag backwards; but For the double pound tag, it may be over-extracted, and two tags are extracted. The second tag ends with a punctuation, and the normal body content is also wrongly extracted as a long tag. In some applications that use the double pound sign extraction mode, the content between the two pound signs will be extracted as a label only if it starts with a pound sign and ends with a pound sign. If only the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/258G06F40/30G06F40/166
CPCG06F40/258G06F40/30G06F40/166
Inventor 王淏淏朱桂华
Owner NETEASE (HANGZHOU) NETWORK CO LTD