Unlock instant, AI-driven research and patent intelligence for your innovation.

Knowledge graph-based intention labeling method for mass live broadcast bullet screen data

A knowledge map and data technology, applied in the field of knowledge thesaurus construction and data labeling, can solve the problems of high loss rate in spoken language scenes, scattered results, inapplicability, etc., and achieve the effect of good promotion and use, easy conversion, safe and convenient use

Pending Publication Date: 2021-09-17
上海适享文化传播有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, these solutions are currently not applicable to colloquial barrage texts. NLP analysis requires complete grammatical sentences, and the main application scenario is written texts. The word vector method has certain effects, but it can only be divided into positive and negative. The keyword method needs to hit the keyword, the loss rate of the spoken language scene is very high, and the results obtained are scattered

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Knowledge graph-based intention labeling method for mass live broadcast bullet screen data
  • Knowledge graph-based intention labeling method for mass live broadcast bullet screen data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0030] Example: such as Figure 1-2 As shown, the present invention provides a technical solution based on a knowledge map to label a large number of live barrage data intentions, including the following steps:

[0031] S1. Extract keywords according to the barrage information and summarize the dimensions of keywords;

[0032] S2, expand the keyword homonym synonym;

[0033] S3. The combination of multiple dimensions serves as a template for a specific intent;

[0034] S4, barrage data deduplication, invalid data removal;

[0035] S5. Bullet screen data extracts opinion intentions through templates;

[0036] S6. Manual checking and checking to remove wrong data.

[0037] According to the above technical solution, in S1, a database is established for the bullet chat information, and the words in the bullet chat are expanded and marked, and the text sentiment is analyzed through the word vector, and the negative words and positive words are easy to identify from the text, an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a knowledge graph-based intention labeling method for mass live broadcast bullet screen data. The method comprises the following steps: S1, extracting keywords according to bullet screen information and summarizing dimensions of the keywords; S2, expanding homophones and synonyms of the keywords; S3, combining a plurality of dimensions to serve as a template of a specific intention; S4, carrying out deduplication of bullet screen data and removal of invalid data; S5, extracting viewpoint intentions of the bullet screen data through a template; and S6, carrying out manual checking to remove error data. According to the method, the structure is scientific and reasonable, the use is safe and convenient, the defects that the data volume is large, the labor efficiency is low, the effect of a traditional NLP in bullet screen analysis is poor and the like are overcome, and semantics in the data are defined, so that bullet screen data de-duplication is facilitated, the workload is reduced, and the method is suitable for better popularization and application.

Description

technical field [0001] The invention relates to the technical field of building knowledge thesaurus and data labeling, and specifically relates to a method for labeling intents of a large number of live barrage data based on knowledge graphs. Background technique [0002] Bullet chat can give the audience an illusion of "real-time interaction". Although the sending time of different bullet chat is different, it will only appear at a specific time point in the video, so the bullet chat sent at the same time is basically the same. With the same theme, when participating in the comment, there will be an illusion of commenting with other viewers at the same time, while the traditional player comment system is independent of the player, so the content of the comment is mostly around the entire video, and the topic is not. Strong, and there is no sense of "real-time interaction"; [0003] On the e-commerce service platform, there are customer comments on products or services. The...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/169G06F40/186G06F40/247G06F40/289G06F40/30G06F16/215G06F16/74G06F16/75G06F16/78
CPCG06F40/169G06F40/186G06F40/247G06F40/289G06F40/30G06F16/215G06F16/74G06F16/75G06F16/7867
Inventor 李抒雁沙涛
Owner 上海适享文化传播有限公司