Identification method and device for garbage barrages and computer equipment

A barrage and spam technology, applied in computing, special data processing applications, instruments, etc., can solve the problems of spam barrage information, low accuracy of spam barrage recognition, and reduced user participation

Active Publication Date: 2017-12-15
WUHAN DOUYU NETWORK TECH CO LTD
View PDF4 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problems existing in the prior art, the embodiment of the present invention provides a garbage barrage identification method, device and computer equipment, which are used to solve the problems in the prior art when performing live broadcast on the live broadcast platform due to the accuracy of the identification of the garbage barrage. Not high, leading to a large amount of spam barrage information in the live broadcast room, directly reducing user participation, and ultimately leading to technical problems in the reduction of the number of users of the live broadcast platform

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Identification method and device for garbage barrages and computer equipment
  • Identification method and device for garbage barrages and computer equipment
  • Identification method and device for garbage barrages and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0056] This embodiment provides a method for identifying garbage barrage, such as figure 1 As shown, the method includes:

[0057] S101. Based on the preset barrage information feature construction rules, perform feature extraction on the barrage information to obtain first barrage information;

[0058] In this step, before performing feature extraction on the bullet chat information, it is necessary to use the database query statement SQL to extract the bullet chat information marked with the type of bullet chat from the relational database HIVE as sample data. The barrage information includes barrage content and barrage type.

[0059] Then, based on the preset barrage information feature construction rules, feature extraction is performed on the barrage information to obtain the first barrage information. The feature construction rule includes: using a specific identifier to represent a word conforming to a certain type of feature.

[0060] Specifically, because there are...

Embodiment 2

[0092] Corresponding to Embodiment 1, this embodiment provides a device for identifying spam barrage, such as figure 2 As shown, the device includes: extraction unit 21, preprocessing unit 22, word segmentation unit 23, conversion unit 24, weighting unit 25, establishment unit 26 and judgment unit 27; wherein,

[0093] Before the extraction unit 21 performs feature extraction on the bullet chat information, it is necessary to use the database query statement SQL to extract the bullet chat information marked with the bullet chat type as sample data from the relational database HIVE. The barrage information includes barrage content and barrage type.

[0094] Then the extracting unit 21 performs feature extraction on the bullet chat information based on the preset bullet chat information feature construction rules to obtain the first bullet chat information. The feature construction rule includes: using a specific identifier to represent a word conforming to a certain type of f...

Embodiment 3

[0122] The present embodiment also provides a kind of computer equipment of rubbish barrage identification, such as image 3 As shown, the computer device includes: a radio frequency (Radio Frequency, RF) circuit 310, a memory 320, an input unit 330, a display unit 340, an audio circuit 350, a WiFi module 360, a processor 370, and a power supply 380 and other components. Those skilled in the art can understand that, image 3 The structure of the computer device shown in the computer device does not constitute a limitation to the computer device, and may include more or less components than those shown in the illustration, or combine some components, or arrange different components.

[0123] Combine below image 3 A detailed introduction to each component of computer equipment:

[0124] The RF circuit 310 can be used for receiving and sending signals, especially, after receiving the downlink information of the base station, the processor 350 processes it. Generally, the RF c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an identification method and device for garbage barrages and computer equipment. The method comprises the steps of constructing rules on the basis of preset barrage information characteristics, and conducting characteristic extraction on barrage information to obtain first barrage information; conducting term segmentation on the first barrage information according to term formation rules in a user-defined term library of a live platform, and forming a term bag model; on the basis of preset mapping rules, converting the term bag model into a term vector, conducting term frequency-inverse document frequency (TF-IDF) weighting on terms in the term vector, and obtaining the TF-IDF weighted values of the terms; establishing a native Bayesian model, and on the basis of the TF-IDF weighted values of the terms, separately utilizing the native Bayesian model to calculate a first probability P1 that the barrage information is a garbage barrage and a second probability P2 that the barrage information is a normal barrage under the condition that all terms appear; judging whether or not the first probability P1 is greater than the second probability P2, and if yes, determining that the barrage information is the garbage barrage.

Description

technical field [0001] The invention belongs to the technical field of garbage barrage processing on a live broadcast platform, and in particular relates to a garbage barrage identification method, device and computer equipment. Background technique [0002] At present, with the rapid development of the live broadcast industry, the audience of live broadcasts is also constantly expanding, and various types of live broadcast content are becoming more and more abundant. Viewers can participate in comments and interactions by sending barrage while watching the live broadcast, which greatly improves user participation and enriches the live broadcast content. [0003] Generally speaking, every time a viewer sends a barrage, the barrage will be sent to the server of the live broadcast platform, and the server of the live broadcast platform will forward the barrage to all viewers in the live broadcast room. However, in order to obtain benefits, some abnormal users often burst out ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30H04N21/475H04N21/488
CPCG06F16/30G06F40/289H04N21/4756H04N21/4884
Inventor 龚灿张文明陈少杰
Owner WUHAN DOUYU NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products