Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Film semantics personalization tag optimizing method based on big data

An optimization method and big data technology, applied in the field of big data analysis, can solve the problems of lack of rigor, waste of data resources, lack of labels for specific movies, etc., and achieve the effect of good practical experience, accurate retrieval, accurate vocabulary or text description

Active Publication Date: 2017-12-26
SICHUAN CHANGHONG ELECTRIC CO LTD
View PDF7 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Because the movie tags on these websites come from user-defined, anyone can define tags on any resource under any circumstances, so these tags are arbitrary and imprecise, and are likely to cause problems such as contradictions and confusion, lacking correctness and reasonableness. Tags will make users get lost in redundant and complicated search results
[0004] Second, there is no personalized label
At present, the tags of movies are concentrated on the general tag set. A specific movie lacks a unique tag, which cannot accurately describe the movie. At the same time, this will make it impossible to search for a specific movie through a unique tag.
[0005] Finally, the waste of data resources, in the existing label optimization methods, very few methods take into account the introduction of the movie, while ignoring a large number of comment resources on the Internet, these resources are also a description of the movie, which will lead to Serious waste of content resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Film semantics personalization tag optimizing method based on big data
  • Film semantics personalization tag optimizing method based on big data
  • Film semantics personalization tag optimizing method based on big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0045] Such as figure 1 As shown, a method for optimizing movie semantic personalized tags based on big data, the method steps are as follows:

[0046] A, collect the comment information data of movie i and movie j, described comment information data comprises movie brief introduction, movie long commentary and movie short commentary, adopts open source Chinese word segmentation tool to carry out word segmentation processing to comment information data; Set up stop words database, by stop Use the word database to remove the stop words in the comment information data after word segmentation to obtain valid comment data;

[0047] B, calculate word frequency (TF): word frequency (TF) = the number of times that a certain word appears in the effective comment data after step A is processed in a certain comment article, and word frequency (TF) adopts calculation method to calculate:

[0048] Term frequency (TF) = the number of times a certain word appears in the effective comment d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a film semantics personalization tag optimizing method based on big data. The method comprises the steps that foreignized personalization film tags are obtained by mining film criticism data; meanwhile, texts and words are obtained through vectorization of a neural network model, a machine learning model is built according to the similarity of the film synopsis texts, the similarity of the tag words and deviations between custom tags obtained before optimization and custom tags obtained after optimization, and the machine learning model is initialized through the personalization tags. By means of the optimizing method, the optimization of the existing custom tags of films is achieved, and it is achieved that redundant tags are combined, wrong tags are corrected, missing tags are complemented and the personalization tags are complemented; film resources are scientifically and effectively classified and described, a film information retrieval basis is provided, and a series of problems caused by artificial film tags are solved.

Description

technical field [0001] The invention relates to the technical field of big data analysis, in particular to a method for optimizing movie semantic personalized tags based on big data. Background technique [0002] Stimulated by the development of the film and television industry and people's spiritual needs, the types and numbers of movies are increasing day by day, so the description of movies is becoming more and more important. At the same time, with the rapid development of the Internet, more and more shared information appears on various websites. For movies, there are Douban, Tencent and other websites. These sites allow users to comment and define category tags for different movies, not only as a kind of information sharing, but most importantly, it will optimize the process of searching for a specific video in the massive video library. However, with the rapid increase of Internet data, some problems arise, mainly as follows: [0003] First, there is the issue of th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06K9/62
CPCG06F16/48G06F16/9562G06F18/22G06F18/241G06F18/214
Inventor 阳柯刘楚雄唐军
Owner SICHUAN CHANGHONG ELECTRIC CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products