Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Hot video real-time finding method and device based on user query logs

A user and log technology, applied in the direction of video data query, video data retrieval, special data processing applications, etc., can solve problems such as unable to reflect semantic associations, cannot be segmented, and unsatisfactory results, etc., to achieve simple and efficient engineering implementation, Avoid combination explosion and improve efficiency

Inactive Publication Date: 2017-06-30
ALIBABA (CHINA) CO LTD
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

One of the difficulties encountered when analyzing user logs is that new terms and hotspots will continuously emerge in the daily user query logs, such as "European Cup", "Corridor Faye Wong and Liu Meilin", etc., but the original word segmentation program cannot reflect Semantic associations of these new words, that is, it is possible to split the strings that should be semantically connected together to form a word into multiple words
The word cutting program generally adopts a vocabulary-based method, that is, scans a string according to a predetermined vocabulary, and finds a most suitable word cutting method through a certain matching method (forward maximum, reverse maximum, two-way matching, etc.). The disadvantage of this method is that it is impossible to segment words that are not included in the original vocabulary, that is, new words
This defect may lead to unsatisfactory results of fuzzy matching (that is, only part of the query words are matched when searching)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hot video real-time finding method and device based on user query logs
  • Hot video real-time finding method and device based on user query logs
  • Hot video real-time finding method and device based on user query logs

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to make the above-mentioned purposes, features and advantages of the present invention more obvious and understandable, the present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments:

[0034] Because real-time hot spots have the characteristics of a large number of searches in a relatively short period of time, it is most likely to discover new hot words and hot events by analyzing the latest user query logs, so as to improve the real-time response of search ranking results. figure 1 It is an implementation principle diagram of the method for discovering hotspot videos in real time based on user query logs in the present invention; figure 1 As shown, the present invention inputs user query logs within a period of time into the word segmentation program to obtain the word segmentation results of each user query. The extracted words here are called atomic words. Then, on this basis, count th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Disclosed are a hot video real-time finding method and device based on user query logs. The method comprises the steps of firstly, conducting word segmentation on the user video query logs in a period of time and obtaining atomic words; then, counting times of each atomic word occurs in the user video query logs in the period of time and times of any two of the atomic words occur in the same user query at the same time; according to a obtained time value, adopting a pointwise mutual information (PMI) method to calculate the correlation degree between any two of the atomic words in the user query logs, and merging any two of the atomic words of which the correlation degree is larger than a threshold value into a compound word and putting the compound word into a compound word list; finally, descendingly sorting the compound words, and at last, using the compound words sorted in the front as key words of hot video real-time finding according to a certain proportion.

Description

[0001] This application is a divisional application of Chinese patent application 201210525735.7 with a filing date of December 7, 2012 and an invention title of "A Method and Device for Real-Time Discovery of Hot Videos Based on User Query Logs". technical field [0002] The invention belongs to the technical field of statistical analysis of Internet data, in particular to a method and device for real-time discovery of hot videos based on user query logs. Background technique [0003] With the rapid development of the Internet, users have put forward higher requirements for video search results, which not only need to be relevant, but also have high real-time performance, which makes real-time search more and more important. Real-time video search refers to the instant and fast search of information in the video library to achieve the effect of instant search. Through real-time search, users can obtain first-hand information on hot events in the first time. However, compar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/73G06F16/951
Inventor 李力行姚键潘柏宇卢述奇尹玉宗
Owner ALIBABA (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products