Content-based distributed feature extraction method and device, equipment and medium

A feature extraction and distributed technology, applied in the field of big data, can solve the problem that the recommendation system cannot accurately understand the information data, the unified characteristics and extraction of different types of information data, etc.

Active Publication Date: 2020-07-24
TENCENT TECH (SHENZHEN) CO LTD
View PDF13 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, now the forms of news information are diversified and the content information is very rich.
Using the existing representation extraction methods, it is impossible to perform unified feature extraction for different sources and different types of information data, which leads to the inability of the recommendation system to accurately understand different sources and types of information data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Content-based distributed feature extraction method and device, equipment and medium
  • Content-based distributed feature extraction method and device, equipment and medium
  • Content-based distributed feature extraction method and device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the relevant disclosure, not to limit the disclosure. It should also be noted that, for the convenience of description, only the parts related to the disclosure are shown in the drawings.

[0028] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0029] Explanation of technical terms

[0030] News data is information that can express a specific event. Information is time-sensitive. Generally speaking, what users want to view is news events that occurred in the latest time period. Its types can include articles, graphics, sho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a content-based distributed feature extraction method and device, equipment and a medium. The method comprises the steps of obtaining multiple different dimensions of content information contained in to-be-processed data, wherein each piece of content information corresponds to one feature dimension; preprocessing each of multiple pieces of content information to obtain a plurality of original feature vectors in one-to-one correspondence with each piece of content information; and calling a distributed feature extraction model to perform feature extraction on the plurality of original feature vectors to obtain distributed features corresponding to the to-be-processed data, wherein the distributed features are results of characterizing the to-be-processed data according to a plurality of feature dimensions defined by a standard feature dimension template, and the standard feature dimension template defines the range of content extraction of the to-be-processed data of different sources and different types. According to the embodiment of the invention, the multiple pieces of content information are mapped to a same vector space by utilizing the distributed feature extraction model, so that an accurate recommendation effect is obtained.

Description

technical field [0001] The present application generally relates to the technical field of big data, and in particular relates to content-based distributed feature extraction methods, devices, equipment and media. Background technique [0002] With the development of electronic devices, more and more people choose to read news information on electronic devices. A personalized news recommendation system based on artificial intelligence usually uses machine learning algorithms, especially neural networks, to extract features from news content. For example, representation extraction based on collaborative information, or representation extraction based on content information, or representation extraction based on a combination of the former two. The former relies on the user's interaction information (such as click, favorite, etc.), which relies on the user's interactive operation. The latter relies on the information of the news content (including title, author, text, body, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/335G06F16/35G06F16/9536G06K9/62G06N20/00
CPCG06F16/335G06F16/35G06F16/9536G06N20/00G06F18/2411
Inventor 白冰张峻旗林也白琨
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products