Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Quick vertical target recognition and classification method, classification system and classification device

A classification method and classification system technology, which is applied in the field of Internet content identification and classification, can solve the problems of slow speed, artificial inability to achieve comprehensive coverage, narrow coverage, etc.

Pending Publication Date: 2020-10-27
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] According to the above, the main disadvantages of the prior art solutions are the high labor cost, slow speed and narrow coverage of vertical resource identification:
[0010] (1) High labor cost: first screen the sites, and then label the links by the labeling team
Especially for the labeling process of a single link, a lot of manpower is required for multi-day labeling; in the process of delineating the site and expanding, it still needs a lot of manpower consumption; making a recognition model from scratch also requires a lot of large labeling samples, the effect A better deep learning model requires about 100,000 samples;
[0011] (2) Slow speed: In the labeling process, there are generally about a thousand links that can be marked at the day level, and it takes weeks or even months to realize the labeling only to identify vertical resources in a small number of delineated sites; every time the site needs to be expanded, it takes It takes a long period of time to complete the identification of vertical resources in the newly added site; training the model from scratch also takes a long time for labeling and training because of the large amount of data and many model parameters;
[0012] (3) Narrow coverage: firstly, manual delineation of sites can only obtain a small number of well-known large sites. Faced with such a wealth of Internet resources, manual coverage cannot be achieved in all directions; secondly, only part of the resources under many sites belong to the target vertical category , while manual screening and sampling tend to ignore these resources, training the model from scratch can theoretically improve coverage, but due to the large number of samples required and the long cycle, the accuracy of the initial model is low, and the coverage of vertical resources The situation is not ideal, and the time cost directly leads to the inability of the deep learning solution to classify and collect different types of vertical resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Quick vertical target recognition and classification method, classification system and classification device
  • Quick vertical target recognition and classification method, classification system and classification device
  • Quick vertical target recognition and classification method, classification system and classification device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0106] An embodiment of the present invention provides a method for classifying identifiers, the method comprising:

[0107] Firstly, obtain a labeled sample set, wherein the labeled sample set has a vector of known class digital content of known class objects and a vector of known class identifiers of different levels of the known class digital content; secondly, obtain The target class sample set and the target set with the identifier to be classified are obtained, and the identifier to be classified in the target set is identified as the same target class as the target class sample set according to the transfer learning method in combination with the labeled sample set.

[0108] The method also specifically includes:

[0109] S1) Obtain a labeled sample set, wherein the labeled sample set has a vector of known digital content of known type objects and a vector of known class identifiers of different levels of the known digital content;

[0110] An object can have many digi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an identifier classification method, and belongs to the technical field of internet content recognition and classification. The method comprises the steps of obtaining a labeledsample set, the labeled sample set having vectors of known class digital content of known class objects and vectors of known class identifiers of different hierarchies of the known class digital content; and obtaining a target class sample set and a target set with identifiers to be classified, and identifying the identifiers to be classified in the target set as target classes the same as the target class sample set according to a transfer learning method in combination with the labeled sample set.

Description

technical field [0001] The present invention relates to the technical field of Internet content identification and classification, in particular to an identifier classification method, an identifier application method, an identifier classification system, a classification device and a computer-readable storage medium. Background technique [0002] With the popularization and rapid development of the Internet, users are constantly looking for practical, practical and affordable information on the Internet. Users have more and more extensive demands for Internet information and hope that the information will become more and more abundant. Traditional web search is being upgraded to content search. The first generation of large and comprehensive horizontal websites (also known as comprehensive websites) can no longer fully meet the needs of users, and focus on specific areas or vertical resources that provide specific services. As a new bright spot of the Internet, it is attrac...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/958G06K9/62
CPCG06F16/958G06F18/241
Inventor 邢智慧胡元元王海威张博
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products