Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for automatically extracting kernel keyword based on B2B platform

A technology for automatic extraction and keyword extraction, which can be used in other database retrieval, network data retrieval, instruments, etc., and can solve problems such as low effect.

Active Publication Date: 2015-03-11
FOCUS TECH
View PDF6 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method selects the field in the database as a feature item for word segmentation, and utilizes the relationship between the database feature item and the text in the database to effectively improve the word segmentation accuracy of traditional word segmentation methods such as unary, binary, and preset vocabulary; but this method The method is based on the relevant thesaurus of the preset vocabulary for word segmentation, and this method is ineffective when dealing with English words, and does not involve the extraction of word stems (word prototypes) and

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for automatically extracting kernel keyword based on B2B platform
  • Method for automatically extracting kernel keyword based on B2B platform
  • Method for automatically extracting kernel keyword based on B2B platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] A method for automatically extracting core keywords based on a B2B platform, comprising the following steps:

[0048] (1) Use the user-set product names, search terms, and industry hot words in the B2B platform as the source of the thesaurus, preprocess the source of the thesaurus and save it in the data mart to form a core thesaurus of product names; the source of the thesaurus The method for preprocessing is:

[0049] For user-set product names, first adopt the principle of high-frequency use of user-set product names, and eliminate user-set product names that are used less frequently; then save the user-set keywords corresponding to user-set product names in the user-set keyword library ;

[0050] For search words, first filter out non-used words including punctuation and special symbols; then use the principle of high-frequency use of search words to eliminate search words that have been used less frequently in the past six months; then preprocess through the core ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for automatically extracting a kernel keyword based on a B2B platform. The method is used for extracting the kernel keyword based on English grammar and semanteme according to the name of an English product. The method for automatically extracting the kernel keyword based on the B2B platform, disclosed by the invention, has obvious advantages in word treatment and self-learning according to a group of rules when various tenses of English words are changed into the original modes during big data concurrence compution.

Description

technical field [0001] The invention relates to a method for automatically extracting core keywords based on a B2B platform. Background technique [0002] Since the development of e-commerce, a large amount of information has been accumulated, as well as a large number of users, including visitors, traders, information providers, etc.; and the high degree of repetition of information occupies a large amount of server resources. [0003] When using a search engine to search for keywords, it is necessary to submit the keywords to the server, and the server searches the massive data according to the keywords, finds a set of relevant information and returns the search results; if it is a concurrent search, it will Great impact on the server. The quality of keywords has a great impact on the efficiency (search speed) and quality (relevance of search results) of search, so it is necessary to establish a method for automatic extraction of core keywords, which combines keywords (co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27G06Q30/00
CPCG06F16/951
Inventor 徐飞
Owner FOCUS TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products