Unlock instant, AI-driven research and patent intelligence for your innovation.

Keyword extraction method and device, computer equipment and storage medium

An extraction method and keyword technology, applied in the field of information processing, can solve the problems of weakening the Matthew effect of keywords, cold start of keywords, etc.

Pending Publication Date: 2021-11-16
GUANGZHOU LIZHI NETWORK TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present invention proposes a keyword extraction method, device, computer equipment, and storage medium to solve the problem of improving the user's retrieval accuracy in the voice retrieval scenario, while avoiding keyword cold start and weakening the keyword Matthew effect The problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Keyword extraction method and device, computer equipment and storage medium
  • Keyword extraction method and device, computer equipment and storage medium
  • Keyword extraction method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0055] figure 1 It is a flow chart of a keyword extraction method provided in Embodiment 1 of the present invention, the method can be executed by a keyword extraction device, the keyword extraction device can be implemented by software and / or hardware, and can be configured in a computer device, For example, servers, workstations, personal computers, etc., specifically include the following steps:

[0056] Step 101, obtaining the keyword thesaurus of the voice object to be queried;

[0057] In the embodiment of the present invention, in the audio data retrieval scenario, when the user enters the search content in the search bar and clicks the search button, it is equivalent to initiating a search request for the sound data, and the background extracts the search content according to the search content input by the user. The keyword is matched according to the mark or subject text of the voice object to be queried, and the user's accurate query result is returned. The establi...

Embodiment 2

[0128] Figure 4 A structural block diagram of a keyword extraction device provided in Embodiment 2 of the present invention may specifically include the following modules:

[0129] Keyword thesaurus acquisition module 201, used to obtain the keyword thesaurus of the voice object to be queried;

[0130] A graphical model building module 202, configured to construct a graphical model with a first preset number of keywords as nodes according to the distance between keywords in the keyword thesaurus;

[0131] The weight calculation module 203 is used to obtain the weight value of each node according to the iterative algorithm of the edge length between each node in the graph model;

[0132] A sorting module 204, configured to sort the keywords corresponding to the nodes according to the weight value;

[0133] Candidate keyword determination module 205, for selecting the second preset number of keywords in the sorting results as candidate keywords;

[0134] The matching module ...

Embodiment 3

[0158] Figure 5 It is a schematic structural diagram of a computer device provided by Embodiment 3 of the present invention. Figure 5 A block diagram of an exemplary computer device 12 suitable for implementing embodiments of the invention is shown. Figure 5 The computer device 12 shown is only an example, and should not impose any limitation on the functions and scope of use of the embodiments of the present invention.

[0159] Such as Figure 5 As shown, computer device 12 takes the form of a general-purpose computing device. Components of computer device 12 may include, but are not limited to: one or more processors or processing units 16 , system memory 28 , bus 18 connecting various system components including system memory 28 and processing unit 16 .

[0160] Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, a processor, or a local bus using any of a variety...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a keyword extraction method and device, computer equipment and a storage medium. The method comprises the steps of obtaining a keyword library of a to-be-queried sound object; constructing a graph model by taking a plurality of keywords as nodes according to the distance of the keywords, iterating the side length to obtain the weight value of each node, and sorting; selecting a plurality of keywords in the sorting result as to-be-selected keywords; taking the keyword, matched with the knowledge base, of the to-be-selected keyword as a first candidate keyword; extracting a user tag in the to-be-queried sound object as a second candidate keyword; converting the first candidate keyword and the second candidate keyword into keyword vectors, calculating a weighted average value of the keyword vectors, and then respectively calculating cosine similarity between each keyword vector and the vector weighted average value; and selecting a plurality of first and second candidate keywords of which the cosine similarity is greater than a similarity threshold as target keywords of the to-be-queried sound object. The purposes that a large amount of manual annotation is not needed, and the labor cost is reduced are achieved.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of information processing, and in particular to a keyword extraction method, device, computer equipment and storage medium. Background technique [0002] In the scenario where the user enters search terms to search related content, the search application background will extract the user's search keywords from the search words entered by the user, and return the search content to the user according to the keyword matching. Application is a critical step that directly determines the accuracy of retrieval results. [0003] In the prior art, the extraction of search words is usually based on the following common methods: 1. Based on the TFIDF method, TFIDF is a statistical method for evaluating the importance of keywords to a document in the corpus, and then sorting and selecting important The words with the highest sex are used as keywords. The importance of a TFIDF word increases pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/242G06F16/245
CPCG06F16/243G06F16/245
Inventor 谭又伟李泽隆
Owner GUANGZHOU LIZHI NETWORK TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More