Supercharge Your Innovation With Domain-Expert AI Agents!

A new word discovery method, system, device and medium based on graph embedding

A new word discovery and graph embedding technology, which is applied to instruments, network data indexing, and other database retrievals, can solve problems such as low-quality new words, and achieve the effect of ensuring accuracy, ensuring accuracy, and stabilizing calculation results

Active Publication Date: 2021-10-29
WORKWAY SHENZHENINFORMATION TECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the field of natural language processing, in the new word discovery task, the existing methods usually use statistical learning methods to construct new words. The basic idea is the method of information entropy, but this simple method only uses the shallow semantics in the corpus information, often introducing many low-quality new words

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A new word discovery method, system, device and medium based on graph embedding
  • A new word discovery method, system, device and medium based on graph embedding
  • A new word discovery method, system, device and medium based on graph embedding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0038] It should be noted that, in the case of no conflict, the following embodiments and the features in the embodiments can be combined with each other; and, based on the embodiments in the present disclosure, those of ordinary skill in the art obtained without creative work All other embodiments belong to the protection scope of the present disclosure.

[0039] It is noted that the following describes various aspects of the embodiments that are within the scope of the appended claims. It should be apparent that the aspects described herein may be embodied in a wide variety of forms and that any specific structure and / or function described herein is illustrative only. Based on the present disclosure one skilled in the art should appreciate that an aspect described herein may be implemented independently of any other aspects and that two or more of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to a new word discovery method, system, device and medium based on graph embedding, comprising: using a sliding window to cut N-GRAM character strings of the corpus to be calculated, and calculating the statistics of each character string, according to the statistics: Each character string is scored, and the character string whose score meets the requirements is selected and written into the new word candidate set; the word segmentation is performed on the corpus to be calculated, and a graph network is constructed based on the word segmentation result; the graph network is calculated based on the graph attention network , to obtain the graph embedding of the words of the corpus to be calculated; based on the graph embedding of the words in the general dictionary, the graph embedding of the words in the new word candidate set is screened, and the words corresponding to the screened graph embedding are used as candidate new words . Based on the graph embedding technology, the present invention can effectively filter low-quality candidate new words during the new word discovery process, thereby obtaining higher-quality, more reliable general new words or domain new words.

Description

technical field [0001] The present invention relates to the field of natural language processing, in particular to a new word discovery method, system, device and medium based on graph embedding. Background technique [0002] Graph Embedding (Graph Embedding, also called Network Embedding) is a process of mapping graph data (usually a high-dimensional dense matrix) into a low-density vector. Graph widely exists in various scenarios in the real world, that is, a collection of nodes and edges. For example, the connection between people in social networks, the interaction of proteins in biology, and the communication between IP addresses in communication networks, etc. In addition, our most common picture and sentence can also be abstractly regarded as the structure of a graph model, and the graph structure can be said to be ubiquitous. By analyzing them we can gain insight into social structures, languages ​​and different modes of communication, so Figure 1 It has been a ho...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/289G06F40/284G06F40/242G06F40/216G06F16/951
CPCG06F16/951G06F40/216G06F40/242G06F40/284G06F40/289
Inventor 莫永卓赵顺峰练睿肖杰
Owner WORKWAY SHENZHENINFORMATION TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More