Supercharge Your Innovation With Domain-Expert AI Agents!

Entity matching method and device thereof

A technology of entities and entity words, applied in the field of data analysis, can solve problems such as difficult to analyze data information, information in different formats cannot be matched, matching resource waste, etc.

Pending Publication Date: 2020-05-22
北京秒针人工智能科技有限公司
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In this method of entity matching, due to the different storage formats of user information in different social media platforms, it is difficult to analyze the data information obtained across platforms during the matching process. For example, information in different formats cannot be matched, resulting in a waste of matching resources. Or matching errors, making the matching results less reliable, resulting in lower efficiency of entity matching

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity matching method and device thereof
  • Entity matching method and device thereof
  • Entity matching method and device thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0051] figure 1 It shows a schematic flowchart of the method for entity matching provided by the embodiment of the present invention, the method includes steps S101-S106; specifically:

[0052] S101. Acquire training text information, perform word segmentation on the training text information, and obtain an entity thesaurus.

[0053] In the embodiment of the present application, as an optional embodiment, the acquiring training text information, performing word segmentation on the training text information, and obtaining entity thesaurus include:

[0054] Crawling text information from social media platforms to obtain the training text information;

[0055] Perform word segmentation on the training text information, and merge repeated words in the word segmentation result based on the word segmentation result to obtain the entity thesaurus.

[0056] Exemplary illustrations, for example, crawl the text content of celebrity-related discussion posts in the entertainment section...

Embodiment 2

[0153] image 3 A schematic structural diagram of an entity matching device provided by an embodiment of the present invention is shown, and the device includes:

[0154] Thesaurus construction module 301, is used for obtaining training text information, carries out word segmentation to described training text information, obtains entity thesaurus;

[0155] Matrix construction module 302, for constructing entity word vector matrix according to the frequency that two entity words in the entity lexicon appear simultaneously in the training text information;

[0156] In the embodiment of the present application, as an optional embodiment, constructing an entity word vector matrix according to the frequency of two entity words in the entity lexicon appearing simultaneously in the training text information includes:

[0157] According to the entity words contained in the entity lexicon, construct entity word vectors, each entity word corresponds to an entity word vector, and the n...

Embodiment 3

[0173] Such as Figure 4 As shown, an embodiment of the present application provides a computer device 400 for executing the method for entity matching in the present application, the device includes a memory 401, a processor 402 and a A computer program running on 402, wherein the processor 402 implements the steps of the entity matching method when executing the computer program.

[0174] Specifically, the above-mentioned memory 401 and processor 402 can be general-purpose memory and processor, which are not specifically limited here. When the processor 402 runs the computer program stored in the memory 401, the above-mentioned entity matching method can be executed.

[0175] Corresponding to the entity matching method in this application, the embodiment of this application also provides a computer-readable storage medium, on which a computer program is stored, and when the computer program is run by a processor, the above-mentioned entity matching is performed. steps of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an entity matching method and a device thereof. The entity matching method comprises the steps as follows: obtaining training text information, performing word segmentation on the training text information to obtain an entity lexicon, constructing an entity word vector matrix according to frequency that every two entity words in the entity lexicon appear simultaneously in training text information, obtaining a target entity word mapped by a to-be-matched entity from the entity lexicon, obtaining a row vector corresponding to the target entity word from the word vector matrix, obtaining candidate column vectors corresponding to other word vectors except the column vector corresponding to the target entity word in the word vector matrix, calculating cosine similarity between a row vector corresponding to the target entity word and the candidate column vector, and determining an entity matched with the to-be-matched entity according to the calculated cosine similarity. Therefore, the entity matching efficiency can be improved.

Description

technical field [0001] The present invention relates to the technical field of data analysis, in particular to a method and device for entity matching. Background technique [0002] With the continuous development of social media, social media has gradually become the main way for people to obtain information. More and more people choose to refer to the information in social media platforms to formulate target plans. Based on the influence of various factors, the target plan is not Only, when the first target solution cannot be implemented, how to efficiently seek similar alternatives has become an urgent problem to be solved. For example, content related to celebrities and other entities has a high degree of discussion on social media platforms. When selecting actors for film and television, variety shows, or brand spokesperson promotion, while determining the number one candidate, it is also necessary to determine some Stars that are similar to the first-ranked candidate ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06F16/33
CPCG06F16/3347
Inventor 张梦醒
Owner 北京秒针人工智能科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More