Entity matching method and device thereof
A technology of entities and entity words, applied in the field of data analysis, can solve problems such as difficult to analyze data information, information in different formats cannot be matched, matching resource waste, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0051] figure 1 It shows a schematic flowchart of the method for entity matching provided by the embodiment of the present invention, the method includes steps S101-S106; specifically:
[0052] S101. Acquire training text information, perform word segmentation on the training text information, and obtain an entity thesaurus.
[0053] In the embodiment of the present application, as an optional embodiment, the acquiring training text information, performing word segmentation on the training text information, and obtaining entity thesaurus include:
[0054] Crawling text information from social media platforms to obtain the training text information;
[0055] Perform word segmentation on the training text information, and merge repeated words in the word segmentation result based on the word segmentation result to obtain the entity thesaurus.
[0056] Exemplary illustrations, for example, crawl the text content of celebrity-related discussion posts in the entertainment section...
Embodiment 2
[0153] image 3 A schematic structural diagram of an entity matching device provided by an embodiment of the present invention is shown, and the device includes:
[0154] Thesaurus construction module 301, is used for obtaining training text information, carries out word segmentation to described training text information, obtains entity thesaurus;
[0155] Matrix construction module 302, for constructing entity word vector matrix according to the frequency that two entity words in the entity lexicon appear simultaneously in the training text information;
[0156] In the embodiment of the present application, as an optional embodiment, constructing an entity word vector matrix according to the frequency of two entity words in the entity lexicon appearing simultaneously in the training text information includes:
[0157] According to the entity words contained in the entity lexicon, construct entity word vectors, each entity word corresponds to an entity word vector, and the n...
Embodiment 3
[0173] Such as Figure 4 As shown, an embodiment of the present application provides a computer device 400 for executing the method for entity matching in the present application, the device includes a memory 401, a processor 402 and a A computer program running on 402, wherein the processor 402 implements the steps of the entity matching method when executing the computer program.
[0174] Specifically, the above-mentioned memory 401 and processor 402 can be general-purpose memory and processor, which are not specifically limited here. When the processor 402 runs the computer program stored in the memory 401, the above-mentioned entity matching method can be executed.
[0175] Corresponding to the entity matching method in this application, the embodiment of this application also provides a computer-readable storage medium, on which a computer program is stored, and when the computer program is run by a processor, the above-mentioned entity matching is performed. steps of th...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com