A concept-based semantic recognition method and device

A technology of semantic recognition and language recognition, which is applied in semantic analysis, natural language data processing, special data processing applications, etc., can solve problems such as time lag, inability to cover proper nouns, completeness of dictionaries cannot be guaranteed, etc., to improve The effect of accuracy and calculation speed

Inactive Publication Date: 2018-12-11
广州极天信息技术股份有限公司
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] 1. The biggest disadvantage of mechanical word segmentation is that the completeness of the dictionary cannot be guaranteed
First of all, the entry of words in the dictionary has a time lag. Secondly, the words entered in the dictionary cannot cover all proper nouns such as names of people, places, and institutions. According to statistics, most of the words entered in the dictionary have names, places, and institutions. proper noun;
[0007] 2. Word segmentation based on grammar and rules Because the existing grammatical knowledge and syntactic rules are very general and complex, the accuracy achieved by word segmentation based on meta and rules is far from satisfactory
Therefore, this word segmentation system is still in the experimental stage.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A concept-based semantic recognition method and device
  • A concept-based semantic recognition method and device
  • A concept-based semantic recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The implementation of the present invention is described below through specific examples and in conjunction with the accompanying drawings, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific examples, and various modifications and changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention.

[0034] Before introducing the present invention, explain the concept that the present invention involves earlier:

[0035] Ontology, also known as Semantic Web, Semantic Dictionary. The concept of ontology comes from the field of philosophy. "Ontology" is translated into ontology or ontology in China. Ontology studies the existence and essential characteristics of everything. An ontology i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a concept-based semantic identification method and device. The method comprises the following steps: step S1, carrying out word segmentation on a text to be segmented, and obtaining a plurality of segmented strings; step S2, matching the segmented character string with all nodes of the semantic web; step S3, performing word sense disambiguation on the string successfully matched with the node of the semantic web to obtain a concept path of the disambiguated string, and storing the concept path in a word concept path storage library; step S4, outputting the concept pathof the related words in the concept path storage library of the text to be segmented. The invention can effectively improve the accuracy rate and calculation speed of the text semantic concept recognition.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a concept-based semantic recognition method and device. Background technique [0002] At present, the mainstream Chinese word segmentation technology mainly has the following two types: [0003] One is the mechanical word segmentation method (based on the dictionary). The principle of mechanical word segmentation is to match the strings in the document with the entries in the dictionary one by one. If a string is found in the dictionary, the match is successful and can be segmented. Otherwise, it will not be segmented, and the mechanical word segmentation method based on the dictionary is simple and practical; [0004] The second is the word segmentation method based on grammar and rules. Its basic idea is to perform syntactic and semantic analysis at the same time as word segmentation, and use syntactic information and semantic information to perform part-of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/289G06F40/30
Inventor 董文平
Owner 广州极天信息技术股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products