User search string organization name recognition method based on semantic feature model

A technology of semantic feature and institution name, which is applied in the field of user search string institution name recognition based on semantic feature model, which can solve the problems of low accuracy and lack of semantic environment of user search string.

Active Publication Date: 2015-06-03
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF4 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to solve the problem of low accuracy when the existing long text organization name recognizer is used for the organization name recognition of th

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • User search string organization name recognition method based on semantic feature model
  • User search string organization name recognition method based on semantic feature model
  • User search string organization name recognition method based on semantic feature model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0079] The method for identifying user search strings and organization names based on the semantic feature model provided by the present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.

[0080] In the embodiment of the present invention, the user search string organization name recognition method based on the semantic feature model, its operation flow is as follows figure 1 As shown, the specific implementation steps are:

[0081] Step 1. Train the organization name recognition semantic model by machine learning.

[0082] Step 1.1: Determine the recognition model that recognizes the organization name in the user's search string.

[0083] In this embodiment, the recognition model of the organization name in the user search string adopts the conditional random field model CRF, and uses the CRF++0.54windows version to realize the model.

[0084] Step 1.2: Determine the training corpus.

[0085] Step 1.2.1: Select the c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of the processing of a natural language, and particularly relates to a user search string organization name recognition method based on a semantic feature model. The method comprises a treatment process of a model establishment stage and a recognition stage. The method comprises the steps of establishing a training language database conforming to the distribution of user search strings by utilizing the existing a long text marking language database at the model establishing stage, wherein the semantic database is used for storing the features of traditional participle and part-of-speech tagging and is additionally provided with a context feature in the search string and a cohesive feature correlated semantic environment feature, establishing a condition random field model according to the composite semantic feature, and adopting the random condition field model as an organization name recognition model; calculating the semantic environment feature corresponding to the user search string to obtain a model sequence of the user inquiry string, extracting the model sequence conforming to the organization name, and obtaining an organization name in the user search string. By adopting the method, the accuracy and recall rate for recognizing the organization name in the user search string can be comprehensively improved.

Description

technical field [0001] The invention belongs to the field of natural language processing, and in particular relates to a method for identifying user search string institution names based on a semantic feature model. Background technique [0002] Today's society has become an era of information explosion, and the rapid development of the Internet has enabled China to have more than 600 million Internet users and ZB (ZettaByte)-level data accumulation. The search engine greatly facilitates people's daily life, work, and learning information acquisition needs, and the importance of information screening and sorting is particularly prominent. After the user enters their own questions into the search engine, the search engine will perform a series of preprocessing links such as word segmentation, stop word removal, error correction, and entity recognition on the user search string. In these preprocessing links, each item Both are extremely important and indispensable. The qualit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
Inventor 牛振东陆浩
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products