Lexicon construction method and computing device

A technology of computing equipment and construction methods, applied in the field of information processing, can solve the problems of difficulty in feature extraction and knowledge discovery, inability to accurately and completely describe the vehicle field, lack of attention to emerging Internet vocabulary, etc., to improve the scope of application and ensure authority. and professional effect

Pending Publication Date: 2020-01-17
CHEZHI HULIAN BEIJING SCI & TECH CO LTD
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Taking the vehicle field as an example, due to the complex system structure and various functional features of the vehicle, and the vehicle field involves many upstream and downstream industries such as OEMs, spare parts suppliers, dealers, after-sales and maintenance, etc. , the constructed lexicon cannot accurately and completely describe the field of vehicles. At the same time, with the development of network technology and the increasing number of car users, more and more nicknames, abbreviations, joking words
The traditional professional thesaurus built based on the structure and function of the car often lacks attention to the emerging vocabulary of the Internet, which brings great difficulties to high-level requirements such as feature extraction and knowledge discovery.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Lexicon construction method and computing device
  • Lexicon construction method and computing device
  • Lexicon construction method and computing device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0028] According to the embodiment of the present invention, a new thesaurus construction scheme is provided, based on the professional words related to the field of the constructed thesaurus, and the network vocabulary similar to the professional vocabulary is supplemented by an algorithm, and the expansion of the thesaurus is beneficial to the Internet. The ability to adapt text to meet the expression habits and language character...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a lexicon construction method which comprises the following steps: acquiring professional words related to the field of constructed lexicons, and generating an initial lexicon;utilizing the word vector model to process each word in the initial word bank to generate a word to be added, the word to be added being a word similar to each word in the initial word bank; adding words to be added into an initial word bank to generate a new word bank; and for each word in the new word bank, repeating the step of generating the word to be added and the step of generating the newword bank until the number of repetitions is reached, and taking the generated new word bank as the constructed word bank. The invention also discloses a corresponding computing device.

Description

technical field [0001] The invention relates to the technical field of information processing, in particular to a method for constructing a thesaurus and a computing device. Background technique [0002] Domain professional thesaurus refers to a collection of specialized vocabulary reflecting knowledge, terminology, and institutions in a specific field. It is the basis and premise for domain knowledge discovery, semantic analysis, and feature extraction. Therefore, more and more researches have begun to pay attention to the construction of domain-specific thesaurus. [0003] A commonly used solution for constructing a domain thesaurus is to segment sentences based on domain text, and filter through word segmentation and part-of-speech tagging, and use keywords with specified parts of speech as candidate keywords. Then, weight calculation is performed on each candidate keyword through word co-occurrence or TF-IDF (term frequency-inverse text frequency index) algorithm. Fina...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/20G06F40/247
CPCG06F16/20
Inventor 邱泽成刘标陈安琪林泽中
Owner CHEZHI HULIAN BEIJING SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products