Special word list dynamic generation system and method

A dynamically generated and professional technology, applied in the field of network communication, can solve the problems of high cost, waste of manpower and material resources, etc., and achieve the effect of high accuracy and cost saving

Active Publication Date: 2007-11-28
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Obviously, the method of manually building a professional vocabulary is a waste of manpower and material resources. In addition, manual construction of the vocabulary depends on the knowledge of the input pe

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Special word list dynamic generation system and method
  • Special word list dynamic generation system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The present invention will be further elaborated below according to the drawings and specific embodiments.

[0018] As shown in Figure 1, a kind of professional vocabulary dynamic generation system of the present invention mainly comprises the document preprocessing module 1 that is connected in sequence, word segmentation module 2, word segmentation postprocessing module 3, subject semantic vector calculation module 4, document semantic vector management module 5 , a document classification module 7, a category document library 9, a vocabulary weight calculation module 10, a category identification and keyword extraction module 11, a category vocabulary management module 12 and a professional category vocabulary library 13. It also includes a document topic semantic vector library 6, which is connected to the document semantic vector management module 5. According to needs, a category semantic seed vector library 8 may also be included.

[0019] Among them, the docume...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a specialized vocabulary meter dynamic generating system, which is characterized by the following: basing on theme semantic vector; proceeding professional sort or cluster document sort module and specialized vocabulary meter generating module for correspond document; extracting finite quantity and a professional sort document text with the specialized vocabulary meter; calculating weight for all vocabulary in the text; ordering with the size of the weight value; choosing the front N vocabulary as specialized vocabulary meter of relative professional sort. This invention also disclose a specialized vocabulary meter dynamic generating method. This invention possesses high accuracy and low cost, which can proceed dynamic updating maintenance for specialized vocabulary meter.

Description

technical field [0001] The present invention relates to network communication technology, and more specifically, relates to a system and method for dynamically generating professional vocabulary. Background technique [0002] Professional domain vocabulary refers to the collection of vocabulary in a certain professional category. In the field of natural language processing, this information is very helpful for such as search and semantic related calculations. At this stage, it is generally collected manually by a dedicated person. Obviously, the method of manually constructing a professional vocabulary is a waste of manpower and material resources. In addition, manual construction of the vocabulary depends on the knowledge of the input person, and there may be many words that have not been recalled. In addition, vocabulary is constantly changing, and manual entry requires a continuous investment in newly created vocabulary, which is too costly. Contents of the invention ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30
Inventor 丁江伟
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products