Dictionary updating method and apparatus

An update method and dictionary technology, applied in the field of intelligent interaction, can solve the problems such as the inability to add new words in time, the high cost of dictionary maintenance, and the easy occurrence of omissions, so as to improve the update efficiency, reduce the amount of calculation, and reduce the amount of calculation.

Active Publication Date: 2016-03-09
SHANGHAI XIAOI ROBOT TECH CO LTD
View PDF5 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] However, the above-mentioned manual working method leads to high maintenance cost of the dictionary, low efficiency, and prone to omissions, which eventually prevents new words from being added to the dictionary in time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dictionary updating method and apparatus
  • Dictionary updating method and apparatus
  • Dictionary updating method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0075] As mentioned above, in the prior art, new words are added to the dictionary manually. Adding new words manually is prone to omissions; due to the limitation of manual processing speed, the efficiency is low; the maintenance cost of the dictionary is also pushed up by labor costs.

[0076] In the embodiment of the present invention, the computer processes the corpus, unifies the corpus into a format suitable for the computer new word discovery process, generates candidate data strings, and sets appropriate conditions to screen the candidate data strings to discover new words. Discovering new words based on computers can improve the efficiency of dictionary updating, avoid omissions, and ensure the accuracy of dictionary updating.

[0077] In order to make the above objects, features and beneficial effects of the present invention more comprehensible, specific embodiments of the present invention will be described in detail below in conjunction with the accompanying drawi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Provided are dictionary updating method and apparatus. The dictionary updating method comprises: performing preprocessing on a received corpus to obtain text data; performing line segmentation on the text data to obtain sentence data; performing word segmentation on the sentence data according to individual words contained in a basic dictionary, so as to obtain word data after word segmentation; performing combination processing on the adjacent word data after word segmentation, so as to generate a candidate data string; performing determination processing on the candidate data string to discover a new word; and if the new word is discovered, adding the new word into the basic dictionary to update the basic dictionary. The dictionary updating method and apparatus can reduce dictionary maintenance costs and improve dictionary update efficiency.

Description

technical field [0001] The invention relates to the field of intelligent interaction, in particular to a method and device for updating a dictionary. Background technique [0002] In many fields of Chinese information processing, it is necessary to complete corresponding functions based on dictionaries. For example, in an intelligent retrieval system or an intelligent dialogue system, through word segmentation, question retrieval, similarity matching, determination of retrieval results or intelligent dialogue answers, etc., each process is calculated by using words as the smallest unit, and the basis of calculation is Word dictionary, so the word dictionary has a great impact on the performance of the entire system. [0003] The progress and changes of social culture and the rapid development of economy and commerce often drive the change of language, and the most rapid manifestation of language change is the emergence of new words. Especially in a specific field, whether ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/374
Inventor 张昊朱频频
Owner SHANGHAI XIAOI ROBOT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products