Recognition method for new words of scientific and technical terminology

A new word recognition and new word technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as poor accuracy, exclusion of existing words, and lack of universality, so as to improve accuracy. and comprehensive effect

Active Publication Date: 2012-10-03
北京新发智信科技有限责任公司
View PDF3 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First of all, this method does not exclude existing vocabulary, and it is easy to mix existing vocabulary and new words, and the accurac...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Recognition method for new words of scientific and technical terminology
  • Recognition method for new words of scientific and technical terminology
  • Recognition method for new words of scientific and technical terminology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0012] In the following, a method for identifying new words of scientific and technological terms provided by the present invention will be described in detail with reference to the drawings and specific embodiments.

[0013] In the following description, a number of different aspects of the present invention will be described. However, for those of ordinary skill in the art, only some or all of the structures or processes of the present invention can be used to implement the present invention. For clarity of explanation, specific numbers, configurations, and sequences are illustrated, but it is obvious that the present invention can also be implemented without these specific details. In other cases, in order not to obscure the present invention, some well-known features will not be described in detail.

[0014] It can be understood that the Chinese new word recognition method of the present invention can be applied to a variety of terminal devices, such as personal computers, pers...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a recognition method for new words, which comprises the following steps: segmenting a scientific and technical text into sentences, and establishing a mechanized dictionary; matching and segmenting the short sentences segmented from the text in vocabularies, then, atomically segmenting the remaining sentence strings, and automatically extracting the two-character words, three-character words and multi-character words; and sequencing the extracted words with the statistical method, and evaluating the sequenced words to obtain new words. The method can greatly increase the recognition accuracy and the comprehensiveness of the new words.

Description

Technical field [0001] The present invention relates to computer Chinese information processing technology, and more specifically, to a new word recognition method of scientific and technological terms. Background technique [0002] Chinese information processing technology has been widely used in technical fields such as computer networks, database technology, software engineering, and document retrieval and identification. Chinese automatic word segmentation is a basic task of Chinese information processing. Many Chinese information processing projects involve word segmentation, such as machine translation, automatic abstracting, automatic classification, and Chinese document database retrieval. Since Chinese text is written continuously, there are no spaces between words and words, so the primary problem in Chinese text processing is word segmentation. The accurate distinction of words is the basis for Chinese text processing. [0003] However, the distinction of vocabulary is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 曲晓光雷静丰瑾侯晓艳徐锡涛
Owner 北京新发智信科技有限责任公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products