Construction and utilization method for context-aware dynamic word or character vector on the basis of deep learning

A technology of deep learning and construction methods, applied in neural architecture, electrical digital data processing, special data processing applications, etc., can solve problems such as different meanings of words or words

Active Publication Date: 2017-03-29
FUDAN UNIV
View PDF6 Cites 55 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a method for constructing and using a context-aware dynamic word or word vector based on deep learning. such as words in English and characters in Chinese)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Construction and utilization method for context-aware dynamic word or character vector on the basis of deep learning
  • Construction and utilization method for context-aware dynamic word or character vector on the basis of deep learning
  • Construction and utilization method for context-aware dynamic word or character vector on the basis of deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The invention discloses a feature vector representation of a word or character dynamically constructed by a computer according to the context and its use method, which is mainly used to solve the problem that a word or character expresses different meanings in different contexts, that is, a word or a word with multiple meanings Questions (such as words in English and characters in Chinese). The dynamic word or word vector method can be used for natural language processing systems that need to convert words or characters into corresponding vector representations, and use these vector representations as input, especially for natural language processing systems developed based on deep learning technology. The specific implementation steps are as follows:

[0055] (1) Collect a large amount of text corpus in the target language (eg: Wikipedia).

[0056] (2) Extract vocabulary or vocabulary from the corpus. Uncommon words or characters that occur less frequently can be rep...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of the natural language processing of computers, in particular to a construction and utilization method for a context-aware dynamic word or character vector on the basis of deep learning. The dynamic construction method for the context-aware dynamic word or character vector on the basis of the deep learning comprises the following steps of: in massive texts, through an unsupervised learning method, simultaneously learning a global feature vector of a word or character and the feature vector representation of the global feature vector when a specific context appears, and combining the global feature vector with the context feature vector, and dynamically generating word or character vector representation. By use of the method, the word or character vector dynamically constructed on the basis of the context can be applied to a natural language processing system. The method is mainly used for solving a problem that the word or character vector expresses different meanings in different contexts, i.e. the problem that one word or one character has multiple meanings can be solved. The dynamic word or character vector can be used for obviously improving the performance of various natural language processing tasks of different languages, wherein the tasks comprise Chinese word segmentation, part-of-speech tagging, naming recognition, grammatical analysis, semantic role tagging, sentiment analysis, text classification, machine translation and the like.

Description

technical field [0001] The invention belongs to the technical field of computer natural language processing, and in particular relates to methods for constructing and using dynamic words or word vectors. Background technique [0002] In recent years, deep learning has made breakthroughs in recent artificial intelligence research. It has ended the situation that artificial intelligence has not made breakthroughs for more than ten years, and has rapidly exerted influence in the industry. Deep learning is different from narrow artificial intelligence systems that can only complete specific tasks (task-oriented functional simulation). As a general artificial intelligence technology, it can deal with various situations and problems. It has been obtained in the fields of image recognition, speech recognition, etc. Extremely successful application, also achieved results in the field of natural language processing (mainly English). Deep learning is currently the most effective and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06N3/04
CPCG06F40/30G06N3/048
Inventor 郑骁庆封江涛
Owner FUDAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products