Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

431 results about "Character vector" patented technology

A text implication relation recognition method based on multi-granularity information fusion

The present invention provides a text implication relation recognition method which fuses multi-granularity information, and proposes a modeling method which fuses multi-granularity information fusionand interaction between words and words, words and words, words and sentences. The invention firstly uses convolution neural network and Highway network layer in character vector layer to establish word vector model based on character level, and splices with word vector pre-trained by GloVe; Then the sentence modeling layer uses two-way long-short time memory network to model the word vector of fused word granularity, and then interacts and matches the text pairs through the sentence matching layer to fuse the attention mechanism, finally obtains the category through the integration classification layer; After the model is established, the model is trained and tested to obtain the text implication recognition and classification results of the test samples. This hierarchical structure method which combines the multi-granularity information of words, words and sentences combines the advantages of shallow feature location and deep feature learning in the model, so as to further improve the accuracy of text implication relationship recognition.
Owner:SUN YAT SEN UNIV +1

Named entities recognition method based on bidirectional LSTM and CRF

The invention discloses a named entities recognition method based on bidirectional LSTM and CRF. The named entities recognition method based on the bidirectional LSTM and CRF is improved and optimizedbased on the traditional named entities recognition algorithm in the prior art. The named entities recognition method based on the bidirectional LSTM and CRF comprises the following steps: (1) preprocessing a text, extracting phrase information and character information of the text; (2) coding the text character information by means of the bidirectional LSTM neural network to convert the text character information into character vectors; (3) using the glove model to code the text phrase information into word vectors; (4) combining the character vectors and the word vectors into a context information vector and putting the context information vector into the bidirectional LSTM neural network; and (5) decoding the output of the bidirectional LSTM with a linear chain condition random field to obtain a text annotation entity. The invention uses a deep neural network to extract text features and decodes the textual features with the condition random field, therefore, the text feature information can be effectively extracted and good effects can be achieved in the entity recognition tasks of different languages.
Owner:南京安链数据科技有限公司

Multi-task named entity recognition and confrontation training method for medical field

The invention discloses a multi-task named entity recognition and confrontation training method for medical field. The method includes the following steps of (1) collecting and processing data sets, so that each row is composed of a word and a label; (2) using a convolutional neural network to encode the information at the word character level, obtaining character vectors, and then stitching withword vectors to form input feature vectors; (3) constructing a sharing layer, and using a bidirection long-short-term memory nerve network to conduct modeling on input feature vectors of each word ina sentence to learn the common features of each task; (4) constructing a task layer, and conducting model on the input feature vectors and the output information in (3) through a bidirection long-short-term network to learn private features of each task; (5) using conditional random fields to decode labels of the outputs of (3) and (4); (6) using the information of the sharing layer to train a confrontation network to reduce the private features mixed into the sharing layer. According to the method, multi-task learning is performed on the data sets of multiple disease domains, confrontation training is introduced to make the features of the sharing layer and task layer more independent, and the task of training multiple named entity recognition simultaneously in a specific domain is accomplished quickly and efficiently.
Owner:ZHEJIANG UNIV

Construction and utilization method for context-aware dynamic word or character vector on the basis of deep learning

The invention belongs to the technical field of the natural language processing of computers, in particular to a construction and utilization method for a context-aware dynamic word or character vector on the basis of deep learning. The dynamic construction method for the context-aware dynamic word or character vector on the basis of the deep learning comprises the following steps of: in massive texts, through an unsupervised learning method, simultaneously learning a global feature vector of a word or character and the feature vector representation of the global feature vector when a specific context appears, and combining the global feature vector with the context feature vector, and dynamically generating word or character vector representation. By use of the method, the word or character vector dynamically constructed on the basis of the context can be applied to a natural language processing system. The method is mainly used for solving a problem that the word or character vector expresses different meanings in different contexts, i.e. the problem that one word or one character has multiple meanings can be solved. The dynamic word or character vector can be used for obviously improving the performance of various natural language processing tasks of different languages, wherein the tasks comprise Chinese word segmentation, part-of-speech tagging, naming recognition, grammatical analysis, semantic role tagging, sentiment analysis, text classification, machine translation and the like.
Owner:FUDAN UNIV

A named entity identification method and apparatus

The embodiment of the present invention provides a named entity identification method and apparatus. The method includes obtaining a text to be recognize; obtaining a character vector corresponding toeach character in the text to be recognized by embedding the text input character into the model, and obtaining a phonetic vector corresponding to each character by embedding the text input characterinto the phonetic embedding model; obtaining combination vectors by combining each character vector and corresponding phonetic vectors, and inputting the combination vectors of all characters into BiLSTM for semantic coding to obtain the corresponding semantic information features of the text to be recognized; obtaining a corresponding entity label sequence in the text to be recognized accordingto the semantic information features. The apparatus is used for performing the method described above. The embodiment of the invention obtains the character vector and the phonetic vector of the textto be recognized respectively according to the character embedding model and the phonetic vector model, and combines the character vector and the phonetic vector and inputs the character vector and the phonetic vector into the BiLSTM for recognition, so the deficiency of the vector representation of the character can be well compensated, and the recognition accuracy is greatly improved.
Owner:CHENGDU SEFON SOFTWARE CO LTD

Fused attention model-based Chinese text classification method

The invention discloses a fused attention model-based Chinese text classification method. The method comprises the following steps of: respectively segmenting a text into a corresponding word set anda corresponding character set through word segmentation preprocessing and character segmentation preprocessing, and training a word vector and a character vector corresponding to the text by adoptionof a feature embedding method according to the obtained word set and character set; respectively carrying out semantic encoding on the word vector and the character vector by taking a bidirectional gate circulation unit neural network as an encoder, and obtaining a word attention vector and a character attention vector in the text by adoption of a word vector attention mechanism and a character vector attention mechanism; obtaining a fused attention vector; and predicting a category of the text through a softmax classifier. The method is capable of solving the problem that more redundant features exist in the classification process as existing Chinese text classification methods neglects character feature information of texts, the extracted texts are single in features, all the pieces of semantic information of the texts are difficult to cover and features having obvious contribution to the classification are not focused.
Owner:中国科学院电子学研究所苏州研究院

Text classification method based on feature information of characters and terms

The invention discloses a text classification method based on feature information of characters and terms. The method comprises the steps that a neural network model is utilized to perform character and term vector joint pre-training, and initial term vector expression of the terms and initial character vector expression of Chinese characters are obtained; a short text is expressed to be a matrixcomposed of term vectors of all terms in the short text, a convolutional neural network is utilized to perform feature extraction, and term layer features are obtained; the short text is expressed tobe a matrix composed of character vectors of all Chinese characters in the short text, the convolutional neural network is utilized to perform feature extraction, and Chinese character layer featuresare obtained; the term layer features and the Chinese character layer features are connected, and feature vector expression of the short text is obtained; and a full-connection layer is utilized to classify the short text, a stochastic gradient descent method is adopted to perform model training, and a classification model is obtained. Through the method, character expression features and term expression features can be extracted, the problem that the short text has insufficient semantic information is relieved, the semantic information of the short text is fully mined, and classification of the short text is more accurate.
Owner:SUN YAT SEN UNIV

Character detection method and apparatus

The embodiment of the invention discloses a character detection method and apparatus. The character detection method includes the steps: performing feature extraction of a plurality of abstraction layers on an image to be detected by means of a feature extraction network; predicting the probability that each pixel point in the image to be detected is a character pixel point, and when each pixel point is a character pixel point, the positional information of a bounding box of a character, opposite to the pixel point, by means of a character detection network; based on the prediction result of the character detection network, determining the positional information of the bounding box of each candidate character; inputting the extracted features into a character mapping network, and performing transformation on a feature diagram output from the character mapping network to generate a character vector; determining the neighbour candidate character of each candidate character in the image to be detected, and connecting the each candidate character with the related neighbour candidate character into a character set; and according to the positional information of the bounding box of eachcandidate character in the character set, determining the character region of the image to be detected. The character detection method improves accuracy of detection of irregular characters.
Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Method and apparatus for increasing generalization capability of convolutional neural network

The invention belongs to the technical field of neural network, and provides a method and apparatus for increasing generalization capability of a convolutional neural network. The method includes the following steps: reading a group of images from a training set, mapping the group of images to a plurality of image character vectors, dividing the plurality of image characteristic vectors into a plurality of classes based on the types of the images; based on the image characteristic vectors of each class, calculating the intra-class loss function of all the characteristic vectors; based on the image characteristic vectors of each class, calculating the inter-class loss function of all the characteristic vectors; based on the intra-class loss function of all the characteristic vectors, using the back propagation algorithm to update the weight of each node of the convolutional neural network; repeating the above mentioned steps until the convolutional neural network converges on the training set or reaches preset repeating times. According to the invention, the method and the apparatus can save all data in long-tailed distribution, makes full usage of abundant inter-class information of tail data, and increases the generalization capability of the convolutional neural network.
Owner:SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI

Non-reference image quality assessment method based on information entropy characters

InactiveCN103475898AHigh subjective consistencySmall time complexityTelevision systemsImaging qualityTime complexity
The invention relates to an image quality assessment method, in particular to a non-reference image quality assessment method based on information entropy characters, and belongs to the field of image analyzing. The method comprises the first step of carrying out Contourlet conversion on a distorted image to obtain N*M conversion sub-bands, the second step of dividing each conversion sub-band and the unconverted original distorted image, the third step of calculating null domain information entropy and frequency domain information entropy on each block coefficient matrix, and the fourth step of screening the blocking characters and calculating a mean value to obtain the quality character value of each conversion sub-band. The method of a support vector machine and the method of non-reference image quality assessment are utilized for testing on a test set, and quality prediction and assessment are carried out through quality character vectors corresponding to a disaggregated model, an evaluation model and the test set all of which are obtained through training. The non-reference image quality assessment method has the advantages of being high in subjective consistency, small in time complexity and good in university, can be embedded into application systems related to image quality, and has very high application value.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Legal document named entity recognition method and device and computer equipment

The invention relates to a legal document named entity identification method and apparatus, and a computer device. The method comprises the steps of obtaining a legal document to be identified; inputting the legal document to be identified into the deep neural network model for identification to obtain an identification result; wherein the deep neural network model is obtained by training a language model through a plurality of legal document data with labels, a bidirectional recurrent neural network and a conditional random place; wherein the language model is obtained by training a Google Bert model through a plurality of corpora. According to the invention, the deep neural network model is adopted to carry out entity identification; extracting a character vector from a Chinese charactersequence of the legal document to be identified by adopting a language model obtained by training a Google Bert model; and inputting the character vectors into a bidirectional recurrent neural network, inputting the output codes of the bidirectional recurrent neural network into a linear chain conditional random field, and obtaining a recognition result, so that the network for realizing named entity recognition is simple in structure, low in training cost and high in prediction capability.
Owner:深圳市华云中盛科技股份有限公司

Film and television entity identification method based on Bilstm-crf and knowledge graph

The invention discloses a film and television entity identification method based on Bilstm-crf and knowledge graph, which comprises the steps of obtaining a character vector and a part-of-speech vector of a to-be-recognized text, performing weighted summation on the character vector and the part-of-speech vector, and inputting a result into a target bidirectional LSTM model for processing to obtain a text feature sequence; inputting the text feature sequence into a target CRF model for processing to obtain a named entity recognition result of the to-be-recognized text; and inquiring the namedentity recognition result in the film and television knowledge graph to further verify the result. According to the method, entity extraction can be effectively carried out on the movie and televisionsearch text of the user, the movie and television knowledge graph is fully utilized to mine the abstract movie and television search intention of the user, and the use experience of the user is improved. Word vectors trained through the language model are used as bottom layer input of the neural network under the condition of less annotation data, so that the training efficiency is improved, andthe method has a good application prospect and can be widely applied to entity recognition scenes in various fields.
Owner:SICHUAN CHANGHONG ELECTRIC CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products