Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

593 results about "Linguistic model" patented technology

Linguistic model. [liŋ′gwis·tik ′mäd·əl] (computer science) A method of automatic pattern recognition in which a class of patterns is defined as those patterns satisfying a certain set of relations among suitably defined primitive elements. Also known as syntactic model.

Intelligent error correcting system and method in network searching process

InactiveCN101206673AMeets preferencesSolve the problem of pinyin error correctionSpecial data processing applicationsLinguistic modelAlgorithm
The invention relates to an intelligent error correction system of key words in the process of searching networks and a method thereof. On an Internet platform, firstly, a related linguistic model and a corresponding dictionary as well as a data index database are established through the training of related data information; secondly, a text is inputted, a Pinyin error correction part calculates the mistakes of Pinyin and characters, the error correction of characters is calculated by a fuzzy match; finally, all results are filtered according to the degree of association, a plurality of results are sorted to get the proximal results. The polyphone mistakes and character types as well as word types mistakes inputted by a user are corrected by means of the sound-character conversion and fuzzy error correction technical methods to correct the character replace mistakes, the unwanted character or the leakage of character mistakes, the character position mistakes, etc. in the input process. Moreover, the basic functions are expanded on the basis such as the English-Chinese and punctuations mixing error correction, the fuzzy match technique, the related prompt technique and the enhanced intelligence error correction.
Owner:北京当当网信息技术有限公司

Electronic medical record text named entity recognition method based on pre-trained language model

The invention belongs to the technical field of medical information data processing, and particularly relates to an electronic medical record text named entity recognition method based on a pre-training language model, which comprises the following steps: collecting an electronic medical record text from a public data set as an original text, and preprocessing the original text; labeling the preprocessed original text entity based on the standard medical term set to obtain a labeled text; inputting the annotation text into a pre-training language model to obtain a training text represented bya word vector; constructing a BiLSTM-CRF sequence labeling model, and learning the training text to obtain a trained labeling model; and taking the trained labeling model as an entity recognition model, and inputting a test text to output a labeled category label sequence. According to the method, text features and semantic information in the deep language model are obtained through training in the super-large-scale Chinese corpus, a better semantic compression effect can be provided, the problem that manual annotation is tedious and complex is avoided, the method does not depend on dictionaries and rules, and the recall ratio and accuracy of named entity recognition are improved.
Owner:SUZHOU INST OF BIOMEDICAL ENG & TECH CHINESE ACADEMY OF SCI

Linguistic model training method and system based on distributed neural networks

InactiveCN103810999AResolution timeSolving the problem of underutilizing neural networksSpeech recognitionLinguistic modelSpeech identification
The invention discloses linguistic model training method and system based on distributed neural networks. The method comprises the following steps: splitting a large vocabulary into a plurality of small vocabularies; corresponding each small vocabulary to a neural network linguistic model, each neural network linguistic model having the same number of input dimensions and being subjected to the first training independently; merging output vectors of each neural network linguistic model and performing the second training; obtaining a normalized neural network linguistic model. The system comprises an input module, a first training module, a second training model and an output model. According to the method, a plurality of neural networks are applied to training and learning different vocabularies, in this way, learning ability of the neural networks is fully used, learning and training time of the large vocabularies is greatly reduced; besides, outputs of the large vocabularies are normalized to realize normalization and sharing of the plurality of neural networks, so that NNLM can learn information as much as possible, and the accuracy of relevant application services, such as large-scale voice identification and machine translation, is improved.
Owner:TSINGHUA UNIV

Translation method integrating syntactic tree and statistical machine translation technology and translation device

The invention discloses a translation method integrating a syntactic tree and statistical machine translation technology and a translation device. The method comprises the following steps. First, a dictionary base, a grammatical rule base, a phrase translation probability table and a target language linguistic model between different languages are established. Then, segmentation, word property removing and grammatical analysis are conducted to an original input sentence, and a syntactic tree is generated. Then by adopting a top-down strategy, the syntactic tree is gone through, by means of each individual node and part of continuous nodes which cross the syntax, the original texts of leaf nodes are taken to be matched with the phrase translation probability table trained by the statistical machine translation, By utilizing the translated texts of the phrase translation table and the linguistic model of the target language, the purpose of improving the fluency and the accuracy of the output translated texts is achieved. By means of the translation method integrating the syntactic tree and the statistical machine translation technology and the translation device, not only is fine grit knowledge provided by the phrase translation table utilized, but also the advantages of the syntactic tree when solving the relevant problems of depth and long distance of a sentence are utilized, and the quality of the texts translated by the machine can be improved remarkably.
Owner:北京赛迪翻译技术有限公司

Pre-trained language model compression method and platform based on Knowledge distillation

The invention discloses a pre-trained language model compression method and platform based on knowledge distillation. The method comprises the steps of firstly designing a universal knowledge distillation strategy of feature migration, mapping and approaching features of each layer of a student model to features of a teacher in a process of distilling knowledge of a teacher model to the student model, focusing on the feature expression ability of a small sample in the middle layer of the teacher model, and guiding the student model by utilizing the features; constructing a distillation methodbased on self-attention crossover knowledge by utilizing the ability of self-attention distribution of a teacher model to detect semantics and syntax among words; and in order to improve the learningquality of the learning model at the early stage of training and the generalization ability of the learning model at the later stage of training, designing a linear migration strategy based on Bernoulli probability distribution to gradually complete feature mapping from teachers to students and knowledge migration of self-attention distribution. According to the method and the device, the multi-task-oriented pre-trained language model is automatically compressed, so that the compression efficiency of the language model is improved.
Owner:ZHEJIANG LAB

Voice synthesis method based on voice vector textual characteristics

The invention discloses a voice synthesis method based on voice vector textual characteristics. The voice synthesis method comprises the following steps: receiving an input text by a text analyzing module; carrying out regular processing on the textual characteristics and transmitting obtained text data to a text parameterization module; obtaining a parameterized text by adopting a single-bit heat code encoding method; receiving the parameterized text by a voice vector training module, and training a linguistic model based on voice vectors; then transmitting to a linguistic parameter training model to train a mapping model from the text to voice parameters; receiving the output text of the text parameterization module and the voice vector training module through a voice vector generation module, so as to generate the voice vectors of the text data; and transmitting the voice vectors of the text data and the mapping model from the text to the voice parameters to a linguistic parameter predication module to obtain the voice parameters corresponding to the voice vectors; and finally, synthesizing voices by a voice synthesis module. According to the voice synthesis method based on the voice vector textual characteristics, the accuracy of modeling of a voice synthesis system is improved; and the complexity and the manual participation degree of system realization are greatly reduced.
Owner:中科极限元(杭州)智能科技股份有限公司

Cardiovascular and cerebrovascular knowledge map questioning and answering method based on electronic medical records

The invention discloses a cardiovascular and cerebrovascular knowledge map questioning and answering method based on electronic medical records. The method comprises the following three parts: 1, constructing a crawler to extract encyclopedia and cardiovascular and cerebrovascular medical knowledge, fusing medical record data to construct a cardiovascular and cerebrovascular domain dictionary andcarrying out electronic medical record desensitization treatment; 2, constructing a knowledge graph based on an electronic medical record, namely realizing entity relationship extraction by using a language model, a bidirectional long-short-term memory network, a conditional random field and a convolutional neural network, screening entity relationships by using word2vec, and storing the entity relationships in Neo4j; and 3, constructing a question-answering method based on a traditional rule, constructing an ACTree and a question template by using the medical feature word set and the domain dictionary to perform question analysis and intention recognition, generating a Cypher query statement, and returning an answer. Based on the electronic medical record and the online knowledge base, cardiovascular and cerebrovascular knowledge graph construction and knowledge graph-based user complaint questioning and answering are realized.
Owner:HANGZHOU DIANZI UNIV

Question and answer processing method and device, language model training method and device, equipment and storage medium

The invention discloses a question and answer processing method and device, a language model training method and device, equipment and a storage medium, and relates to the field of natural language processing. The specific implementation scheme is as follows: obtaining at least one candidate table matched with a to-be-queried question, wherein each candidate table comprises a candidate answer corresponding to the question; processing the at least one candidate table to obtain at least one table text, the table text comprising text content of each domain in the candidate table, the domains comprising titles, headers and cells; respectively inputting the question and each table text into a preset language model to obtain a matching degree of the question and each candidate table; according to the matching degree of each candidate table, outputting a reply table, wherein the reply table is a candidate table of which the matching degree with the question is greater than a preset value or acandidate table corresponding to the maximum matching degree in the at least one candidate table. The language model is adopted to perform semantic matching on the questions and the texts, so that the matching accuracy and recall rate of the questions and the tables are improved.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Model fusion triad representation learning system and method based on deep learning

The invention discloses a model fusion triad representation learning system and method based on deep learning. The method comprises the following steps: carrying out the embedded representation of a word through a pre-trained BERT language model, and obtaining a more contextualized representation of the word; meanwhile, a masking language modeling task of a BERT structure is used for taking a triple of the masking language modeling task as sequence input; the method is used for solving the problem of multiple semantics of the same entity; the mapping entity relationship can be represented differently in different fields by using a projection or conversion matrix; however, the transformed BERT can take the triad or the description information thereof as text input and train the triad and the description information together; the mechanism of the BERT itself has different word vectors for the entity relationship in different sentences, and the problem of different semantics of the entityrelationship is effectively solved, so that the selection of TransE is not limited by the model itself. On the contrary, the model is simple enough to truly reflect the corresponding relationship among the triples. Meanwhile, the complexity of the model is reduced.
Owner:XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products