Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

53 results about "Sentence word" patented technology

A sentence word (also called a one-word sentence) is a single word that forms a full sentence. Henry Sweet described sentence words as "a variety of words which have the peculiarity of always forming a sentence by themselves" and gave words such as "Come!", "John!", "Alas!", "Yes." and "No." as examples of sentence words. The Dutch linguist J. M. Hoogvliet described sentence words as "volzinwoorden". They were also noted in 1891 by Georg von der Gabelentz, whose observations were extensively elaborated by Hoogvliet in 1903; he does not list "Yes." and "No." as sentence words. Wegener called sentence words "Wortsätze".

Method and device for detecting text information in video and electronic equipment

The invention discloses a method and a device for detecting text information in a video and electronic equipment. The method for checking the text information in the video comprises the following steps: extracting a target picture from a to-be-detected video, wherein the target picture comprises a key frame in the to-be-detected video; extracting text information from the target picture, and performing text sentence word segmentation on the text information to obtain sentences after word segmentation; further carrying out vector conversion on the sentences after word segmentation to obtain word vectors of segmented words in the sentences; and finally inputting the sentence subjected to word segmentation and the word vector obtained by conversion into a text classification model; carrying out semantic recognition through the text classification model, and outputting a semantic recognition result representing whether the text information contains characters with preset semantics or not so as to realize the illegal video character detection, namely, by extracting the key frame in the video and semantically identifying the character information, the problem that the illegal video character cannot be detected due to the change of a simple character expression mode is avoided, and the accuracy and the detection efficiency of the illegal video character detection are improved.
Owner:ALIBABA GRP HLDG LTD

Knowledge base construction method and device oriented to program design field question-answering system

The invention discloses a knowledge base construction method and device oriented to a program design field question-answering system. The method comprises the following steps that: according to the knowledge contents of an online evaluation system, preliminarily establishing an intelligent question-answering knowledge base; obtaining a user question, carrying out sentence word segmentation on thequestion, carrying out synonym replacement, and carrying out similarity calculation with contents in the knowledge base; if a user is satisfied with a current returning answer, according to question answering, synchronously updating an intelligent question-answering knowledge base, and applying a code similarity algorithm to detect data in a cache in real time; and otherwise, returning a suboptimum answer to the user until the user is satisfied. By use of the system, the problems in a traditional online after class question answering system that a time difference is in the presence, search accuracy is low, efficiency needs to be improved and the like, the information acquisition speed and accuracy of the user can be effectively improved, and meanwhile, the teaching effect of the online evaluation system is optimized.
Owner:NANJING UNIV OF POSTS & TELECOMM

Method and system for generating braille file

The invention provides a method and system for generating a braille file. The method includes the steps that text of an object file is obtained so as to obtain text content information; based on the text content information, sentence word segmentation is conducted on the text to obtain sentence word segmentation information; based on the sentence word segmentation information, segmented-word Pinyin labeling is conducted on the text so as to obtain segmented-word Pinyin information; according to the segmented-word Pinyin information, braille conversion is conducted on the test so as to obtain braille character information; the method for generating the braille file further includes correction processing; the correction processing includes at least one of the following steps that in response to a user selection, ambiguity correction is conducted on the test so as to update the text content information, and the updated text content information is used for subsequent sentence word segmentation processing; in response to a user selection, sentence word segmentation correction is conducted on the text so as to update the sentence word segmentation information, and the updated sentence word segmentation information is used for subsequent segmented-word Pinyin labelling; in response to the user selection, segmented-word Pinyin correction is conducted on the text so as to update segmented-word Pinyin information, and the updated segmented-word Pinyin information is used for subsequent braille conversion.
Owner:FUJI XEROX IND DEV(CHINA) CO LTD

Sentence semantic distance measurement method

The invention relates to a sentence semantic distance measurement method. The method comprises the following steps: firstly, carrying out word segmentation and stop word removal preprocessing on a sentence data set; selecting a word meaning similarity scheme, and setting a threshold value to execute normalization of synonymous words and synonymous words; then, calculating the vector space distanceof the two statements by combining smooth inverse frequency weighting and common component removal; measuring the word order distance of the two statements according to the out-of-order degree; calculating the semantic dependency distance of the two statements by combining the semantic dependency quintuple features; and finally, carrying out hybrid weighting calculation on the vector space distance, the word order distance and the semantic dependency distance. Measurement is from three dimensions of sentence vector representation, sentence word sequence and sentence component dependency, andfinally a final semantic distance is obtained in a weighted summation manner. A word level calculation means is utilized, and a sentence level operation idea is absorbed, and through introduction andcreative combination of a vector space distance, a word order distance and a semantic dependency distance, the semantic distance of the sentences is more comprehensively and reasonably measured.
Owner:网经科技(苏州)有限公司

Method for automatically mining corresponding citing fragment and citied literature original content fragment

InactiveCN106126497AResolve ambiguityAvoid the disadvantage of requiring large-scale corpus training in advanceNatural language data processingSpecial data processing applicationsSentence segmentationComputation complexity
The invention discloses a method for automatically mining a corresponding citing fragment and a citied literature original content fragment. The method comprises the following steps: extracting a sentence which cites references as a citing fragment; carrying out sentence segmentation and numbering on the references cited by the citing fragment; carrying out word segmentation on each sentence in the citing fragment and the references to form citing fragment word groups and reference sentence word groups, and calculating similarity between the sentence in the citing fragment and the sentence in the references; and according to the calculated similarity of the sentences, sorting the sentences, extracting the sentence, which has the highest similarity with the citing fragment, in the references, and taking the extracted sentence as the cited literature original content fragment corresponding to the citing fragment. By use of the method provided by the invention, corpus training does not need to be prepared in advance, calculation complexity is low, various similarity calculation methods can be flexibly realized, and high accuracy and a high recall rate are realized.
Owner:同方知网数字出版技术股份有限公司

Mixed multi-feature sentence similarity calculation method and system, and storage medium

The invention requests to protect a mixed multi-feature sentence similarity calculation method and system, and a storage medium. The mixed multi-feature sentence similarity calculation method comprises the following steps: obtaining a test set and a training set for sentence similarity calculation, and obtaining a word vector corresponding to each word through a word vector model; calculating sentence word vector similarity through a computer by utilizing weighted sum to remove non-information noise from the word vectors based on a smooth inverse frequency algorithm; based on a word dependencytriple structure, respectively calculating the similarity between the test sentence and the sentence dependency syntax with the top ten screened similarities; and based on the sentence mixing similarity calculated by the two obtained sentence vectors, adjusting an optimization coefficient beta by adopting a P@N and MRR (mean sorting reciprocal) parameter determination method to obtain a sentencewith the maximum sentence similarity with the sentence in the training set. According to the mixed multi-feature sentence similarity calculation method, the characteristics of keywords, word vectors,syntactic structures and the like in sentences are considered, so that the deep meanings of the sentences are expressed more accurately, and the similarity of sentence contents is judged correctly.
Owner:CHONGQING UNIV OF POSTS & TELECOMM

Candidate answer selection method based on depth learning

The invention provides a method for selecting candidate answers based on depth learning, which comprises the following steps: step S1, inputting a question and a candidate answers, respectively resolving the question and the candidate answers into a question word sequence and a candidate answer word sequence; step S2, model that question word sequence and the candidate answer word sequence throughthe long-short time memory network to obtain the semantic representation of the question sentence and the semantic representation of the candidate answer; step S3, selecting that word vector of the word with the highest weight value in the question sentence word sequence to initialize the knowledge memory module; step S4, calculating the knowledge representation of the question according to the knowledge information stored in the knowledge memory module and the semantic representation of the question; Step S5, calculating the similarity between the knowledge representation of the question andthe semantic representation of the candidate answer, and selecting the candidate answer with the highest similarity to output. The invention introduces a knowledge memory module in the depth learningnetwork to improve the connection between the question sentence and the candidate answer, and to improve the quality of the answer selection, so as to be better applied to the community question answering website and the question answering system.
Owner:EAST CHINA NORMAL UNIV

Bilingual sentence automatic alignment method and device

ActiveCN112668307AHigh precisionAccurately estimate translation probabilitiesNatural language translationInformation transferSentence word
The invention discloses a bilingual sentence automatic alignment method and device, and the method comprises the steps of obtaining an article pair set, enabling each article pair to comprise a source language article S and a target language article T, dividing sentences of an article, and carrying out the statistics of the relative length of each sentence and the relative position of each sentence in the article; determining word similarity between sentences si in the source language article S and sentences tj in the target language article T by utilizing a word vector model; calculating the distance between the sentence in the source language article S and the sentence in the target language article T by utilizing the inter-sentence word similarity, the sentence relative length difference and the relative position difference of the sentence in the article, taking the relative length of the sentence as the information amount, minimizing the sum of the products of the distance and the information amount as an information transfer optimization model, and solving the model to establish an alignment relationship. According to the invention, alignment between sentences is converted into searching for an optimal transportation strategy, and under the condition that work is minimum, all information of a source language article is transferred into a target language article.
Owner:TSINGHUA UNIV

Multi-round dialogue omission recovery method based on gated copying and masking

The invention provides a multi-round dialogue omission recovery method based on gated copying and masking. The method comprises the steps of obtaining original omission sentences and context text content of the original omission sentences; performing word segmentation on the text by using a word segmentation tool, and mapping a word sequence into a digital sequence by using a dictionary; using a pre-trained word vector file to represent words; based on a gating mechanism, fusing the multi-head self-attention information and a gating encoder of the Bi-GRU, and performing semantic encoding on the omitted sentence word vector sequence and the context word vector sequence; calculating soft mask features of the omitted sentence based on a soft mask mechanism; calculating probability distribution of the word list by using a mask decoder; calculating scores of the context words, and normalizing the scores by using a Softmax function to obtain context probability distribution; and adding the probability distribution of the word list and the context probability distribution by using a gating unit to obtain final omission word probability distribution, and selecting filling contents of omission sentences. The omission recovery result accuracy is improved.
Owner:中国科学院电子学研究所苏州研究院

Restatement generation method, system and equipment and computer readable storage medium

PendingCN109933806AVariety of typesTypes are flexibleSpecial data processing applicationsAlgorithmMode selection
The invention discloses a restatement generation method, system and equipment and a computer readable storage medium. The method comprises: based on a sequence comprising a replication mechanism to asequence framework, calculating word vectors of source sentence words and keyword words; a context vector and an attention vector, the mode selection of each target word in the target sentence is determined; if the mode is a duplication mode, the mode is a duplication mode;, if yes, calculating probability distribution of the keyword set through a softmax function according to the output of the decoder; selecting the keyword with the maximum probability as a predicted target word to be copied to a target sentence, if the keyword is in a write-in mode, calculating probability distribution of the whole dictionary through a softmax function according to the output of a decoder, and selecting the word with the maximum probability as the predicted target word to be written into the target sentence; sequentially generating target words to obtain a whole target sentence; the generated restatement keywords conforming to the user given keywords can be directly copied from the original sentences, the matching conforming to the user habits is generated from the dictionary, and the applicability is high.
Owner:CHANGSHA UNIVERSITY OF SCIENCE AND TECHNOLOGY

Intelligent robot chat system and method based on similar-sentence searching

The invention discloses an intelligent robot chat system and method based on similar-sentence searching. The system includes a receiving unit used for receiving a sentence input by a user, a similar-sentence calculation unit used for calculating a sentence which is in sentences included in a question and answer knowledge library and is closest to the sentence input by the user, an output unit usedfor outputting a corresponding answer in the question and answer knowledge library, and a question and answer knowledge library unit used for storing dialogues in various situations. According to thesystem, on the one hand, a method of information retrieval is adopted, an inverted index is established in the question and answer knowledge library, to-be-found vocabulary can be quickly located ininverted indexing, and index space is saved at the same time; and on the other hand, weight calculation and sentence word order distance comparison are carried out according to different part-of-speech of vocabulary, cases where words used in sentences are similar but meanings are different can be distinguished, thus the more accurate similar sentence is obtained, a more accurate searching resultis provided, and interactive experience of the user is enhanced.
Owner:SHENZHEN SANBOT INNOVATION INTELLIGENT CO LTD

Emotion analysis method and device fused with emotion dictionary

The invention provides a sentiment analysis method and device fused with a sentiment dictionary, which can be applied to the field of artificial intelligence. The method comprises the following steps: inputting an obtained sentence in a source field and an obtained sentence in a target field into a pre-generated encoder to obtain a source field sentence word vector and a target field sentence word vector; inputting the source field word vector and the target field sentence word vector into a sentence vector encoder to respectively obtain source field sentence representation and target field sentence representation; and obtaining a sentiment classification result of the target domain by using the source domain sentence representation and the target domain sentence representation. According to the method, sentiment words are taken as prediction targets, corpora of a source domain and a target domain are jointly trained through a language model, an encoder capable of extracting sentiment analysis related features is generated, and then feature vectors of texts to be subjected to sentiment classification are generated by using a word encoder; therefore, the technical effect that the available classification result can be obtained by using less or no annotation data is achieved.
Owner:INDUSTRIAL AND COMMERCIAL BANK OF CHINA

Self-supervised public opinion comment viewpoint object classification method based on comparative learning

The invention relates to a self-supervised public opinion comment viewpoint object classification method based on comparative learning, and belongs to the field of natural language processing. The method comprises the following steps: constructing a data set of microblog comment viewpoint object classification; k-means clustering is carried out on the basis of Word2Vec word vectors, a special self-attention mechanism is fused to obtain vector representation of comments, comment sentence representation is reconstructed, positive and negative examples of comment sentences are constructed through comment sentence word vector representation and sentence representation in the aspect of reconstruction, text features related to comment viewpoint objects are enhanced through a contrastive learning method, and the comment viewpoint objects are obtained. The unrelated distance between the sentences and the non-viewpoint objects is enlarged, so that the comment sentences are deduced and classified through the model. And finally, the comment text is classified into four case aspects: a certain institution, a party, a certain name and others, and support is provided for subsequent microblog comment abstracts.
Owner:KUNMING UNIV OF SCI & TECH

Relationship extraction method combining neural network and feature calculation

The invention discloses a relationship extraction method combining a neural network and feature calculation. The method comprises the following steps of 1, performing vector mapping on a text based ona random word vector; 2, extracting atomic features in the sentence, performing feature calculation on the atomic features to obtain composite features, and performing vector mapping on the compositefeatures; 3, performing convolution pooling operation on the word vector matrix through a neural network to extract features; 4, splicing a result after convolution pooling with the composite featurevector in the sentence; and step 5, performing full connection, and predicting a result on a Softmax layer. On the basis of making full use of sentence text complete information, structure and semantic information is obtained in combination with a feature calculation method. Simultaneous introduction of neural network technology, according to the method, the characteristic that a neural network automatically extracts high-dimensional abstract features in a layered mode is brought into full play, a result obtained after sentence word vectors are input into a convolution pooling layer is combined with composite feature vectors, the problem of feature sparseness caused by the limited number of words in sentences is avoided to a certain extent, and therefore the experimental performance of arelation extraction task is effectively improved.
Owner:GUIZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products