Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

75 results about "Sentence length" patented technology

In English grammar, sentence length refers to the number of words in a sentence. Most readability formulas use the number of words in a sentence to measure its difficulty. Yet in some cases, a short sentence can be harder to read than a long one.

Online traditional Chinese medicine text named entity identifying method based on deep learning

The invention discloses an online traditional Chinese medicine text named entity identifying method based on deep learning. The method includes the steps that online traditional Chinese medicine text data are obtained through a web crawler, and named entities of the obtained online traditional Chinese medicine text data are labeled with existing terminological dictionaries and human assistance; a word2vec tool is used for carrying out learning on large-scale label-free linguistic data, and word vectors with fixed length are obtained and used for forming a corresponding glossary; word segmentation is carried out on the online traditional Chinese medicine text data, words are converted into the word vectors with the fixed length by searching for the glossary, the word vectors serve as input of a convolutional neural network, and a blank character is used for filling when sentence length is insufficient; output of the convolutional neural network serves as input of a bidirectional long-short-time memory recurrent neural network, and an identification result of the online traditional Chinese medicine text data words to be identified is output. Compared with a traditional method for named entity identifying, the method reduces complexity and workload of feature extraction, simplifies the processing process and remarkably improves identification efficiency.
Owner:SOUTH CHINA UNIV OF TECH

Chinese network review emotion classification method based on integrated study frame

The invention discloses a Chinese network review emotion classification method based on an integrated study frame. According to the method, a part-of-speech combination mode, an order-preserving sub-matrix mode and a frequent word sequence mode are adopted as input characteristics, in the level of characteristics, factors of the influence of Chinese word order information, interval phrase characteristics and the sentence length are considered, and the characteristic vector sparsity problem is solved through semantic similarities; the problem that many review text characteristics exist is solved, the inter-base-classifier independence is guaranteed, and the classification performance of base classifiers is improved as much as possible; a base classifier algorithm constructed based on product attributes is adopted to comprehensively review emotion information of each attribute in a text, and then the sentence-level emotional tendency of reviews is judged, so that a final classification result is more accurate. The Chinese network review emotion classification method based on the integrated study frame is applicable to e-commerce network review emotion classification in various fields, can make a potential consumer know evaluation information of a commodity before purchase and can also make a merchant better sufficiently know the consumer's opinion, and therefore the service quality is improved.
Owner:NANJING SILICON INTELLIGENCE TECH CO LTD

Text similarity calculation method and device, computer equipment and computer storage medium

The invention discloses a text similarity calculation method and device, and relates to the technical field of text processing, which can accurately calculate the similarity between texts in a text with complex expression. The method comprises the steps of obtaining training word segmentation corpora obtained after word segmentation is conducted on text corpora with different sentence lengths; inputting the training word segmentation corpora as training data into a supervision model for training, and constructing a sentence vector conversion model which is used for converting sentences in the text corpus into sentence vectors for representing text characteristics; adjusting characteristic parameters in the sentence vector conversion model according to the sentence vector which is obtained by training and represents the text characteristics; based on the adjusted sentence vector conversion model, performing sentence vector conversion on the plurality of target texts to obtain a plurality of sentence vectors representing the characteristics of the target texts; and calculating the similarity among the plurality of target texts according to the plurality of sentence vectors representing the characteristics of the target texts.
Owner:PING AN TECH (SHENZHEN) CO LTD

Song ci poetry text message hiding technology based on hybrid encryption

The invention provides a Song ci poetry text message hiding technology based on hybrid encryption, which belongs to the information hiding and data security directions in the field of computers. The Song ci poetry text message hiding technology comprises the steps of encrypting secret information to be hidden by using an advanced encryption standard (AES) in a hybrid manner, encrypting an AES secret key by using an elliptic curve cryptography (ECC) algorithm, passing all information after encryption processing through a 140 tune name template library of the complete collection of Song ci poetry, and hiding the information by means of the system which is composed of templates, a dictionary, a steganographic device and an extractor, wherein the system can generate steganographic Song ci poetry through a random selection or template designation method according to the length of a cryptograph, and the sentence length, grammatical style and intonation sentence pattern of the steganographic Song ci poetry conform to the original Song ci poetry completely, thereby achieving the purposes of obfuscating attackers and ensuring secure transmission of the hidden information. The Song ci poetry text message hiding technology disclosed by the invention can solve the security problem of data transmission in channels, can provide double security measures of information hiding and data encryption, and has high practical application value.
Owner:NANJING UNIV OF AERONAUTICS & ASTRONAUTICS

Method for grading Chinese electronic document reading on the Internet

The invention discloses a method for grading Chinese electronic document reading on the Internet, comprising firstly determining the frequency distributions of Chinese characters, word groups and sentence structure indexes in different grades of documents; selecting the Chinese characters and the word groups for grading document reading, and avoiding the interference of often-used words and little-used words, then analyzing the word composition of a to-be-graded target document, analyzing the document to be a two-tuple vector (of words and occurrence number); calculating the sentence structure indexes of the document comprising an average paragraph length, an average sentence length, the length difference between the longest sentence and the shortest sentence and the like; and finally using the Naive Bayes method for determining the reading grade of the document based on the word composition information and the sentence structure information of the Chinese document. The reading grade of a Chinese electronic document is efficiently determined by analyzing the Chinese characters and word group composition of the document, combining with the sentence structures of the document, reasoning from the frequency distribution of each word and the structure indexes in different reading grades of documents and applying the Naive Bayes method.
Owner:NANJING UNIV

Electronic medical record entity relationship extraction method based on shortest dependency subtree

The invention provides an electronic medical record entity relationship extraction method based on a shortest dependency subtree. The method comprises the following steps: firstly, extracting an entity-based shortest subtree from an original sentence through dependency syntactic analysis to compress the sentence length; secondly, coding the statements through a bidirectional long short-term memory(BLSTM) neural network, and then coding the statements through the BLSTM neural network; learning final semantic representation of the sentences through a maximum pooling layer (Max Pooling), and finally classifying the sentences through a softmax classifier to obtain an entity relationship. According to the method, noise vocabularies and compressed statement lengths can be deleted. Meanwhile, the key words representing the relations between the entities are completely reserved, so that the compressed statement semantic relations are clearer. The problem that semantic information of statements cannot be well represented due to too long statements of an existing electronic medical record entity relation extraction model is solved, and the performance of the relation extraction model is improved.
Owner:SICHUAN UNIV

Example sentence searching method and system

ActiveCN102890723ARegularize output example sentencesSpecial data processing applicationsUser inputCalculation methods
The invention relates to the field of natural language processing, and provides an example sentence searching method according to query. The method comprises the following steps of: obtaining the query input by a user; processing the query input by the user; searching sample sentences matched with the query in an example sentence library, and calculating the relativity of the query and the example sentences; carrying out example sentence relativity scoring adjustment according to a usage diversity or translation diversity principle, and sorting the example sentences; outputting the example sentences and presenting phrases in the example sentences. The invention further provides an example sentence searching system according to the query. According to the scheme provided by the invention, various factors are comprehensively considered in calculation of the relativity of the query and the example sentences, and specifically, the features of the related phrases to the query in the example sentences, the syntactic features, the example sentence structure integrality feature, the sentence length feature and the digital noise feature of punctuations in the example sentences are comprehensively considered for calculating the relativity of the query and the example sentences; and the method is superior to other relativity calculation methods.
Owner:深圳宜搜天下科技股份有限公司

Translation model optimization method for dynamically adjusting length punishment and translation length

ActiveCN111178092AThe optimization method is simpleThe optimization method is convenient and effectiveNatural language translationNeural architecturesData setAlgorithm
The invention discloses a translation model optimization method for dynamically adjusting length punishment and translation length. The method comprises the steps of obtaining standard data in a specified language direction as a standard bilingual data set for various index prediction; performing word segmentation operation on the standard bilingual data set, and performing further training to obtain a new training data set; modifying a neural machine translation model decoder part, and automatically predicting the optimal length punishment value of the current sentence pair; performing lengthstatistics to obtain a target statement sub-length; preparing an independent feedforward neural network model so that a translation finally predicted by the model tends to a translation result with the optimal length; and enabling the Transformer neural machine translation model to dynamically adjust the length penalty and the optimal translation sentence length for different sentences. Accordingto the method, the length punishment and the dynamic adjustment of the translation length in the model translation process are realized, the realization is simple, the method is effective, the practicability is high, and the model translation quality improvement effect is obvious.
Owner:沈阳雅译网络技术有限公司

Mixed corpus word segment method based on LSTM (Long Short Term Memory)-CNN (Convolutional Neural Network)

The invention discloses a mixed corpus word segment method based on an LSTM (Long Short Term Memory)-CNN (Convolutional Neural Network). The method comprises the steps of converting training mixed corpus data into the mixed corpus data at a character level; counting the mixed corpus data characters to obtain a character set, and numbering each character to obtain a character serial number set; counting character labels to obtain a label set, numbering the labels to obtain a label serial number set; segmenting the corpus according to a sentence length, and grouping the obtained sentences according to the sentence length to obtain a data set; randomly selecting a sentence subgroup from the data set, extracting a plurality of sentences from the sentence subgroup, wherein the characters of each sentence form a datum w, and a corresponding label set is y; converting the datum w into a corresponding serial number and sending the label y to a model LSTM-CNN, and training a parameter of a deeplearning model; and converting to-be-predicted mixed corpus data into data matched with the deep learning model, sending the to-be-predicted mixed corpus data to the trained deep learning model to obtain a word segment result.
Owner:北京知道未来信息技术有限公司

Corpus processing method and device and storage medium

The invention discloses a corpus processing method. The corpus processing method comprises the steps: acquiring the phoneme frequency of each phoneme and the sentence length frequency of each sentence in an original corpus, wherein the phoneme frequency of each phoneme represents the number of the same phonemes in the original corpus, and the sentence length frequency of each sentence represents the number of sentences with the same sentence length in the original corpus; and calculating a frequency parameter of each sentence according to the phoneme frequency and the sentence length frequency, and taking the frequency parameter as a score of the sentence, wherein the frequency parameter is in negative correlation with the phoneme frequency, and is in negative correlation with the sentence length frequency. The invention further discloses a corpus processing device and a storage medium. According to the corpus processing method, a reliable standard is provided for corpus selection, so that the reliability of corpus sentence selection during screening can be improved, and the screening efficiency of a large number of text corpora is effectively improved, and the corpus processing method is suitable for large-scale corpus information screening tasks.
Owner:GUANGZHOU DUOYI NETWORK TECH +2
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products