Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

109 results about "Sentence extraction" patented technology

Sentence extraction is a technique used for automatic summarization of a text. In this shallow approach, statistical heuristics are used to identify the most salient sentences of a text. Sentence extraction is a low-cost approach compared to more knowledge-intensive deeper approaches which require additional knowledge bases such as ontologies or linguistic knowledge. In short "sentence extraction" works as a filter which allows only important sentences to pass.

Interactive speech recognition system and method

ActiveCN101923854AFix recognition errorsCandidate is accurateSpeech recognitionSpeech identificationAcoustic model
The invention discloses an interactive speech recognition system which comprises an acoustic model, a language model selection module, a speech and sentence extraction module, a speech recognition module, a word candidate generation and error correction module and an interaction module, wherein the acoustic model and the language model selection module are used for selecting an acoustic model which is the most similar to an object to be recognized in the pronunciation characteristic for the object to be recognized and a language model which is the most similar to the object to be recognized in the field for the whole recognition process according to the information of the object to be recognized; the speech and sentence extraction module is used for segmenting the whole section of a speech signal into a plurality of speeches and sentences, extracting the segmented speeches and sentences and sending to the speech recognition module; the speech recognition module is used for recognizing the speeches and the sentences extracted by the speech sentence extraction module and outputting an intermediate recognition result; the word candidate generation and error correction module is used for processing the intermediate recognition result to generate a candidate assembly and correcting recognition errors according to selected candidates or input correct data to obtain a final recognition result; and the interaction module is used for sending data input by a user to the acoustic model and the language model selection module and feeding back the recognition result of the word candidate generation and error correction module to the user.
Owner:INST OF COMPUTING TECH CHINESE ACAD OF SCI

Aspect-level sentiment analysis method and device based on graph convolutional neural network

ActiveCN112528672AMake up for the inaccurate defect of extracting syntactic featuresImprove accuracySemantic analysisNeural architecturesFeature extractionAlgorithm
The embodiment of the invention provides an aspect-level sentiment analysis method and device based on a graph convolutional neural network. The method comprises the steps of acquiring sentences to besubjected to aspect sentiment analysis and aspect words in the sentences to be subjected to aspect sentiment analysis; preprocessing the sentences and the aspect words to be subjected to aspect sentiment analysis to obtain input vector sequences and syntactic weighted graphs corresponding to the sentences to be subjected to aspect sentiment analysis; and inputting the input vector sequence and the syntax weighted graph into a pre-trained double-graph convolutional neural network to obtain an emotion analysis result corresponding to the aspect word. According to the embodiment of the invention, the dual-graph convolutional neural network not only pays attention to syntactic features of the sentences, but also pays attention to semantic features of the sentences and extracts semantic related features corresponding to the sentences, so that the defect that syntactic feature extraction of sentences insensitive to syntactics is inaccurate is overcome, and the accuracy of emotion analysis results is improved.
Owner:BEIJING UNIV OF POSTS & TELECOMM

Tendency analysis method of public opinion based on word2vec

The invention provides a tendency analysis method of a public opinion based on word2vec. The method comprises a vector training phase, a key sentence extraction phase and a tendency discrimination phase; by extracting news key sentences, the discriminant feature space is reduced, contents with relatively large relevance with the original theme are remained, useless information is eliminated and the accuracy of tendency analysis of the public opinion is improved; a depth learning model word2vec is introduced into the tendency analysis of the public opinion, and used for comparing the semantic similarity between words and comparing the semantic similarity through word vectors, so that words with the same emotional tendency but not in an emotion dictionary can be well identified, thus even if the emotion dictionary is not complete, a better analysis effect can be obtained, and meanwhile, weighted calculation is performed on the emotional tendency of the key sentences by fusing a grammatical rule; and combined with contextual information, the limitation of simply using the semantic similarity is compensated, and the tendency is analyzed integrally from the sentences, so that the emotional tendency and emotional intensity of news texts of the text level are accurately discriminated.
Owner:TONGJI UNIV

Automatic metaphor rhetoric sentence analysis and judgment method based on part of speech, syntax and dictionary

The invention discloses an automatic metaphor rhetoric sentence analysis and judgment method based on part of speech, syntax and a dictionary. A randomly input sentence is taken as a processing object, and the method comprises the following steps: (1) segmenting words and carrying out part-of-speed tagging; (2) deleting a modifier based on syntactic analysis; (3) deleting redundant ingredients based on a simple subordinate clause; (4) deleting redundant ingredients based on a metaphoric word; (5) reducing a range of a candidate body and a candidate metaphorical object by virtue of dependency relation; (6) screening the candidate body and the candidate metaphorical object according to the dependency relation formed by corresponding words of root nodes; (7) extracting the candidate body and the candidate metaphorical object according to a simple metaphoric sentence extraction rule; and (8) realizing automatic analysis and judgement on a metaphor rhetoric sentence based on automatic judgement of a metaphor rhetoric technique. The method disclosed by the invention is high in automation degree and high in judgement accuracy rate and can be widely applied to automatic metaphor rhetoric analysis and judgment systems of the fields of natural language deep understanding, machine translation, computer-assisted instruction and the like.
Owner:南城县工业与科技创新投资发展集团有限公司

Traditional Chinese medicine medical case naming identification method and system based on multi-feature template correction

The invention discloses a traditional Chinese medicine medical case naming identification method and system based on multi-feature template correction. The method comprises the following steps that: carrying out sentence extraction on a traditional Chinese medicine medical case text; classifying extracted sentences; carrying out word segmentation processing on each category of sentences; carryingout character feature, part of speech feature, left demonstrative word feature, right demonstrative word feature and term feature annotation on each word obtained by word segmentation in sequence; constructing training corpora; formulating a feature template; independently inputting the obtained corpora and the feature template into a conditional random field model, and training the conditional random field model to obtain the trained conditional random field model; constructing corpora to be predicted for a traditional Chinese medicine medical case to be predicted; taking constructed identification corpora as an input, inputting into the trained conditional random field model, and outputting a traditional Chinese medicine medical case category and a character position; and finally, according to the traditional Chinese medicine medical case category and the character position, identifying the four diagnostic methods, the pattern of syndrome and the therapeutic method of the traditionalChinese medicine.
Owner:山东管理学院

Chinese web page text deduplication system and method

The invention discloses a Chinese web page text deduplication system and a Chinese web page text deduplication method. The deduplication system comprises an index server and a search server, wherein the index server comprises a web page text preprocessing module, a combined characteristic sentence extraction module and a digital signature calculation module; and the search server comprises a web page text capture module and a Hash query module. The deduplication method comprises the following steps of: normalizing a web page text; extracting a combined characteristic sentence of the text; calculating a digital signature of the combined characteristic sentence; and comparing the digital signature with the existing digital signature in a Hash table, and judging whether the digital signature is duplicated or not. By the deduplication system and the deduplication method, a search engine can quickly and accurately determine and remove a large number of Chinese web pages with duplicated contents in the Internet; and when the search engine captures a new web page, the digital signature of the web page is calculated and compared with the digital signature of the web page, which has been stored by the search engine, whether the web page is duplicated or not is judged, and the web page is not stored if the web page is duplicated, so that the waste of a storage space is avoided, and the search accuracy of the search engine is improved simultaneously.
Owner:SHENGLE INFORMATION TECH SHANGHAI

Multi-feature fusion Chinese-over-the-sea news viewpoint sentence extraction method

The invention relates to a multi-feature fusion Chinese-overtopped news viewpoint sentence extraction method, and belongs to the technical field of natural language processing. Firstly, a cross-language representation learning method is adopted to construct a Chinese-Vietnamese bilingual word embedding model; and then calculating feature weights of the topic, emotion and position of the sentence,and fusing the feature weight information into a coding layer and an attention mechanism to obtain representation of the sentence in the aspects of topic, emotion, position and the like. And finally,viewpoint sentence classification is carried out according to the obtained sentence representation. Aiming at the problem that Chinese and Vietnamese marking resources are unbalanced, a Chinese-Vietnamese bilingual word embedding model is constructed; according to the method, the sentences are extracted from the sentences, then the weights of the topics, the positions and the sentiment features ofthe sentences are calculated respectively, the sentence weights are fused into the word vectors and the attention mechanism respectively, sentence semantic information and the sentiment, topic and position features are combined, and the accuracy of extracting the sentences of the Hami news viewpoints can be effectively improved.
Owner:KUNMING UNIV OF SCI & TECH

Method and system for processing text semantics by utilizing image processing technology and semantic vector space

The invention belongs to the technical field of text semantic information processing, and in particular relates to a method and a system for processing text semantics by utilizing an image processing technology and semantic vector space. The system comprises a text input and preprocessing module, a semantic vector construction module, a semantic information processing module and a semantic processing result display module, wherein the semantic information processing module is specifically used for semantic turning sentence extraction, semantic noise sentence detection, semantic range tracking and semantic scene segmentation. According to the method and the system, a text unit is mapped to a pixel in an image, and a semantic vector which describes the text unit is taken as pixel grayscale of the image, so that various technologies and methods in an image processing field can be introduced to process a text flexibly and intuitively without the influence of the diversification of word forms; meanwhile, the semantic vector is constructed by instructing a Word2Vec method, so that the lightweight of algorithms is ensured to meet the requirements on real-time application.
Owner:SHANGHAI JILIAN NETWORK TECH CO LTD

Text-summarization generating method based on neural network

The invention provides a text-summarization generating method based on a neural network. The text-summarization generating method based on the neural network includes the steps that an input documentis subjected to word segmentation and vectorization expression, and word vectors are obtained; all obtained word vectors of all sentences are input into a first layer of a first circulation neural network in sequence, and state vectors of the sentences after current word vectors of the sentences are input are obtained, wherein the state vectors of the corresponding sentences after the last word vectors of all the sentences are input represent sentence vectors of the sentences; all the sentence vectors are input into a second layer of the first circulation neural network in sequence, and corresponding document state vectors after all the sentences are input into the document are obtained, wherein the corresponding document state vector after the last sentence is input is a state vector of the whole document; expression of the input document is decoded through a second circulation neural network, and summarization is generated. According to the text-summarization generating method basedon the neural network, the cost problem when summarization is manually generated is solved, and meanwhile the information fragmentation problem and the information ambiguity problem which are caused by a sentence extraction method are solved.
Owner:北京牡丹电子集团有限责任公司数字科技中心

Key sentence extraction method and device for text paragraph

The invention provides a key sentence extraction method and device for a text paragraph, and relates to the technical field of data processing. The key sentence extraction method comprises the following steps: carrying out word segmentation on each line of text of each text paragraph in a corpus to obtain a first word segmentation result; selecting effective vocabularies from the first word segmentation result; selecting a key paragraph from each text paragraph according to the composition structure of the text paragraph; classifying the key paragraphs according to the effective vocabularies to obtain a plurality of classification categories; determining a target keyword of each classification category and the weight of each target keyword; and extracting a key sentence of each key paragraph according to the target keyword and the weight of the target keyword. According to the technical scheme, for the customer service industry, classification of customer service question and answer data and extraction of user demands and purposes are achieved, and the period of customer service knowledge accumulation is greatly shortened, and the cost of customer service knowledge accumulation isreduced, and meanwhile a complete problem set is provided for subsequent intelligent customer service.
Owner:HANGZHOU WEIMING XINKE TECH CO LTD +1

A shape filling type reading understanding analysis model and method based on reinforcement learning

The invention discloses a complete form filling type reading understanding analysis model and method based on reinforcement learning. The model comprises a coding layer, which is used for vectorizingwords of an original text, coding the words, taking a hidden vector of the last word of each sentence, outputting the hidden vector as a sentence vector, coding the text into a sequence of sentence vectors, and transmitting the sequence to a sentence extraction layer; a sentence extraction layer which is used for selecting sentence vectors, taking obtained sentences as current given text segmentsand encoding the current given text segments; a classification layer which takes each vacancy to be filled as a problem, takes the obtained text segment codes and the word vectors of the four candidate words as input, and calculates an output probability through a multi-feature classification network; a prediction layer which is used for normalizing the probability value obtained by the upper layer and the probability value of the language model to obtain the probabilities of the four final options; And an output layer which is used for calculating the cross entropy of the probability and theactual probability obtained by the previous layer, optimizing the classification network and updating the parameters of the network by taking the loss value as a delay reward.
Owner:SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products