Patents

Literature

Patsnap Eureka AI that helps you search prior art, draft patents, and assess FTO risks, powered by patent and scientific literature data.

30 results about "Sentence clustering" patented technology

Filter

Efficacy Topic

Property

Owner

Technical Advancement

Application Domain

Technology Topic

Technology Field Word

Patent Country/Region

Patent Type

Patent Status

Application Year

Inventor

Usually sentence clustering is used to cluster sentences derived from different documents and can be considered as a transverse segmentation of the documents content. Thus, the number of clusters can exceed the number of documents.

Multiple-document automatic abstracting method based on frequent itemset

InactiveCN102043851ASimple and easy to implementImprove simplicitySpecial data processing applicationsDocument preparationSentence similarity

The invention discloses a multiple-document automatic abstracting method based on a frequent itemset. In the method, a frequent itemset excavating ideal in the association rules is introduced, and an associating method is utilized to excavate the frequent itemsets of effective itemset to serve as child themes; sentences are directly clustered to different child themes without carrying out sentence similarity computing; and multiple-document automatic abstracting is carried out on the basis of an SFI (sub-topics based on frequent item sets) method. In the method, the sentences are directly clustered to different child themes without carrying out sentence similarity computing, thus the method has the characteristics of high simplicity, high legibility, high practicability and the like.

Multiple-document automatic abstracting method based on frequent itemset

Multiple-document automatic abstracting method based on frequent itemset

Multiple-document automatic abstracting method based on frequent itemset

Owner:SICHUAN UNIV

News sentence clustering method based on semantic similarity, device and storage medium

ActiveCN107679144AAccurate clusteringEfficient clusteringSemantic analysisSpecial data processing applicationsComputational semanticsSemantic vector

The invention provides a news sentence clustering method based on semantic similarity. The method includes the following steps: preprocessing news sentences of a corpus, and extracting available words; utilizing the available words to train a continuous bag-of-words model to obtain an initial word vector of each available word; utilizing an initial sentence vector of each news sentence and the initial word vectors of the left and right adjoining available words of a certain available word in the news sentence to train the continuous bag-of-words model in an iterative manner to obtain a currentword vector of each available word in the news sentence and a final sentence vector of the news sentence; merging an average value of the word vectors of all the available words, one-hot vectors of high-frequency words and the final sentence vector of each news sentence to obtain a semantic vector of the news sentence; and calculating distances between the semantic vectors to obtain the semanticsimilarity between the different news sentences, and clustering the news sentences of the corpus in accordance therewith. The invention also provides an electronic device and a computer-readable storage medium.

News sentence clustering method based on semantic similarity, device and storage medium

News sentence clustering method based on semantic similarity, device and storage medium

News sentence clustering method based on semantic similarity, device and storage medium

Owner:PING AN TECH (SHENZHEN) CO LTD

Text data viewpoint summary mining method merging topic attributes and emotion information

ActiveCN108287922APrecise topic attributesSemantic analysisSpecial data processing applicationsFeature vectorViewpoints

The invention provides a text data viewpoint summary mining method merging topic attributes and emotion information. The method comprises the steps of preprocessing a text corpus set of a topic; inputting a topic corpus set and a background corpus set; extracting the topic attributes of the topic corpus set; adding emotional polarities to the obtained topic attributes, and vectorizing sentences; taking the obtained topic attributes as evaluation objects, obtaining emotional attribute features contained in the sentences, and conducting feature vectorization on one sentence by means of a topic attribute and emotion analysis method; utilizing an obtained topic attribute set and a text sentence feature vector set S to construct a three-layer graph structure, and clustering all the text sentences; selecting sentences from class clusters to form a viewpoint summary, and selecting the sentences with high scores to form a viewpoint summary. According to the text data viewpoint summary mining method, the extracted topic attributes are more accurate by adopting a topic attribute extraction method, and meanwhile the text data viewpoint summary mining method can be applied not only to the field of Chinese microblogs but also to the field of website news and product reviews.

Text data viewpoint summary mining method merging topic attributes and emotion information

Owner:FUZHOU UNIV

Method for structured processing of Chinese pathological text

InactiveCN104899260AImprove accuracyAdapt to data structuring needsSpecial data processing applicationsText database clustering/classificationSentence segmentationC-value

The present invention relates to a method for structured processing of a Chinese pathological text. The method comprises the following steps: extracting template information corresponding to each sample from a hierarchical stricture of a sample of text data of a pathological report text data and indicator; extracting the template information comprising short sentence segmentation and indicator name extraction; classifying the short sentences; with respect to each sample, in combination with a classification result cluster and a short sentence cluster, calculating a TF value, an IDF value and a C-value of each indicator name in an indicator name list in a short sentence language material, and screening out an indicator name whose TF value, IDF value and C-value satisfy a threshold, and using the obtained indicator name as a component of the final template. According to the present invention, a non-structured Chinese pathological text can be structured.

Method for structured processing of Chinese pathological text

Method for structured processing of Chinese pathological text

Method for structured processing of Chinese pathological text

Owner:DONGHUA UNIV +1

Multilingual automatic abstract method

ActiveCN109829161AFind quicklyNeural architecturesEnergy efficient computingEngineeringNetwork model

The invention relates to the technical field of text generation in natural language processing. The invention relates to a method, in particular to a multilingual automatic abstract method. INCLUDINGA Whole AUTOMATIC ABNORMATION SYSTEM, the automatic abstract system is divided into a model training module; a single-document abstract module and a multi-document abstract module, the model trainingmodule is divided into a text preprocessing module and a training module; wherein the single-document summary module is divided into a text preprocessing module and a summary generation module, the multi-document summary module is divided into a text preprocessing module, a multi-language sentence clustering module and a summary generation module, a model in the model training module is a seq2seqneural network model, and a training text is obtained through summary-summary generation. According to the invention, a multilingual generative automatic abstract system is designed and realized, a bilingual word embedding technology and a deep learning method are adopted, and a brief abstract is generated for a text or a text set specified by a user, so that the user is helped to browse intentions of an original text and quickly find out the most required information.

Multilingual automatic abstract method

Multilingual automatic abstract method

Multilingual automatic abstract method

Owner:YANBIAN UNIV

Information processing method and system for knowledge services

ActiveCN105373546AImprove experienceFit real needsSpecial data processing applicationsUser needsKnowledge services

The invention discloses an information processing method and system for knowledge servers. The method comprises the following steps: obtaining all or part of the knowledge points as a knowledge point set; determining the semantic information of each knowledge point in the knowledge point set; determining a sentence cluster set corresponding to the knowledge points according to semantic information; determining corresponding chapter information according to the sentence cluster set; and determining corresponding digital resources according to the chapter information. According to the method, the semantic information of the knowledge points is comprehensively considered, and the manner of correlating the corresponding knowledges through the keywords input by the users is not used, so that the method more fits the real demands of the users and is capable of correlating the corresponding knowledges mostly fitting the user demands according to the semantic information of the knowledge points, so that the organization of the knowledges in the field in a knowledge point manner is really realized and the user experience is improved.

Information processing method and system for knowledge services

Information processing method and system for knowledge services

Information processing method and system for knowledge services

Owner:NEW FOUNDER HLDG DEV LLC +2

Structural processing method for a thyroid ultrasound report based on a tree structure

ActiveCN109918672AReduce complexityIncrease coverageText database indexingSpecial data processing applicationsSentence segmentationPart of speech

The invention relates to a tree-shaped structured template established according to a part-of-speech dictionary and a dependency relationship tree, and a method for structuring a thyroid ultrasound report by referring to the template. The overall process mainly comprises a part-of-speech dictionary establishing module, a tree structure template establishing module and a tree template calling structuring stage. And the part-of-speech dictionary establishing module is used for carrying out short sentence segmentation on the report and carrying out short sentence clustering. And then a complete part-of-speech dictionary is established by using a named entity recognition technology according to the organ words ORG, the position words LOC, the attribute words ATT and the attribute names. And the tree template establishing module is used for analyzing by using a dependency syntactic to obtain a semantic relationship of each short sentence and obtaining a part-of-speech of each word by usinga part-of-speech dictionary. And a tree template establishment process is provided by combining the two steps. And the tree template calling module is used for carrying out text structuring by using atree template.

Structural processing method for a thyroid ultrasound report based on a tree structure

Structural processing method for a thyroid ultrasound report based on a tree structure

Structural processing method for a thyroid ultrasound report based on a tree structure

Owner:DONGHUA UNIV +1

Method for segmenting communication transcripts using unsupervised and semi-supervised techniques

InactiveUS7912714B2Digital data information retrievalNatural language data processingLexical similaritySentence clustering

A method is provided for forming discrete segment clusters of one or more sequential sentences from a corpus of communication transcripts of transactional communications that comprises dividing the communication transcripts of the corpus into a first set of sentences spoken by a caller and a second set of sentences spoken by a responder; generating a set of sentence clusters by grouping the first and second sets of sentences according to a measure of lexical similarity using an unsupervised partitional clustering method; generating a collection of sequences of sentence types by assigning a distinct sentence type to each sentence cluster and representing each sentence of each communication transcript of the corpus with the sentence type assigned to the sentence cluster into which the sentence is grouped; and generating a specified number of discrete segment clusters by successively merging sentence clusters according to a proximity-based measure between the sentence types assigned to the sentence clusters within sequences of the collection.

Method for segmenting communication transcripts using unsupervised and semi-supervised techniques

Method for segmenting communication transcripts using unsupervised and semi-supervised techniques

Method for segmenting communication transcripts using unsupervised and semi-supervised techniques

Owner:NUANCE COMM INC

Corpus generation device and method, human-machine interaction system

InactiveUS10268678B2Easy to operateSave resourcesNatural language translationSemantic analysisHuman–robot interactionParallel corpora

A corpus generation device and method, the device comprising: a segmentation module, connected to at least one monolingual parallel corpus for segmenting a sentence into words and processing the segmented words by a knowledge-driven approach; a classification module, for classifying sentences having different tag sequences but the same meaning into the same sentence cluster; a mapping module, for determining the categories of sentence structures of all the sentences in the sentence cluster, recording and storing a mapping mode for transforming tags between sentence structures when different categories of sentence structures in the same sentence cluster are transformed; a sentence structure generation module, for generating sentence structures according to a first mapping mode between a first category of sentence structures in one of the sentence clusters and other categories of sentence structures in the same sentence cluster; and a corpus generation module, for nesting a word corresponding to a sequence tag to generate a new monolingual parallel corpus.

Corpus generation device and method, human-machine interaction system

Corpus generation device and method, human-machine interaction system

Corpus generation device and method, human-machine interaction system

Owner:SHENZHEN GOWILD ROBOTICS CO LTD

Corpus generation device and method, human-machine interaction system

ActiveUS20180004730A1Improve translation accuracyAccurate corpusNatural language translationSemantic analysisText corpusSentence clustering

A corpus generation device and method, the device comprising: a segmentation module, connected to at least one monolingual parallel corpus for segmenting a sentence into words and processing the segmented words by a knowledge-driven approach; a classification module, for classifying sentences having different tag sequences but the same meaning into the same sentence cluster; a mapping module, for determining the categories of sentence structures of all the sentences in the sentence cluster, recording and storing a mapping mode for transforming tags between sentence structures when different categories of sentence structures in the same sentence cluster are transformed; a sentence structure generation module, for generating sentence structures according to a first mapping mode between a first category of sentence structures in one of the sentence clusters and other categories of sentence structures in the same sentence cluster; and a corpus generation module, for nesting a word corresponding to a sequence tag to generate a new monolingual parallel corpus.

Corpus generation device and method, human-machine interaction system

Corpus generation device and method, human-machine interaction system

Corpus generation device and method, human-machine interaction system

Owner:SHENZHEN GOWILD ROBOTICS CO LTD

Information extraction method based on deep semantic comprehension

PendingCN110889275AEfficient use ofAbstract highSemantic analysisText database clustering/classificationEntity typeRelationship extraction

The invention provides an information extraction method based on deep semantic comprehension, which comprises the following steps of: constructing a body and a basic relationship in the field, and manually labeling parts of corpora; processing the manually annotated corpora, identifying an entity type corresponding to a specific relationship, and mining new words and synonyms in the field at the same time; merging synonyms recognized in the sentences, abstracting the original sentences and making syntactic analysis; clustering the abstracted sentences into sentence templates, and performing template learning; making sentence template evaluation; and performing new relationship extraction on manually unlabeled corpora by utilizing the sentence template, and evaluating and filtering a new relationship. According to the method provided by the invention, the syntactic analysis result can be better utilized, so that the automatically mined template has higher-level abstraction and generalization capabilities.

Information extraction method based on deep semantic comprehension

Information extraction method based on deep semantic comprehension

Information extraction method based on deep semantic comprehension

Owner:鼎复数据科技(北京)有限公司

Theme information-based text segmentation method

ActiveCN110110326AEasy retrievalSemantic analysisCharacter and pattern recognitionFeature vectorFeature extraction

The invention discloses a theme information-based text segmentation method, which comprises the following specific operations of: preprocessing an input text and a training set to obtain a sentence consisting of a series of words; carrying out feature extraction to obtain feature vectors of the features; carrying out clustering operation on the input text according to semantic information contained in the sentence cluster to obtain a series of sentence clusters, and distributing a digital label for each cluster in sequence to obtain a series of simple sentences with the digital labels; distributing existing theme tags in a training set for each sentence, so that the existing theme tags in the training set are distributed to all sentences in the text. According to the invention, the digitallabel labeling result and the theme label labeling result are used for correction to obtain the text fragment with the theme label, and the theme label is distributed to the cut text, so that the theme described by the sentence can be clearly seen, the position for describing the theme in the text can be conveniently positioned according to the theme, and the retrieval is more convenient.

Theme information-based text segmentation method

Theme information-based text segmentation method

Theme information-based text segmentation method

Owner:XI AN JIAOTONG UNIV

Sentence cluster extract method and device based on object knowledge point

ActiveCN105512238AImprove accuracyNatural language data processingSpecial data processing applicationsData miningArtificial intelligence

The invention relates to a sentence cluster extract method and device based on object knowledge points; the method comprises the following steps: obtaining knowledge point accuracy attributes; using the accuracy attribute to extract attribute of the knowledge point from to-be processed digit resources; using the accuracy attribute and fuzzy attribute to do sentence cluster hitching of the knowledge points in the to-be processed digit resources; obtaining the knowledge point sentence cluster. The accuracy attribute and fuzzy attribute of the knowledge points are added so as to improve knowledge point sentence cluster extract accuracy.

Sentence cluster extract method and device based on object knowledge point

Sentence cluster extract method and device based on object knowledge point

Sentence cluster extract method and device based on object knowledge point

Owner:NEW FOUNDER HLDG DEV LLC +2

Method and device for clustering sentences

PendingCN111858916AClustering implementationRich clustering methodsSemantic analysisText database clustering/classificationPattern recognitionSemantic vector

The embodiment of the invention discloses a method and device for clustering sentences. One specific embodiment of the method comprises the steps of determining a set composed of semantic vectors corresponding to all sentences in a to-be-clustered sentence set as a semantic vector set; for each semantic vector in the semantic vector set, executing the following density calculation operation; for each semantic vector in the semantic vector set, executing the following clustering division operation; for each established cluster, determining the semantic vector with the maximum density in the semantic vectors divided into the cluster as the clustering center semantic vector of the cluster; and determining to-be-clustered sentences corresponding to the determined clustering center semantic vectors as a clustering center sentence set. According to the embodiment, the sentence clustering accuracy is improved.

Method and device for clustering sentences

Method and device for clustering sentences

Method and device for clustering sentences

Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Method for automatically analyzing user comments in application store and recommending comments to developers

PendingCN110827118AReduce intakeImprove experienceSemantic analysisBuying/selling/leasing transactionsEngineeringSoftware development

The invention relates to a method for automatically analyzing user comments in an application store and recommending the user comments to developers. The method is technically characterized by comprising the following steps: collecting user comment data and preprocessing the user comment data; carrying out intention classification on the user comments and establishing a classification model; carrying out topic classification on the user comments under each intention classification; performing sentence clustering on the user comments under each topic category, and calculating the clustering center position; establishing a mechanism for evaluating the priority of the user comments, calculating comprehensive scores of the user comments and recommending the comprehensive scores to a software developer. According to the method, intention classification, topic classification and sentence clustering are performed through comment information, and comments are processed in combination with timesequence and sentiment analysis; the hotspot top-k comments recommended and returned by the system are obtained, comment contents with reference values are provided for developers, so that referencesare provided for development and maintenance of applications, intake of redundant information of the developers is effectively reduced, user experience is improved, and the method has the characteristics of accurate and reliable content analysis, convenience in use and the like.

Method for automatically analyzing user comments in application store and recommending comments to developers

Method for automatically analyzing user comments in application store and recommending comments to developers

Method for automatically analyzing user comments in application store and recommending comments to developers

Owner:TIANJIN UNIV

Context mining method and device based on clustering algorithm and electronic equipment

PendingCN111291186AImprove analysis efficiencySpecial data processing applicationsText database clustering/classificationCluster algorithmAlgorithm

The invention provides a context mining method and device based on a clustering algorithm and electronic equipment. The method and the device specifically comprise the following steps: in response toa mining request of a user, screening from a pre-prepared call text according to a keyword specified by the mining request to obtain a plurality of key sentences containing the keyword; and intercepting a plurality of associated sentences directly connected with the key sentences from the call text; performing unsupervised clustering processing on the plurality of key sentences to obtain a plurality of sentence clusters; and for each statement cluster, performing context construction according to the keywords and the associated statements. According to the scheme, context construction for thecorresponding keywords is realized on the basis of the electronic equipment, so that a user can analyze important topics, verbal skills and the like of massive call texts according to the constructedcontext contents without viewing the text contents one by one, and the call text analysis efficiency is improved.

Context mining method and device based on clustering algorithm and electronic equipment

Context mining method and device based on clustering algorithm and electronic equipment

Context mining method and device based on clustering algorithm and electronic equipment

Owner:BEIJING SINOVOICE TECH CO LTD

A Text Automatic Summarization Method Based on Fusion Semantic Clustering

ActiveCN108197111BExpress co-occurrence relationshipEfficient removalSemantic analysisRelational databasesPattern recognitionSemantic vector

The invention discloses an automatic text summarization method based on fusion semantic clustering. The method comprises the steps of text preprocessing, wherein preprocessing is conducted on originaldocuments, and word frequency information of keywords in the text is counted; weight calculation, wherein local weights are combined, and global weights and introduced relevant weights are used for determining the contribution degree of the keywords in sentences; semantic analysis, wherein a text matrix is subjected to singular value decomposition to obtain a semantic analysis model to calculatea semantic vector of each sentence; clustering, wherein K sentence clusters are obtained through a clustering algorithm in a semantic space on the basis of the calculated sentence semantic vectors; sentence selection, wherein the sentence weights is calculated in each sentence cluster, the first n sentences are selected to compose an abstract according to ranking, and the redundancy is removed. The method is simple and practical, a characteristic representation is provided for the text, the semantic connection of the context is integrated, a co-occurrence relationship between the sentences andwords is more fully displayed, and the generated abstract can better in line with the theme of the text.

A Text Automatic Summarization Method Based on Fusion Semantic Clustering

A Text Automatic Summarization Method Based on Fusion Semantic Clustering

A Text Automatic Summarization Method Based on Fusion Semantic Clustering

Owner:SOUTH CHINA UNIV OF TECH

Method for segmenting communication transcripts using unsupervised and semi-supervised techniques

InactiveUS20090112571A1Digital data information retrievalNatural language data processingLexical similaritySentence clustering

A method is provided for forming discrete segment clusters of one or more sequential sentences from a corpus of communication transcripts of transactional communications that comprises dividing the communication transcripts of the corpus into a first set of sentences spoken by a caller and a second set of sentences spoken by a responder; generating a set of sentence clusters by grouping the first and second sets of sentences according to a measure of lexical similarity using an unsupervised partitional clustering method; generating a collection of sequences of sentence types by assigning a distinct sentence type to each sentence cluster and representing each sentence of each communication transcript of the corpus with the sentence type assigned to the sentence cluster into which the sentence is grouped; and generating a specified number of discrete segment clusters by successively merging sentence clusters according to a proximity-based measure between the sentence types assigned to the sentence clusters within sequences of the collection.

Method for segmenting communication transcripts using unsupervised and semi-supervised techniques

Method for segmenting communication transcripts using unsupervised and semi-supervised techniques

Method for segmenting communication transcripts using unsupervised and semi-supervised techniques

Owner:NUANCE COMM INC

Intelligent input method and device for electronic medical records

ActiveCN112883712AImprove the efficiency of medical record inputMuch timeMedical data miningNatural language data processingMedical recordEngineering

The invention provides an intelligent input method and device for electronic medical records. The intelligent input method comprises the following steps: classifying massive electronic medical record text data according to disease categories; performing similarity calculation on the sentences under each disease category, and clustering the sentences according to the similarity; extracting keywords or words within the first few words of the same kind of sentences as sentence heads, establishing a sentence head library, taking other parts of the same kind of sentences except the sentence heads as sentence tails, establishing a sentence tail library, counting the occurrence frequency of the same kind of sentence tails, and establishing a sentence frequency library; after a specific keyword or word is input and a sentence head library is matched ,determining a sentence clustering category, calling a sentence tail library and a sentence frequency library, carrying out sentence tail completion, and displaying a plurality of sentence tails with frequency ranks from high to low in the sentence frequency library on display equipment for a user to select; and obtaining the sentence tail selected by the user every time and updating the sentence frequency library in real time. The medical record input efficiency of medical staff can be effectively improved, so that more time is reserved for treating a patient, and the medical quality is improved.

Intelligent input method and device for electronic medical records

Intelligent input method and device for electronic medical records

Intelligent input method and device for electronic medical records

Owner:GENERAL HOSPITAL OF SOUTHERN THEATRE COMMAND OF PLA

News sentence clustering method, device and storage medium based on semantic similarity

ActiveCN107679144BAccurate clusteringEfficient clusteringSemantic analysisSpecial data processing applicationsSemantic vectorBag-of-words model

The invention provides a method for clustering news sentences based on semantic similarity, the method comprising the following steps: preprocessing the news sentences of the corpus to extract available words; using the available words to train the continuous bag-of-words model, Obtain the initial word vector of each available word; Utilize the initial sentence vector of each news sentence and the initial word vector of the left and right adjacent available words of a certain available word in this news sentence to iteratively train the described continuous word bag model, obtain the The current word vector of each available word in the news sentence and the final sentence vector of the news sentence; the average value of the word vectors of all available words of each news sentence, the one-hot vector of high-frequency words and the final sentence vector are merged, The semantic vector of the news sentence is obtained; the distance between the semantic vectors is calculated to obtain the semantic similarity between different news sentences, and the news sentences of the corpus are clustered accordingly. The invention also provides an electronic device and a computer-readable storage medium.

News sentence clustering method, device and storage medium based on semantic similarity

News sentence clustering method, device and storage medium based on semantic similarity

News sentence clustering method, device and storage medium based on semantic similarity

Owner:PING AN TECH (SHENZHEN) CO LTD

Method for constructing and processing human behavior text data set based on crowdsourcing

ActiveCN113407716AImprove accuracyCharacter and pattern recognitionEnergy efficient computingHuman behaviorData set

The invention discloses a method for constructing and processing a human behavior text data set based on crowdsourcing, which comprises the following steps of: firstly, determining a subject object needing to be collected, generating a task according to a specific requirement, publishing the task to a crowdsourcing platform, and obtaining a text data set of all possible human examples under a set subject; presenting the text of the same behavior or event in a plurality of sentences after being written by different persons, so that different sentences describing the same event need to be clustered together, and different text representations belonging to the same behavior are clustered into one class for the acquired data set by adopting a clustering mode; mining a precedence relation structure existing between behaviors by adopting a correlation analysis technology; adopting the mutual information technology to learn a mutual exclusion relation structure existing between behaviors, creating various relations existing in the human behaviors into a plot. What events can occur under a certain condition is indicated, the occurrence mode of the events is limited, and the analysis accuracy of the human behaviors is improved.

Method for constructing and processing human behavior text data set based on crowdsourcing

Method for constructing and processing human behavior text data set based on crowdsourcing

Method for constructing and processing human behavior text data set based on crowdsourcing

Owner:GUILIN UNIV OF ELECTRONIC TECH +1

Natural language-based airworthiness instruction problem feature extraction

PendingCN112115711AImprove accuracyTime consuming goodCharacter and pattern recognitionNatural language data processingEngineeringData pre-processing

The invention relates to the technical field of airworthiness certification, in particular to natural language-based airworthiness instruction problem feature extraction, which comprises the followingsteps of: extracting problem description chapters behind an airworthiness instruction, and carrying out text data preprocessing; detecting overlapped sentence clusters; selecting a given number of sentence clusters; extracting feature descriptors. The method for extracting the features by detecting the overlapped sentence clusters and directly selecting the phrases from the text description has higher accuracy. Meanwhile, the method has better performance in the aspect of time consumption compared with a comparison method selected in the prior art; key design features, expressed by the airworthiness instruction text, of aircraft products can also be found in feature extraction actually for airworthiness instructions.

Natural language-based airworthiness instruction problem feature extraction

Natural language-based airworthiness instruction problem feature extraction

Natural language-based airworthiness instruction problem feature extraction

Owner:中国民用航空上海航空器适航审定中心

Generating and using a sentence model for answer generation

ActiveUS20220075951A1Semantic analysisOther databases indexingAlgorithmTheoretical computer science

In an approach to generating and using a sentence model for answer generation, one or more computer processors ingest a first corpus of a plurality of text sentences. One or more computer processors convert the plurality of text sentences into a plurality of sentence vectors. One or more computer processors group the plurality of sentence vectors into a plurality of sentence clusters, wherein a sentence cluster is composed of sentences that are semantically similar. One or more computer processors receive a second corpus. One or more computer processors determine, for each sentence cluster of the plurality of sentence clusters, a frequency each sentence cluster appears in the second corpus. Based on the determined frequency, one or more computer processors calculate a probability of each sentence cluster of the plurality of sentence clusters. Based on the calculated probabilities, one or more computer processors generate a first sentence model.

Generating and using a sentence model for answer generation

Generating and using a sentence model for answer generation

Generating and using a sentence model for answer generation

Owner:IBM CORP

An information processing method and system for knowledge service

ActiveCN105373546BImprove experienceFit real needsSemantic tool creationKnowledge servicesInformation processing

The invention discloses an information processing method and system for knowledge servers. The method comprises the following steps: obtaining all or part of the knowledge points as a knowledge point set; determining the semantic information of each knowledge point in the knowledge point set; determining a sentence cluster set corresponding to the knowledge points according to semantic information; determining corresponding chapter information according to the sentence cluster set; and determining corresponding digital resources according to the chapter information. According to the method, the semantic information of the knowledge points is comprehensively considered, and the manner of correlating the corresponding knowledges through the keywords input by the users is not used, so that the method more fits the real demands of the users and is capable of correlating the corresponding knowledges mostly fitting the user demands according to the semantic information of the knowledge points, so that the organization of the knowledges in the field in a knowledge point manner is really realized and the user experience is improved.

An information processing method and system for knowledge service

An information processing method and system for knowledge service

An information processing method and system for knowledge service

Owner:NEW FOUNDER HLDG DEV LLC +2

A method and device for extracting sentence groups based on target knowledge points

InactiveCN105512238BImprove accuracyNatural language data processingText database indexingData miningArtificial intelligence

The invention relates to a sentence cluster extract method and device based on object knowledge points; the method comprises the following steps: obtaining knowledge point accuracy attributes; using the accuracy attribute to extract attribute of the knowledge point from to-be processed digit resources; using the accuracy attribute and fuzzy attribute to do sentence cluster hitching of the knowledge points in the to-be processed digit resources; obtaining the knowledge point sentence cluster. The accuracy attribute and fuzzy attribute of the knowledge points are added so as to improve knowledge point sentence cluster extract accuracy.

A method and device for extracting sentence groups based on target knowledge points

A method and device for extracting sentence groups based on target knowledge points

A method and device for extracting sentence groups based on target knowledge points

Owner:NEW FOUNDER HLDG DEV LLC +2

Key sentence extraction method, system, and computer-readable storage medium

ActiveCN113505213BEasy accessNo human intervention requiredSemantic analysisCharacter and pattern recognitionSentence segmentationSentence extraction

The invention discloses a key sentence extraction method, system, and computer-readable storage medium, wherein the key sentence extraction method includes the following steps: obtaining a target question and a target answer; performing sentence processing on the target answer to obtain several answer sentences; calculating each answer The correlation between the sentence and the target question is obtained to obtain the corresponding correlation score; the answer sentences are combined in pairs to obtain a number of answer pairs, and the coherence between the two answer sentences in the answer pair is calculated to obtain the corresponding coherence score; Based on the coherence score, each answer sentence is clustered to obtain several sets of sentence clusters; the correlation score corresponding to each answer sentence in the sentence cluster is extracted, and the relationship between the sentence cluster and the target question is calculated based on the extracted correlation score. Correlation degree: extract each answer sentence in the sentence cluster with the greatest correlation degree, and obtain the corresponding key sentence. The key sentences extracted by the present invention take both coherence and relevance into consideration, and can accurately express the central content of the target answer.

Key sentence extraction method, system, and computer-readable storage medium

Key sentence extraction method, system, and computer-readable storage medium

Key sentence extraction method, system, and computer-readable storage medium

Owner:无码科技(杭州)有限公司

A Method for Structural Processing of Chinese Pathological Texts

InactiveCN104899260BImprove accuracyAdapt to data structuring needsSpecial data processing applicationsText database clustering/classificationSentence segmentationComputer science

The present invention relates to a method for structured processing of a Chinese pathological text. The method comprises the following steps: extracting template information corresponding to each sample from a hierarchical stricture of a sample of text data of a pathological report text data and indicator; extracting the template information comprising short sentence segmentation and indicator name extraction; classifying the short sentences; with respect to each sample, in combination with a classification result cluster and a short sentence cluster, calculating a TF value, an IDF value and a C-value of each indicator name in an indicator name list in a short sentence language material, and screening out an indicator name whose TF value, IDF value and C-value satisfy a threshold, and using the obtained indicator name as a component of the final template. According to the present invention, a non-structured Chinese pathological text can be structured.

A Method for Structural Processing of Chinese Pathological Texts

A Method for Structural Processing of Chinese Pathological Texts

A Method for Structural Processing of Chinese Pathological Texts

Owner:DONGHUA UNIV +1

A text data opinion summarization mining method that integrates topic attributes and sentiment information

ActiveCN108287922BPrecise topic attributesSemantic analysisSpecial data processing applicationsWeb siteFeature vector

The present invention provides a text data viewpoint summary mining method that combines topic attributes and emotional information, including: preprocessing the text corpus of the topic; inputting the topic corpus and the background corpus; extracting the topic attributes of the topic corpus; and obtaining Add emotional polarity to the topic attribute of the sentence and vectorize the sentence; use the obtained topic attribute as the evaluation object to obtain the emotional attribute characteristics contained in the sentence, and use the topic attribute and sentiment analysis method to perform feature vectorization on a sentence; use the obtained topic The attribute set and text sentence feature vector set S construct a three-layer graph structure to cluster all text sentences; select sentences from clusters to form opinion summaries, and select sentences with high scores to form opinion summaries. The invention makes the topic attributes extracted by the method of extracting topic attributes more accurate, and also makes it not only applicable to the field of Chinese microblog, but also applicable to the fields of website news and commodity reviews.

A text data opinion summarization mining method that integrates topic attributes and sentiment information

Owner:FUZHOU UNIV

A Text Segmentation Method Based on Topic Information

ActiveCN110110326BEasy retrievalSemantic analysisCharacter and pattern recognitionFeature vectorFeature extraction

The invention discloses a text cutting method based on subject information. The specific operation is as follows: preprocessing the input text and the training set to obtain a sentence composed of a series of words; then performing feature extraction to obtain its feature vector; and then according to its implication The semantic information of the input text is clustered to obtain a series of sentence clusters, and a numerical label is assigned to each cluster in order to obtain a series of single sentences with numerical labels; each sentence is assigned an existing sentence in the training set Topic tags, so that the existing topic tags in the training set are assigned to all sentences in the text; use the digital tag labeling results and the topic tag labeling results to make corrections to obtain text fragments with topic tags, and assign topic tags to the cut text In this way, the topics described in the sentences are clearly visible, and the position in the text describing the topic can be easily located according to the topic, making retrieval more convenient.

A Text Segmentation Method Based on Topic Information

A Text Segmentation Method Based on Topic Information

A Text Segmentation Method Based on Topic Information

Owner:XI AN JIAOTONG UNIV

Systems and methods for discovering and exploring concepts

ActiveCN105745679BCustomer relationshipNatural language data processingPattern recognitionMachine learning

A method for identifying concepts in a plurality of interactions comprising: filtering, on a processor, the interactions based on intervals; creating, on the processor, a plurality of sentences from the filtered interactions; computing the prominence of each of said statements; deleting statements with low salience on said processor so as to produce a set of informative sentences; aggregating said set of informative sentences on said processor statement to generate a plurality of statement clusters, each of said clusters corresponding to one of said concepts; computing the salience of each of said clusters on said processor; and Each of the clusters is named on the processor.

Systems and methods for discovering and exploring concepts

Systems and methods for discovering and exploring concepts

Systems and methods for discovering and exploring concepts

Owner:GREENEDEN U S HLDG II LLC

Popular searches

Documentation One-hot High frequency High Frequency Waves Microblogging Analysis method Extraction methods Information retrieval Cut score Product reviews