Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

49 results about "Latent semantic indexing" patented technology

Latent semantic indexing is an indexing and retrieval method that uses a mathematical technique called singular value decomposition to identify patterns in the relationships between the terms and concepts contained in an unstructured collection of text. LSI is based on the principle that words that are used in the same contexts tend to have similar meanings. A key feature of LSI is its ability to extract the conceptual content of a body of text by establishing associations between those terms that occur in similar contexts. LSI is also an application of correspondence analysis, a multivariate statistical technique developed by Jean-Paul Benzécri in the early 1970s, to a contingency table built from word counts in documents. Called Latent Semantic Indexing because of its ability to correlate semantically related terms that are latent in a collection of text, it was first applied to text at Bellcore in the late 1980s. The method, also called latent semantic analysis, uncovers the underlying latent semantic structure in the usage of words in a body of text and how it can be used to extract the meaning of the text in response to user queries, commonly referred to as concept searches.

Network-based method for analyzing opinion information in discrete text

The invention relates to a network-based system for analyzing opinion information in a discrete text, belonging to the field of network information safety. The system comprises the following modules: a discrete text information acquisition module which acquires network information in a preset analysis cycle, a discrete text information tracking and restoring module which restores ellipsis and remote anaphora in the original content to obtain a text which contains a relatively complete text structure and semantic information, a semantic information mining and characteristic extracting module which realizes semantic information mining and characteristic extracting on text information by utilizing a latent semantic indexing technology, an opinion information clustering module which realizes information clustering by combining a niche genetic algorithm with a K-Means method, a hot opinion event discovery module which mines the hot opinion in the obtained topic and event, and a background information processing and data supporting center which analyzes data and provides a repertoire specially for a network, new words in the network, the existing class information and the existing hot topics. By applying the invention, the problem that information analysis is influenced as the text structure of the existing network opinion information is incomplete, ellipsis and remote anaphora are more and the new works in the network are more is solved, and the accuracy for discovery of the opinion and hot event is improved by adopting a high-efficiency clustering method.
Owner:GUILIN UNIV OF ELECTRONIC TECH

API (Application Programing Interface) tag recommendation method based on heterogeneous information

The invention discloses an API (Application Program Interface) tag recommendation method based on heterogeneous information, and mainly adopts a random walk algorithm based on the heterogeneous information. The API tag recommendation method comprises the following steps: firstly, according to a relationship among the API, mashup and a mashup tag, establishing a heterogeneous network, wherein the network comprises an inclusion relationship between the API and the mashup, a corresponding relationship between the mashup and the tag and an isomorphic relationship among three elements; then, according to the heterogeneous network, generating a corresponding transfer matrix, carrying out random walk with restart on the basis of the transfer matrix, iteratively transferring to a mashup layer and a tag layer from an API vertex, and finally achieving globally stable distribution so as to obtain a probability for the API to each tag vertex; and finally, importing text processing model (Latent Semantic Indexing) to calculate the semantic similarity of the API and the tag, combining with the obtained probability to generate a final tag sorting list to recommend a proper tag for the API so as to improve tag recommendation accuracy to a large extent.
Owner:ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products