Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

331 results about "Reverse index" patented technology

Database management systems provide multiple types of indexes to improve performance and data integrity across diverse applications. Index types include b-trees, bitmaps, and r-trees. In database management systems, a reverse key index strategy reverses the key value before entering it in the index. E.g., the value 24538 becomes 83542 in the index. Reversing the key value is particularly useful for indexing data such as sequence numbers, where each new key value is greater than the prior value, i.e., values monotonically increase. Reverse key indexes have become particularly important in high volume transaction processing systems because they reduce contention for index blocks.

Method for segmenting and indexing scenes by combining captions and video image information

The invention relates to a method for segmenting and indexing scenes by combining captions and video image information. The method is characterized in that: in the duration of each piece of caption, a video frame collection is used as a minimum unit of a scene cluster. The method comprises the steps of: after obtaining the minimum unit of the scene cluster, and extracting at least three or more discontinuous video frames to form a video key frame collection of the piece of caption; comparing the similarities of the key frames of a plurality of adjacent minimum units by using a bidirectional SIFT key point matching method and establishing an initial attribution relationship between the captions and the scenes by combining a caption related transition diagram; for the continuous minimum cluster units judged to be dissimilar, further judging whether the minimum cluster units can be merged by the relationship of the minimum cluster units and the corresponding captions; and according to the determined attribution relationships of the captions and the scenes, extracting the video scenes. For the segments of the extracted video scenes, the forward and reverse indexes, generated by the caption texts contained in the segments, are used as a foundation of indexing the video segments.
Owner:INST OF ACOUSTICS CHINESE ACAD OF SCI

Music retrieval system based on audio fingerprint features

The invention belongs to the technical field of information retrieval, and particularly relates to a music retrieval system based on audio fingerprint features. The system is composed of a preprocessing module, a feature extraction module, a reverse index module and a fine matching module. The preprocessing module mainly carries out audio signal conversion, resampling and filtering; the feature extraction module is used for representing audio files, wherein the audio fingerprint features are adopted to select the most stable point from a frequency spectrum as the feature point through twice screening based on dynamic threshold values, and each feature is represented by a dot pair; according to the reverse index module, the features are used as key words, reverse indexes are built according to the features of a song library, and the index result is returned according to the number of the same key words; according to the fine matching module, the sequential relationship of the audio features is combined, an improved editing distance is adopted as the similarity of two feature sequences, and therefore the index result is optimized. The music retrieval system based on the audio fingerprint features is suitable for the retrieval of a large number of songs, and can particularly conduct effective retrieval on record inquiry segments.
Owner:FUDAN UNIV

Index generation method and index generation device based on MapReduce programming architecture

The invention relates to an index generation method and an index generation device based on a MapReduce programming architecture. The index generation method comprises the following steps of: acquiring data, preparing the data into a unified format and storing the prepared data in a record set formula; carrying out head encapsulation on each data record in the record set; inserting the data records subjected to data encapsulation into an HBase cluster in batch; calling a MapReduce service and an HBase service in an Hadoop cluster and connecting an Solr cluster; carrying out MapReduce operation and submitting an operation index parallel generating task to form a reverse index intermediate file; carrying out Reduce operation to generate a reverse index file; and starting a new Map task for carrying out slit operation on the reverse index file to generate a final index. According to the index generation method and the index generation device, disclosed by the invention, the storage of high-efficiency distributed mass data and the establishment of the index can be realized; and in addition, the index generation method and the index generation device have the advantages of extensibility, high fault tolerance, high performance and the like.
Owner:XIAMEN MEIYA PICO INFORMATION

Microblog-oriented emotion entity searching system

The invention relates to a microblog-oriented emotion entity searching system. The emotion entity searching system comprises a user interface (1), a query expansion module (2), a query processing module (3), an emotive information mining module (4), an emotive information judging and index building module (5) and a reverse index building module (6). The user interface (1) is used for interaction between a user and the system, and the user can submit a query request through the user interface and obtain a feedback result; the query expansion module (2) is used for carrying out word relation mining on microblog corpus data and building a weighting word relation graph in combination with a WordNet ontology base; the query processing module (3) is used for converting the query request of the user into query key words or query statements and for carrying out query expansion on the basis of the word relation graph built by the query expansion module (2), wherein the query key words or the query statements can be accepted by an index base; the emotive information mining module (4) is used for performing emotion mining on the microblog corpus base and generating a judging rule for emotion entities and emotion polarities; the emotive information judging and index building module (5) is used for judging the emotion entities and emotion polarities, building an emotive information index and storing the emotive information index; the reverse index building module (6) is used for building a reverse index for microblog text information and storing the reverse index. The microblog-oriented emotion entity searching system solves the problems that difficulty exists in microblog emotion entity extraction, emotion polarity analysis, emotion entity search and the like, and a novel intelligent searching product is provided for analyzing and monitoring social networking public opinions.
Owner:FOSHAN UNIVERSITY +2

Reverse index mixed compression and decompression method based on Hbase database

The invention discloses a reverse index mixed compression method based on an Hbase database. The reverse index mixed compression method comprises the steps of processing the Hbase database to obtain an Hbase database reversed index data table including keys and values; compressing the key part by a key dictionary compression method; compressing the value part by a variable bytecode compression method; and writing the compressed content into files. The invention also discloses a decompression method of the compressed file key part after being compressed by the compression method. The decompression method comprises the steps of judging the length of each compressed data item, processing to obtain decompressed data according to two conditions of the length being less than or equal to 13, and the length being more than or equal to 25, otherwise, failing to decompress. According to the method adopts the classification mixed compression and the classification decompression method, the compression ratio is improved on the condition that the high decompression ratio is ensured possibly; the unified considerations of file reading and data decompression can be achieved; and the query efficiency of the reverse index can be improved completely and the storage space can be saved.
Owner:CHENGDU UNIV OF INFORMATION TECH

Cryptogram-based safe full-text indexing and retrieval system

The invention discloses a cryptogram-based safe full-text indexing and retrieval system. In the system, a cryptogram index library comprises a cryptogram entry reverse index and an internal document object set; a cryptogram document library is responsible for storing and managing an encrypted XML document; a word segmentation encryption server carries out Chinese word segmentation on a plaintext document and encrypts the plaintext document item by item; a cryptogram full-text indexing server standardizes an original plaintext document into an XML document, encrypts and stores the XML document in the cryptogram document library, creates a corresponding internal document object in the cryptogram index library by combining document metamessage, and creates a cryptogram reverse index for the XML document through the cryptogram entry; and a cryptogram full-text retrieval server retrieves the cryptogram index library to obtain the internal document object set through user authority information and the cryptogram entry, obtains a corresponding encrypted XML document result set from the cryptogram document library according to a pointer, decrypts the corresponding encrypted XML document result set, and returns the decrypted corresponding encrypted XML document result set to a user. The Chinese word segmentation method, the safe and high-efficiency indexing structure and the retrieval mechanism of the invention based on the special requirements of cryptogram full-text indexing can realize the cryptogram full-text indexing integrated with an access control strategy. The cryptogram-based safe full-text indexing and retrieval system has the advantages of a safe and high-efficiency indexing process, no decrypted docuterms in the indexing process, a high recall ratio and a high precision ratio in a cryptogram environment, and the like.
Owner:HUAZHONG UNIV OF SCI & TECH

Distributed type reverse index organization method based on user log analysis

The invention discloses a distributed type reverse index organization method based on user log analysis. The distributed type reverse index organization method comprises the following steps: 1) analyzing query logs of the user, extracting high-frequency words and non-high-frequency words, establishing a relativity matrix of the high-frequency words, and establishing a high-frequency word relation graph according to the relativity of the high-frequency words; 2) calculating the load of each high-frequency word, and clustering the high-frequency words according to the high-frequency word relation graph and the loads of the high-frequency words; 3) distributing the clusters to nodes, establishing a high-frequency word index, hashing non-high-frequency words to the nodes, and establishing a non-high-frequency word index; 4) establishing a global index table according to the high-frequency word index and the non-high-frequency word index, and inquiring routes according to the global index table. The distributed type reverse index organization method disclosed by the invention has the advantages of small query cost, high query efficiency, and favorable query performance, and also has the advantages that the distributed type reverse index organization method can realize the balance of the throughput of the entire system and the query response speed of each time, and less nodes is referred during the query of a plurality of words.
Owner:ZHEJIANG UNIV

Multimedia courseware retrieval system based on voice keyword recognition

InactiveCN103956166AGood effectThe effect is: the use of speech recognition technology to automatically retrieve goodSpeech recognitionSpecial data processing applicationsHabilitation trainingHide markov model
The invention provides a multimedia courseware retrieval system based on voice keyword recognition. Firstly, a backstage converts pre-provided text knowledge points into voice models, courseware is labeled by a voice recognizing technique based on a hidden markov model to locate the accurate positions of the knowledge points in the multimedia courseware, and a reverse index based on keywords is constructed and maintained in an index module. When a user inputs text keywords in a prompt box to inquire, results in the index are extracted to be displayed to the user if the keywords are previously labeled. If the keywords are not labeled, the system can retrieve the courseware in real time, waits for feedback of the user to the results and makes statistics for the feedback information. Self-adaption training is carried out on the keywords to label the courseware again and upgrade the indexes. Compared with a traditional network course learning style system, the courseware retrieval system can quickly search for and locate the keywords of the knowledge points, improves retrieval accuracy through user interaction and finally improves the learning efficiency of students effectively.
Owner:EAST CHINA UNIV OF SCI & TECH

A method for storage and near-real time query of time-sensitive data based on open source big data

The invention provides a method for storage and near-real time query of time-sensitive data based on open source big data. The method comprises the steps of establishing a near-real time query processing platform having an internal storage space and an external storage space; defining a file storage strategy and performing data processing and calculating on source data files in the internal storage space so that the source data files are stored in the external storage space after being arranged according to the time-sensitive characteristics thereof; performing reverse index with the time-sensitive characteristics of the data files as the filter conditions, establishing point index and range index to generate index information and storing the index information into the external storage space and caching the information into the internal storage space; inquiring the index information and searching the point index or range index to obtain relevant file path lists, and reading source data files corresponding to query requests according to the file path lists. Fully based on the time-sensitive characteristics, the data filter strategy is designed to reduce data scanning quantity, and thus the near-real time query feedback of big data is realized.
Owner:EAST CHINA NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products