Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

645 results about "Text retrieval" patented technology

Text retrieval is a branch of information retrieval where the information is stored primarily in the form of text.

Video content quick retrieving method based on object tag

The invention provides a video content quick retrieving method based on an object tag. The method comprises the following steps: extracting and analyzing the color feature, contour feature, scene feature and character feature of a moving object in each image frame of a video; processing a plurality of pictures of known types by using the feature extraction method, and training a contour classifier and a scene classifier by using the contour features and scene features of the pictures; processing a video to be retrieved by using the feature extraction and analysis method and the classifiers soas to generate type tags of objects in each image frame of the video, wherein the type tags are used for constructing an object tag database; and retrieving a response server to search the object tagdatabase to find videos related to a query request submitted by a user, and generating an ordered result for the user to browse and refer. The method provided by the invention can be used for retrieving the video content at a speed similar to that of the conventional text retrieval only by searching the object tag database and achieving the fine granularity retrieval of the video content, so thatthe method is more accurate than the conventional method.
Owner:SOUTH CHINA UNIV OF TECH

Spark SQL-based distributed full text retrieval system and method

The invention relates to a Spark SQL-based distributed full text retrieval system and method. The system comprises an SQL translation layer, a data source management layer, a parallel calculation layer and a distributed storage layer; an SQL-based full text retrieval method and translation processes, among modules of the SQL translation layer, of full text retrieval SQL statements are proposed; a full text retrieval process parallelization method is designed in a data source management module; and in a retrieval optimization module, two index storage models and corresponding primitive table data reduction strategies during query are designed, wherein a partition align connection algorithm which is used for reducing primitive table data during query and has a complexity of O (n) is designed for an index appointed column-based storage model. Under the two storage models, the index construction time is shortened to 0.6% / 0.5% of the traditional database, the query time is shortened to the 1% / 10% of the traditional database, and the index storage amount is decreased to 55.0% of the traditional database. According to the method, the Spark SQL data analysis function is strengthened, and the requirements for traditional business migration and full text retrieval carried out on mass data in the existing businesses can be satisfied.
Owner:INST OF SOFTWARE - CHINESE ACAD OF SCI

Cryptogram-based safe full-text indexing and retrieval system

The invention discloses a cryptogram-based safe full-text indexing and retrieval system. In the system, a cryptogram index library comprises a cryptogram entry reverse index and an internal document object set; a cryptogram document library is responsible for storing and managing an encrypted XML document; a word segmentation encryption server carries out Chinese word segmentation on a plaintext document and encrypts the plaintext document item by item; a cryptogram full-text indexing server standardizes an original plaintext document into an XML document, encrypts and stores the XML document in the cryptogram document library, creates a corresponding internal document object in the cryptogram index library by combining document metamessage, and creates a cryptogram reverse index for the XML document through the cryptogram entry; and a cryptogram full-text retrieval server retrieves the cryptogram index library to obtain the internal document object set through user authority information and the cryptogram entry, obtains a corresponding encrypted XML document result set from the cryptogram document library according to a pointer, decrypts the corresponding encrypted XML document result set, and returns the decrypted corresponding encrypted XML document result set to a user. The Chinese word segmentation method, the safe and high-efficiency indexing structure and the retrieval mechanism of the invention based on the special requirements of cryptogram full-text indexing can realize the cryptogram full-text indexing integrated with an access control strategy. The cryptogram-based safe full-text indexing and retrieval system has the advantages of a safe and high-efficiency indexing process, no decrypted docuterms in the indexing process, a high recall ratio and a high precision ratio in a cryptogram environment, and the like.
Owner:HUAZHONG UNIV OF SCI & TECH

Titan-based enterprise information analysis platform and construction method thereof

The invention discloses a Titan-based enterprise information analysis platform and a construction method thereof. The Titan-based enterprise information analysis platform comprises a web crawler, a Hadoop distributed system infrastructure, a Titan server, an Elasticsearch server, a Cassandra database and an application layer, wherein the Hadoop distributed system infrastructure is used for storing collected structured or non-structured original data; the Titan server stores an enterprise relationship atlas, and utilizes the Cassandra database as a data storage medium and the Elasticsearch server as a storage medium for full-text retrievals in the enterprise relationship atlas; the application layer is used for displaying enterprise data and relationships on a front-end page by establishing a foreground application frame. The Titan-based enterprise information analysis platform and the construction method thereof provided by the invention have the advantages that through the data visualization technology, interest relationships among enterprises, such as investment relationships, can be rapidly sorted out; long-time and real-time full-automatic collection, storage and analysis can be achieved.
Owner:山东合天智汇信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products