Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

2247 results about "Text categorization" patented technology

Text categorization (a.k.a. text classification) is the task of assigning predefined categories to free-text documents. It can provide conceptual views of document collections and has important applications in the real world.

Category based, extensible and interactive system for document retrieval

In information retrieval (IR) systems with high-speed access, especially to search engines applied to the Internet and / or corporate intranet domains for retrieving accessible documents automatic text categorization techniques are used to support the presentation of search query results within high-speed network environments. An integrated, automatic and open information retrieval system (100) comprises an hybrid method based on linguistic and mathematical approaches for an automatic text categorization. It solves the problems of conventional systems by combining an automatic content recognition technique with a self-learning hierarchical scheme of indexed categories. In response to a word submitted by a requester, said system (100) retrieves documents containing that word, analyzes the documents to determine their word-pair patterns, matches the document patterns to database patterns that are related to topics, and thereby assigns topics to each document. If the retrieved documents are assigned to more than one topic, a list of the document topics is presented to the requester, and the requester designates the relevant topics. The requester is then granted access only to documents assigned to relevant topics. A knowledge database (1408) linking search terms to documents and documents to topics is established and maintained to speed future searches. Additionally, new strategies are presented to deal with different update frequencies of changed Web sites.
Owner:COGISUM INTERMEDIA

Academic resource recommendation service system and method

The invention provides an academic resource recommendation service system and method. The method comprises the following steps: crawling academic resources on an internet by using an LDA (Latent Dirichlet Allocation)-based focused crawler, classifying the academic resources according to preset A types by using an LDA-based text classification model, and storing the academic resources in a local academic resource database, wherein the system further comprises an academic resource model, a resource quality value calculation module and a user interest module; implanting a tracking software module at a user terminal, combining interesting subjects and historical browsing behavior data of the user, respectively modeling the academic resource model and the user interest module by virtue of four dimensions such as the academic resource type, subject theme distribution, key word distribution and LDA latent theme distribution, calculating the similarity between the academic resource model and the user interest preference module, combining the resource quality value to calculate the recommendation degree, and finally perform academic resource Top-N recommendation for the user according to the recommendation degree. According to the method disclosed by the invention, personalized accurate recommendation of the academic resources is performed according to the identity, interest and browsing behaviors of users, and the working efficiency of scientific research personnel is improved.
Owner:NINGBO UNIV

Domain-knowledge-based short text classification method and text classification system

The invention discloses a domain-knowledge-based short text classification method and a domain-knowledge-based short text classification system used in the technical field of information. The method is used for overcoming the defect that the traditional text classification method cannot well classify short texts. Aiming at the characteristics that the short text description concept signals are relatively weak and the text features are seriously insufficient, the invention provides the short text data classification method and the text classification system suitable for commodity web page data. According to the embodiment, a commodity classifier with excellent classification effect is obtained by reforming the traditional classifier, introducing new elements and devoting to matching application of algorithm and data. The introduction of the new elements comprises the following steps of: introducing a concept of domain words and introducing the concept into the classifier so as to effectively increase the information quantity of the short texts; and performing different-lexical-item-set-based semantic analysis on the short text data, particularly the web page commodity data, and introducing the semantic analysis result into the classifier so as to introduce new information for the commodity data information and improve the accuracy of text classification.
Owner:SHANGHAI BIJIA DATA

Chinese text classification method based on super-deep convolution neural network structure model

The invention provides a Chinese text classification method based on a super-deep convolution neural network structure model. The method comprises the steps of collecting a training corpus of a word vector from the internet, combining a Chinese word segmentation algorithm to conduct word segmentation on the training corpus, and obtaining a word vector model; collecting news of multiple Chinese news websites from the internet, and marking the category of the news as a corpus set for text classification, wherein the corpus set is divided into a training set corpus and a test set corpus; conducting word segmentation on the training set corpus and the test set corpus respectively, and then obtaining the word vectors corresponding to the training set corpus and the test set corpus respectively by utilizing the word vector model; establishing the super-deep convolution neural network structure model; inputting the word vector corresponding to the training set corpus into the super-deep convolution neural network structure model, and conducting training and obtaining a text classification model; inputting the Chinese text which needs to be sorted into the word vector model, obtaining the word vector of the Chinese text which needs to be classified, and then inputting the word vector into the text classification model to complete the Chinese text classification.
Owner:HEBEI UNIV OF TECH

Video classification method and device and server

ActiveCN109359636AFully consider the characteristics of different dimensionsImprove accuracySemantic analysisVideo data clustering/classificationText categorizationClassification methods
The invention discloses a video classification method and device and a server. The method comprises the following steps of: obtaining a target video; The image frames in the target video are classified by the first classification model, and the image classification result is obtained. The first classification model is used for classification based on the image features of the image frames. The audio in the target video is classified by the second classification model, and the audio classification result is obtained. The second classification model is used to classify the audio based on the audio features. The text description information corresponding to the target video is classified by the third classification model, and the text classification result is obtained. The third classification model is used to classify the text information based on the text characteristics of the text description information. According to the image classification results, audio classification results andtext classification results, the target video target classification results are determined. In the present application, image features, audio features and text features are integrated for classification, and features of different dimensions of the video are fully considered, thereby improving the accuracy of the video classification.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Natural language processing-based multi-language analysis method and device

The invention discloses a natural language processing-based multi-language analysis method and device. The method comprises the following steps of: selecting to input a natural language text information language category through a language detection training model; obtaining word embedding expression information of corresponding words which can be recognized by a computer through a trained word vector model, and extracting a keyword of the obtained word embedding expression information through a TF-IDF manner; calculating an article vector and a category vector of each preset category according to the keyword and a keyword weight, and calculating a similarity between an article of natural language text information and each preset category so as to determine a text classification result ofthe natural language text information; and inputting the word embedding expression information of the natural language text information into a trained convolutional neural network and a parallel-framework text emotion analysis model of a bidirectional gate circulation unit, and obtaining a final emotion tendency value through calculation. According to the method and device, the problem that traditional multi-language analysis method needs to know domain knowledges of related linguistics and needs plenty of manpower to carry out operation is solved.
Owner:北京百分点科技集团股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products