Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

593 results about "Content extraction" patented technology

Content extraction is the task of separating boilerplate such as comments, navigation bars, social media links, ads, etc, from the main body of text of an article formatted as HTML. The main content typically accounts for only a small portion of a page’s source code (highlighted in red in the image below).

Meta-content analysis and annotation of email and other electronic documents

Meta-content analysis and annotation upon the body of email documents, and other electronic documents, and to create a displayable index of these instances of meta-content, which is sorted and annotated by type are provided. In addition, the electronic document is enhanced by providing links for the semantic foci to external documents containing related information. An electronic document adapted for delivery to one or more recipients, the electronic document including a header and a body, is processed by:performing meta-content extraction of semantic foci within said header and said body, the semantic foci comprising a plurality of type of information including one or more of email addresses, URLs, dates, currency values, organization names, names of people, names of places, and phone numbers;creating a meta-content index the document based upon said extracted semantic foci;arranging the meta-index according to said plurality of types;combining said meta-content index with said header and said body to provide an enhanced document; andsending said enhanced document to said one or more recipients via a communication network.The process includes converting the electronic mail document to a markup language format, and wherein said meta-content index comprises one or more objects expressed in said markup language adapted for presentation with body in said enhanced document.
Owner:SAP AMERICA

Customer service information providing method and device, electronic equipment and storage medium

The invention provides a customer service information providing method and device, electronic equipment and a storage medium. The method comprises the steps of receiving a Chinese text input by a user; inputting the input Chinese text into a Chinese customer service question-answering model based on a Bi-LSTM (Bidirectional Long Short-Term Memory) model and a CNN (Convolutional Neural Network) model to acquire an answering statement; inputting the input Chinese text into a content extraction and intention classification model based on a Bi-LSTM-CRF (Conditional Random Field) model and an LSTMclassifier to acquire customer intention classification and key information; determining service recommended to a user according to the customer intention classification and the key information; inputting the input Chinese text into a Chinese text emotion analysis model based on the CNN model to acquire a user emotion classification; adjusting the answering statement according to the user emotionclassification; and in combination with the adjusted answering statement and the determined service, providing customer service information to the user. According to the method and device optimizationmodel provided by the invention, the automatic customer service answering is realized.
Owner:上海携程国际旅行社有限公司

Method for extracting, analyzing and searching network flow and content

The invention discloses a method for extracting, analyzing and searching network flow and content. The method comprises the following steps: shunting original flow into n data processing queues; independently processing an original data message of each data processing queue by the data processing queue, performing protocol recognition and filtration on the message and performing conversation recombination on TCP (Transmission Control Protocol) flow in the message; performing protocol resolving and decoding on a recombined TCP conversation and extracting out structured data information therein; and as for key information specified by requirements, performing searching labeling in data content extracted by a content resolving and extracting module based on a multimode matching algorithm or a search engine technology, and submitting labeling results to a searching labeling information database, thereby providing searching labeling results for multiple modes of applications. The method can be used for solving the problems of repeated data packets, serial number zero adjustment and the like in the TCP conversation recombination, realizing the character labeling for the original flow, and ensuring that a user can acquire effective information conveniently.
Owner:XI AN JIAOTONG UNIV

Listed-company announcement classification and abstract generation method based on deep learning

The invention discloses a listed-company announcement classification and abstract generation method based on deep learning. The method comprises the following steps: step 1, acquiring announcement original-text data, extracting text, picture and form information, and establishing structured documents. step 2, establishing a classification rule word library of different announcements on the basis of industry knowledge of announcement fields according to various company operation change event keyword differences, and carrying out statistical judgment on announcement classes; and step 3, for the announcements of the different classes, extracting announcement document contents, combining the rule word library of corresponding class keywords to train an announcement content classification model, and automatically generating document abstract contents, wherein content extraction, training set selection, keyword model optimization, model training, model testing, result analysis and content generation are included. The method can solve technical problems of automatically classifying the announcements for a large amount of announcement information generated each day, automatically extracting key and important information according to classification situations, generating the abstract contents and the like.
Owner:北京文因互联科技有限公司

Webpage content extraction forwarding system for mobile communication terminal and application method thereof

InactiveCN101674374ASolve the technical problem of not being able to send to by SMSEasy to shareSubstation equipmentSpecial data processing applicationsHyperlinkText message
The invention relates to the field of a mobile communication equipment terminal, in particular to a browse system for the mobile communication equipment terminal and an application method thereof. Theinvention provides the browse system for the mobile communication equipment terminal, which comprises a browse module, a short message converting module, a shortening module, an identifying module and a skipping module, wherein the browse module is arranged in the mobile communication equipment terminal and used for browsing a page, the short message converting module is arranged in the mobile communication equipment terminal and used for sending a hyperlink by a short message, the shortening module is arranged in the mobile communication equipment terminal and uses a short link for replacingthe hyperlink, the identifying module is arranged on a transferring server and used for transmitting the short link. The browse system transmits the hyperlink to users and friends in a short messagemode, causes the users to conveniently share network resources, solves the technical problem that in the short message, the overlong hyperlink can not be sent by the short message, and causes the users to send various hyperlink by the short message.
Owner:UCWEB
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products