Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

144 results about "Automatic summarization" patented technology

Automatic summarization is the process of shortening a text document with software, in order to create a summary with the major points of the original document. Technologies that can make a coherent summary take into account variables such as length, writing style and syntax.

Automatic microblog text abstracting method based on unsupervised key bigram extraction

The invention discloses an automatic microblog text abstracting method based on unsupervised key binary word extraction. The automatic microblog text abstracting method comprises the steps of preprocessing a microblog; standardizing a binary word; extracting a key binary word based on a mixed TF-IDF (term frequency-inverse document frequency), TexRank and an LDA (local data area); sequencing sentences based on the intersection similarity and a mutual information strategy; extracting abstract sentences based on a similarity threshold value; generating abstract by reasonably combining the abstract sentences. According to the automatic microblog text abstracting method, the binary word is used as a minimum vocabulary unit, and the binary word has richer text information than words, so that the sentences based on the key binary word is higher in noise immunity and accuracy than the sentences based on key word extraction; meanwhile, when the abstract sentences are extracted, the similarity threshold value is introduced to control redundancy, so that the abstract is higher in recall rate. The abstract generated by the method is accurate, simple and comprehensive; the efficiency and the quality that a user acquires knowledge are obviously improved, and the time of the user is greatly saved.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Funnel type data gathering, analyzing and pushing system and method for online public opinion

InactiveCN104408157AImplement topic tracking analysisRealize network public opinion monitoringData processing applicationsWeb data indexingProcess moduleThe Internet
The invention discloses a funnel type data gathering, analyzing and pushing system and method for online public opinion. The funnel type data gathering, analyzing and pushing system comprises an online public opinion gathering module, an online public opinion processing module and an online public opinion publishing module, and the modules comprise a directed precise gathering sub-module, a non-directive gathering sub-module, a hot spot and sensitive topic identifying sub-module, a topic tracking sub-module, an automatic abstracting sub-module, a comprehensive analysis sub-module, a public opinion pre-warning sub-module and a multi-dimensional public opinion information display sub-module. The funnel type data gathering, analyzing and pushing method for the online public opinion uses a special public opinion funnel algorithm and uses the lexicons of three types of keywords of related to me, public opinion and positive and negative aspects to analyze, judge and classify the gathered data and warn early to grasp the latent change rule. The funnel type data gathering, analyzing and pushing system and method for the online public opinion reduce the manual public opinion event polling burden, duly and precisely grasp the development trend of the public opinion event, form the latest, hottest and sensitive topics in the recent period on the Internet, and detect the public opinion message what the user is concerned about and give an early warning in the first time.
Owner:SICHUAN ESLITE ELECTRONICS COMMERCE CO LTD

Machine learning-based Chinese automatic summarization method

The invention provides a machine learning-based Chinese automatic summarization method which comprises the following steps: inputting a text, and preprocessing the text; performing text structure division on the preprocessed text information, dividing the preprocessed text into a plurality of semantic paragraphs representing different themes, and calculating the importance degrees of the semantic paragraphs and the importance degrees of paragraphs; performing concept acquisition on the preprocessed text, converting all word expressions in the text into concept expressions, and calculating the importance degree of a concept, the frequency of the concept and the position of the concept; calculating the importance degrees of sentences according to structure information acquired by text division, the frequency of the concept, the position of the concept, the importance degree of the paragraphs and the importance degree of the semantic paragraphs; extracting the sentences with the importance degrees greater than preset values from all the semantic paragraphs; and arranging the sentences with the importance degrees greater than the preset values according to the original order, and outputting the sentences as a summarization result. The machine learning-based Chinese automatic summarization method can automatically generate a summary of a Chinese text.
Owner:BEIJING DINGTAI ZHIYUAN TECH CO LTD

Method and system for automatically abstracting pictures and texts from commodity-related network article

The invention provides a method and a system for automatically abstracting pictures and texts from a commodity-related network article. The method comprises the following steps of searching network articles on an Internet; screening the network article related with the commodity with particular topic from the searched network articles, extracting the corresponding commodity name, correlating the screened network article and the corresponding commodity name, and storing into a database of the commodity with particular topic; respectively obtaining pictures embedded into all network articles related with each commodity from the database of the commodity with particular topic, respectively screening the representative picture of each commodity from the pictures related with each commodity, and storing the representative picture of each commodity into the database of the commodity with particular topic. The system for automatically abstracting the pictures and texts has the advantages that by adopting the automatic abstracting technique, the different information sources are summarized, the commodity information, such as representative pictures and comment abstracts, of the commodity are provided, the intuitional data is provided for a user, and the query of the user is convenient.
Owner:FU TAI HUA IND SHENZHEN +1

Semantics-based sci-tech information processing method and system

The present invention discloses a semantics-based sci-tech information processing method and system, and belongs to the technical field of data processing. The method comprises the following steps: acquiring network data; according to a Chinese-English bilingual parallel corpus, translating the network data into Chinese / English by means of a decoding algorithm; generating an abstract according to the translated network data; performing classification according to the abstract, and generating a class tag; and storing the translated network data, the abstract and the class tag into a full-text retrieving database. According to the method and system disclosed by the present invention, by using technologies such as automatic search of sci-tech information, automatic abstracting of the sci-tech information and automatic classification of texts, sci-tech information related to scientific development, technical innovation and recent news can be automatically acquired by means of a public information channel from the Internet, so that acquisition accuracy is improved, the cross-language content understanding barrier is eliminated, the problem of information overload is solved, and the efficiency of reading and understanding information of the user is increased.
Owner:THE 28TH RES INST OF CHINA ELECTRONICS TECH GROUP CORP

Mobile terminal task assessment method and system based on Internet of Things

The invention discloses a mobile terminal task assessment method and system based on Internet of Things. The mobile terminal task assessment method based on the Internet of Things comprises steps of performing submission at a mobile terminal after accomplishing a task, entering a background management unit by task information to wait for task evaluation, performing point or virtual golden coin calculation according to a standard, waiting for a leader to finish final approval after finishing task assessment, if the leader approves, sending the assessment information to an assessment display module and a middle-level performance score management module for processing, if the leader does not approve, anew choosing the standard to perform assessment submission, performing automatic summarization statistic and displaying a result in an assessment report and a middle-level performance report by the assessment display module and the middle-level performance score management module, and sending the points and the virtual golden coins to personal accounts of workers or user who accomplish the task. The mobile terminal task assessment method and system based on the Internet of Things utilizes the advanced mobile Internet application technology to combine with the enterprise performance assessment management work and realizes a real-time, objectified, data-orienting and informationized work task assessment method on the mobile terminal.
Owner:广州时刻销销网络科技有限公司

Graph model text abstract generation method based on word frequency and semantics

The invention discloses a graph model text abstract generation method based on word frequency and semanteme. The method comprises the following steps of 1) performing word segmentation on sentences ina text, and performing part-of-speech tagging; 2) filtering the lexical items, and only reserving the lexical items with specific part-of-speech; and 3) training word vectors by using a Word2Vec model and a BM25 algorithm to form a feature word vector set, further representing sentences, and constructing a sentence-word text matrix; 4) constructing a text undirected graph model through the text matrix; and 5) performing iterative computation of sentence node weights by using a TextRank algorithm until convergence, and selecting TOP-K sentences to generate text abstracts. 6) experimental results show that the method is suitable for industrial production, compared with a traditional text automatic abstracting method considering a single word frequency characteristic of a text and based on atext semantic characteristic, according to the method, under the optimal adjustment factor combination, a higher Rouge value is obtained, it is proved that the method effectively integrates text wordfrequency and semantic features, and then the abstract generation accuracy is improved through a TextRank algorithm based on contextual information.
Owner:LIAONING UNIVERSITY

Data mining-oriented text processing system and method

The present invention provides a data mining-oriented text processing system. The system comprises: a text extraction module, a text segmentation module, an index establishing module, an entity identification module, a keyword extraction module, an automatic summarization module, an automatic classification module and a service interface module. The text segmentation module performs code conversion, conversion between simplified and traditional Chinese, and a part-of-speech tagging operation on a text extracted by the text extraction module. The index establishing module, the entity identification module, the keyword extraction module, the automatic summarization module and the automatic classification module are used for obtaining an index file, an entity word, a keyword, an abstract and a classification result of the text content. The service interface module is used for publishing output results of the index establishing module, the entity identification module, the keyword extraction module, the automatic summarization module and the automatic classification module in the form of a service to other systems for calling. The present invention also provides a data mining-oriented text processing method. The method is capable of providing a more complete text processing capability.
Owner:NO 32 RES INST OF CHINA ELECTRONICS TECH GRP

Generating system of three-dimensional sheet metal welding technology

The invention discloses a generating system of a three-dimensional sheet metal welding technology. The generating system comprises a design technology synergy module, a technological compilation module and a three-dimensional sheet metal welding technology workshop application module, wherein the design technology synergy module is used for achieving a three-dimensional design model based on MBD, and achieving data sharing and data distribution as well as creation of a sheet metal welding working procedure model on a united data source; the technological compilation module is used for achieving structuring of technology elements of the sheet metal welding technology, working procedures, working steps and the like and an automatic summarization to generate two-dimensional technology procedure file; the three-dimensional sheet metal welding technology workshop application module is used for exhibiting the three-dimensional technological structure of the sheet metal welding technology and the two-dimensional technology procedure file. The generating system of the three-dimensional sheet metal welding technology has the advantages that synergy of a design department and a technological compilation department is achieved, so that technological compilation can obtain an accurate and sole data source from a design division, and after formation of structured data, strong convenience for retrieval, retrospect and extraction of design data and technology data is provided.
Owner:BEIJING POWER MACHINERY INST

Multilingual word segmentation method based on dictionaries and grammar analysis

The invention discloses a multilingual word segmentation method based on dictionaries and grammar analysis. Efficient and accurate word segmentation of mixed texts of Chinese, Japanese, Korean, Cantonese and the like can be realized, flexible lexicon expansion of words for different time periods and different professionals can be realized, lexicon information is updated effectively, and efficient and accurate multilingual language text word segmentation is realized; a word segmentation sub-device of Chinese, Japanese, Korean, Cantonese and other language families, a Chinese quantum word segmentation device and a western language word segmentation device are embedded to realize the accurate word segmentation of each language text; a text segment to be performed with word segmentation is segmented by a built-in language segment coded identification mechanism, each segmented text segment corresponds to a language family, and the word segmentation is carried out by using a corresponding word segmentation sub-device; the word segmentation of western inflectional languages and the smart mode word segmentation of the Chinese, Japanese, Korean, Cantonese can be realized by grammar analysis, and texts containing Arabic numeral information can be processed; and meanwhile, the word segmentation of texts with a plurality of mixed languages can also be realized by the multilingual word segmentation method provided by the invention, thereby getting rid of the limitation that a word segmentation tool can only realize the word segmentation of single language and some individual languages and ensuring the security, accuracy, efficiency, flexibility and universality of word segmentation of texts. The multilingual word segmentation method provided by the invention has a wide application prospect in the text word segmentation fields such as enhancement of mass data text classification, text information extraction, autoabstract, etc.
Owner:BEIJING SCISTOR TECH +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products