Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

41 results about "Multi-document summarization" patented technology

Multi-document summarization is an automatic procedure aimed at extraction of information from multiple texts written about the same topic. The resulting summary report allows individual users, such as professional information consumers, to quickly familiarize themselves with information contained in a large cluster of documents. In such a way, multi-document summarization systems are complementing the news aggregators performing the next step down the road of coping with information overload.

Method and device for generating multi-document summarization

ActiveCN108733682AImprove performanceImprove measurement capabilitiesSpecial data processing applicationsSemantic vectorVerb phrase
The embodiment of the invention discloses a method and a device for generating a multi-document summarization, relates to the field of data processing and solves the problem of poor performance of a summarization generated by an existing automatic multi-document summarization technology. A specific scheme of the method comprises the steps of dividing multiple documents into n sentences; generatingan input word bag vector; performing unsupervised training on each sentence represented by the input word bag vector to obtain an encoding hidden layer vector of each sentence and a potential semantic vector of each sentence; collecting m potential semantic vectors; obtaining m decoding hidden layer vectors and m output word bag vectors according to the m potential semantic vectors; updating them decoding hidden layer vectors and the m output word bag vectors; estimating an importance degree of each sentence; acquiring the importance degree and a redundancy degree of a verb phrase of each sentence and the importance degree and the redundancy degree of a noun phrase of each sentence; and generating the summarization of multiple documents according to the importance degree and the redundancy degree of all noun phrases and the importance degree and the redundancy degree of all verb phrases. The embodiment of the invention is used for a process for generating the multi-document summarization.
Owner:HUAWEI TECH CO LTD

Method for automatically generating unsupervised science and technology intelligence abstract based on multi-sentence compression

The invention relates to an unsupervised scientific and technological intelligence abstract automatic generation method based on multi-sentence compression, and belongs to the technical field of natural language generation. Aiming at multi-document text generation in the field of science and technology intelligence, firstly, source data are acquired based on a topic crawler of an LDA topic similarity word library extension method; and sorting all text paragraphs through a text information value evaluation model of three indexes of authority, timeliness and content correlation of the text information. And selecting a paragraph with a higher score as an original text for generating the final science and technology intelligence. Finally, an unsupervised multi-document abstract method based on spectral clustering and multi-sentence compression is adopted, and a science and technology intelligence abstract is automatically generated. According to the method, the problem that in the data screening process, scientific and technological information generation has high requirements for data timeliness and authority is effectively solved, and the problem that a traditional multi-document generation method based on a neural network cannot be applied due to lack of a data set in the field of scientific and technological information is effectively solved.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products