Document atlas extraction method and device based on machine learning and storage medium

A machine learning and document technology, applied in the field of knowledge graphs, to improve reading quality and reduce reading time

Inactive Publication Date: 2021-03-05
京华信息科技股份有限公司
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Many concepts in documents are not entities, usually some intangible and invisible concepts. Existing knowledge map technology cannot decompose a document into a structured, mind-mapping map (document map)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document atlas extraction method and device based on machine learning and storage medium
  • Document atlas extraction method and device based on machine learning and storage medium
  • Document atlas extraction method and device based on machine learning and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0052] In the description of the present invention, it should be understood that the orientation descriptions, such as up, down, front, back, left, right, etc. indicated orientations or positional relationships are based on the orientations or positional relationships shown in the drawings, and are only In order to facilitate the description of the present invention and simplify the description, it does not indicate or imply that the device or element referred to must have a specific orientation, be constructed and operated in a specific ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a document atlas extraction method and device based on machine learning and a storage medium. The method comprises the steps of obtaining a document text, wherein the documenttext comprises a document title, a document body and document content; carrying out fragmentization processing on the document text to obtain fragmentized data; and according to the fragmented data, extracting a document map by using a trained knowledge unit classification model. According to the method, the trained knowledge unit classification model is used for extracting the document text to obtain structured document atlas data, the document atlas of the brain graph structure can be automatically formed, the document content is clear at a glance, the reading time can be greatly shortened,and the reading quality is improved. The method can be widely applied to the technical field of knowledge maps.

Description

technical field [0001] The present invention relates to the technical field of knowledge graphs, in particular to a method, device and storage medium for extracting document graphs based on machine learning. Background technique [0002] A knowledge graph is a language network that reveals the relationship between entities. It is usually used to describe the relationship between objects, people, institutions, cities, etc. in the real world, and is used for intelligence analysis, semantic search, intelligent question answering, and recommendation systems. The core point of the knowledge map is to collect a series of large-scale structured or unstructured data, analyze and model the data based on domain expertise, and find out the rules through machine calculations to generate calculation rules for relevant data. Many concepts in documents are not entities, and are usually intangible concepts. Existing knowledge map technology cannot decompose a document into a structured, min...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/36
CPCG06F16/35G06F16/367
Inventor 蓝建敏李观春
Owner 京华信息科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products