Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese Machine Reading System

A reading system, Chinese technology, applied in the direction of instruments, special data processing applications, electrical digital data processing, etc., to achieve the effect of large breadth, wide use and strong practicability

Active Publication Date: 2017-02-15
JIANGSU MINGTONG TECH
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, the statistics of the co-occurrence frequency of the text in the existing technology is limited to sliding the window, and then counting the co-occurrence frequency of two words; or using the language model to count the frequency of words that appear continuously

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese Machine Reading System
  • Chinese Machine Reading System

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The present invention will be further described below in conjunction with the accompanying drawings.

[0027] Chinese machine reading system, including data capture module 1, data processing module 2, data extraction module 3, knowledge base 4, data integration module 5 and user interface 6, data capture module 1, data processing module 2, data extraction module 3 It is connected with the knowledge base 4 in turn, and the data integration module 5 and the user interface 6 are connected with the knowledge base 4 .

[0028] The data capture module 1 is used to capture the unstructured data of the text on the Internet. Data capture module 1 uses URL seeds to spread and capture web pages through graph propagation. For the captured web pages, analyze HTML structured data, extract unstructured text information, and use Hadoop framework to capture using URL data Take unstructured text information, use Lucene and Neo4J two storage frameworks, Lucene processes and retrieves uns...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese machine-reading system. The Chinese machine-reading system comprises a data grabbing module, a data processing module, a data extracting module, a knowledge base, a data integration module and a use interface, wherein the data extracting module comprises a wiki content extracting module, a template extracting module, an entity extracting module, a relation extracting module and a template matching module. Compared with the prior art, an open extracting method is used, the extracting field is not limited, unstructured text information widely existing on the Internet can be read, and the system is suitable for being popularized and used and can automatically adapt to evolution of Chinese language.

Description

technical field [0001] The invention relates to the technical field of Chinese reading, in particular to a Chinese reading system. Background technique [0002] With the advent of the era of big data, more and more data are published online in the form of text. How to understand network data has become a more urgent and urgent problem to be solved. One of the ways is to organize unstructured text data into structured data (such as the relationship between words) that machines can recognize and use, laying the foundation for a series of reasoning and recognition in the future. Structured data can be used for semantic disambiguation, and the meaning of words can be inferred based on the relationship between words. In addition, the statistics of the co-occurrence frequency of texts in the prior art is limited to sliding the window to count the co-occurrence frequency of two words; or to use the language model to count the frequency of consecutive words. With the improvement ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/313G06F16/367
Inventor 秦谦宋阳秋常凯斯
Owner JIANGSU MINGTONG TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products