Multi-language internet information analysis method

A technology of Internet information and analysis methods, applied in the field of multilingual Internet information analysis, can solve problems affecting Internet supervision and other issues, and achieve the effect of promoting and strengthening Internet information supervision

Inactive Publication Date: 2015-09-23
CHINA CHANGFENG SCI TECH IND GROUPCORP
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These languages ​​are very different in terms of morphemes, grammar, and context, which has brought great pressure to Internet governance and has affected managers' supervision of the Internet.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0013] 1. Data collection:

[0014] First determine the data source for data collection; then search for the data source.

[0015] The data source search adopts the design of web crawler, which is designed by combining breadth-first search and improved PageRank web page analysis algorithm.

[0016] Data analysis and search engine. Use the LIUS search engine to analyze and index documents in various formats / types (including MS Word, MS Excel, MS PowerPoing, RTF, PDF, XML, HTML, TXT, Open Office and JavaBeans, etc.), adding index Update and blend index functions.

[0017] Basic data database construction. Relying on the search engine, the data format is uniformly determined, and the first-level index is performed according to the character unicode code, and a word correspondence table is established at the same time, the original address of the retrieved word is established in the table, and an index table is established for the original address in the database, so that trac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A multi-language internet information analysis method comprises: (1) data acquisition: determining a data source for data acquisition and searching the data source; (2) data statistics: establishing a large corpus of each language by collecting mass data and establishing an atom character database of each language; (3) data analysis: performing automatic processing on the mass data by adopting an artificial intelligence technology, then obtaining accurate monitored contents through analysis and providing early warning information; and (4) language conversion: main translating a database and an acquired search result in different languages, wherein translated contents are words.

Description

technical field [0001] The invention relates to the fields of semantic analysis and machine translation, in particular to a method for analyzing multilingual Internet information. Background technique [0002] The information on the Internet is complex and diverse, good and bad, and needs to be properly regulated. But at the same time, the virtual society of the Internet, like the real society, presents a trend of language diversification. These languages ​​are quite different in terms of morphemes, grammar, and context, which has brought enormous pressure to Internet management and control, affecting managers' supervision of the Internet. Contents of the invention [0003] The purpose of the present invention is to provide a multilingual Internet information analysis method, which combines Internet information analysis technology and multilingual mutual translation technology to realize automatic real-time monitoring and analysis of multiple languages ​​involved in massi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 罗克刚陈加
Owner CHINA CHANGFENG SCI TECH IND GROUPCORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products