Data analysis method and apparatus

A data analysis and database technology, applied in the field of data analysis, can solve problems such as slow speed, inability to respond to business needs in a timely manner, and inability to meet the requirements of real-time analysis and real-time access to batch data at the same time, so as to reduce complexity, reduce learning and use Cost, the effect of improving the efficiency and accuracy of real-time analysis

Active Publication Date: 2018-11-02
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF2 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1. Some open source and free graph databases do not support distribution (such as the graph database neo4j), and some have stopped updating (such as the graph database titan), which is not enough to meet the real-time analysis and access functions of large-scale community relational data;
[0006] 2. Although HBase (non-relational distributed database), MySql (relational database management system), Cassandra (open source distributed non-relational database system) can meet the real

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data analysis method and apparatus
  • Data analysis method and apparatus
  • Data analysis method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0083] Example 1: Indicates the vertex of the policy, and queries all the policy information in the vertex

[0084] gremlin>g.V().hasLabel("category").valueMap()

[0085] Convert to SQL-like statement: SELECT*FROM Categories

example 2

[0086] Example 2: To represent the vertex of the strategy, query the name of the strategy of all vertices

[0087] gremlin>g.V().hasLabel("category").values("name")

[0088] Convert to SQL-like statement: SELECT CategoryName FROM Categories

[0089] like image 3 As shown, the workflow of batch relational data analysis performed by the interface adaptation layer of the data analysis method in the embodiment of the present invention is as follows:

[0090] Step 301: Input a SQL-like statement.

[0091] For example, analysts need to analyze all user relationship information between March 1 and March 2, that is, enter:

[0092] SELECT * FROM Categories where begin_time>='2017-03-01' and end_time<='2017-03-02'.

[0093] Step 302: Convert the SQL-like statement into a gremlin syntax format.

[0094] Step 303: calling the scanning interface of the columnar database through the gremlin statement to perform a data scanning operation.

[0095] Figure 4 It is a schematic diagram...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data analysis method and apparatus, which can meet real-time analysis and real-time access requirements of batch data at the same time. According to the data analysis method,an interface adaptation layer is arranged between a column type database and a distributed graph inquiry engine. The method comprises the steps of converting a graph operation command of the distributed graph inquiry engine and sending the graph operation command to the column type database.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a data analysis method and device. Background technique [0002] The community relationship network is a mesh connection between users through certain connections, such as the same combination of receiving mobile phone numbers, the same ordering IP (Internet Protocol Address) address, and the same mobile phone number and IP address. etc. By analyzing the network density, number of edges, ternary closure and other characteristics of the community relationship network, it is possible to analyze potential risk groups such as the scalper group and the order-swiping group. [0003] The community relationship network based on the graph model pays attention to the interrelationship between entities and is highly complex. It is more and more widely used in e-commerce, mobile operations, risk control and other fields. The demand for real-time access and real-time analysis of rel...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06Q50/00
CPCG06Q50/01
Inventor 游正朋梅尚健
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products