Unstructured data resource identification and locating method based on URL (Uniform Resource Locator)

An unstructured data and resource identification technology, applied in the field of URL-based unstructured data resource identification and location, can solve the problem of destroying the integrity of data information, low efficiency of XML processing unstructured data, and inability to realize resource location and access. and other issues to achieve the effect of improving accuracy and effectiveness and ensuring integrity

Active Publication Date: 2017-02-15
CHONGQING UNIV OF POSTS & TELECOMM
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1) Using traditional XML to process unstructured data is inefficient, and it is impossible to locate and access resources under complex conditions
[0005] 2) The current processing and extraction of unstructured data has destroyed the integrity of data information to a large extent
[0006] 3) Under complex access conditions, it is difficult for existing data models to accurately locate unstructured data resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unstructured data resource identification and locating method based on URL (Uniform Resource Locator)
  • Unstructured data resource identification and locating method based on URL (Uniform Resource Locator)
  • Unstructured data resource identification and locating method based on URL (Uniform Resource Locator)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0028] The identification model IDM (identification data model) of unstructured data includes data object space and attribute space. The data object space is the collection of unstructured data objects, and the attribute space is the collection of attributes of the data objects. In the identification model of this embodiment, an unstructured data is converted into a URL identification through its data model. The three attribute classes in the data model are: data resource basic attribute class, data resource content attribute class and data resource characteristic attribute class. Each data object has a unique identifier, and the identifier is the abstracted URL of the data resource. figure 1 is an unstructured data model diagram in the embodiment of the present invention.

[0029] The detailed attribute composition of the identification...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an unstructured data resource identification and locating method based on a URL (Uniform Resource Locator) and belongs to the technical field of unstructured data. According to the method, an abstract model comprising multiple aspects such as a characteristic attribute, a content attribute and a basic attribute is created for the unstructured data; a data resource is expressed through adoption of an URL identifier; and an identification rule is designed for the model of the unstructured data. For a complicated condition access submitted by a user, a data identification server resolves a condition, carries out similarity match on the condition and stored unstructured data identifiers to obtain a matched identification resource address and returns the identification resource address to a user. The user can access a data resource according to the returned resource address. Through application of the method, the unstructured data is uniformly abstracted as a URL identification resource, and the access and application of the unstructured data with described details can be supported well.

Description

technical field [0001] The invention belongs to the technical field of unstructured data, and relates to a URL-based unstructured data resource identification and positioning method. Background technique [0002] With the advent of the era of mobile Internet and big data, the degree of informatization continues to deepen. Emerging services such as cloud computing, the Internet of Things, and social networks have led to an unprecedented growth in the types and scale of data in human society. In recent years, driven by Internet giants at home and abroad, unstructured data has grown exponentially. Since the existing unstructured data does not have a unified data model, the data processing method is often based on XML files, and the unstructured data is converted into structured data through attribute feature extraction and other methods, and finally dumped into the traditional database. system. Due to the diversity of unstructured data, traditional processing methods may caus...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/33G06F16/38G06F16/61G06F16/686G06F16/71G06F16/7867G06F16/9566
Inventor 熊安萍李鸿健祝清意邹洋
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products