Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

311 results about "Data scraping" patented technology

Data scraping is a technique in which a computer program extracts data from human-readable output coming from another program.

Product database forming method based on Internet data and system

The invention discloses a product database forming method based on Internet data and a system. The method includes the steps of capturing webpage data with the theme relevance higher than a preset threshold value by the adoption of the focused crawler technology, performing structuralized storage on the captured webpage data, automatically classifying the structuralized storage webpage data according to the categories which products belong to, performing statistics on the frequency and the time of occurrence of attributes of the products in the webpage data after the automatic classification, performing weighting calculation on the frequency and the time of occurrence of the attributes of the products according to preset weighting, acquiring the decision value of the attributes of the products, and determining the sort order of the attributes of the products according to the decision value of the attributes of the products. The system comprises a data capturing module, a structuralized storage module, a data classifying module and an attribute deciding module. According to the product database forming method based on the Internet data and the system, a user can acquire comprehensive and summarized information without needing to collect and sort product information on the Internet, real-time performance of data is ensured, and real-time requirements of the user are met.
Owner:广州市尊网商通资讯科技有限公司

Data collection method based on intelligent grasping system for science and technology service information

The invention relates to a data collection method based on an intelligent grasping system for science and technology service information. The method comprises the following steps: 1, data grasping: in terms of configuration of a crawler, a user releases a grasping task through a configuration module and a start module of a client and sets a webpage for grasping and corresponding rules; 2, loading of timing grasping tasks: the task released by the user is loaded dynamically to a timing grasping task list; 3, page downloading; 4, page parsing: pages in a queue are parsed; 5, adding to URL to be grasped; 6, data processing and storage: page data is subjected to parsing extraction processing, and extracted two-dimensional structured data is stored. The method can meet the requirement of the crawler for generality and the requirement of the science and technology service system for grasping, and convenient extension and plug-in type development are realized; parser rule configuration, width and depth of the pages subjected to grasping, grasping threads, database configuration or index configuration is added to specific service logic, and information can be grasped and collected intelligently.
Owner:山东辰华科技信息有限公司

Single-image robot disordered target grabbing method based on pose estimation and correction

The invention particularly discloses a single-image robot disordered target grabbing method based on pose estimation and correction. The method comprises the steps: S1, generating an image data set ofa to-be-grabbed object model; S2, constructing a convolutional neural network model according to the image data set in the step S1; S3, importing the two-dimensional image of the to-be-grabbed objectinto the trained convolutional neural network model to extract a corresponding confidence map and a vector field; S4, obtaining a predicted translation amount and a predicted rotation amount of the to-be-grabbed object; S5, finding the optimal grabbing point of the object to be grabbed and calculating the measurement translation amount of the depth camera; S6, performing grabbing safety distancecorrection according to the predicted translation amount of the object to be grabbed and the measured translation amount of the depth camera, executing correction data grabbing if the correction succeeds, and entering S7 if the correction fails; and S7, repeating the steps S3-S6. The disordered target grabbing method provided by the invention has the characteristics of high reliability, strong robustness and good real-time performance, can meet the existing industrial production requirements, and has a relatively high application value.
Owner:张辉

Commodity networked gene based brand intellectual property protection platform

The invention discloses a commodity networked gene based brand intellectual property protection platform comprising a data source module, a data collection module, a data integration module, a data storage module, a data analysis module, an object detection module, a visual module and a data application module. When the data collection module collects data source data, an open-source Hadoop platform is utilized for constructing a distributive whole-network commodity data capturing system; the data analysis module is utilized for structured arrangement of a great number of unstructured commodity comment data; the object detection module is utilized for analysis and detection of suspected infringing commodities by the aid of an established infringing commodity identification model; the visual module is utilized for displaying the analyzed and detected suspected infringing commodities via a visual interface. The commodity networked gene based brand intellectual property protection platform has the advantages that fewer human and material resources can be utilized, a larger-scale market can be handled effectively, and intellectual property maintenance cost is reduced for enterprises, so that economic benefits are increased.
Owner:HANGZHOU YEGOON TECH CO LTD

Recommendation method and system based on knowledge-aware hypergraph neural network

The invention discloses a recommendation method based on a knowledge-aware hypergraph neural network, which comprises the following steps: step 1, constructing a user hypergraph, and initializing a hyperedge by taking an article having an interaction relationship with a user as an entity node; constructing an article hypergraph, wherein the initial hyperedge takes an article having an interaction relationship with any user having the interaction relationship with the article as an entity node; step 2, convolution calculation is performed on the user hypergraph and each article hypergraph; and step 3, calculating the inner product of the user and all the articles to obtain the interaction score of the user and all the articles. The method has the beneficial effect that the auxiliary information of the articles is integrated into the vector representation of the user, so that the articles of which the user is more likely to have the interaction relationship can be selected from the numerous articles. The invention discloses a recommendation system based on a knowledge-aware hypergraph neural network. The recommendation system comprises: a data capture module; a knowledge perception hypergraph construction module; a domain convolution module; a hyperedge convolution module; and a prediction module. The method has the beneficial effects of simple operation and accurate prediction.
Owner:神行太保智能科技(苏州)有限公司

Computer robot crawling task distribution method and device, and computer robot data crawling method and device

The embodiment of the invention discloses a computer robot crawling task distribution method and device, and a computer robot data crawling method and device. The computer robot data crawling method comprises the following steps: a crawled historical page is stored into historical crawling data, whether a historical page corresponding to the URL(Uniform Resource Locator) cluster of a crawling task is in the presence in the historical crawling data or not is judged when the crawling task is in the presence, first target page data can be directly extracted from an available historical page if the historical page is in the presence and the existing historical page is available, the part of the historical page does not need to be repeatedly crawled, and system resources are saved. Meanwhile, the network bandwidth usage rates of all computer robots are detected in fixed time, the mean value E(w) and the variance D(w) of the network bandwidth usage rate of each computer robot are calculated and stored, and then, the availability of each computer robot is calculated according to E(w) and D(w). For the available computer robots, the computer robot for executing tasks is selected according to the descending order of an availability probability so as to reasonably distribute computer robot resources.
Owner:ALIBABA GRP HLDG LTD

A user data processing method and system

ActiveCN109711890AAnalyze displacement changesSatisfaction intuitively respondsMarketingData processing systemResidence
The invention discloses a user data processing system which comprises a user data capture module, a data integration processing module, a quantitative classification statistics module, a residence track analysis module, a store resource database, a store resource updating module, a management server and a guide push initiation module. The user data capturing terminal is respectively connected withthe quantitative classification statistics module and the data integration processing module; wherein the quantitative classification statistics module is connected with the comprehensive quantitative evaluation module through the resident trajectory analysis module, the commodity resource database is connected with the comprehensive quantitative evaluation module, and the management server is respectively connected with the data integration processing module, the comprehensive quantitative evaluation module, the store resource database and the guide push initiation module. According to the invention, the user position information and the consumption record are analyzed to determine the preference and interest of the user, so that the guidance information can be accurately pushed to guidethe user to shop, the satisfaction degree of the user is greatly improved, and the sales of commodities in the shopping mall is promoted.
Owner:SHANGHAI TRUELAND INFORMATION & TECH CO LTD

Data fetching method and device of OTT (over the top) application

ActiveCN103986788ANetwork Behavior OptimizationImplement synchronous fetchingTransmissionSpecial data processing applicationsTime informationData scraping
The invention provides a data fetching method and device of an OTT (over the top) application. The data fetching method comprises the following steps: indicating a to-be-tested terminal to fetch signaling layer data generated when the OTT application sends a signaling in business of the OTT application arranged in the to-be-tested terminal, wherein the signaling layer data comprise signaling information of a signaling sent every time and time information corresponding to signaling sent every time; fetching application layer data generated by the OTT application from the data output end of a simulative network device, wherein the application layer data comprise information of an application data packet sent every time and time information corresponding to the application data packet sent every time; and relating the signaling information of the signaling layer data matched with the time information with the application data packet information of the application layer data to form behavior data corresponding to the OTT application. By utilizing the data fetching method and device, the technical problem that the synchronous fetching of the signaling layer data and network layer data can not be realized so as not to guide an operator to optimize network behaviors of the OTT application in the prior art is effectively solved.
Owner:CHINA UNITED NETWORK COMM GRP CO LTD

Database system applied to real-time big data scene

ActiveCN104834719AHigh-speed real-time data access capabilityReduce hardware costsSpecial data processing applicationsThree levelInternal memory
The invention discloses a database system applied to a real-time big data scene. The database system applied to the real-time big data scene is characterized by comprising a data capturing module which is used for defining the range for capturing a data source, automatically capturing webpage data, performing content extraction and repetition exclusion, and performing context analysis; a feature database module which is used for storing Cookie data, advertisement position dada and linked data; a real-time database module which is used for performing indexing and sharding storage on the data in the feature database module; an advertisement putting module which is used for obtaining advertisement feature data from the real-time database module, comparing the advertisement feature data with the data in the feature database module, and finally determining whether to put advertisement to specific users and which advertisement is put to the users. The database system disclosed by the invention has the beneficial effects that as a three-level storage structure of internal memory, SSD and hard disk is adopted, the system not only provides high-speed real-time data access ability, but also can effectively cope with super-large scale data storage and has the disaster recovery ability.
Owner:BEIJING BIKU TIANDI CULTURE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products