Data processing method and device

A technology of data processing and calculation method, which is applied in the Internet field to improve the efficiency of word access and save resources.

Inactive Publication Date: 2015-04-29
TENCENT TECH (SHENZHEN) CO LTD
View PDF5 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there is no way in the prior art to mine and recommend self-internal chain words related to the word to be processed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device
  • Data processing method and device
  • Data processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0049] see figure 1 , figure 1 It is a flowchart of the method provided by Embodiment 1 of the present invention. Such as figure 1 As shown, the method includes the following steps:

[0050] Step 101, determine the feature vector word of the word to be processed.

[0051] In the present invention, the word to be processed may include at least one word.

[0052] The following describes how to determine the feature vector word of the word to be processed, and this step 101 will not be described in detail.

[0053] Step 102, taking the set internal link words appearing on the result page dedicated to the word to be processed as the candidate internal link words of the word to be processed.

[0054] In the present invention, the word to be processed is a word in the preset knowledge base, wherein, when setting the knowledge base, the present invention can set an exclusive result page for each word in the knowledge base for explanation or Describe the word.

[0055] Based on...

Embodiment 2

[0117] Embodiment 2 is described below:

[0118] see Figure 5 , Figure 5 It is a flow chart of the method provided by Embodiment 2 of the present invention. Compared with the above-mentioned embodiment 1, this embodiment 2 does not need to calculate the feature vector word for the word to be processed, but relies on the frequency of other words in the preset knowledge base being accessed by users to determine the word to be processed. The relevant self-inner chain words of processing words are simpler than embodiment 1, and are described in detail below:

[0119] Such as Figure 5 As shown, the process may include the following steps:

[0120] In step 501, other words in the preset knowledge base are used as the candidate internal link words of the word to be processed.

[0121] Step 502, obtaining the number of times each candidate internal link word is accessed by the user within a set time.

[0122] Usually, when any word in the knowledge base is accessed, the knowl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data processing method and device. The method includes: determining feature vector words of words to be processed; using internal chain words set and occurring in a results page which the words to be processed specially belong to, as candidate internal self-chain words of the words to be processed; performing calculation according to a set recommended score calculating method, and calculating a recommendation score of each candidate internal self-chain word through a feature vector word of each candidate internal self-chain word and the feature vector words of the words to be processed; selecting a set number of candidate internal self-chain words having high recommendation scores, as internal self-chain words related to the words to be processed. The data processing method has the advantage that an internal self-chain word of a word can be automatically mined during processing of the word.

Description

technical field [0001] This application relates to Internet technology, in particular to a data processing method and device. Background technique [0002] In order to make this application easy to understand, the technical terms involved in this application are first described below: [0003] Word segmentation: It is to divide a sequence into individual words. The sequence may be a sequence of Chinese characters, or a sequence of Chinese characters and proprietary English words. [0004] Knowledge base: It is a collection of many semantic trees. A semantic tree is composed of a set of words with the same or similar semantics. [0005] Feature vector word: a word used to represent a feature of a certain document, which includes at least one word. [0006] Internal link words: appear in the text of the Q&A community, users can click and jump to links and descriptions on other pages. It can be used as a feature vector word of a document. [0007] Self-internal link word:...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 程刚
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products