Method and device for processing a set of related words

A processing method and word technology, applied in the Internet field, can solve the problem of small vocabulary and achieve the effect of improving the collection of related words
CN106649334BActive Publication Date: 2020-09-15BEIJING GRIDSUM TECH CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
BEIJING GRIDSUM TECH CO LTD
Publication Date
2020-09-15

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a conjunction word set processing method and device, wherein the processing method comprises the steps of crawling a web text from a target data source on the basis of conjunction words in a conjunction word set of an object to be analyzed; performing word segmentation on the web text to obtain a plurality of text vocabularies, and obtaining the vocabulary information of each text vocabulary, wherein the vocabulary information includes conjunction index data of each text vocabulary and / or information of part of speech of each text vocabulary, and the conjunction index data is used for indicating the conjunction degree of each text vocabulary and the conjunction words; screening the conjunction index data of a plurality of text vocabularies and / or information of part of speech of a plurality of text vocabularies, and obtaining the screened conjunction vocabularies; and updating the conjunction word set by using the screened conjunction vocabularies. The method and the device provided by the invention solve the technical problem of small vocabulary quantity of the existing word bag accumulating method.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present application relates to the Internet field, and in particular, relates to a processing method and device for a set of associated words. Background technique

[0002] When an enterprise releases a product or service, or a government department promulgates a certain policy, or an instant event that attracts social attention occurs, there will inevitably be some relevant news reported by online media on the Internet. These online news will be Arouse the attention and discussion of netizens. In the process of collecting Internet public opinion content (i.e., web texts related to the object) for an analysis object (such as current events, products, characters, policies, etc.), if a web crawler is used to crawl the web texts related to the analysis object To collect information, since crawling does not distinguish whether the content is related to the object of analysis, after crawling the web text, it needs to be filtered to filter out the conte...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More