Method and device for extracting keywords of business field

A keyword and field technology, applied in information retrieval and extraction business fields, can solve problems such as inaccurate judgment of keywords, and achieve the effect of solving inaccurate judgment of keywords

Inactive Publication Date: 2018-06-12
BEIJING GRIDSUM TECH CO LTD
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] The embodiment of the present invention provides a method and device for extracting keywords in the business field, so as to at least solve the technical problems in the prior art that the vocabulary of words that should be deleted needs to be manually maintained and the keywords are judged to be inaccurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for extracting keywords of business field
  • Method and device for extracting keywords of business field
  • Method and device for extracting keywords of business field

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0027] According to an embodiment of the present invention, an embodiment of a method for extracting keywords in a business field is provided.

[0028] figure 1 is a flowchart of a method for extracting keywords in a business field according to an embodiment of the present invention, such as figure 1 As shown, the method includes the following steps:

[0029] Step S102, acquiring at least one text in the business domain.

[0030] In the above steps, the above-mentioned business field may be a business field of any industry, for example, manufacturing industry, tourism industry, transportation and logistics industry and so on. The text in the above business field may be knowledge text information on the Internet, for example, a blog on Weibo. Through the above steps, a text collection that needs to extract keywords in a specific business field can be obtained.

[0031] Step S104, calculating the word frequency and inverse document frequency of each keyword contained in each...

Embodiment 2

[0090] According to an embodiment of the present invention, an embodiment of an apparatus for extracting keywords in a business field is provided, wherein the method in Embodiment 1 above can be run in the apparatus provided in this embodiment.

[0091] Figure 5 is a schematic structural diagram of a device for extracting keywords in a business field according to an embodiment of the present invention, such as Figure 5 As shown, the device includes: an acquisition module 501 , a first calculation module 503 , a second calculation module 505 , a first selection module 507 and a second selection module 509 .

[0092] An acquisition module 501, configured to acquire at least one text in the business domain.

[0093]In the above acquisition module, the above-mentioned business field may be a business field of any industry, for example, manufacturing industry, tourism industry, transportation and logistics industry and so on. The text in the above business field may be knowledg...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and device for extracting keywords of a business field. The method comprises the steps of obtaining at least one text in the business field; calculating the word frequency and the inverse document frequency of each keyword included in each text; according to the word frequency of each keyword and the inverse document frequency, determining the critical degree indexof each keyword included in each text; according to the critical degree index of each keyword included in each text, screening to obtain the keywords satisfying predetermined conditions from the texts; according to the screened result, determining the keywords in the business field. The technical problem in the prior art that a word table of words which should be deleted needs to be manually maintained, and the judgment of the keywords is inaccurate are solved.

Description

technical field [0001] The invention relates to the field of information retrieval, in particular to a method and device for extracting keywords in the business field. Background technique [0002] There is a large amount of knowledge text information about various industries on the Internet, such as news reports on the automobile industry, discussions on car models in forums, advertising news in the tourism industry, travel strategies and other news. It is difficult to extract the key information from the text. Therefore, how to quickly and effectively summarize the key information of the text in a certain field or topic has become an important problem that information viewers need to face. [0003] Referring to our usual reading of papers and other documents, there are usually keyword information in the first paragraph of the document to mark the main content and main points of this document, so that readers can search and quickly obtain the general content information of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/313G06F40/216G06F40/289
Inventor 贺达
Owner BEIJING GRIDSUM TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products