Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Dictionary construction method and device

A construction method and dictionary technology, applied in the computer field, can solve problems such as poor labeling consistency, low accuracy and coverage, and low efficiency, and achieve the effects of improving quality, improving efficiency, and avoiding poor consistency

Pending Publication Date: 2021-12-07
BEIJING WODONG TIANJUN INFORMATION TECH CO LTD +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, product element dictionaries are often constructed by manual labeling, but due to the huge amount of product description texts, the efficiency of manual labeling is low, which affects the construction efficiency of product element dictionaries; in addition, because different people have different perceptions of product elements , labeling consistency is poor, accuracy and coverage are low, and the actual application effect of the constructed commodity element dictionary is not good

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dictionary construction method and device
  • Dictionary construction method and device
  • Dictionary construction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0071] figure 1 is a schematic diagram of the main flow of the dictionary construction method according to an embodiment of the present invention, such as figure 1 As shown, the dictionary construction method specifically may include the following steps:

[0072] Step S101, divide the sentence in the target text into one or more clauses according to the punctuation marks...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a dictionary construction method and a device, and relates to the technical field of computers. A specific embodiment of the method comprises the following steps: dividing a sentence in a target text into one or more clauses according to punctuation marks; using a text classification model trained based on a semi-supervised learning algorithm to predict a first probability that clauses in the target text belong to commodity elements contained in a pre-constructed commodity element dictionary; under the condition that the first probability that the clause belongs to the commodity element is greater than a first threshold probability, calculating a second probability that words, except element words currently contained in the commodity element, in the clause belong to the commodity element; and under the condition that the second probability that the word belongs to the commodity element is greater than a second threshold probability, adding the word to the commodity element dictionary as an element word of the commodity element. According to the embodiment, automatic expansion of the dictionary is realized, and the construction efficiency of the dictionary is improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a dictionary construction method and device. Background technique [0002] In the field of e-commerce, in order to facilitate users to quickly understand the performance of products and stimulate users' desire to buy, it is often necessary to generate product summaries for users based on detailed information of products, which include product elements. words. In the process of generating product abstracts, it is necessary to determine the product elements contained in the detailed information of the product based on the element words of the product elements contained in the pre-built product element dictionary, and then generate a product abstract including the product elements for the user. Among them, element words refer to synonyms, synonyms or hyponyms of commodity elements, such as "mobile phone" has multiple commodity elements such as "screen" and "battery", and the comm...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/242G06F40/216G06F16/35
CPCG06F40/242G06F40/216G06F16/35
Inventor 李浩然袁鹏
Owner BEIJING WODONG TIANJUN INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products