Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Word classification model training method based on artificial intelligence and word processing method and device

A classification model and artificial intelligence technology, applied in the fields of electronic equipment and storage media, word processing methods, devices, and word classification model training methods, can solve problems such as low accuracy, improve accuracy, enrich the number of samples, and improve training effects Effect

Active Publication Date: 2020-07-10
TENCENT TECH (SHENZHEN) CO LTD
View PDF18 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the small number of initial entity words, the rule template constructed in this way is too broad, and the accuracy of entity word classification is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word classification model training method based on artificial intelligence and word processing method and device
  • Word classification model training method based on artificial intelligence and word processing method and device
  • Word classification model training method based on artificial intelligence and word processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0075] In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with the accompanying drawings, and the described embodiments should not be considered as limiting the present invention, and those of ordinary skill in the art do not make any All other embodiments obtained under the premise of creative labor belong to the protection scope of the present invention.

[0076] In the following description, references to "some embodiments" describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or a different subset of all possible embodiments, and Can be combined with each other without conflict.

[0077] In the following description, the term "first\second" is only used to distinguish similar objects, and does not represent a specific order for objects. Understandably, "first\second\third" is allowed The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a word classification model training method and device based on artificial intelligence, a word processing method and device, electronic equipment and a storage medium. The method comprises the steps: obtaining a seed entity word set composed of a plurality of seed entity words, wherein the plurality of seed entity words belong to a to-be-mined entity type; combining any twoseed entity words in the seed entity word set to obtain a positive example sample pair; obtaining a historical text including the seed entity words, and constructing a negative example sample pair according to the seed entity words and the historical text excluding the seed entity words; updating a word classification model through the positive example sample pair and the negative example samplepair, wherein the updated word classification model is used for determining the probability that the entity words to be recognized belong to the entity types to be mined. Through the method and the device, the richness of model training samples can be improved, the corpus annotation cost required by entity mining is reduced, and meanwhile, the training effect of the word classification model can also be improved.

Description

technical field [0001] The present invention relates to artificial intelligence technology, in particular to an artificial intelligence-based word classification model training method, word processing method, device, electronic equipment and storage medium. Background technique [0002] Artificial Intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. Natural Language Processing (NLP, Nature Language Processing) is an important direction of artificial intelligence. It mainly studies various theories and methods that can realize effective communication between humans and computers using natural language. [0003] Entity word classification is an important application of natural language processing. By determining the entity type of entity words in t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/295G06F16/35
CPCG06F16/35
Inventor 邵纪春孙钟前胡海峰
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products