Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Word segmentation processing method and device, electronic equipment and storage medium

A word segmentation processing and word segmentation technology, applied in the Internet field, can solve the problems of high access cost, poor word segmentation effect and word segmentation performance, and achieve the effect of improving word segmentation effect and word segmentation performance, improving stability and word segmentation speed, and low cost of word segmentation

Pending Publication Date: 2021-11-05
BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present disclosure provides a word segmentation processing method, device, electronic equipment, and storage medium to at least solve the problems of high access cost, poor word segmentation effect, and poor word segmentation performance in related technologies

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word segmentation processing method and device, electronic equipment and storage medium
  • Word segmentation processing method and device, electronic equipment and storage medium
  • Word segmentation processing method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0099] In order that those skilled in the art better understand the technical solution of the present disclosure, in conjunction with the accompanying drawings below, the technical solutions in the embodiments of the present disclosure will be clearly and fully described.

[0100]It should be noted that the specification and claims of the present disclosure and the terms "first", "second", "second", etc. of the drawings are used to distinguish similar objects without having to describe a particular order or ahead order. It is to be understood that the data such as use can be interchangeable in appropriate, so that the embodiments of the present disclosure described herein can be implemented in the order other than those illustrated or described herein. The embodiments described in the exemplary embodiments described below do not represent all embodiments consistent with the present disclosure. Instead, they are only examples of apparatus and methods consistent with some aspects of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a word segmentation processing method and device, electronic equipment and a storage medium. The method comprises: obtaining target feature vectors corresponding to information to be subjected to word segmentation of multiple target words arranged in sequence; performing label prediction processing on the target feature vector to obtain first prediction results of the plurality of target words belonging to preset word segmentation labels; on the basis of a preset word segmentation tag, combining target words in a target word segmentation fragment corresponding to the information to be subjected to word segmentation, obtaining target word sequences corresponding to the target words, carrying out tag prediction processing on the target word sequences, and obtaining second prediction results that the target words belong to the preset word segmentation tag; according to the first prediction result and the second prediction result, determining target word segmentation labels corresponding to the target characters; and determining a word segmentation result of the to-be-segmented information according to the to-be-segmented information and the target word segmentation tag. By utilizing the scheme provided by the embodiment of the invention, the word segmentation effect and performance of the to-be-segmented information can be improved, and the word segmentation cost is reduced.

Description

Technical field [0001] The present disclosure relates to the field of Internet technology, and more particularly to a method, apparatus, electronic device, and storage medium. Background technique [0002] The Chinese word in natural language processing refers to the process of re-combining a consecutive word sequence in accordance with a certain specification. [0003] The relevant techniques are generally based on matching, statistical, depth learning, etc. However, the matching word algorithm (such as forward, reverse matching algorithm) is too dependent on the dictionary, the maintenance cost of the dictionary is high, the system resource consumption is large, and the matching word algorithm based on the word words and unregistered words. Treatment effect (such as the stability of the word boundary) is poor; based on the statistical word algorithm, the complexity is large, the word performance (for example, the vocabulary rate) is poor, and a large number of artificial labels...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/289
CPCG06F40/289
Inventor 胡羽蓝
Owner BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products