Word segmentation method, device, equipment and storage medium for bom text

A word segmentation method and text technology, applied in the direction of instruments, calculations, electrical digital data processing, etc., can solve the problem of inaccurate word segmentation of BOM files, achieve fast word segmentation speed, and solve the effect of inaccurate word segmentation

Active Publication Date: 2022-04-15
ALLCHIPS LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] The main purpose of the present invention is to solve the technical problem of inaccurate word segmentation of existing BOM files

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word segmentation method, device, equipment and storage medium for bom text
  • Word segmentation method, device, equipment and storage medium for bom text
  • Word segmentation method, device, equipment and storage medium for bom text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058]Embodiments of the present invention provide a word segmentation method, device, equipment and storage medium for BOM text.

[0059] The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present invention and the above drawings are used to distinguish similar objects, and not necessarily Used to describe a specific sequence or sequence. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments described herein can be practiced in sequences other than those illustrated or described herein. Furthermore, the term "comprising" or "having" and any variations thereof, are intended to cover a non-exclusive inclusion, for example, a process, method, system, product or device comprising a sequence of steps or elements is not necessarily limited to those explicitly listed instead, may include other steps or elements not explicitly listed or inherent to the process, metho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of text word segmentation, and discloses a word segmentation method, device, equipment and storage medium for BOM text. The method includes: obtaining BOM text data to be segmented, and splitting the BOM text data into Chinese and English to obtain a cut text set; reading the cut text in the cut text set; judging whether the cut text is Chinese text; if it is Chinese text, Then, according to the preset jieba function, the word segmentation process is performed on the cut text to obtain the cut word set, and the cut word set is determined as the word segmentation data; Perform screening and splitting to obtain the word segmentation data of English numbers; combine all the word segmentation data into a word segmentation data set, and determine the word segmentation data set as the word segmentation result of the BOM text data.

Description

technical field [0001] The present invention relates to the field of text word segmentation, in particular to a method, device, equipment and storage medium for word segmentation of BOM text. Background technique [0002] The BOM file is a semi-structured text file, and the user will write in the BOM file the parameter information of the hardware to be purchased, including model, brand, precision, etc. [0003] Natural Language Processing (NLP, Natural Language Processing) is an important direction in the field of artificial intelligence. It mainly studies various theories and methods for effective communication between humans and computers using natural language. The underlying tasks of natural language processing can be roughly divided into lexical analysis, syntactic analysis and semantic analysis from easy to difficult. Word segmentation is the most basic task in lexical analysis (including part-of-speech tagging and named entity recognition), and it is also an essentia...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/289G06F40/284G06F40/242
CPCG06F40/289G06F40/284G06F40/242
Inventor 杜飞高宇鹏刘武刘松山王园园王安李六七
Owner ALLCHIPS LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products