Data processing method and device, and electronic equipment

A processing method and data technology, applied in the information field, can solve problems such as low efficiency of word segmentation processing, achieve the effects of improving accuracy and professionalism, improving processing efficiency, and narrowing the scope

Pending Publication Date: 2020-12-11
XINYANG BRANCH HENAN CO LTD OF CHINA MOBILE COMM CORP +1
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a data processing method, device, and ele

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device, and electronic equipment
  • Data processing method and device, and electronic equipment
  • Data processing method and device, and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0028] figure 1 is a flowchart of a data processing method provided by an embodiment of the present invention, figure 1 The shown method can be performed by a data processing device, figure 1 The methods shown include:

[0029] S110. Obtain the text to be processed in the target webpage, where the category of the target webpage is an instance class.

[0030] In S110 , the target webpage is obtained from the DPI log webpage by using the crawler crawl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data processing method and device, and an electronic equipment. The method comprises steps of obtaining a to-be-processed text in a target web page; determining a target industry category to which the to-be-processed text belongs; the M preset classification word banks being preloaded into a cache from a database, wherein the M preset classification word banks correspondto M industry categories; dynamically matching the category name of the target industry category with lexicon names of M preset classification word banks in the cache, and determining a target classification word bank; and dynamically loading the target classification word bank from the cache to a memory, and performing word segmentation processing on the to-be-processed text based on the target classification word bank in the memory to obtain a word segmentation processing result. Based on the method, the category name of the target industry is dynamically matched with the lexicon name of thepreset classification word bank, the target classification word bank can be dynamically loaded to achieve a purpose of dynamic adaptation, the word segmentation range can also be narrowed, and word segmentation accuracy, specialty and processing efficiency are improved.

Description

technical field [0001] The present invention relates to the field of information technology, in particular to a data processing method, device and electronic equipment. Background technique [0002] With the increasing expansion of business data, the in-depth analysis of DPI log web page content is facing huge challenges, and the demand for a large amount of data analysis is showing a blowout trend. How to ensure the accuracy of word segmentation for a large amount of data has gradually become a thorny problem. [0003] The relatively common word segmentation technology in the existing large amount of data recognition is: based on the general thesaurus, perform word segmentation and statistics on the content of a large number of DPI log web pages. The commonly used high-frequency words in the word segmentation results of this word segmentation technology are all ranked first, the word segmentation accuracy and professionalism are low, and the number of commonly used high-fr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/33G06F16/36G06F40/289
CPCG06F16/3344G06F16/3346G06F16/374
Inventor 朱建浩白琳崔刚
Owner XINYANG BRANCH HENAN CO LTD OF CHINA MOBILE COMM CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products