Unlock instant, AI-driven research and patent intelligence for your innovation.

Character string processing method and device

A processing method and string technology, applied in the Internet field, can solve the problems of long time consumption, slow calculation speed of effective character ratio, low efficiency of water post recognition, etc., and achieve the effect of improving the ratio calculation speed

Active Publication Date: 2016-01-27
BEIJING GRIDSUM TECH CO LTD
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most watering posts are long in length and may contain tens of thousands or even hundreds of thousands of characters, which makes the process of traversing the entire string time-consuming, and the calculation of the proportion of valid characters is slow, which in turn leads to the identification of watering posts. Low
[0004] For the above problems, no effective solution has been proposed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character string processing method and device
  • Character string processing method and device
  • Character string processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] In order to enable those skilled in the art to better understand the solution of the application, the technical solutions in the embodiments of the application will be clearly and completely described below in conjunction with the drawings in the embodiments of the application. Obviously, the described embodiments are only It is a part of the embodiments of this application, not all the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work should fall within the protection scope of this application.

[0022] It should be noted that the terms "first" and "second" in the description and claims of the application and the above-mentioned drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It should be understood that the data used in this way can be interchanged under appropriate circumstances so that the embodi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a character string processing method and device. The method comprises the following steps: obtaining a target character string; randomly selecting a preset quantity of characters on the target character string; determining the effective characters from the selected preset quantity of characters by utilizing a preset effective character set, and counting the quantity of the effective characters; and selecting the proportion of the effective characters in the selected preset quantity of characters according to a preset quantity and the quantity of the effective characters, and taking the proportion as the proportion of the effective characters in the target character string. According to the character string processing method and device, the technical problem that the calculation of the proportion of the effective characters in the character strings of spamming is low in speed is solved.

Description

Technical field [0001] This application relates to the Internet field, and specifically to a string processing method and device. Background technique [0002] In the Internet field, before analyzing network data, a large amount of data, such as forums and microblogs, needs to be crawled from the Internet. In the body of a forum post, many links are often maliciously injected, and the number may reach thousands or even tens of thousands, which are called water posts. This kind of forum may be occupied by irrigation posts and updated daily. After these posts are crawled, in the process of parsing the content of the forum posts, the efficiency of parsing will be greatly reduced. Therefore, it is necessary to find out the posts and remove them. [0003] At present, the recognition of watered posts is usually: given a valid character set, traverse the string in the entire post content, calculate the number of valid characters in the entire string, and then determine the string accord...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/90344
Inventor 石岱曦何鑫
Owner BEIJING GRIDSUM TECH CO LTD