Unlock instant, AI-driven research and patent intelligence for your innovation.

A method for extracting a telephone number from a text

A phone number and text technology, applied in the field of computer information processing, can solve problems such as errors in extracting information, low accuracy, and user inability to use, and achieve the effects of improving efficiency, accurate phone numbers, and high usability.

Inactive Publication Date: 2019-06-14
陈包容
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the existing technical problems raised in the background technology, the present invention provides a method for extracting phone numbers from text, aiming to solve the problem of low accuracy in the phone number identification and extraction proposed in the above background technology, errors in extracted information, and inability of users to problem of use

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for extracting a telephone number from a text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0040] see figure 1 , a method for extracting phone numbers from text, comprising the following steps:

[0041] S1. Create a dedicated database group in advance;

[0042] S2. Perform word segmentation on the obtained text content to obtain a word segmentation data set;

[0043] S3. For the word segmentation data set, the phone numbers are extracted step by step according to the following rules:

[0044] First, retrieve the continuous combination strings of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of computer information processing, and discloses a method for extracting a telephone number from a text, which comprises the following steps of: pre-creating a special database group; performing word segmentation on the obtained text content to obtain a segmented word data set; aiming at the word segmentation data set, extracting telephone numbers in the word segmentation data set step by step according to the following rules: retrieving a continuous combined character string which is not lower than three Arabic numbers in the word segmentation data set, and summarizing the numbers into a digital text combination; starting to retrieve a continuous sequence combination of the first number and the number on the right side of the first number fromthe first Arabic number of the number text combination; and finally, speculating and extracting telephone numbers according to a pre-created special database group. The word segmentation data set isobtained for the text content, the digital text combination is screened out from the word segmentation data set, matching screening is carried out item by item according to the formats of the mobile phone number and the fixed telephone number, the telephone number obtained from the text content is more accurate, and the user usability is high.

Description

technical field [0001] The invention relates to the technical field of computer information processing, in particular to a method for extracting telephone numbers from text. Background technique [0002] Text data mining is an application-driven discipline that uses computer processing technology to extract valuable information and knowledge from text data. The type of data processed by text data mining is text data, which belongs to a branch of data mining and is closely related to machine learning, natural language processing, mathematical statistics and other disciplines. Text mining plays an important role in many applications, such as data collection, information extraction (such as Internet search), etc. [0003] Text information extraction is a basic technology of text data mining. Text information extraction is a technique to extract specific information from text data. Text data is composed of some specific units, such as sentences, paragraphs, and chapters, whil...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F16/35
Inventor 陈包容
Owner 陈包容