Unlock instant, AI-driven research and patent intelligence for your innovation.

News Text Processing Method Based on Automatic Word Segmentation

A text processing and automatic word segmentation technology, applied in the field of text processing, can solve problems such as difficulty in understanding text and inability to retrieve it, and achieve the effect of convenient current affairs content and easy learning

Active Publication Date: 2022-02-08
东华理工大学南昌校区
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Secondly, there are some innovative vocabulary in some news, which cannot be searched in the existing thesaurus, which brings difficulty to the understanding of the text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • News Text Processing Method Based on Automatic Word Segmentation
  • News Text Processing Method Based on Automatic Word Segmentation
  • News Text Processing Method Based on Automatic Word Segmentation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the drawings in the embodiments of the present invention.

[0042] Such as Figure 1 to Figure 4 Shown is the news text processing method based on automatic word segmentation of the present invention, according to the phrase database and current affairs text, the news text can be divided into multiple character strings to achieve the purpose of fast word segmentation. The main steps of the present invention are as follows.

[0043] Step1, generate a phrase database, the phrase database has a phrase dictionary with arbitrary domain labels, and the phrase dictionary contains multiple basic phrases. First, collect basic phrases corresponding to the fields of philosophy, economics, law, education, literature, history, science, engineering, agriculture, medicine, military science, management, and art, and construct the fields of the phrase...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a news text processing method based on automatic word segmentation. The method first generates a database of phrases. Then extract the current affairs text, field tags, and news texts, compare the current affairs text with the basic phrase, and determine multiple active phrases and passive phrases of the current affairs text. Then retrieve the same strings to be processed as the active phrases and passive phrases in the news text, and generate the first string, the second string and the intermediate text. Then compare the middle text with the basic phrase, determine the third character string and the fourth character string, and finally splice the first, second, third, and fourth character strings to complete word segmentation processing of the news text. This method provides a technical basis for text semantic recognition through word segmentation of news text. It is also conducive to mining the value of news and enabling more precise positioning and search.

Description

technical field [0001] The invention relates to text processing technology, in particular to a news text processing method based on automatic word segmentation. Background technique [0002] With the development of the Internet, information acquisition has become very easy. There are various news reports on the same current event. The process of users understanding real events is often filled with a lot of irrelevant information, and they cannot really see the information they want to know. . In the prior art, the news processing system of CN201610114278.0 divides news into multiple categories by multi-level classification of news titles, thereby improving the use value of news. However, just by categorizing the titles cannot guarantee the identification of the content. Maybe the news titles match the keywords searched by the user, but the content has nothing to do with it. Before understanding the content of the news, it is necessary to combine the classification technolo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33G06F16/36G06F40/289
CPCG06F16/3344G06F16/36G06F40/289
Inventor 黄振华李惠惠
Owner 东华理工大学南昌校区