Unlock instant, AI-driven research and patent intelligence for your innovation.

Text content quality assessment method and system

A quality assessment and text technology, applied in instruments, natural language translation, data processing applications, etc., can solve the problems of consuming energy and time of users, occupying system resources by useless data, and uneven quality of network data, so as to save storage resources and improve The effect of reading quality

Active Publication Date: 2021-10-01
GLOBAL TONE COMM TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the quality of these network data is uneven. The storage of a large amount of useless data takes up valuable system resources, and the reading of invalid information will also consume a lot of energy and time of users.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text content quality assessment method and system
  • Text content quality assessment method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The specific embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings, but it should be understood that the protection scope of the present invention is not limited by the specific embodiments.

[0019]Unless expressly stated otherwise, throughout the specification and claims, the term "comprise" or variations thereof such as "includes" or "includes" and the like will be understood to include the stated elements or constituents, and not Other elements or other components are not excluded.

[0020] In order to accurately and quickly extract valuable information from the text data, the inventor thought about it and found that the text data in the current open source sites often contains some advertisements, subscription information or some rich text information, which are often related to the text topic Irrelevant, it will reduce the quality of the entire text, affect the user's reading, and increase the cost of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and system for assessing the quality of text content. The idea of ​​constructing feature engineering and establishing a classification model through the N-gram of part-of-speech tags is used to effectively identify invalid information in the text and score the overall text content. The invention can Applied in an intelligent data mining system, as part of preprocessing, it removes valueless information, retains valuable information in the text body to the greatest extent, obtains valuable text and serves downstream tasks, and can effectively save system storage resources. Improve the reading quality of users.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a text content quality assessment method and system. Background technique [0002] With the deepening of the application and development of the Internet industry, all aspects of people's production and life are deeply affected, and the various text data generated thereupon is explosively growing. The number of these texts has become extremely large and contains a lot of important information. However, the quality of these network data varies. The storage of a large amount of useless data takes up valuable system resources, and the reading of invalid information also consumes a lot of energy and time for users. Therefore, how to accurately and quickly extract valuable information from these text data is an urgent problem to be solved. [0003] The information disclosed in this Background section is only for enhancing the understanding of the general background...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/284G06F40/44G06K9/62G06Q10/06
CPCG06F40/284G06F40/44G06Q10/06395G06F18/2415G06F18/214
Inventor 张力文
Owner GLOBAL TONE COMM TECH