Method for automatically writing specific manuscript

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A specific and manuscript technology, applied in the field of automatic writing of specific manuscripts, can solve the problems of not being able to intelligently learn parameter characteristics and not being directly applicable

Active Publication Date: 2017-05-31

李鹏

View PDF3 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] 1. The automatic writing of specific manuscripts in the prior art is based on English text and wiki encyclopedias, but in Chinese, natural language processing has many differences in details, so it cannot be directly applied;

[0005] 2. The classification method used in the automatic writing of specific manuscripts in the prior art is maximum entropy classification, which cannot intelligently learn parameter features and then intelligently classify;

[0007] 4. Automatic writing of specific manuscripts in the prior art. In the prior art, only attempts have been made to two aspects: American actors and diseases;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0051] In order to make the purpose, technical solution and advantages of the present invention clearer, the technical solution of the present invention will be described in detail below. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other implementations obtained by persons of ordinary skill in the art without making creative efforts fall within the protection scope of the present invention.

[0052] Glossary:

[0053]TF-IDF, the abbreviation of Term Frequency-Inverse Document Frequency, is called Term Frequency-Inverse Document Frequency in Chinese. It is a commonly used weighting for information retrieval and data mining. The main idea of TF-IDF is: if a word or phrase appears in an article with a high frequency TF and rarely appears in other articles, it is considered This word or phrase has a good category discrimination ability.

[0054] VSM...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a method for automatically writing a specific manuscript and belongs to the field of information processing. A text document is subjected to clustering processing according to the title similarity, a VSM is set up based on a TF-IDF, a text is converted into a vector quantity mode, the meaning of a word is considered in the scheme, and Chinese manuscript writing is more reasonable and accurate. MI is utilized for carrying out dimension reduction processing on the vector space model (VSM), and effective features provided for a machine learning classifier are selected; intelligent learning is carried out through the machine learning classifier, a first draft article is obtained, statements of the first draft article are integrated through an ILP processor, repeated sentences in the whole article are automatically removed, and a final draft article better in quality is obtained. When the Chinese manuscript is written, the semanteme and the meaning of the word can be considered, the advantage of being intelligent in learning is achieved, and by means of optimization of the statements and article integration, the method can be suitable for writing specific manuscripts in multiple fields.

Description

technical field [0001] The invention belongs to the field of information processing, and in particular relates to a method for automatically writing a specific manuscript. Background technique [0002] The so-called specific manuscripts refer to specific field texts with fixed writing patterns, such as Wikipedia, Baidu Encyclopedia, product promotional materials, etc. The writing mode of a specific manuscript is an objective existence established according to the requirements of the media or publicity. This type of specific manuscript consists of a general title and several subtitles. After each subtitle, there is one or more paragraphs of text to expand the content of the subtitle. [0003] Although the prior art provides an automatic writing scheme for a specific manuscript, it has the following defects: [0004] 1. The automatic writing of specific manuscripts in the prior art is based on English text and wiki encyclopedias, but for Chinese, natural language processing h...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F17/30G06N99/00

CPCG06F16/35G06N20/00

Inventor 李鹏

Owner 李鹏

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method for automatically writing specific manuscript

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology