Check patentability & draft patents in minutes with Patsnap Eureka AI!

News content recognition method, model training method and device

A content recognition and model training technology, applied in the field of text recognition, can solve the problem of low recognition accuracy and achieve the effect of improving recognition accuracy

Pending Publication Date: 2021-07-16
SHENZHEN IPANEL TECH LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, if there is a change in the news video of a local TV station, if the leader of the city changes, the recognition accuracy will not be high because the name of the new leader has not appeared in the original news corpus
For another example, before the domestic movie "The Wandering Earth" was released in 2019, the author "Liu Cixin" was relatively unpopular. After that, the frequency of appearances in the news increased sharply with the popularity of the movie, but because the word rarely appeared in the news corpus Therefore, the recognition accuracy is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • News content recognition method, model training method and device
  • News content recognition method, model training method and device
  • News content recognition method, model training method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0039] see figure 1 , the embodiment of the present invention discloses a news content recognition model training method to improve the recognition accuracy of new words and unpopular words in news content, the training method includes:

[0040] Step S01: Use web crawler technology to grab news text content within a preset time period from the Internet.

[0041] Specifically, a web crawler is a program or script that automatically grabs web page information a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a news content recognition method, and a model training method and device, and aims to improve the recognition accuracy of new words and unpopular words in news content. The news content recognition model training method comprises the following steps: capturing news text content from a network within a preset time period from nowadays by utilizing a web crawler technology; taking the currently captured news text content as the latest news corpus, and combining with a pre-stored background picture and a word stock to synthesize a training set; and training a news content identification model by using the training set.

Description

technical field [0001] The present invention relates to the technical field of character recognition, and more specifically, to a news content recognition method, a model training method and a device. Background technique [0002] The continuous emergence of new words in natural language is an objective law. With the rapid development of social economy and the increasing frequency of foreign exchanges, especially the widespread use of the Internet, it provides a broad space for the emergence and popularity of new words. In addition, some unpopular words may also become hot words due to sudden hot spots of public opinion. [0003] In the field of news, vocabulary is updated frequently. Because new words have never appeared in the corpus, and unpopular words rarely appear in the corpus, when they are recognized based on the original news corpus, the recognition accuracy is not high. For example, if the news video of a TV station in a certain city changes, the recognition accu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/951G06F40/258G06K9/00G06N3/04G06N3/08
CPCG06F16/951G06F16/3344G06N3/084G06V30/40G06N3/044G06N3/045
Inventor 徐佳宏朱吕亮
Owner SHENZHEN IPANEL TECH LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More