Player and character code detection method and device for subtitle file

A character encoding and detection method technology, applied in the field of character encoding detection of players and subtitle files, to achieve accurate display and good playback experience

Active Publication Date: 2011-09-21
TENCENT TECH (SHENZHEN) CO LTD
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the embodiment of the present invention is to provide a character encoding detection method of a subtitle file, aiming at solving the problem that the prior art needs to manually analyze the character encoding of the subtitle file

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Player and character code detection method and device for subtitle file
  • Player and character code detection method and device for subtitle file
  • Player and character code detection method and device for subtitle file

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0035] figure 1 The implementation flow of the character encoding detection method for subtitle files provided by the first embodiment of the present invention is shown, and the details are as follows:

[0036] In step S101, a character code including all code values ​​in the subtitle file is selected from the character code set to be selected.

[0037] In the embodiment of the present invention, in order to enable the player to accurately output subtitle files using different character codes, a character code set is generally set in the player in advance, and the character code set includes one or more character codes. When the character encoding of the subtitle file to be played in the player needs to be detected, the character encoding set set in the player is used as the character encoding set to be selected, and the character encoding containing all encoding values ​​in the subtitle file is selected from the character encoding set to be selected . Examples are as follow...

Embodiment 2

[0066] figure 2 It shows the implementation flow of the character encoding detection method for subtitle files provided by the second embodiment of the present invention, figure 2 Steps S202, S203 and S204 in the character code detection method shown are respectively the same as figure 1 The steps S101, S102 and S103 of the character encoding detection method shown are the same, and the only difference is that it also includes the following steps:

[0067] In step S201, a large amount of data in different languages ​​is collected, the probability of occurrence of each character in different languages ​​is counted, and the distribution probability of the code value of each character code is calculated according to the probability of appearance of each character in different languages, and the code value of each character code is obtained. Distribution probability table. Examples are as follows:

[0068] By collecting a large number of language data such as webpages and boo...

Embodiment 3

[0070] image 3 It shows the implementation flow of the character encoding detection method for subtitle files provided by the third embodiment of the present invention, image 3 Steps S301 and S302 in the character encoding detection method shown are respectively the same as figure 1 The steps S101 and S102 of the character encoding detection method shown are the same, and the only difference is that it also includes the following steps:

[0071] In step S303, it is judged whether the possibility probability of the character code corresponding to the largest subtitle file is greater than a preset threshold, if yes, execute step S304, otherwise, execute step S305.

[0072] In the embodiment of the present invention, in order to make the detection result more accurate, after obtaining the probability probability of each selected character code corresponding to the subtitle file, it is judged whether the probability probability of the character code corresponding to the largest...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention is applicable to the field of multimedia processing, providing a player and a character code detection method and device for a subtitle file. The method comprises the following steps of: selecting character codes comprising all code values in the subtitle file from a character code set to be selected; calculating probability of correspondingly selecting each type of character code by the subtitle file according to a pre-stored code value distribution probability table of each kind of character code and the subtitle file; and determining the character code with the maximum probability of the subtitle file as the character code of the subtitle file. According to the embodiment provided by the invention, the character code of the subtitle file can be automatically, quickly and accurately detected. When a video file is played, since the character code of the subtitle file corresponding to the video file can be automatically, quickly and accurately loaded and detected, the player can analyze the subtitle file by using the character code of the subtitle file; therefore, subtitle content can be accurately displayed.

Description

technical field [0001] The invention belongs to the field of multimedia processing, in particular to a character code detection method and device for a player and a subtitle file. Background technique [0002] When the player plays a video file, in order to achieve a better playback effect, generally a corresponding subtitle file will be produced for the video file. In order to facilitate finding the subtitle file corresponding to the video file, the video file and the subtitle file generally use the same name. When making subtitle files, different character encodings can be used for different languages, such as GB2312, GBK, and GB18030 for simplified Chinese characters, BIG5 for traditional Chinese characters, Latin1 for Western European languages, and East Asian, Chinese, Japanese, and Korean CJK of language and text, and UNICODE (UTF-8, UTF-16) which includes most languages ​​in the world, etc. The character encoding refers to the digital representation rules of charact...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G11B27/10
Inventor 赵东
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products