Method and system for distinguishing language of document image
A document image and discriminant technology, which is applied in character and pattern recognition, instruments, computing, etc., can solve the problems of distinguishing between Simplified Chinese and Traditional Chinese, and the speed of language discrimination is unacceptable
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0072] First, we will explain some basic concepts used in this specification, which are as follows.
[0073] - Language family / language set
[0074] In this specification, a language family / language set refers to a Chinese-based or Latin-based language. The Chinese-based language family includes three East Asian languages, which are Chinese (both Simplified and Traditional), Japanese, and Korean. The Latin-based language family mainly includes European languages.
[0075] -connected domain
[0076]In an undirected graph, a connected domain is a maximally connected subgraph. Two vertices are in the same connected domain if and only if there is a path between them. When plotting, each connected domain can be plotted separately with empty intervals between them. A non-empty connected graph has at least one connected domain.
[0077] - Circular / Circular Connected Domain
[0078] The shape of a circular / circular connected domain resembles a circle or ellipse, not a rectangle...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com