Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for processing multilingual text

A multi-language and text technology, applied in the search field, to achieve the effect of increasing the search hit rate and improving the quality of language processing

Active Publication Date: 2021-10-08
EMC IP HLDG CO LLC
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the continuous deepening of internationalization, multilingual search is becoming more and more important, but at the same time multilingual search is also a big challenge

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for processing multilingual text
  • Method and device for processing multilingual text
  • Method and device for processing multilingual text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Preferred embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although preferred embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure can be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

[0019] As used herein, the term "comprise" and its variants mean open inclusion, ie "including but not limited to". The term "or" means "and / or" unless otherwise stated. The term "based on" means "based at least in part on". The terms "one example embodiment" and "one embodiment" mean "at least one example embodiment." The term "another embodiment" means "at least one further embodiment". The terms "first", "second", etc. may refer to di...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present disclosure relate to methods and apparatus for processing multilingual text. According to an embodiment of the present disclosure, for multilingual text including content in multiple languages, the multilingual text may be encoded using Unicode. The method also includes dividing the multilingual text into multiple parts based on the Unicode of the multilingual text, and the contents in the multiple parts have different languages. In addition, multilingual texts can be processed on a multi-part basis.

Description

technical field [0001] Embodiments of the present disclosure generally relate to the field of search, and in particular to methods and devices for processing multilingual texts. Background technique [0002] With the continuous deepening of internationalization, multilingual search is becoming more and more important, but at the same time multilingual search is also a big challenge. Taking an enterprise search system as an example, at present, language is usually detected based on the front part of the entire text, and the detected main language is used for language processing. Consequently, users who have documents in multiple languages ​​fail to hit desired documents in their searches because the content of interest in those documents is not processed in the corresponding language but in other languages ​​in the document. This problem is very common for users of Chinese, Japanese and Korean (CJK) languages, as they use more than one official language in their daily work. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/38
CPCG06F16/38G06F40/263G06F40/53G06F40/58G06F16/951
Inventor 黄坤武张磊陈超刘晶晶代洪涛
Owner EMC IP HLDG CO LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products