Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

129 results about "Transliteration" patented technology

Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus trans- + liter-) in predictable ways (such as α → a, д → d, χ → ch, ն → n or æ → ae). For instance, for the Modern Greek term "Ελληνική Δημοκρατία", which is usually translated as "Hellenic Republic", the usual transliteration to Latin script is "Ellēnikḗ Dēmokratía", and the name for Russia in Cyrillic script, "Россия", is usually transliterated as "Rossiya".

Foreign language abbreviation translation in an instant messaging system

A system for automatically providing foreign language abbreviation translation in an instant messaging system that identifies a foreign language abbreviation translation database based on a user indicated source culture. The foreign abbreviation translation database stores abbreviation translations for foreign language abbreviations frequently used by people from the user indicated source culture. The system locates a candidate term in an instant message and compares the candidate term to the foreign language abbreviations in the foreign language abbreviation translation database. In the event that the candidate term matches one of the foreign language abbreviations in the identified foreign language abbreviation translation database, the corresponding translation is retrieved and displayed. The comparison of the candidate term with the foreign language abbreviations may include automatically obtaining a transliteration of the candidate term. The disclosed system advantageously enables translation of foreign language abbreviations to be performed in real-time.
Owner:IBM CORP

Method and device for transliteration

A method for transliteration includes receiving input such as a word, a sentence, a phrase, and a paragraph, in a source language, creating source language sub-phonetic units for the word and converting the source language sub-phonetic units for the word to target language sub-phonetic units, retrieving ranking for each of the target language sub-phonetic units from a database and creating target language words for the word in the source language based on the target language sub-phonetic units and ranking of the each of the target language sub-phonetic units. The method further includes identifying candidate target language words based predefined criteria, and displaying candidate target language words.
Owner:SAMSUNG ELECTRONICS CO LTD

Machine Learning For Transliteration

Methods, systems, and apparatus, including computer program products, for performing transliteration between text in different scripts. In one aspect, a method includes generating a transliteration model based on statistical information derived from parallel text having first text in an input script and corresponding second text in an output script; and using the transliteration model to transliterate input characters in the input script to output characters in the output script. In another aspect, a method includes performing word level transliterations. In another aspect, a method includes using an entry-aligned dictionary of source and target script pairs, in which, whenever a particular source word is mapped to multiple target words, the dictionary includes an entry for each target word including the same source word repeated in each entry. In another aspect, a method includes using phonetic scores of words in different scripts to identify corresponding parallel text.
Owner:GOOGLE LLC

Method for transliterating and suggesting arabic replacement for a given user input

A method for suggesting transliteration for user inputs, comprising: receiving an original user input composed of alpha-numeric characters; identifying the possibility of transliterating the input; determining at least one potential transliteration by performing at least one of the following (1) replacing a sequence of characters in the original input to a possible sequence of Arabic characters (2) determining the probabilities of the potential transliterated alternatives to the user input; and electing the most likely transliteration according to some predetermined criteria (3) verifying the suggested output against a validation repository, the validation repository having a large corpus of Arabic words.
Owner:SHERIKAT LINK LETATWEER ELBARMAGUEYAT E

Transliteration for query expansion

Methods, systems, and apparatus, including computer program products, for identifying candidate synonyms of transliterated terms for query expansion. In one aspect, a method includes identifying multiple transliterated terms in a target language. For each transliterated term of the multiple transliterated terms in the target language, the transliterated term is mapped to one or more terms in a source language. For a first transliterated term of the multiple transliterated terms in the target language, one or more second transliterated terms of the multiple transliterated terms in the target language are identified as candidate synonyms of the first transliterated term, where each of the one or more second transliterated terms is mapped to at least one term in the source language that is also mapped from the first transliterated term.
Owner:GOOGLE LLC

Method and apparatus for converting phonetic language input to written language output

ActiveUS9471566B1High conversion precisionReduce and minimize numberNatural language translationSpecial data processing applicationsAutocorrectionTransliteration
Method and apparatus for converting language text input into a different language text output. One embodiment may be configured to convert Pinyin into written Chinese language text. Embodiments may provide a language input mechanism that accepts phonetic language input text and generates written language output text, and that also accepts modifications to existing input text and automatically corrects the corresponding output text. Embodiments may display one or more alternative output text words and / or sentences as candidates for the transliteration of the input text. The candidate output words may be ranked according to calculated probabilities, with a most probable candidate appearing first. The user may select from the candidate output text to build an output sentence that represents the correct or preferred transliteration of the input text. A lexicon, language model and history cache of previously selected output text words may be referenced in determining the probabilities of candidate output words.
Owner:SUN MICROSYSTEMS INC

Methods and apparatus for implementing multilingual software applications

The invention provides, in some aspects, a system for implementing a multilingual rules-based user interface including a reference tracker (executing on a digital data processor) that identifies translatable elements—by way of example, static text, enumerated variable text, images and icons—referenced by one or more user interface rules in a rules base and that generates a translation package containing those translatable elements for at least one target locale. A rules engine (which may execute on the same or another digital data processor) retrieves, from one or more transliteration rules based on the translation facilitation package, one or more translated elements that are associated with a locale and with one or more translatable elements referenced by the user interface rules. The rules engine processes those user interface rules using those one or more translated elements and, as a result, generates a user interface with the one or more translated elements.
Owner:PEGASYSTEMS

Determining a compact model to transcribe the arabic language acoustically in a well defined basic phonetic study

In the development of an automatic speech recognition (ASR) system, an extensive study of the basic phonetic alphabet is performed to collect information regarding phonology and phonetics of the language or dialect in question (modern standard Arabic or MSA in this case). In addition, terminological and transcriptional problems are identified with respect to the language or dialect in question. Next, based on feature description (rather than symbol shapes), the symbols in the literature are mapped to a single or more recent phonetic alphabet. Lastly, from a maximal set containing all the phonemes, allophones, and transliteration symbols, a reduced set is created with a compact set of phonetic alphabets. Memory consumption is greatly reduced in a computer system by using this compact set of phonetic alphabets.
Owner:SAKHR SOFTWARE COMPANY

Text analysis, transliteration and translation method and apparatus for hieroglypic, hieratic, and demotic texts from ancient egyptian

A system for translating a body of text in an initial language into a target language, that comprises an input device for imputing a body of text comprising a plurality of elements, a first analyzer for analyzing said text and identifying one or more possible characters of said initial language corresponding to respective said elements, a second analyzer for analyzing a stream of said identified characters to identify known characters or recognizable patterns thereof and assign thereto one or more potential translations into said target language, a third analyzer for analyzing the potential translations in conjunction with one or more determinatives representative of a conceptual framework of said text to determine which potential translations correspond with said conceptual framework, and an output device for outputting one or more potential translations determined to correspond with said conceptual framework.
Owner:HIERODICTION SOFTWARE

Method, system and computer program product for storing transliteration and/or phonetic spelling information in a text string class

A multi-field text string data structure is employed to encapsulating identification, meaning, and pronunciation information for a text string. A first field contains the Unicode characters for the text string in a language in which the text string is entered, which may be latin characters, characters which sound-map to latin characters, or one or more ideographs. A second field contains either the same characters or an intermediate representation of the text string, such as syllabary characters for a phonetic spelling of the characters within the first field. A third field contains either the same characters as the first field or a latin character phonetic spelling of the characters in the first field. The first field thus contains the text string in the language in which the text string was entered, while the second and third field contains information about the meaning and pronunciation of the text string. When the characters in the first field are unrecognizable to a user, or when the characters in the first field have more than one meaning or more than one pronunciation, the contents of the second and third fields allow the user to recognize the text string and / or perceive the correct meaning and pronunciation of the text string.
Owner:CERENCE OPERATING CO

System, method to generate transliteration and method for generating decision tree to obtain transliteration

InactiveUS20080270111A1Fast and efficient and accurateEasily transliterateNatural language data processingSpecial data processing applicationsProgramming languageTransliteration
The transliteration scheme developed as the present invention is contingent upon two languages, namely the keyboard language and the target language. While the invention is independent of the actual languages in question, save to the extent that they must be phonetic or pseudo-phonetic and the keyboard language must possess a script, it is nonetheless convenient to cast the invention within the framework of two specific languages. In this regard, English has been chosen as the keyboard language, and Hindi (a language widely spoken in India) has been chosen as the target language. An understanding of the current invention is possible without any knowledge of Hindi by a practitioner skilled in the art.
Owner:TACHYON TECH PVT

Method and apparatus for training transliteration model and parsing statistic model, method and apparatus for transliteration

The present invention provides a method and apparatus for training a parsing statistic model, a method and apparatus for transliteration. Said parsing statistic model is to be used in transliteration between a single-syllable language and a multi-syllable language and includes sub-syllable parsing probabilities of said multi-syllable language. Said method for training the parsing statistic model comprising: inputting a bilingual proper name list as corpus, said bilingual proper name list includes a plurality of proper names of said multi-syllable language and corresponding proper names of said single-syllable language respectively; parsing each of said plurality of proper names of multi-syllable language in said bilingual proper name list into a sub-syllable sequence using parsing rules; determining whether said parsing is correct according to the corresponding proper name of said single-syllable language in said bilingual proper name list; and training said parsing statistic model base on the result of parsing that is determined as correct.
Owner:KK TOSHIBA

Chinese name recognition method based on recurrent neural network

The invention provides a Chinese name recognition method based on a recurrent neural network. The method includes the steps of S1, corpus pretreatment; S2, word vector training, wherein a word2vec tool is used for word vector training; S3, Chinese name recognition model training, wherein data obtained after processing in S1 and word vectors obtained after training in S2 are used for training a neural network model; S4, name recognition and aftertreatment, wherein the model obtained after modeling in S3 carries out name recognition on test corpuses, names recognized by the model are subjected to aftertreatment through a context rule and a diffusion algorithm, and finally names are obtained. By means of the method, the complexity of feature selection during Chinese name recognition can be effectively lowered, rich syntax and grammar information included in Chinese texts is fully utilized through the word vectors, the generalization ability of the model is improve accordingly, Japanese names and foreign transliteration names are recognized at the same time, and the extent of Chinese name recognition is widened.
Owner:DALIAN UNIV OF TECH

Computer-implemented methods and systems for entering and searching for non-Roman-alphabet characters and related search systems

A computer-implemented method for selecting a desired Roman or non-Roman-alphabet character or objects from a set of non-Roman characters or objects may include steps of providing an association database that includes, for each non-Roman-alphabet character of the set, a Roman alphabet or other phonetic transliteration associated with each said non-Roman-alphabet character and a plurality of entries that are associated with each said non-Roman-alphabet character; receiving a phonetic transliteration of the desired non-Roman-alphabet character or data object and at least one associated entry that is associated with the desired non-Roman-alphabet character or other similar symbolic input; accessing the association database and identifying as candidate characters those characters of the set that are associated with the received phonetic transliteration and with the at least one received associated entry; if a number of candidate characters is greater than one, receiving additional associated entries and repeating the accessing and identifying step until a number of candidate characters is narrowed down to a single candidate character, and providing the single candidate character as the desired non-Roman-alphabet character. Also, derived from the principles described above, this invention includes a variety of methods for improving the efficiency of search engines through use of associations and other means of providing context for the item(s) being searched.
Owner:ORACLE INT CORP

Method and apparatus to transliterate text using a portable device

A method, apparatus, and computer instructions for a method in a portable device for transliterating text. An image of the text is generated using a camera function in the portable device. The image is sent with an identification of a source language and a target language to a transliteration service using a wireless communications link. A response is received from the transliteration service, wherein the response contains a transliteration of the text in the target language. The transliteration is then presented on the portable device. The presentation may be, for example, text displayed on the portable device or in the form of speech presented audibly to the user.
Owner:NUANCE COMM INC

System and method for automatically translating an imaged surface of an object

A system that incorporates teachings of the subject disclosure may include, for example, receiving a graphical representation of an object surface having a number of inscriptions indicative of information encoded upon the object surface. The graphical representation of the object surface is transcribed by the system to determine markings from the inscriptions. A number of alternative corresponding symbols are determined by the system. The alternative corresponding cuneiform symbols are determined from a number of the markings. A number of alternative transliterations are determined by the system. The alternative transliterations are determined from of a symbol of the multiple alternative corresponding symbols. A number of alternative translations are also determined by the system of a transliteration of the multiple translations transliterations. Other embodiments are disclosed.
Owner:LEARNING SITES

Word input method and input method system having translation function

The invention provides a word input method and an input method system having a translation function. The word input method comprises the following steps of: receiving a coded string input by a user; dissolving the coded string and obtaining a syllable unit sequence of a source language corresponding to the coded string; converting the syllable unit sequence of the source language and obtaining a syllable unit sequence of a target language; converting the syllable unit sequence of the target language and obtaining a transliteration result, and displaying the transliteration result; and outputting the appointed transliteration result according to an instruction of the user. In the word input method, a mapping relation between pronunciations of the two languages is constructed, so an effect that the user can obtain a correct spelling result through pronunciation can be achieved; under the help of the input method system, the user who forgets spelling can input a correctly spelled word or sentence; furthermore, some words which cannot be paraphrased or the paraphrase is not correct enough can be translated more accurately.
Owner:BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD

Computer-implemented methods and systems for entering and searching for non-Roman-alphabet characters and related search systems

A computer-implemented method for selecting a desired Roman or non-Roman-alphabet character or objects from a set of non-Roman characters or objects may include steps of providing an association database that includes, for each non-Roman-alphabet character of the set, a Roman alphabet or other phonetic transliteration associated with each said non-Roman-alphabet character and a plurality of entries that are associated with each said non-Roman-alphabet character; receiving a phonetic transliteration of the desired non-Roman-alphabet character or data object and at least one associated entry that is associated with the desired non-Roman-alphabet character or other similar symbolic input; accessing the association database and identifying as candidate characters those characters of the set that are associated with the received phonetic transliteration and with the at least one received associated entry; if a number of candidate characters is greater than one, receiving additional associated entries and repeating the accessing and identifying step until a number of candidate characters is narrowed down to a single candidate character, and providing the single candidate character as the desired non-Roman-alphabet character. Also, derived from the principles described above, this invention includes a variety of methods for improving the efficiency of search engines through use of associations and other means of providing context for the item(s) being searched.
Owner:ORACLE INT CORP

An improved institutional electronic messaging system

An electronic messaging system for both participating institutions and nonparticpating institutions and their populations. When an institution is participating in the system of the present invention, both message senders and recipients are authenticated and known and verified before any communication can take place. Messages for institutions not currently participating in the system are converted to printed mail for delivery to the non-participating institution using traditional postal systems. The present invention also teaches novel message processing techniques that take advantage of the electronic format of the messages to provide automated security and translation and / or transliteration services and further teaches the novel feature of marking incoming messages with a code to assist in accurately and secure delivery of reply messages to the appropriate party.
Owner:MICROWORKS

Multimedia transliteration method and system

The invention provides a multimedia transliteration method is applied to a multimedia transliteration system and comprises the following steps of: S1, receiving a demonstration manuscript, and constructing a key information tree of the demonstration manuscript; S2, receiving voice data, carrying out voice identification on the voice data, and obtaining transliteration texts of the voice data; S3, synchronizing the voice data and the transliteration texts with the demonstration manuscript by means of the key information tree; and S4, displaying the demonstration manuscript with the synchronized voice data and transliteration texts to a user. The user can hear the voices of a speaker and see the texts transliterated by the voices of the speaker while seeing the demonstration manuscript, and furthermore, the transliteration texts are segmented according to sub-themes included in each page of the transliteration texts, the transliteration texts of the same sub-theme is in one segment, and the transliteration texts of different sub-theme serve as different segments, so that the user can conveniently understand the transliteration texts, and the experience of the user is further improved.
Owner:ANHUI IFLYTEK INTELLIGENT SYST

Method, apparatus, and program for transliteration of documents in various indian languages

ActiveUS20050043941A1Easily transliterate the selected word or phrase into a target language or scriptNatural language translationSpeech analysisWeb browserTransliteration
A transliteration mechanism is provided that allows a user to view a text in one Indian language, to highlight a word or phrase, and to easily transliterate the selected word or phrase into a target language or script. The mechanism may be an application, an applet, or a plug-in to another application, such as a Web browser. The target language and / or script may be stored in a user profile. Preferably, the source language may be any known Indian language in any known script.
Owner:IBM CORP

Reverse Chinese-English transliteration method and device thereof

The invention relates to a reverse Chinese-English transliteration method and a device thereof. A Chinese transliteration name to be translated is converted into a pinyin sequence, and a statistics transliteration module is used for generating transliteration candidates; the transliteration candidates are revised into real English words by a revision module, and the revision module uses real English words collected from a great quantity of webpages to form a vocabulary; revised real English words are used to be inquiry to verify translation results, and webpage resources obtained with a search engine are used to rearrange the revised transliteration candidates; words which appear as a named entity role on the webpage are given high marks so as to filter common English words. The method can overcome the problems that a statistic model loses aphonic syllables or chooses wrong same pronouncing syllables in the process of transliteration and the like, and effectively improves precision rate of transliteration. Experiences prove that the precision rate of transliteration is improved by 17.55% in open beta.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Text analysis, transliteration and translation method and apparatus for hieroglypic, hieratic, and demotic texts from ancient egyptian

A system for translating a body of text in an initial language into a target language, that comprises an input device for imputing a body of text comprising a plurality of elements, a first analyzer for analyzing said text and identifying one or more possible characters of said initial language corresponding to respective said elements, a second analyzer for analyzing a stream of said identified characters to identify known characters or recognizable patterns thereof and assign thereto one or more potential translations into said target language, a third analyzer for analyzing the potential translations in conjunction with one or more determinatives representative of a conceptual framework of said text to determine which potential translations correspond with said conceptual framework, and an output device for outputting one or more potential translations determined to correspond with said conceptual framework.
Owner:HIERODICTION SOFTWARE

Phonetic transliteration card display

A phonetic transliteration card display includes a display board to which a plurality of display cards can be secured removably. The card display enables a person who speaks a first language, but not a second language, to speak simple phrases in the second language and be understood by a listener who speaks only the second language. This result is accomplished by displaying a source phrase in the first language on each display card, along with a target phrase that constitutes a translation of the source phrase in the second language. A phonetic transliteration of the target phrase is displayed in the first language. When the phonetic transliteration is read aloud by the speaker, the listener will hear, and therefore understand, the target phrase. The cards can be color-coded by subject matter, if desired. The display board contains a chamber that permits a number of display cards to be stored. The display board is constructed such that a number of similar boards can be connected to each other. This permits a number of display cards to be viewed simultaneously.
Owner:CREATIVE ACTION

Cyrillic to Latin script transliteration system and method

Embodiments of the present invention relate to methods, systems and computer-readable media for transliteration between Cyrillic and Latin script in a software product. An embodiment of this transliteration system and method comprises loading a text of characters and words in one of a Cyrillic or Latin script into a character transliteration module. This module converts each character in the one of a Cyrillic or Latin script into a corresponding opposite transliterated Cyrillic or Latin character. Then each word is examined in a word capitalization and exception module that compares each transliterated word against a set of predetermined grammatical rules to determine whether there are exceptions in capitalization. If there are, then appropriate internal capitalization of characters is added. Each word of the text to be transliterated is sequentially examined and converted until all words have been examined.
Owner:MICROSOFT TECH LICENSING LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products