Patents

Literature

Patsnap Eureka AI that helps you search prior art, draft patents, and assess FTO risks, powered by patent and scientific literature data.

183 results about "Language identification" patented technology

Filter

Efficacy Topic

Property

Owner

Technical Advancement

Application Domain

Technology Topic

Technology Field Word

Patent Country/Region

Patent Type

Patent Status

Application Year

Inventor

In natural language processing, language identification or language guessing is the problem of determining which natural language given content is in. Computational approaches to this problem view it as a special case of text categorization, solved with various statistical methods.

Method and system for language identification

InactiveUS7818165B2Computationally efficientNatural language data processingSpecial data processing applicationsFeature setRelevant information

A method and system for language identification are provided. The system includes a feature set of a plurality of character strings of varying length with associated information. The associated information includes one or more significance scores for a character string for one or more of a plurality of languages. Means are provided for detecting character strings from the feature set within a token from an input text. The system uses a finite-state device and the associated information is provided as glosses at the final nodes of the finite-state device for each character string. The associated information can also include significance scores based on linguistic rules.

Method and system for language identification

Method and system for language identification

Method and system for language identification

Owner:LINKEDIN

Method, apparatus and computer program product for providing flexible text based language identification

ActiveUS7552045B2Accurate analysisHighly configurable multilingualSpeech analysisNatural language data processingProcessing elementHuman language

An apparatus for providing flexible text based language identification includes an alphabet scoring element, an n-gram frequency element and a processing element. The alphabet scoring element may be configured to receive an entry in a computer readable text format and to calculate an alphabet score of the entry for each of a plurality of languages. The n-gram frequency element may be configured to calculate an n-gram frequency score of the entry for each of the plurality of languages. The processing element may be in communication with the n-gram frequency element and the alphabet scoring element. The processing element may also be configured to determine a language associated with the entry based on a combination of the alphabet score and the n-gram frequency score.

Method, apparatus and computer program product for providing flexible text based language identification

Method, apparatus and computer program product for providing flexible text based language identification

Method, apparatus and computer program product for providing flexible text based language identification

Owner:NOKIA TECH OY

Voice-controlled data system

ActiveUS20070198273A1Speech recognitionControl dataSpeech identification

A voice-controlled data system may include a data storage unit including media files having associated file identification data, and a vocabulary generating unit generating phonetic data corresponding to the file identification data, the phonetic data being supplied to a speech recognition unit as a recognition vocabulary, where one of the media files may be selected according to a recognized speech control command on the basis of the generated phonetic data, where the file identification data include a language identification part for identifying the language of the file identification data, and where the vocabulary generating unit generates the phonetic data for the file identification data of a media file based on its language identification part.

Voice-controlled data system

Voice-controlled data system

Voice-controlled data system

Owner:HARMAN BECKER AUTOMOTIVE SYST

Language identification from short strings

ActiveUS20160357728A1Natural language translationSemantic analysisNatural language processingShort string

Systems and processes for language identification from short strings are provided. In accordance with one example, a method includes, at a first electronic device with one or more processors and memory, receiving user input including an n-gram and determining a similarity between a representation of the n-gram and a representation of a first language. The representation of the first language is based on an occurrence of each of a plurality of n-grams in the first language and an occurrence of each of the plurality of n-grams in a second language. The method further includes determining whether the similarity between the representation of the n-gram and the representation of the first language satisfies a threshold.

Language identification from short strings

Language identification from short strings

Language identification from short strings

Owner:APPLE INC

Language model adaptation via network of similar users

InactiveUS6484136B1Natural language translationSpeech recognitionComputer usersHuman language

A language recognition system, method and program product for recognizing language based input from computer users on a network of connected computers. Each computer includes at least one user based language model trained for a corresponding user for automatic speech recognition, handwriting recognition, machine translation, gesture recognition or other similar actions that require interpretation of user activities. Network computer users are clustered into classes of similar users according to user similarities such as, nationality, profession, sex, age, etc. User characteristics are collected by sensors and from databases and, then, distributed over the network during user activities. Language models with similarities among similar users on the network are identified. The language models include a language model domain, with similar language models being clustered according to their domains. Language models identified as similar are modified in response to user production activities. After modification of one language model, other identified similar language models are compared and adapted. Also, user data, including information about user activities and language model data, is transmitted over the network to other similar users. Language models are adapted only in response to similar user activities, when these activities are recorded and transmitted over the network. Language models are given a global context based on similar users that are connected together over the network.

Language model adaptation via network of similar users

Language model adaptation via network of similar users

Language model adaptation via network of similar users

Owner:NUANCE COMM INC

Statistical method and apparatus for learning translation relationships among phrases

InactiveUS7249012B2Natural language translationSpecial data processing applicationsLanguage identificationHuman language

The present invention learns phrase translation relationships by receiving a parallel aligned corpus with phrases to be learned identified in a source language. Candidate phrases in a target language are generated and an inside score is calculated based on word association scores for words inside the source language phrase and candidate phrase. An outside score is calculated based on word association scores for words outside the source language phrase and candidate phrase. The inside and outside scores are combined to obtain a joint score.

Statistical method and apparatus for learning translation relationships among phrases

Statistical method and apparatus for learning translation relationships among phrases

Statistical method and apparatus for learning translation relationships among phrases

Owner:MICROSOFT TECH LICENSING LLC

Identifying keyword occurrences in audio data

ActiveUS20100179811A1Speech recognitionSpeech identificationHuman language

Occurrences of one or more keywords in audio data are identified using a speech recognizer employing a language model to derive a transcript of the keywords. The transcript is converted into a phoneme sequence. The phonemes of the phoneme sequence are mapped to the audio data to derive a time-aligned phoneme sequence that is searched for occurrences of keyword phoneme sequences corresponding to the phonemes of the keywords. Searching includes computing a confusion matrix. The language model used by the speech recognizer is adapted to keywords by increasing the likelihoods of the keywords in the language model. For each potential occurrences keywords detected, a corresponding subset of the audio data may be played back to an operator to confirm whether the potential occurrences correspond to actual occurrences of the keywords.

Identifying keyword occurrences in audio data

Identifying keyword occurrences in audio data

Identifying keyword occurrences in audio data

Owner:CENT DE RECH INFORMATIQUE DE MONTREAL

Language identification using recurrent neural networks

ActiveUS20180067918A1Semantic analysisSpeech analysisAlgorithmMachine learning

Systems and processes for language identification using recurrent neural networks are provided. An example method includes, at an electronic device, receiving a first typed character of a character sequence and determining a character context of the first typed character based on the first typed character and a second typed character of the character sequence. The method further includes determining a confidence level that the character sequence is associated with a language of a plurality of languages based on the character context of the first typed character, and determining whether the confidence level exceeds a threshold, in accordance with a determination that the confidence level exceeds the threshold, providing the language as a candidate language, and in accordance with a determination that the confidence level does not exceed the threshold, forgoing providing the language as a candidate language.

Language identification using recurrent neural networks

Language identification using recurrent neural networks

Language identification using recurrent neural networks

Owner:APPLE INC

Efficient language identification

ActiveUS20060184357A1Improve performanceReduce in quantityAnti-theft devicesCharacter and pattern recognitionPresent methodMore language

A system and methods of language identification of natural language text are presented. The system includes stored expected character counts and variances for a list of characters found in a natural language. Expected character counts and variances are stored for multiple languages to be considered during language identification. At run-time, one or more languages are identified for a text sample based on comparing actual and expected character counts. The present methods can be combined with upstream analyzing of Unicode ranges for characters in the text sample to limit the number of languages considered. Further, n-gram methods can be used in downstream processing to select the most probable language from among the languages identified by the present system and methods.

Efficient language identification

Efficient language identification

Efficient language identification

Owner:MICROSOFT TECH LICENSING LLC

Language identification from short strings

ActiveUS10127220B2Natural language translationSemantic analysisShort stringUser input

Systems and processes for language identification from short strings are provided. In accordance with one example, a method includes, at a first electronic device with one or more processors and memory, receiving user input including an n-gram and determining a similarity between a representation of the n-gram and a representation of a first language. The representation of the first language is based on an occurrence of each of a plurality of n-grams in the first language and an occurrence of each of the plurality of n-grams in a second language. The method further includes determining whether the similarity between the representation of the n-gram and the representation of the first language satisfies a threshold.

Language identification from short strings

Language identification from short strings

Language identification from short strings

Owner:APPLE INC

Language Identification Equipment, Translation Equipment, Translation Server, Language Identification Method, and Translation Processing Method

InactiveUS20080281577A1Natural language data processingSpecial data processing applicationsCode tableCollation

In some preferred embodiments, a language identification apparatus, comprises a storing means 32 configured to store undefined character code lists of a plurality of languages to which no character is allotted in a character code table of each language, a collating means 34 configured to collate a character code of each character contained in a character string of a language identification target with the undefined character code list of each language stored in the storing means, and an identification means 34 configured to identify a language in which a character corresponding to the undefined character code is not contained in the character string as a language of the character string among the plurality of languages as a result of collation by the collating means.

Language Identification Equipment, Translation Equipment, Translation Server, Language Identification Method, and Translation Processing Method

Language Identification Equipment, Translation Equipment, Translation Server, Language Identification Method, and Translation Processing Method

Language Identification Equipment, Translation Equipment, Translation Server, Language Identification Method, and Translation Processing Method

Owner:IMPULSE JAPAN

Intelligent household control system based on multichannel interaction manner

InactiveCN102932212AReduce false positive rateIncrease audio input gainCharacter and pattern recognitionData switching by path configurationControl systemComputer module

The invention discloses an intelligent household control system based on a multichannel interaction manner. The intelligent household control system comprises a controller and a control network, wherein the controller comprises a front-end sensor, a signal preprocessing module and a core processing chip, wherein the front-end sensor comprises a front-end camera and an audio sensor; the signal preprocessing module comprises a video preprocessing module and an audio preprocessing module; the core processing chip comprises a lip language identification module, a human face identification module and a hand gesture identification module and a voice identification module; and the control network comprises a controller communication module arranged on the controller and a household appliance communication module arranged on a household appliance. The intelligent household control system disclosed by the invention makes use of advantages of various technologies in respective application field, learns from each other, makes various technical modules relatively independent and mutually fused and realizes stability and operation comfortability.

Intelligent household control system based on multichannel interaction manner

Intelligent household control system based on multichannel interaction manner

Intelligent household control system based on multichannel interaction manner

Owner:SOUTH CHINA UNIV OF TECH

Enhanced multilingual speech recognition system

InactiveUS20050197837A1Increase the number ofImprove speech recognition accuracySpeech recognitionSpeech synthesisSpeech soundHuman language

A speech recognition system comprising: a language identification unit for identifying the language of a text item entry; at least one separate pronunciation modelling unit including a phoneme set and pronunciation model for at least one language; means for activating the pronunciation modelling unit including the phoneme set and pronunciation model for the language corresponding to the language identified in the language identification unit for obtaining a phoneme transcription for the entry; and a multilingual acoustic modelling unit for creating a recognition model for the entry.

Enhanced multilingual speech recognition system

Enhanced multilingual speech recognition system

Enhanced multilingual speech recognition system

Owner:NOKIA CORP

Method and Apparatus for Multi-Lingual End-to-End Speech Recognition

ActiveUS20190189111A1Enhanced Feature RepresentationUnnecessary to prepareMathematical modelsSpeech recognitionData setMulti language

A method for training a multi-language speech recognition network includes providing utterance datasets corresponding to predetermined languages, inserting language identification (ID) labels into the utterance datasets, wherein each of the utterance datasets is labelled by each of the language ID labels, concatenating the labeled utterance datasets, generating initial network parameters from the utterance datasets, selecting the initial network parameters according to a predetermined sequence, and training, iteratively, an end-to-end network with a series of the selected initial network parameters and the concatenated labeled utterance datasets until a training result reaches a threshold.

Method and Apparatus for Multi-Lingual End-to-End Speech Recognition

Method and Apparatus for Multi-Lingual End-to-End Speech Recognition

Method and Apparatus for Multi-Lingual End-to-End Speech Recognition

Owner:MITSUBISHI ELECTRIC RES LAB INC

Method and apparatus for multi-language user selection for system user interface

ActiveUS20070017971A1Complete banking machinesAcutation objectsMulti languageDisplay device

A method and apparatus for a credit card and debit card transaction terminal including a processor, a display and a memory. A card reader is configured to read data stored on a portable memory medium. An operator interface is produced by an operator interface program and executable by the processor. The operator interface being displayable on the display in an operator language. A customer interface is produced by a customer interface program and executable by the processor. The customer interface being displayable in a customer language. A language identification module is stored in the memory and executable on the processor. The language identification module is configured to select the operator language and to select a customer language in which displayable for the operator and customer, respectively. The operator and customer languages are selected from a plurality of languages stored in the module. The operator and customer languages are different.

Method and apparatus for multi-language user selection for system user interface

Method and apparatus for multi-language user selection for system user interface

Method and apparatus for multi-language user selection for system user interface

Owner:GOLDMINE WORLD

Head carried stereo vision hand gesture identifying device

InactiveCN1648840ARealize human trackingAchieving identifiabilityInput/output for user-computer interactionCharacter and pattern recognitionLanguage identificationIdentification device

The present invention provides a head carried stereo vision hand gesture identifying device and belongs to the field of computer vision and man-computer interaction technology. The head carried stereo vision hand gesture identifying device consists of four parts: stereo vision imaging head, stereo vision board, head carried display and worn computer. The stereo vision imaging head obtains the hand grey image of the wearer; the stereo vision board calculates the real-time the dense parallax map and transmits the dense parallax map and the grey image to the worn computer; and the worn computer completes hand tracking and hand gesture identifying operation and showing the calculation results in the head carried display. The present invention can realize the hand tracking and hand gesture identifying under worn computer environment, complete hand gesture mouse, hand language identification, hand written character identification, etc. and intelligent man-computer interface based on hand gesture.

Head carried stereo vision hand gesture identifying device

Head carried stereo vision hand gesture identifying device

Head carried stereo vision hand gesture identifying device

Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Method of Multilingual Speech Recognition by Reduction to Single-Language Recognizer Engine Components

InactiveUS20050187758A1Natural language data processingSpeech recognitionComputation complexityLanguage speech

In some speech recognition applications, not only the language of the utterance is not known in advance, but also a single utterance may contain words in more than one language. At the same time, it is impractical to build speech recognizers for all expected combinations of languages. Moreover, business needs may require a new combination of languages to be supported in short period of time. The invention addresses this issue by a novel way of combining and controlling the components of the single-language speech recognizers to produce multilingual speech recognition functionality capable of recognizing multilingual utterances at a modest increase of computational complexity.

Method of Multilingual Speech Recognition by Reduction to Single-Language Recognizer Engine Components

Method of Multilingual Speech Recognition by Reduction to Single-Language Recognizer Engine Components

Method of Multilingual Speech Recognition by Reduction to Single-Language Recognizer Engine Components

Owner:KHASIN ARKADY

Language identification for documents containing multiple languages

InactiveUS20100125447A1Convenient and accurateShorten the lengthNatural language translationSpecial data processing applicationsHypothesisDocument preparation

Multiple nonoverlapping languages within a single document can be identified. In one embodiment, for each of a set of candidate languages, a set of non-overlapping languages is defined. The document is analyzed under the hypothesis that the whole document is in one language and that part of the document is in one language while the rest is in a different, non-overlapping language. Language(s) of the document are identified based on comparing these competing hypotheses across a number of language pairs. In another embodiment, transitions between non-overlapping character sets are used to segment a document, and each segment is scored separately for a subset of candidate languages. Language(s) of the document are identified based on the segment scores.

Language identification for documents containing multiple languages

Language identification for documents containing multiple languages

Language identification for documents containing multiple languages

Owner:MICRO FOCUS LLC

Privacy-preserving text language identification using homomorphic encryption

InactiveUS9288039B1Digital data protectionCommunication with homomorphic encryptionMore languagePrivacy preserving

A system and method for text language identification allow private information of a server and a client to be kept secret from each other. An encrypted score for each of a plurality of languages is received by the server from the client. The encrypted scores are generated by homomorphic addition of encrypted frequencies of n-grams in a list of n-grams extracted from text. The unencrypted list is not provided to the server. The encrypted frequencies of the n-grams in the list are extracted using encrypted resources which, for each of the plurality of languages, include an encrypted frequency for each of a set of n-grams. At the server, the encrypted scores are decrypted to generate unencrypted scores and information is provided to the client based on the unencrypted scores from which the client is able to identify a language for the text.

Privacy-preserving text language identification using homomorphic encryption

Privacy-preserving text language identification using homomorphic encryption

Privacy-preserving text language identification using homomorphic encryption

Owner:XEROX CORP

Method and apparatus for performing machine translation using a unified language model and translation model

InactiveUS6990439B2Natural language translationSpeech recognitionMachine translation systemHuman language

The present invention is a method and apparatus for processing a phrase in a first language for translation to a second language. A plurality of possible linguistic patterns are identified in the second language, that correspond to the phrase in the first language. For each of the patterns identified, a probability for the pattern is calculated, based on a combination of the language model probability for the pattern and a translation model probability for the pattern. In one embodiment, an output is also provided which is indicative of a translation of the phrase in the first language to the second language based upon the translation probabilities calculated for the patterns.

Method and apparatus for performing machine translation using a unified language model and translation model

Method and apparatus for performing machine translation using a unified language model and translation model

Method and apparatus for performing machine translation using a unified language model and translation model

Owner:MICROSOFT TECH LICENSING LLC

Method, Apparatus and Computer Program Product for Providing Flexible Text Based Language Identification

ActiveUS20080147380A1Accurate analysisHighly configurableSpeech analysisNatural language data processingMore languageProcessing element

An apparatus for providing flexible text based language identification includes an alphabet scoring element, an n-gram frequency element and a processing element. The alphabet scoring element may be configured to receive an entry in a computer readable text format and to calculate an alphabet score of the entry for each of a plurality of languages. The n-gram frequency element may be configured to calculate an n-gram frequency score of the entry for each of the plurality of languages. The processing element may be in communication with the n-gram frequency element and the alphabet scoring element. The processing element may also be configured to determine a language associated with the entry based on a combination of the alphabet score and the n-gram frequency score.

Method, Apparatus and Computer Program Product for Providing Flexible Text Based Language Identification

Method, Apparatus and Computer Program Product for Providing Flexible Text Based Language Identification

Method, Apparatus and Computer Program Product for Providing Flexible Text Based Language Identification

Owner:NOKIA TECHNOLOGLES OY

A Human-Computer Interaction Recognition System Applied to Remote Information Service

InactiveCN102298694ANotable featuresSignificant progressCharacter and pattern recognitionSpeech identificationVisual perception

The invention belongs to the field of computers, in particular to a human-computer interaction identification system applied to remote information services, characterized in that the service system providing network information services or its client has at least one human-computer interaction identification device; The computer-interactive recognition device includes: an auditory collection module for collecting user voice signals; a visual collection module for collecting user face images; a first preprocessing module, a face feature extraction module, and a face recognition module; a second Preprocessing module, speech feature extraction module, speech recognition module; third preprocessing module, facial expression feature extraction module, facial expression recognition module; fourth preprocessing module, language emotion feature extraction module, language emotion recognition module; The gender recognition fusion module is used to fuse the recognition results of the face recognition module and the voice recognition module to form a recognition result that combines face features and voice features; the implicit recognition fusion module is used to integrate the facial expression recognition module It is fused with the recognition results of the language emotion recognition module to form a recognition result that combines facial expression features and voice emotion features.

A Human-Computer Interaction Recognition System Applied to Remote Information Service

A Human-Computer Interaction Recognition System Applied to Remote Information Service

Owner:GUANGDONG IKER DIGITAL TECH

Coordinated training-based dual-language named entity identification method

ActiveCN103853710AEasy to identifyImprove recallSpecial data processing applicationsData setNamed-entity recognition

The invention discloses a dual-language coordinated training-based named entity identification method, and belongs to the technical field of natural language processing in computer science. Parallel Chinese and English sentence datasets are considered as two different view of a dataset for dual-language coordinated training, a log-linear model is used for correcting projection marks in a projection process, and named entity dual-language aligned annotation consistency is introduced as a measurement index for mark confidence estimation when the model is used for predicting an unseen case. Compared with the prior art, the method has the advantages that the domain dependence of named entity identification is reduced, the advantages of dual-language identification are fused, the problem of partial identification ambiguity in single-language identification is solved, and the method is particularly suitable for the dual-language named entity synchronous identification of large-scale language materials.

Coordinated training-based dual-language named entity identification method

Coordinated training-based dual-language named entity identification method

Coordinated training-based dual-language named entity identification method

Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Modeling method and modeling device for language identification

InactiveCN101894548AImprove accuracyReduce operational complexitySpeech recognitionAlgorithmLanguage type

The embodiment of the invention provides a modeling method for language identification, which comprises the following steps of: inputting voice data, preprocessing the voice data to obtain a characteristic sequence, mapping a characteristic vector to form a super vector, performing projection compensation on the super vector, and establishing a training language model through an algorithm of a support vector machine; and adopting the steps to obtain a super vector to be measured of the voice to be measured, performing the projection compensation on the super vector to be measured, grading the super vector to be measured by utilizing the language model, and identifying language types of the voice to be measured. The embodiment of the invention also provides a modeling device for the language identification, which comprises a voice preprocessing module, a characteristic extraction module, a multi-coordinate system origin selection module, a characteristic vector mapping module, a subspace extraction module, a subspace projection compensation module, a training module and an identification module. According to the method and the device which are provided by the embodiment of the invention, information which is invalid to the identification in high-dimension statistics is removed, the correction rate of the language identification is improved, and the computational complexity on an integrated circuit is reduced.

Modeling method and modeling device for language identification

Modeling method and modeling device for language identification

Modeling method and modeling device for language identification

Owner:TSINGHUA UNIV

Method and apparatus for presenting caller identification information with geographical and/or source language information

InactiveUS20050084078A1Multiplex system selection arrangementsAutomatic call-answering/message-recording/conversation-recordingTelecommunicationsCommunication device

A method, apparatus, and computer instructions for processing calls. Initiation of a call by a user is detected in the communications device. In response to detecting the call, a geographic identification based on a number assigned to the communications device and / or a language identification based on a user of the communications device is identified. The geographic identification and the language identification are transmitted with caller identification information for use by a called party.

Method and apparatus for presenting caller identification information with geographical and/or source language information

Method and apparatus for presenting caller identification information with geographical and/or source language information

Method and apparatus for presenting caller identification information with geographical and/or source language information

Owner:IBM CORP

Language identification method of scene text image in combination with global and local information

ActiveCN110334705ATake advantage ofImprove accuracyCharacter and pattern recognitionNatural language data processingLocal information systemsPattern recognition

The invention discloses a language identification method of a scene text image in combination with global and local information. Basic features of a character image are extracted, and then global andlocal feature representations are extracted respectively; the global extraction branch uses global maximum pooling to express the whole graph as a vector, and category score prediction is carried out;probability prediction is performed on the local blocks of the image by the local aggregation branches respectively, and then the series of probability distributions are combined to obtain a categoryprediction score of a local level; and finally, global and local prediction scores are dynamically fused according to the branch prediction conditions to obtain a final identification result. According to the method, overall features and local differentiated features of the character images are noticed at the same time, and end-to-end training can be achieved in one step. Compared with an existing technology utilizing local features, the method has the advantages that the local differentiated features can be accurately extracted, excellent effects are achieved in the aspects of accuracy, operation efficiency and universality, and high practical application value is achieved.

Language identification method of scene text image in combination with global and local information

Language identification method of scene text image in combination with global and local information

Language identification method of scene text image in combination with global and local information

Owner:HUAZHONG UNIV OF SCI & TECH

Web Page Translation Device and Web Page Translation Method

InactiveUS20080172218A1Natural language translationSpecial data processing applicationsTerminal equipmentHTML

A WEB page translation method includes a description language identification means 33 for discriminating whether the translation target portion of the WEB page accessed by a user via a terminal device 1 is described in HTML, a first translation means 35 for translating the character string displayed on the WEB page into another language on a translation target portion discriminated to be described in HTML by the description language identification means 33, and a second translation means 36 for translating the character string displayed on the WEB page into another language on a translation target portion discriminated to be described not in HTML by the description language identification means 33.

Web Page Translation Device and Web Page Translation Method

Web Page Translation Device and Web Page Translation Method

Web Page Translation Device and Web Page Translation Method

Owner:IMPULSE JAPAN

System language setting method and device

InactiveCN105511857ASpeech recognitionExecution for user interfacesIntelligent equipmentSpeech sound

The invention discloses a system language setting method and device, and belongs to the field of language processing. The system language setting method comprises steps of acquiring voice information; carrying out language identification to the voice information, and determining the language kind corresponding to the voice information; and switching the system language of an intelligent device to a language corresponding to the obtained language kind. According to the invention, a microphone of the intelligent device is employed to acquire and identify the voice information to determine the language kind corresponding to the voice information;, and the system language of the intelligent device is switched to the language corresponding to the obtained language kind, the intelligent device can directly switch the system language in dependence on the voice information obtained by the microphone, and therefore the problem that the manner of obtaining the language by means of SIM card information is not accurate is solved, and the effect that the intelligent device switches the system language in dependence on the user's actual language usage case, and the switching accuracy of the system language of the intelligent device is improved is achieved.

System language setting method and device

System language setting method and device

System language setting method and device

Owner:XIAOMI INC

Intelligent cross-language speech recognition and conversion method

ActiveCN107945805ASolve efficiency problemsSolve balance problemsSpeech recognitionSpeech identificationLanguage speech

An intelligent cross-language speech recognition and conversion method claimed by the present invention classifies speech data based on the language family, establishes inter-language distances, initially determines a language family to which to-be-recognized speech data belongs, and then performs language refined identification in the language family. After the first language identification error, neighbor language families can be further found based on established inter-language distances, and the language family is established; and after the language is identified, the language is convertedinto a standardized text, then the text is subjected to word segmentation, word frequency statistics and other processing to establish the mapping relationship, which facilitates subsequent voice query. On the one hand, the method effectively solves the disadvantages that the efficiency and the speed cannot be balanced in the current speech recognition. On the other hand, the processing of the speech text conversion is more reasonable, and the establishment of mapping relationship makes the recognition conversion efficiency more accurate.

Intelligent cross-language speech recognition and conversion method

Intelligent cross-language speech recognition and conversion method

Intelligent cross-language speech recognition and conversion method

Owner:北京烽火万家科技有限公司

Balance method of actual scene linguistic data and finite state network linguistic data

InactiveCN101593518AEasy to identifyComprehensive vocabularySpeech recognitionData balancingSpeech identification

The invention provides a balance method of actual scene linguistic data and finite state network (FSN) linguistic data. In order to train the language model of a continuous speech recognizer, training linguistic data is produced according to the application field of the speech recognizer. Linguistic data mainly has two resources: one part is an actual scene linguistic data which is obtained by tidying records in actual scene, and the other part is FSN linguistic data generated with a finite state network syntactic rule method. Two linguistic data balance methods are mainly researched in the invention, and the invention provides a method that probability comparison of the keyword shared by the actual scene linguistic data and the FSN linguistic data is taken as basis, and a certain multiple of parts of actual scene linguistic data are used for expanding the FSN linguistic data to obtain the final method of language model training linguistic data. The language model for training linguistic data, which is obtained with the method, greatly improves the recognition performance of the continuous speech recognizer.

Balance method of actual scene linguistic data and finite state network linguistic data

Balance method of actual scene linguistic data and finite state network linguistic data

Balance method of actual scene linguistic data and finite state network linguistic data

Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Popular searches

Linguistic rule System usage Finite state Multiple language Computer program N-gram Data system Data store Voice data Speech control