Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

50 results about "Tibetan language" patented technology

Standard Tibetan is the most widely spoken form of the Tibetic languages. It is based on the speech of Lhasa, an Ü-Tsang (Central Tibetan) dialect. For this reason, Standard Tibetan is often called Lhasa Tibetan. Tibetan is an official language of the Tibet Autonomous Region of the People's Republic of China.

On-line identification method and recognition system for 'ding' of handwriting Tibet character

The invention discloses an on-line handwritten Tibetan language recognition method and a recognition system. The on-line handwritten Tibetan language recognition method comprises the following steps of: carrying out pretreatment on a handwritten character mode; extracting the direction features of handwriting points and the features of the space occupied by the characters, and carrying out dimensionality reduction and switching on a higher-dimension feature through a linear discriminating analysis method, thus obtaining the recognition features of the Tibetan language characters; improving the speed of a standard quadratic classifier and obtaining a high-speed quadratic classifier which is used for recognizing the Tibetan language characters; associating the syllables with the recognized characters; and finally inputting the standard Tibetan language syllables into the text. The invention finishes an integral on-line handwritten Tibetan language recognition system through the pretreatment on the Tibetan language characters, feature extraction, classifier designing and syllable association. The on-line handwritten Tibetan language recognition method and the recognition system can be used in the on-line handwritten Tibetan language recognition inputting in computers and mobile phones, and has the advantages of high recognition rate, high inputting speed, steady performance, and the like.
Owner:NORTHWEST UNIVERSITY FOR NATIONALITIES

Tibetan language-based multi-modal emotion calculation method and system

The embodiment of the invention provides a Tibetan language-based multi-modal emotion calculation method and system, and a server. The method comprises: firstly, obtaining Tibetan language data to beclassified; collecting video signals, voice signals and text information from the Tibetan language data; then, extracting high-level video features, high-level voice features and text features in a classification emotion corpus, respectively extracting the high-level video features, the high-level voice features and the text features, performing learning based on a deep learning model to obtain high-level fusion features, and finally, classifying the high-level fusion features in the classification emotion corpus based on SVM and storing the high-level fusion features in the classification emotion corpus. Therefore, the blank state of the Tibetan language in sentiment analysis can be filled. A basic corpus is provided for Tibetan multi-modal sentiment analysis. The Tibetan language data sentiment recognition method based on the three modes is beneficial to development of Tibetan language multi-mode sentiment analysis, the natural language processing capacity and the intelligent sentiment recognition capacity of the Tibetan language can be promoted, the artificial intelligence information processing capacity of the Tibetan language is improved, and in addition, the sentiment recognition rate of the Tibetan language data can be effectively increased under the condition of mutual fusion of the three modes.
Owner:QINGHAI UNIVERSITY

Tibetan language thesis copying detection method and Tibetan language thesis copying detection system based on Tibetan language sentence levels

The invention discloses a Tibetan language thesis copying detection method and a Tibetan language thesis copying detection system based on Tibetan language sentence levels. The Tibetan language thesis copying detection method includes: subjecting Tibetan language text characters to code conversion and noise removal preprocessing; segmenting a text into text blocks according to sentences through boundary identification of Tibetan language sentences and establishing a temporary table of segmented text blocks; extracting and computing text features from a sentence-document inverted index table and the temporary table according to the number of the sentences to obtain sentence similarity; establishing an adjacency list in accordance with the sentence similarity, computing text block similarity and detecting copying of two Tibetan language theses according to a text block similarity value. The Tibetan language thesis copying detection system comprises a Tibetan language thesis copying detection device and a database, wherein the Tibetan language thesis copying detection device is connected to a client terminal server through the Internet, and the database is connected to the server and used for storing Tibetan language theses. The Tibetan language thesis copying detection device comprises a preprocessing module for code conversion and noise removal of the text characters, a temporary table module for constructing the segmented text blocks, an extracting module for constructing sentence text features and a copying detection module for detecting whether the theses have similar copied data or not.
Owner:QINGHAI UNIV FOR NATITIES

Semantic ontology creation and vocabulary expansion method for Tibetan language

The invention relates to a method for processing Chinese minority scripts, in particular to a semantic ontology creation and vocabulary expansion method for a Tibetan language. The method comprises the following steps: (1) establishing an upper level ontology on the basis of the Chinese dictionary of the HowNet; (2) expanding conceptual synonyms appearing in the upper level ontology by using definitions in an electronic dictionary; (3) carrying out a conceptual hyponymy mode matching algorithm on the upper level ontology in a multi-language ontology library to expand the concept of the upper level ontology; (4) searching for conceptual synonyms in the expanded ontology; (5) sequencing from higher similarities to lower similarities on the basis of an ontology conceptual lexical semantic similarity algorithm; (6) modifying the sequencing results and editing the ontology. According to the method, the upper level ontology is established on the basis of the Chinese dictionary of the HowNet, levels of different concepts are defined according to a hyponymy in the ontology, and more new semantic words can be obtained on the basis of the hyponymy, so that the vocabulary of the existing Tibetan language ontology is expanded, and the Tibetan language information processing accuracy is increased greatly.
Owner:MINZU UNIVERSITY OF CHINA

Intelligent rule multi-language type interpretation system and creation method for same

The invention discloses an intelligent rule multi-language type interpretation system and a creation method for the same, and is applied to the field of intelligent home. By the system, a foundation is laid for solving the problem of artificial intelligence (essentially a remote control system or local intelligence) in the field of intelligent home at present. The system is an integrated language interpretation environment formed by a rule editor, rule lexical checking, rule grammar checking, intermediate code generation, executable code generation, executable code file management, rule debugging, a virtual machine execution environment and the like. The system and the method are simple and easy to learn; language grammars are similar to those of natural languages of human beings, a user can understand and use the system and the method without any professional knowledge, the whole program can be described completely by pure Chinese (Tibetan language, Mongolian language and the like) besides operators, and is high in execution efficiency, and executable codes are very close to machine languages. By fusion of such a language system and an intelligent home cloud service system, the user can speak to the system to simply require the system to automatically finish a complex process, and according to the actual condition of the current market, such a technology has broad market prospect.
Owner:谢玮琦

Method and system for constructing Tibetan emotional dictionary based on Tibetan language features

InactiveCN107122465ASentiment Accurate AnalysisEmotionally accurate determinationNatural language data processingSpecial data processing applicationsPattern recognitionMicroblogging
The present invention discloses a method and a system for constructing a Tibetan emotional dictionary based on Tibetan language features. The method comprises: matching the Chinese vocabulary ontology with emotion classification with a Chinese-Tibetan dictionary to obtain a Tibetan basic emotional dictionary; carrying out corpus training on the preliminarily collected Tibetan microblogging information by using the Word2vec tool to obtain a synonym set of the corpus training vocabulary, and taking the synonym set as an extended candidate word set; calculating the weight variance of each extended candidate word; and screening the extended candidate words according to the weight variance to obtain emotional extension words. According to the method for constructing the Tibetan emotional dictionary based on Tibetan language features disclosed by the present invention, the Chinese vocabulary ontology is matched with a Chinese-Tibetan dictionary to obtain the Tibetan basic emotional dictionary, corpus training and screening is carried out on the Tibetan microblogging information by using the Word2vec tool, and extension is carried out based on the Tibetan basic emotional dictionary, so that more Tibetan emotional vocabulary is provided, and emotion of the current Tibetan microblogging information expression is accurately analyzed.
Owner:MINZU UNIVERSITY OF CHINA

Tibetan word vector representation method fusing components and character information

The invention belongs to the technical field of Tibetan language information processing, and discloses a Tibetan language word vector representation method fusing components and character information,and the Tibetan language word vector representation method fusing the components and the character information comprises the steps: directly fusing the components and the character information into amodel TCCWEI represented by the Tibetan language word vector; fusing the component information into the vector representation of the character, and then fusing the component information and the information of the character into a model TCCWEII represented by the Tibetan word vector; and fusing components and Tibetan word vectors of character position information into the TCCWEII model to represent a model TCCWEII + P. According to the Tibetan language word vector representation model TCCWE provided by the invention, compared with the current optimal word vector representation, the Tibetan language word vector representation model TCCWE is improved by 8% on the Tibetan language similarity evaluation set TWordSim215. Compared with the current optimal word vector representation, the Tibetanlanguage correlation evaluation set TWordRel 215 is improved by 7%, and the semantic and correlation of the found similarity/correlation words are very high.
Owner:QINGHAI NORMAL UNIV

Method for converting gestures into Chinese-Tibetan bilingual speech

InactiveCN108665898AFacilitate daily communication activitiesTroubleshoot voice output issuesSpeech recognitionSpeech synthesisComputer-aidedBroadcasting
The present invention provides a method of converting gestures into a Chinese-Tibetan bilingual speech. The method comprises the steps that sample data is used for carrying out gesture recognition ongestures to be recognized to obtain meanings of the gestures; the meanings of the gestures are expressed in a Chinese-Tibetan bilingual form to obtain a semantic definition of the gestures, and context-related annotations of the gestures are generated according to the semantic definition of the gestures; a speaker-related Tibetan model or a Mandarin model is obtained through self-adaptive trainingof the speaker and using the training corpus of a specific speaker in Mandarin or Tibetan, and the Tibetan speech or Mandarin speech are synthesized by using a speaker-related Tibetan model or a Mandarin model and the context-sensitive annotations of the gestures. The method can convert the input static gestures and dynamic gestures into Mandarin or Tibetan language, can promote the daily communication activities of speech impaired persons and normal persons, solves the speech output problem in the communication between disabled persons and the normal persons, and can further be applied to the aspects such as computer assisted instruction for deaf and dumb, bilingual broadcasting of TV programs.
Owner:NORTHWEST NORMAL UNIVERSITY

Service system for automatic generation of Tibetan reading questions in primary schools

The invention relates to a service system for automatic generation of Tibetan reading questions in primary schools. The system comprises a Tibetan reading corpus construction model and a Tibetan reading text question generation model. According to the Tibetan reading corpus construction model, the Tibetan reading corpus is obtained by extracting feature data of primary school Tibetan articles and designing a mixed multi-strategy text screening model; The Tibetan reading text problem generation model comprises a coding end and a decoding end, and the coding end uses a bidirectional RNN network and an attention mechanism; The decoding end uses a one-way RNN network, an attention mechanism and a replication mechanism. Through the designed mixed multi-strategy text screening model, Tibetan Chinese articles suitable for being read in primary schools can be screened out from large-scale encyclopedia Tibetan Chinese texts. And an end-to-end automatic problem generation model is designed, so that the problems that the Tibetan language reading teaching materials in primary schools are few, the updating speed is low, the manual question setting amount is small and the like are solved, and the development of Tibetan language teaching in national regions is promoted.
Owner:MINZU UNIVERSITY OF CHINA

Translation management and evaluation method for Chinese and Tibetan language data under domestic operating system

The invention provides a Chinese and Tibetan language data translation management and evaluation method under a domestic operating system, which can perform overall management on tens of millions of Tibetan language data of a large amount of software in the domestic operating system based on Linux, so that the management cost of Chinese and Tibetan language maintenance of the domestic operating system is effectively reduced; comprising the following steps: analyzing all software source codes under a domestic operating system based on Linux to obtain source entries, and constructing a source entry data set; in response to update of the translation data of the source entry, constructing a translation mapping relationship between the newly added translation data and the source entry; according to the translation mapping relation, cross-software to-be-processed source entry checking is carried out in the domestic operating system, and translation data updating or newly adding is carried out on the software with the detected to-be-processed source entry; and performing translation quality evaluation on the newly added translation data, and outputting an evaluation result of translation correctness.
Owner:NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products