Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

31 results about "Phonetic form" patented technology

In the field of linguistics, specifically in syntax, phonetic form (PF), also known as phonological form or the articulatory-perceptual (A-P) system, is a certain level of mental representation of a linguistic expression, derived from surface structure, and related to Logical Form. Phonetic form is the level of representation wherein expressions, or sentences, are assigned a phonetic representation, which is then pronounced by the speaker. Phonetic form takes surface structure as its input, and outputs an audible (or visual, in the case of sign languages), pronounced sentence.

Distributed real time speech recognition system

InactiveUS20050080625A1Facilitates query recognitionAccurate best responseNatural language translationData processing applicationsFull text searchTime system
A real-time system incorporating speech recognition and linguistic processing for recognizing a spoken query by a user and distributed between client and server, is disclosed. The system accepts user's queries in the form of speech at the client where minimal processing extracts a sufficient number of acoustic speech vectors representing the utterance. These vectors are sent via a communications channel to the server where additional acoustic vectors are derived. Using Hidden Markov Models (HMMs), and appropriate grammars and dictionaries conditioned by the selections made by the user, the speech representing the user's query is fully decoded into text (or some other suitable form) at the server. This text corresponding to the user's query is then simultaneously sent to a natural language engine and a database processor where optimized SQL statements are constructed for a full-text search from a database for a recordset of several stored questions that best matches the user's query. Further processing in the natural language engine narrows the search to a single stored question. The answer corresponding to this single stored question is next retrieved from the file path and sent to the client in compressed form. At the client, the answer to the user's query is articulated to the user using a text-to-speech engine in his or her native natural language. The system requires no training and can operate in several natural languages.
Owner:NUANCE COMM INC

Speech-to-speech translation system with user-modifiable paraphrasing grammars

The present invention discloses a speech-to-speech translation device which allows one or more users to input a spoken utterance in one language, translates the utterance into one or more second languages, and outputs the translation in speech form. Additionally, the device allows for translation both directions, recognizing inputs in the one or more second languages and translating them back into the first language. The device recognizes and translates utterances in a limited domain as in a phrase book translation system, so the translation accuracy is essentially 100%. By limiting the domain the system increases the accuracy of the speech recognition component and thus the accuracy of the overall system. However unlike other phrase book systems, the device also allows wide variations and paraphrasing in the input, so that the user is much more likely to find the desired phrase from the stored list of phrases. The device paraphrases the input to a basic canonical form and performs the translation on that canonical form, ignoring the non-essential variations in the surface form of the input. The device can provide visual and / or auditory feedback to confirm the recognized input and makes the system usable for non-bilingual users with absolute confidence.
Owner:EHSANI FARZAD +2

On-line touch-and-talk pen system and touch reading method thereof

InactiveCN103236195ARealize random readingPracticalElectrical appliancesData informationLine search
The invention discloses an on-line touch-and-talk pen system, which comprises a scanning module, a storage module, an on-line searching module and an output module, wherein the scanning module is used for scanning feature information of books to be read; the storage module is used for receiving and storing the feature information transmitted by the scanning module; the on-line searching module is used for realizing connection of the storage module and a network on-line high-volume database; data information matched with the feature information stored in the storage module is acquired through on-line searching; and the output module is used for outputting data information acquired by the on-line searching module, and playing the data information in a phonetic form. The on-line touch-and-talk pen system is combined with an on-line searching function, the books are scanned, and the features of the books are extracted, and through on-line searching matching, the corresponding information data are acquired, are fed back to the storage module, are output and are subjected to voice playing, so that random touch reading on any book is realized, the application range is enlarged, and the practicability is strong. The invention also provides a touch reading method of the on-line touch-and-talk pen system.
Owner:SUN YAT SEN UNIV

Chinese word similarity detection algorithm based on pronunciation, shape and meaning

The invention provides a Chinese word similarity detection algorithm based on pronunciation, shape and meaning, which detects the overall similarity of Chinese character strings by comprehensively considering three characteristics of pronunciation, shape and meaning of Chinese characters, and comprises the following steps of: firstly, converting the pinyin of each Chinese character of the Chinesecharacter strings s1 and s2 into a corresponding phonetic code, and converting each Chinese character of the Chinese character strings s1 and s2 into a shape code; then respectively calculating the phonetic code similarity and the shape code similarity between the Chinese character strings s1 and s2, then independently calculating the similarity of the Chinese character string meanings, and finally setting contribution parameters for an application scene in combination with the phonetic form meanings to calculate the overall similarity of the final Chinese character strings s1 and s2. The algorithm can meet complex application scenarios, can be applied to detection of the repetition degree of structured data items, especially in the case of manual input errors, and can also be applied to detection of sensitive words hidden in wrongly written characters and the like. Compared with a Chinese character similarity detection algorithm of the same type, the detection effect on the Chinese character string similarity is greatly enhanced.
Owner:HAINAN UNIVERSITY

Chatting equipment, information output method of chatting equipment, chatting system and information interactive method of chatting system

The invention discloses chatting equipment, an information output method of the chatting equipment, a chatting system and an information interactive method of the chatting system. The chatting equipment comprises a controller, a display panel and a numerical matrix library; the display panel is electrically connected with the controller; the numerical matrix library is in communication connection with the controller; the numerical matrix library comprises a plurality of numerical matrixes in one-to-one correspondence with a plurality of emotion icons; the display panel comprises a plurality of lifting parts; the chatting equipment is used for receiving the emotion icons; and the controller is used for controlling the lifting parts at the corresponding positions on the display panel to lift according to positions and values of various elements in the matrixes to be displayed, so that the received emotion icons can be displayed on the display panel. According to the invention, the emotion icons can be vividly displayed on a hardware structure; the display manner of chatting information is enriched; practical chatting scenes are presented more really; furthermore, received texts or voice can be output in a voice form; and thus, chatting information is relatively rapid and convenient to obtain.
Owner:SHANGHAI CHUANGGONG COMM TECH

Display device, text error correction method and server

The embodiment of the invention provides a display device, a text error correction method and a server, the display device comprises a display and a controller, the controller is configured to: in response to receiving a voice command input by a user, performing voice conversion on the voice command to obtain a text to be corrected; controlling a display to display the to-be-corrected text; performing error correction on the to-be-corrected text based on the confusion set with similar phonetic forms and a graph attention mechanism to obtain an initial error-corrected text, performing candidate recall on the to-be-corrected text and the initial error-corrected text, and obtaining a final error-corrected text according to a sorting result of the recalled texts; and controlling the display to refresh the to-be-corrected text into the final corrected text. According to the embodiment of the invention, the pronunciation similar knowledge graph and the shape similar knowledge graph are generated according to the confusion set corresponding to the to-be-corrected text, the pinyin and font related knowledge of Chinese characters is fused into the graph neural network, and the deep semantic information among the similar characters is extracted, so that the knowledge of similar pronunciation and shape can be effectively utilized, and the correct rate and recall rate of error detection and correction are improved.
Owner:HISENSE VISUAL TECH CO LTD

WeChat public platform-based Chinese-Mongolian corpus crowdsourcing construction method

ActiveCN110472948ASolving the problem of collecting open-domain natural spoken corpusImprove experienceOffice automationResourcesSpoken languageThe Internet
The invention discloses a WeChat public platform-based Chinese-Mongolian corpus crowdsourcing construction method, and belongs to the field of corpus resource construction. The method comprises the following specific operation steps: 1) obtaining a multi-body cut open domain original corpus; 2) screening and filtering the users participating in the translation task through a Mongolian level test questionnaire; 3) sending a crowdsourcing translation task to a user following the WeChat official account in a subscription account pushing mode; 4) enabling each WeChat client to translate one or more source sentences into Mongolian and feed back the Mongolian to the background in a voice form; 5) evaluating the corpus quality in a manner of combining background administrator auditing and crowdsourcing quality evaluation to realize corpus quality control. The WeChat public platform-based Chinese-Mongolian corpus crowdsourcing construction method completes corpus collection online, is simple in interaction, good in user experience and high in user participation degree, effectively solves the problem of collecting open domain natural spoken language corpora in a real Mongolian language environment, and shows an extremely high practical prospect under an Internet mobile platform.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Leadless group discussion system

PendingCN114881024AImprove interview skillsImprove interview efficiencyNatural language data processingSpeech recognitionPart of speechSpeech sound
The invention discloses a leadless group discussion system, which comprises the following steps of: converting a captured speech in a voice form into a speech in a text form, performing word segmentation processing on the speech in the text form, and filtering out stop words in the speech and retaining words with specified parts of speech in the word segmentation processing process to obtain a filtered word segmentation set; constructing a vertex set by using the segmented words in the filtered segmented word set, constructing an edge between any two points of the vertex set by adopting a co-occurrence relationship to obtain candidate keyword graphs of the filtered segmented word set, calculating the weight of each vertex set in the candidate keyword graphs, and sorting the weights of the candidate keyword graphs from large to small to obtain the candidate keyword graphs of the filtered segmented word set. The N vertexes with the weight values in the top are selected as the keywords of the speaking, so that an interviewer or an examiner is assisted to quickly understand whether the speaking of the interviewer deviates from the theme or not by extracting the keywords, and the interview ability of the interviewer or the interview efficiency of the interviewer is assisted to be improved.
Owner:CENT SOUTH UNIV

Elderly user use disorder reporting and solving method of mobile phone terminal

The invention relates to software engineering and data mining technologies, in particular to an elderly user use obstacle report and solution method for a mobile phone terminal, which comprises the following steps: 1, identifying an action, converting the action into a character string, and further processing the character string only when an identification result shows that a problem is encountered; 2, searching a character string, and converting a search result into a vector; 3, performing data training, converting all known search results into vectors according to the constructed Chinese word bank, and constructing a prediction model; and 4, carrying out data prediction, predicting a search result of the new character string through a prediction model, giving a solution to a problem, and returning the solution in a voice form. According to the method, the elder user does not need to perform input operation from beginning to end, and all operations are established on the basis of automatic elder action recognition. When the old people encounter difficulties in using the mobile phone, a solution can be obtained at the first time, and meanwhile, a result is returned in a voice form, so that the user experience of the old people in using the smart phone is improved.
Owner:WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products