Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

90 results about "Voice Recognition Software" patented technology

Voice recognition. Alternatively referred to as speech recognition, voice recognition is a computer software program or hardware device with the ability to decode the human voice. Voice recognition is commonly used to operate a device, perform commands, or write without having to use a keyboard, mouse, or press any buttons.

Interactive multimedia tour guide

An interactive multimedia tour guide provides a user with packaged tours in a multimedia format that includes directions and useful information about a selected tour. The packaged tours are composed of principle and ancillary points of interest. A user profile is developed which is used to generate a preference mask for the user. The preference mask is used to select only those ancillary points of interest that would be of most interest to the user. The selected tour is stored on a portable self-contained electronic system which includes a GPS navigation system and cell phone. The system includes voice recognition software and speech synthesis software to provide the user with a verbal interface That provides directions and information on various points of interest during the tour. Combined with optional camera, the interactive multimedia tour guide allows for rapid identification and editing of pictures or videos made on a tour.
Owner:WHITHAM HLDG

Method for Transforming Language Into a Visual Form

A computer assisted design system (100) that includes a computer system (102) and text input device (103) that may be provided with text elements from a keyboard (104). A user may also provide oral input (107) to the text input device (103) or to a voice recognition software with in-built artificial intelligence algorithms (110) which can convert spoken language into text elements. The computer system (102) includes an interaction design heuristic engine (116) that acts to understand and translate text and language into a visual form for display to the end user.
Owner:MOR F DYNAMICS

Modifying electronic documents with recognized content or other associated data

Systems and methods enhance editing capabilities associated with a wide variety of different types of electronic documents. Such systems and methods may include a processor that maintains an electronic document having a first portion (e.g., an individual word, character, character string, or the like) provided by a recognizer (e.g., by handwriting or speech recognition software), and they may provide access to potential alternative characters, words, or character strings generated by the recognizer during various user editing functions and operations. Other data associated with the first portion of the document also may be stored and made available to the user during various functions and operations. This invention further relates to computer-readable media including instructions for performing various methods and / or operating various systems for editing electronic documents, including systems and methods like those described above.
Owner:MICROSOFT TECH LICENSING LLC

Method and apparatus for improving the transcription accuracy of speech recognition software

The present invention involves the dynamic loading and unloading of relatively small text-string vocabularies within a speech recognition system. In one embodiment, sub-databases of high likelihood text strings are created and prioritized such that those text strings are made available within definable portions of computer-transcribed dictations as a first-pass vocabulary for text matches. Failing a match within the first-pass vocabulary, the voice recognition software attempts to match the speech input to text strings within a more general vocabulary. In another embodiment, the first-pass text string vocabularies are organized and prioritized and loaded in relation to specific fields within an electronic form, specific users of the system and / or other general context-based, interrelationships of the data that provide a higher probability of text string matches then those otherwise provided by commercially available speech recognition systems and their general vocabulary databases.
Owner:COIFMAN ROBERT E +1

Computer interface system for tracking of radio frequency identification tags

A method for operating with multiple protocols for handling communications comprising the steps of obtaining information from sensors and related input devices utilizing specialized tamper resistant passive transceivers working with active pulse type transceivers to create historical maps of information on people or objects. This includes steps of: a) identifying recording information, b) sending and receiving prompts, c) associating the call with timers, d) monitoring passive transceivers with low level diagnostic information, e) monitoring the transceivers with voice recognition software, f) recording associated data, g) identifying the users, the key words or phrases within the recorded data, h) naming the recording and i) saving the data in a protected format.
Owner:GLOBAL TELLINK

User intent analysis extent of speaker intent analysis system

InactiveUS20120262296A1Rapid and remote processing of facial expressionLeast possible laborSpecial service for subscribersDigital computer detailsPagerSpeech identification
A speaker intent analysis system and method for validating the truthfulness and intent of a plurality of participants' responses to questions. A computer stores, retrieves, and transmits a series of questions to be answered audibly by participants. The participants' answers are received by a data processor. The data processor analyzes and records the participants' speech parameters for determining the likelihood of dishonesty. In addition to analyzing participants' speech parameters for distinguishing stress or other abnormality, the processor may be equipped with voice recognition software to screen responses that while not dishonest, are indicative of possible malfeasance on the part of the participants. Once the responses are analyzed, the processor produces an output that is indicative of the participant's credibility. The output may be sent to proper parties and / or devices such as a web page, computer, e-mail, PDA, pager, database, report, etc. for appropriate action.
Owner:BEZAR DAVID

Interactive multimedia book

An interactive multimedia book provides hands-on multimedia instruction to the user in response to voiced commands. The book is implemented on an easy to use computer system which is suitable to various environments in which the book might be used. The interactive multimedia book is published on a computer readable medium with the necessary software to support the interactive operation of the book. Alternatively, the book may be downloaded form a remote site using a network, such as the Internet, in which case the content of the book and the necessary software are copied to a local medium, such as a computer hard disk. The content includes both text and audio / video clips. The interactive multimedia book is accessed by a computer system which is equipped with a microphone and voice recognition software. Voiced commands and natural language queries are the primary user input to the computer system. The computer system is also equipped with a high resolution display, a voice synthesizer and a speaker or headphone system to provide output to the user. A combination headphone and directional microphone can be especially convenient in some environments as, for example, the wood shop where the headphones allow the user to better hear the instruction over the din of machine noise while at the same time protecting the user's hearing. The displayed text is written in a markup language, such as HyperText Markup Language (HTML), and contains hyperlinks which link the current topic with other related topics. The user may command the book to read the text and, as the text is read by the voice synthesizer, a word which is also a hyperlink will change its attributes upon being spoken. The user will be able to observe or hear this and, without having to click a mouse button, simply utter the word which is the hyperlink to navigate to the linked topic.
Owner:WHITHAM HLDG

Speaker intent analysis system

InactiveUS20110066436A1High and low probabilityEliminate needSpeech recognitionPagerSpeech identification
A speaker intent analysis system and method for validating the truthfulness and intent of a plurality of participants' responses to questions. A computer stores, retrieves, and transmits a series of questions to be answered audibly by participants. The participants' answers are received by a data processor. The data processor analyzes and records the participants' speech parameters for determining the likelihood of dishonesty. In addition to analyzing participants' speech parameters for distinguishing stress or other abnormality, the processor may be equipped with voice recognition software to screen responses that while not dishonest, are indicative of possible malfeasance on the part of the participants. Once the responses are analyzed, the processor produces an output that is indicative of the participant's credibility. The output may be sent to proper parties and / or devices such as a web page, computer, e-mail, PDA, pager, database, report, etc. for appropriate action.
Owner:BEZAR FAMILY IRREVOCABLE TRUST +1

Virtual Trainer

An interactive virtual training system including at least one exercise equipment device including a video user interface, an audio input, an audio output, voice recognition software for interpreting a user's spoken commands and software that monitors a user's exercise pattern for consistency, where changes in the exercise pattern trigger a query to the user asking whether assistance is necessary, and software for adjusting the workout routine if a user indicates that modification is needed.
Owner:DEL GIORNO RALPH J

Relay for personal interpreter

A relay is described to facilitate communication through the telephone system between hearing users and users who need or desire assistance in understanding voice communications. To overcome the speed limitations inherent in typing, the call assistant at the relay does not type most words but, instead, re-voices the words spoken by the hearing user into a computer operating a voice recognition software package trained to the voice of that call assistant. The text stream created by the computer and the voice of the hearing user are both sent to the assisted user so that the assisted user can be supplied with a visual text stream to supplement the voice communications. A time delay in the transmission of the voice of the hearing user through the relay is of assistance in the assisted user comprehending the communications session.
Owner:ULTRATEC INC

System and method for transcribing audio files of various languages

System, method and program product for transcribing an audio file included in or referenced by a web page. A language of text in the web page is determined. Then, voice recognition software of the language of text is selected and used to transcribe the audio file. If the language of the text is not the language of the audio file, then a related language is determined. Then, voice recognition software of the related language is selected and used to transcribe the audio file. The related language can be related geographically, by common root, as another dialect of the same language, or as another language commonly spoken in the same country as the language of the text. Another system, method and program product is disclosed for transcribing an audio file included in or referenced by a web page. A domain extension or full domain of the web page and an official language of the domain extension or full domain are determined. Then, voice recognition software of the official language is used to attempt to transcribe the audio file. If the official language is not a language of the audio file, then a language related to the official language is determined. Then, voice recognition software of the related language is selected and used to transcribe said audio file. The related language can be related geographically, by common root, as another dialect of the same language, or as another language commonly spoken in the same country as the official language.
Owner:NUANCE COMM INC

Method and system for measuring and valuing contributions by group members to the achievement of a group goal

A method and system for human or computer-based group-members to interact with peers to craft an action sequence to achieve a group goal. Method includes means for guiding group members on how to integrate their activities in pursuit of a specific pre-defined group goal, when given only partial understanding of how they can achieve said goal. The method identifies, selects, values and integrates group-member actions that are causal to a group achievement. The system incorporates the method along with means for recording, assigning value and reporting contributions by group members. System also includes an apparatus consisting of head-mounted microphone, voice recognition software and miniature video screen in field of view to aid data collection in applications where events occur in rapid sequence. For computer-based group members, system includes unsupervised neural network embodied in a computer mechanism and means to evaluate the instant activity and immediately relate processed information to guide the integration of group members actions.
Owner:OBJECT POWER

Apparatus and method for processing service interactions

An interactive voice and data response system then directs input to a voice, text, and web-capable software-based router, which is able to intelligently respond to the input by drawing on a combination of human agents, advanced speech recognition and expert systems, connected to the router via a TCP / IP network. The digitized input is broken down into components so that the customer interaction is managed as a series of small tasks rather than one ongoing conversation. The router manages the interactions and keeps pace with a real-time conversation. The system utilizes both speech recognition and human intelligence for purposes of interpreting customer utterance or customer text. The system may use more than one human agent, or both human agents and speech recognition software, to interpret simultaneously the same component for error-checking and interpretation accuracy.
Owner:ARES VENTURE FINANCE

Metadata tagging of moving and still image content

A method and apparatus for tagging image content with rich metadata is provided. The metadata is generated from keyword descriptions of image content spoken by human taggers whilst viewing the content. Voice recognition software is employed to identify the key keywords in an audio stream and the resultant metadata is associated in a synchronous manner with the relevant image content. A control console allows the human tagger to rapidly navigate onscreen menus and select different taglines for providing multilevel metadata tagging of the image content. An integrated system provides for the storage of tagged digital image content, with near immediate access to tagged raw footage for viewing and editing, and for easy searching and accessing of finalized footage. A method of serving the tagged content is also provide, which allows the content to be streamed over the web at an acceptable image resolution whilst maintaining the associated metatags.
Owner:EXCESSION TECH LTD

Telephone call inbox

A system and method for extracting and presenting useful data from calls received by a client is disclosed. The resulting “telephone call inbox” is a way for a client view pay per call advertising as a stream of consumers with information available to understand the call activity of the consumers and for the client to navigate their call history. The system automatically filters non-consumer fraudulent calls, extracts the identity of a consumer, aggregates several calling entities into a single consumer, transcribes the call into a call stream using voice recognition software, extracts patterns and draws conclusions from the call stream, and presents a list of call streams in a user friendly set of web pages configured as the telephone call inbox. The telephone call inbox includes, for each call, the caller ID, one or more key words, phrases or major conclusions concerning the call, and the voice recognized call stream.
Owner:FELIX CALLS

Short voice message (SVM) service method, apparatus and system

Tiresome entry of numerous letters of the alphabet into a hand-held device for assembling a short text message for transmission via a short message service (SMS) to a second terminal is avoided by the sending of a short voice message (SVM). The SVM is recorded in the sending terminal and sent to a SVM service center (SVMSC). The SVMSC may notify the intended recipient of the arrival of the SVM and await acceptance before sending it, or send it immediately if the presence of the intended recipient is detected. The second terminal may then commence a bidirectional communication so that an instant voice message session can be established. Alternatively, the problem can be overcome by converting the spoken SVM to text in the user terminal by means of voice recognition software and sending the converted text to the recipient by means of the traditional SMS infrastructure for display as text or for playback as text converted to voice.
Owner:NOKIA CORP

Point-of-sale customer order system utilizing an unobtrusive transmitter/receiver and voice recognition software

A point-of-sale order system utilizes a relatively non-obtrusive transmitter-receiver device which includes a microphone for receiving order information and a speaker for receiving confirmation of the order information. The system utilizes voice recognition software in order to control processing and data flow during order taking operations, and to receive order information from the server in real time during interaction with the customer. A relatively limited database of recognizable words define computing commands and order information. An audible feedback is provided to the server which is relatively unobtrusive, and which provides the server with a positive indication that the order information has been fully and correctly received and processed.
Owner:NEGREIRO MANUEL

Method and apparatus for improving the transcription accuracy of speech recognition software

A virtual vocabulary database is provided for use with a with a particular user database as part of a speech recognition system. Vocabulary elements within the virtual database are imported from the user database and are tagged to include numerical data corresponding to the historical use of the vocabulary element within the user database. For each speech input, potential vocabulary element matches from the speech recognition system are provided to the virtual database software which creates virtual sub-vocabularies from the criteria according to predefined criteria templates. The software then applies vocabulary element weighting adjustments according to the virtual sub-vocabulary weightings and applies the adjustment to the default weighting provided by the speech recognition system. The modified weightings are returned with the associated vocabulary elements to the speech engine for selection of an appropriate match to the input speech.
Owner:COIFMAN ROBERT E

Method for correcting error-prone words in voice interaction

InactiveCN107305768AEnhance typo-correcting abilityIncrease weightSpeech recognitionNamed-entity recognitionContext recognition
The invention provides a method for correcting error-prone words in voice interaction. The method comprises the steps of context recognition, automatic error correction based on semantic restriction, and artificial error correction based on semantic feedback. Through voice interaction with users and perception and recognition of the context of a topic, automatic error correction of an entity with a specific meaning can be achieved by using the named entity recognition technology within the limited semantic range, additional semantics can be obtained through artificial feedback so as to conduct error correction, and higher input efficiency and more convenient wrong word correction than existing voice recognition software are realized.
Owner:SHANGHAI JIAO TONG UNIV

Interactive personal security system

An interactive personal security system utilizing a portable object having embedded therein all or a combination of a microphone, still and video cameras, distance sensor, a timer, speakers, a motion sensor, a tracking transponder, a receiver, and a transmitter operably connected to a power source and a conventional microprocessor including input devices, random access memory, read only memory and a database. Sounds and images are transmitted to remote monitoring stations by radio waves or microwaves for radio or television broadcasting or recording on a tape recorder or VCR. Alarms are transmitted to telephones and beepers. Face and voice recognition software identifies the people speaking, playing or attempting to kidnap the child. Sensors identify the presence of persons or animals or a child wandering out of a restricted area. Speakers allow guardians to communicate two-ways with the child, thereby responding to the immediate needs of the child. Tracking transponder allows for a pinpoint location of the portable object.
Owner:DEOME DENNIS E +1

Relay for personal interpreter

A relay is described to facilitate communication through the telephone system between hearing users and users who need or desire assistance in understanding voice communications. To overcome the speed limitations inherent in typing, the call assistant at the relay does not type most words but, instead, re-voices the words spoken by the hearing user into a computer operating a voice recognition software package trained to the voice of that call assistant. The text stream created by the computer and the voice of the hearing user are both sent to the assisted user so that the assisted user can be supplied with a visual text stream to supplement the voice communications. A time delay in the transmission of the voice of the hearing user through the relay is of assistance to the assisted user in comprehending the communications session.
Owner:ULTRATEC INC

Method for object selection

The invention is a method for objects selection at a location comprising the steps of using a mobile computer having a bar code reader, a display, an audio output device, an audio input device, a tactile input device, text to speech software, a voice recognition software, objects selection applications software, and radio frequency identification (RFID) reader, wherein said mobile computer is adapted for communication between an order systems server and a user and the order systems server is adapted for communication between the mobile computer and at least one external computer system.
Owner:SYST APPL ENG

Software code comments management method and system supporting speech recognition technology

A system and method for enabling audio comments to be used when writing and executing code, during design time and run time. A code writer is hereby enabled to simultaneously write code and compose voice comments. These comments, divided into help comments, test items and variable comments, are subsequently recorded, stored, analyzed, prescribed and displayed using text to speech and voice recognition software. It is therefore possible to define test cases, execute vocal follow up on changes, and listen to comments and variables values, while running a program
Owner:MAXIMA BLUE

Apparatus and method for radiological image interpretation using different time zones

A method and apparatus for high quality, timely medical interpretations of radiological images acquired in one time zone and interpreted and a different time zone. The use of a different time zone allows images acquired at night to be interpreted during regular working daylight hours. The images can include images created by conventional x-ray technology, computed radiography, magnetic resonance imaging (MRI), computed tomography (CT), ultrasound imaging, and nuclear medicine equipment. The invention includes the transmission of these images, the interpretation of these images, and the transmission of the interpretations back to the originating facility. The interpretation is performed on high-resolution workstations and the written report is created either by voice recognition software or dictation and typed transcription.
Owner:WILCOX JOHN RICHARDSON JR

Sound identifying method for geographic information and its application in navigation system

A speech recognition method of geographic data includes utilizing the existed speech recognition module and its calling interface to obtain recognized random character string and to convert it to be phonetic character string, converting geographic data character string picked up form geographic databank to be phonetic character string, calculating degree of matching for the two and using source string with most close degree of matching as result character string i.e geographic data name. The method can be applied in navigation system to raise its intelligent level.
Owner:NANJING NORMAL UNIVERSITY

Method for improving text and voice matching efficiency

The invention relates to a method of improving the efficiency of a text matching with voice matching, which includes the following steps: Step 1: a voice recognition software can be used for identifying an audio file to get a text with a timestamp; Step 2: the text with a timestamp is compared with a text input by a user; Step 3: the time-stamp of the text with a timestamp can be endowed to the text input by the user. With high efficiency, the method does not need the manual intervention and can match the voice and text files in a largebatch way.
Owner:陈健全
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products