Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

141 results about "Voice Tag" patented technology

Voice tags are used in automated speech recognition in a voice command device, allowing the user to "speak" commands. For example, using voice commands with an automated device, such as an IVR telephone prompt or to dial a contact on a mobile phone.

Application of Voice Tags in a Social Media Context

According to a present invention embodiment, a system utilizes a voice tag to automatically tag one or more entities within a social media environment, and comprises a computer system including at least one processor. The system analyzes the voice tag to identify one or more entities, where the voice tag includes voice signals providing information pertaining to one or more entities. One or more characteristics of each identified entity are determined based on the information within the voice tag. One or more entities appropriate for tagging within the social media environment are determined based on the characteristics and user settings within the social media environment of the identified entities, and automatically tagged. Embodiments of the present invention further include a method and computer program product for utilizing a voice tag to automatically tag one or more entities within a social media environment in substantially the same manner described above.
Owner:IBM CORP

Methods and Apparatus for Detecting Fraud with Time Based Computer Tags

ActiveUS20090083184A1Detection of potentialPrevention of potentialComputer security arrangementsPayment architectureTemporal informationBank account
Systems and methods for creating and analyzing computer tag information for the prevention or detection of potential fraud. Computers and other devices accessing the Web carry device tags with date and time information describing when they were issued by a security tag server. A server time stamp may be inserted into time based computer tags such as a cookies indicating when they were created. Such time stamp information can be encrypted and analyzed during future attempts to access a secure network such as a customer attempting to log into an online banking account. When the time stamp information from the tag is compared to other selected information about the user, device and / or account, including but not limited to last account log-in date / time or account creation date, the invention may be used to detect suspicious activity.
Owner:THE 41ST PARAMETER

Search capabilities for voicemail messages

Methods, systems, and products for voicemail searching that include storing, in association with voicemail messages, voiceprints of callers who leave voicemail messages for voicemail users in a voicemail system; storing caller speech tags in association with the voiceprints; identifying, in dependence upon caller voiceprints, callers who leave new voicemail messages; receiving, from a particular voicemail user, search keywords entered as speech and converted to text through automated speech recognition; and selecting, in dependence upon the search keywords and the caller speech tags, one or more selected voicemail messages from a multiplicity of voicemail messages for the particular voicemail user.
Owner:IBM CORP

Hands-free circuit and method for communicating with a wireless device

A hands-free circuit (10) and method produces audio information (90) corresponding to voice tag information (60) stored either in the hands-free circuit (10) or in a wireless device (320). The hands-free circuit includes a wireless local area network (WLAN) transceiver (20), memory (40) and a speech processor (30). For example, the wireless local area network transceiver (20) may be a Bluetooth transceiver that communicates with the wireless device (320) and receives wireless device information (80). The wireless device information (80) may represent caller identification information, call processing information relating to the status of a call, phone status information or any suitable information. The speech processor (30) produces audio information (90) corresponding to the voice tag information (60). According to one embodiment, the speech processor (30) produces speech synthesized wireless device information (80), in-band ring tone information or any suitable information.
Owner:GOOGLE TECH HLDG LLC

Smart voice interaction method and system

The invention discloses a smart voice interaction method and system. The method includes receiving voice data; performing voice recognition on the voice data and acquiring a voice recognition result;performing recognition rejection judgment on the voice recognition result according to a pre-constructed recognition rejection judgment model based on a semantic layer and acquiring a model output result; determining whether the voice data is man-machine interaction voice data or not according to the model output result; if the voice data is man-machine interaction voice data, performing semanticunderstanding on the voice recognition result and generating an interaction result according to a semantic understanding result, wherein the interaction result includes a response text. By utilizing the method and system provided by the invention, influence on man-machine interaction by noise-containing voice data can be reduced and error response of the man-machine interaction system is reduced.
Owner:IFLYTEK CO LTD

Generating and relating text to audio segments

A method, apparatus and system for generating speech minutes. The method comprises the steps of displaying status indicators of respective audio (speech) stream chunks received and text information thereof on a GUI display and establishing the tagging between each audio stream chunk and the corresponding text information by dragging and dropping the status signs of the respective speech stream chunks onto the corresponding text information on the GUI, such that the speech stream, the text information and the corresponding tagging relation form voice tagged meeting minutes.
Owner:NUANCE COMM INC

Audio caller ID for mobile telephone headsets

A caller ID feature on the mobile telephone identifies the telephone number of an incoming call and correlates the number with any corresponding voice tag stored in the personal directory. The voice tag associated with the incoming caller is delivered from the telephone to the headset where it is played to provide the user with an audio caller identification. In the absence of a voice tag, voice-synthesized numerals corresponding to the telephone number of the incoming call can be provided to the headset as an audio caller ID.
Owner:LOGITECH EURO SA

Apparatus and method for voice-tagging lexicon

A voice-tag editor develops voice-tag “sounds like” pairs for a voice-tagging lexicon. The voice-tag editor is receptive of alphanumeric characters input by a user. The alphanumeric characters are indicative of a voice tag and / or “sounds like” text. The voice-tag editor is configured to allow the user to view and edit the alphanumeric characters. A text parser connected to the voice-tag editor generates normalized text corresponding to the “sounds like” text. The normalized text serves as recognition text for the voice tag and is displayed by the voice-tag editor. A storage mechanism is connected to the editor. The storage mechanism updates the lexicon with the alphanumeric characters which represent voice-tag “sounds like” pairs.
Owner:PANASONIC CORP

Program for voice talking, voice talking method, and voice talking apparatus

A computer program, which is used to voice talking to cause an information terminal to execute voice talking, managing an ID of the information terminal and a first address over a first network, stores instructions for execution on a computer system enabling the computer system to perform determining that the information terminal moves over a second network which is different from the first network, acquiring a second address over the second network, and transmitting a request for re-registering a combination of the second address and the ID instead of the first address into another device having a combination of the first address and the ID registered therein.
Owner:KK TOSHIBA

RFID reader enabled intelligent traffic signalling and RFID enabled vehicle tags (number plates)

A system and method for regulating the flow of traffic at a roadway intersection having one or more traffic signals by positioning a processor in the vicinity of the intersection to store cycle times of the traffic flow directions, mounting an RFID reader in the vicinity of each traffic signal in communication with the processor, mounting a plurality of RFID tags in the vicinity of a license plate so as to be within the communication range of an RFID reader at the intersection and so that the RFID readers interrogate the RFID tags of the vehicles, calculating an unused time slice of the cycle time for at least one of the traffic flow directions at the intersection; and, reducing the cycle time for the traffic flow.
Owner:RAMASUBBU SRIDHARA SUBBIAH

Voice endpoint detection method and device, computer equipment and storage medium

The invention relates to a voice endpoint detection method and device, computer equipment and a storage medium. The method comprises the steps that a voice signal with noise is acquired, and an acoustic characteristic and a spectrum characteristic which correspond to the voice signal with the noise are extracted; the acoustic characteristic and the spectrum characteristic are converted, and a corresponding acoustic characteristic vector and a corresponding spectrum characteristic vector are obtained; a classifier is acquired, the acoustic characteristic vector and the spectrum characteristic vector are input to the classifier, and an acoustic characteristic vector with a voice label and a spectrum characteristic vector with a voice label are obtained; the acoustic characteristic vector with the voice label and the spectrum characteristic vector with the voice label are parsed, and corresponding voice signals are obtained; according to time sequences of the voice signals, an initial point and a termination point which correspond to the voice signals are determined. By adopting the method, the accuracy of voice endpoint detection can be effectively improved.
Owner:SHENZHEN RAISOUND TECH

Facilitation of speech recognition in user interface

Items are represented to a user through a user interface with each item having a respective perceivable range value and associated label by which the item can be addressed. To address a particular item, the user speaks its label at a loudness indicative of its perceived range. A loudness-to-range function of the interface determines on the basis of the loudness of the user input, a range gate expected to encompass the range value of the addressed item. A speech recogniser is used to recognise the spoken label and thus the addressed item, the label search space of the recogniser being restricted to exclude the labels of items having a range value outside of the determined range gate. In one embodiment, the user interface is an audio interface in which the items are represented in an audio field through corresponding synthesized sound sources, the depth at which each sound source is rendered in the audio field being the range value associated with the corresponding item.
Owner:SAMSUNG ELECTRONICS CO LTD

System and method for a remotely accessible web-based personal address book

A computer implemented method for providing a remotely accessible web-based address book includes the following steps. First, a user registers with a web-server and sets up an account. The web-server is configured to generate, store and provide access services to web-based address books. Next, the user uploads personal address book information and contacts in the account. Next, the web-server generates a personal web-based address book for the user based on the address book information and contacts and then adds voice tags and text tags to each entry in the user's personal web-based address book. Next, the web-server cross-correlates and matches the uploaded names and contact information of the user's personal contacts with information in other users' profiles stored in a central directory database. If a match exists between one of the uploaded user's personal contacts and a pre-existing user's profile in the central directory database, the web-server updates the pre-existing user's profile in the central directory database. If a match does not exist, the web-server generates a new user's profile in the central directory database. Next, the user accesses the personal web-based address book by placing a phone-call via a voice transmitting connection. Next, the web-server verifies the user's identity. Next, the user selects a personal contact in the user's personal web-based address book and the web-server places a phone-call to the selected personal contact.
Owner:HUMANBOOK

User interface identification and service tags for a document processing system

A tag-based user interface scheme for digitizing and processing hardcopy documents utilizes a sticker that includes a printed data code representative of a user identity code and a service code. When the sticker is applied to a hardcopy document and scanned, the sticker is located, the data code is parsed, and a desired service is performed based upon the information stored in the data code.
Owner:XEROX CORP

Voice recognition apparatus and voice recognition method

InactiveUS20180308483A1Voice recognition is able to be performed efficientlyEfficient executionSound input/outputSpeech recognitionSpeech soundVoice data
Disclosed is a voice recognition apparatus including: an audio input unit configured to receiving a voice; a communication module configured to transmit voice data received from the audio input unit to a server system which performs voice recognition processing, and receive recognition result data on the voice data from the server system; and a controller configured to control the audio input unit and the communication module, wherein, when first voice data is received, the controller perform control to transmit the first voice data to the server system, and wherein, when the first voice data is a conversation command, the controller performs control to receive a first audible answer message including a first question from the server system and output the first audible answer message. In this manner, voice recognition may be performed efficiently.
Owner:LG ELECTRONICS INC

Digital audio recorder

An audio recording device includes a memory storing pre-recorded audio data that include a plurality of voice tags; a detector that is operative to produce a signal upon detection of a substantial similarity between a first portion of a statement spoken by a user and one of the voice tags; and a controller operative, in accordance with the signal produced by the detector, to store, in the memory, a second portion of the statement in association with the voice tag. In accordance with the scope of the invention, audio data may further include instruction commands, such that in response to a substantial similarity detected by the detector between a first portion of the statement and one of the instruction commands, the instruction command is applied in association with the second portion of the statement.
Owner:SANDISK IL LTD

Method for Parsing Natural Language Text

A parser for natural language text is provided. The parser is trained by accessing a corpus of labeled utterances. The parser extracts details of the syntactic tree structures and part of speech tags from the labeled utterances. The details extracted from the tree structures include Simple Links which are the key to the improved efficiency of this new approach. The parser creates a language model using the details that were extracted from the corpus. The parser then uses the language model to parse utterances.
Owner:NEW ROBERT D

System and method of using POS tagging for symbol assignment

Systems and methods for automatically discovering and assigning symbols for identified text in a software application include identifying text for which symbol assignment is desired. The words within the identified text and selected surrounding words defining an observation sequence are subjected to a part of speech tagging algorithm to electronically determine one or more most likely part of speech tags for the identified text. Context relations between the identified text and selected surrounding keywords may also be identified. The identified text, part of speech tag(s) and / or determined relations are then analyzed to map the identified text to one or more identified word senses. Related word senses may also be analyzed to determine if any related word senses have symbols. One of the determined symbols may then be associated with the identified text such that the symbol is thereafter displayed in conjunction with or instead of the text in the application.
Owner:DYNAVOX SYST

Automated voice and speech labeling

A system and method for voice and speech analysis which correlates a speaker signal source and a normalized signal comprising measurements of input acoustic data to a database of language, dialect, accent, and / or speaker attributes in order to create a transcription of the input acoustic data.
Owner:SRC INC

Locating digital images in a portable electronic device

The present invention provides systems and methods for the creation and use of voice tags in an electronic device. When tags are created an image handling unit receives a user selection of a voice tag that may be provided for locating at least one digital image, a sound recording unit records sound emitted by said user, which sound is stored as a sound file to be used as a tag for locating images. When image files are to be located the image handling unit receives the selection of searching for digital images using name tags from the user, the sound recording unit records sound emitted by said user, a voice recognition unit compares the sound with stored sound files and indicates a sound file corresponding to the received sound. The image file associated with the indicated sound file is then located.
Owner:SONY ERICSSON MOBILE COMM AB

Voice recognition method and system based on deep learning

InactiveCN109147768AReduce waiting time for repliesReduce workloadSpeech recognitionData setAcoustic model
The application discloses a voice recognition method and system based on deep learning. The method includes the following steps: acquiring a training data set, wherein the training data set includes atraining voice data set, a voice label and dialogue text information; training the training data set through a training process, and establishing an acoustic model and a language model; acquiring voice query request data; carrying out voice recognition on the voice query request data according to the acoustic model, the language model and a preset dictionary; and finally, outputting a voice recognition text result of the voice query request data. Through the voice recognition method based on deep learning provided by the application, voice consulting content input by customers can be accurately identified, the workload of the manual customer service staff needing to listen to all the consulting requests is reduced, and the time for customers to wait for response is reduced.
Owner:YUNNAN POWER GRID +1

Voice state data generating device, voice state visualizing device, voice state data editing device, voice data reproducing device, and voice communication system

A speech situation data creating device for providing the user with data with a good convenience for the user when the user uses speech data collected from sound sources and recorded with time.A direction / speaker identifying section (3) of a control unit (1) observes a variation of direction data acquired from speech communication data and sets single-direction data and combination direction data on a combination of directions in speaker identification data if no variation of the direction data indicating a single direction or direction data indicating directions over a predetermined time occurs. If any variation of the direction data occurs within a predetermined time, the direction / speaker identifying section (3) reads speech feature value data Sc from a speaker speech DB (53), identifies the speaker by comparing the speech feature value data Sc with the speech feature value analyzed by a speech data analyzing section (2), sets speaker name data in the speaker identification data if the speaker is identified, and sets direction undetection data in the speaker identification data if the speaker is not identified. A speech situation data creating section (4) creates speech situation data according to the variation with time of the speaker identification data.
Owner:YAMAHA CORP

Switch with packet services processing

Virtual machine environments are provided in the switches that form a network, with the virtual machines executing network services previously performed by dedicated appliances. The virtual machines can be executed on a single multi-core processor in combination with normal switch functions or on dedicated services processor boards. Packet processors analyze incoming packets and add a services tag containing services entries to any packets. Each switch reviews the services tag and performs any network services resident on that switch. This allows services to be deployed at the optimal locations in the network. The network services may be deployed by use of drag and drop operations. A topology view is presented, along with network services that may be deployed. Services may be selected and dragged to a single switch or multiple switches. The management tool deploys the network services software, with virtual machines being instantiated on the switches as needed.
Owner:AVAGO TECH INT SALES PTE LTD

Chinese speech recognition system and method

A Chinese speech recognition system and method is disclosed. Firstly, a speech signal is received and recognized to output a word lattice. Next, the word lattice is received, and word arcs of the word lattice are rescored and reranked with a prosodic break model, a prosodic state model, a syllable prosodic-acoustic model, a syllable-juncture prosodic-acoustic model and a factored language model, so as to output a language tag, a prosodic tag and a phonetic segmentation tag, which correspond to the speech signal. The present invention performs rescoring in a two-stage way to promote the recognition rate of basic speech information and labels the language tag, prosodic tag and phonetic segmentation tag to provide the prosodic structure and language information for the rear-stage voice conversion and voice synthesis.
Owner:NAT CHIAO TUNG UNIV

Voice taxi calling method, voice taxi calling device and voice taxi calling system

The invention belongs to the technical field of mobile terminals, and discloses a voice taxi calling method comprising the steps that voice information of a user is detected in real time; when the mobile terminal responds to preset awakening information included in the voice information of the user under the standby state, a taxi calling software client side is awakened; and when the taxi calling software client side responds to destination information included in the voice information of the user, current position information of the mobile terminal is acquired, and the current position information and the destination information are transmitted to a taxi calling software server so that the taxi calling software server is enabled to start the taxi calling flow. The voice information of the user is identified, the awakening information and the destination information are acquired from the voice information of the user, the taxi calling software client side is awakened according to the awakening information and the current position information of the mobile terminal is acquired, and the destination information and the current position information are transmitted to the taxi calling software server so as to start the taxi calling flow. The taxi calling service can be realized by inputting the destination information for one time through the voice information.
Owner:LETV HLDG BEIJING CO LTD +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products