Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

140 results about "Text stream" patented technology

Automatically scrolling handwritten input user interface for personal digital assistants and the like

A handheld device 100 such as a personal digital assistant (PDA) or the like, a handwritten input user interface (HIUI), a method of interfacing handwritten text and a program product therefor. A lower portion of a touch enabled display is designated as a handwriting input area 104. Action icons 106, 108, 110, 112 and 114 are disposed at a right side of the handwriting user interface 102. Recognized text is displayed on the screen in a text display area located between a file management tool bar 116 and the handwritten input area 104. A scroll bar 118 is disposed at the right side of the display 112. As text is continuously entered each individual word may be recognized, and inserted into the end of the text stream. A word separator 120 may demarcate or bracket individual words in a continuous input stream. A secondary list of potential recognition candidates may be available for display in a box 128 and offered for substitution for or in lieu of the recognized word. Handwritten text may be continuously entered and displayed in the handwriting input area 104 as digital ink, the input point staying approximately fixed with the ink display automatically scrolling. The input area behaves as a "treadmill" or "ticker tape" that is moving from right to left, thereby giving the illusion of a continuous writing space. The speed of the "treadmill" automatically adapts to writing speed. The device 100 may include a communications function and, in particular the device may include an antenna 122 for wireless communication. Individual function switches, buttons and other controls are disposed about the device.
Owner:GOOGLE TECHNOLOGY HOLDINGS LLC

System and method for automatic speech to text conversion

Speech recognition is performed in near-real-time and improved by exploiting events and event sequences, employing machine learning techniques including boosted classifiers, ensembles, detectors and cascades and using perceptual clusters. Speech recognition is also improved using tandem processing. An automatic punctuator injects punctuation into recognized text streams.
Owner:SCTI HLDG INC

Automatically scrolling handwritten input user interface for personal digital assistants and the like

A handheld device 100 such as a personal digital assistant (PDA) or the like, a handwritten input user interface (HIUI), a method of interfacing handwritten text and a program product therefor. A lower portion of a touch enabled display is designated as a handwriting input area 104. Action icons 106, 108, 110, 112 and 114 are disposed at a right side of the handwriting user interface 102. Recognized text is displayed on the screen in a text display area located between a file management tool bar 116 and the handwritten input area 104. A scroll bar 118 is disposed at the right side o f the display 112. As text is continuously entered each individual word may be recognized, and inserted into the end of the text stream. A word separator 120 may demarcate or bracket individual words in a continuous input stream. A secondary list of potential recognition candidates may be available for display in a box 128 and offered for substitution for or in lieu of the recognized word. Handwritten text may be continuously entered and displayed in the handwriting input area 104 as digital ink, the input point staying approximately fixed with the ink display automatically scrolling. The input area behaves as a "treadmill" or "ticker tape" that is moving from right to left, thereby giving the illusion of a continuous writing space. The speed of the "treadmill" automatically adapts to writing speed. The device 100 may include a communications function and, in particular the device may include an antenna 122 for wireless communication. Individual function switches, buttons and other controls are disposed about the device.
Owner:GOOGLE TECH HLDG LLC

Use of metadata to post process speech recognition output

A method of utilizing metadata stored in a computer-readable medium to assist in the conversion of an audio stream to a text stream. The method compares personally identifiable data, such as a user's electronic address book and / or Caller / Recipient ID information (in the case of processing voice mail to text), to the n-best results generated by a speech recognition engine for each word that is output by the engine. A goal of this comparison is to correct a possible misrecognition of a spoken proper noun such as a name or company with its proper textual form or a spoken phone number to correctly formatted phone number with Arabic numerals to improve the overall accuracy of the output of the voice recognition system.
Owner:YAP +1

Methods, systems, and computer program products for securely transforming an audio stream to encoded text

A method, system, computer program product, and method of doing business by providing improved audio compression wherein an audio stream is securely transformed to an encoded text stream (such as an ASCII, EBCDIC, or Unicode text stream). One or more components which are involved in the transformation process are authenticated. A unique identifier of each such component is included within cryptographically-protected information that is provided for the encoded text stream. A digital signature is preferably used for the cryptographic protection, thereby digitally notarizing the encoded text stream. The authenticity and integrity of the encoded text stream can therefore be verified. In preferred embodiments, the authenticated identities of components performing the transformation can also be determined from the cryptographically-protected information. The encoded text stream will typically require much less storage space than the audio stream, and providing the digital notarization along with the encoded text stream serves to reliably establish evidence of the contents of the audio stream (even though a perfect speech-to-text transformation might not be achieved).
Owner:MICROSOFT TECH LICENSING LLC

Method and system for filtering sensitive web page based on multiple classifier amalgamation

The invention discloses a system and a method for filtering sensitive webpage, which is based on multi-classifier fusion. The processing object is a webpage, and the processing result is whether the webpage contains sensitive content, which may be pornography, reaction, violence and other unhealthy Internet contents harmful to society. The system comprises a data stream obtaining and preprocessing unit, an image and text stream filtering unit and an information fusion unit of image filter and text filter, by the cooperation of multiple classifiers, the system acquires source code of a webpage by using the URL of the webpage, a text and an image are separated at preprocessing stage to obtain text information and effective image information; an input webpage is divided into three modes by decision tree algorithm; the webpage is recognized by using a consecutive text classifier, a discrete sensitive text classifier and an image classifier, the output result recognized by the classifiers is fused and calculated, then a judge factor is given, and the final result is returned to a browser.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Method selecting actions or phases for an agent by analyzing conversation content and emotional inflection

A method and apparatus are provided for accepting a call by an automatic call distributor and for automatic call handling of the call. The apparatus for automatic call handling has: a call receiving system that outputs at least one voice signal; a text voice converter having an input for the at least one voice signal, the text voice converter converting the voice signal to a text stream and providing the text stream on an output thereof; an emotion detector having an input for the at least one voice signal, the emotion detector detecting at least one emotional state in the voice signal and producing at least one tag indicator indicative thereof on an output of the emotion detector; and a scripting engine having inputs for the text stream and the at least one tag indicator, the scripting engine providing on an output thereof at least one response based on the text stream and on the at least one tag indicator. The method and apparatus provides the agents with scripts that are based on not only the content of the call from a caller, but that are also based upon the emotional state of the caller. As a result, there is a decrease in call duration, which decreases the cost of operating a call center. This decrease in the cost is a result in the amount of time an agent spends based on the agent's hourly rate and the costs associated with time usage of inbound phone lines or trunk lines.
Owner:FIRSTPOINT CONTACT TECH +1

Relay for personal interpreter

A relay is described to facilitate communication through the telephone system between hearing users and users who need or desire assistance in understanding voice communications. To overcome the speed limitations inherent in typing, the call assistant at the relay does not type most words but, instead, re-voices the words spoken by the hearing user into a computer operating a voice recognition software package trained to the voice of that call assistant. The text stream created by the computer and the voice of the hearing user are both sent to the assisted user so that the assisted user can be supplied with a visual text stream to supplement the voice communications. A time delay in the transmission of the voice of the hearing user through the relay is of assistance in the assisted user comprehending the communications session.
Owner:ULTRATEC INC

Technique for improved audio compression

A method, system, computer program product, and method of doing business by providing improved audio compression wherein an audio stream is securely transformed to an encoded text stream (such as an ASCII, EBCDIC, or Unicode text stream). One or more components which are involved in the transformation process are authenticated. A unique identifier of each such component is included within cryptographically-protected information that is provided for the encoded text stream. A digital signature is preferably used for the cryptographic protection, thereby digitally notarizing the encoded text stream. The authenticity and integrity of the encoded text stream can therefore be verified. In preferred embodiments, the authenticated identities of components performing the transformation can also be determined from the cryptographically-protected information. The encoded text stream will typically require much less storage space than the audio stream, and providing the digital notarization along with the encoded text stream serves to reliably establish evidence of the contents of the audio stream (even though a perfect speech-to-text transformation might not be achieved).
Owner:NUANCE COMM INC

Integrated speech recognition, closed captioning, and translation system and method

A system and method that integrates automated voice recognition technology and speech-to-text technology with automated translation and closed captioning technology to provide translations of “live” or “real-time” television content is disclosed. It converts speech to text, translates the converted text to other languages, and provides captions through a single device that may be installed at the broadcast facility. The device accepts broadcast quality audio, recognizes the speaker's voice, converts the audio to text, translates the text, processes the text for multiple caption outputs, and then sends multiple text streams out to caption encoders and / or other devices in the proper format. Because it automates the process, it dramatically reduces the cost and time traditionally required to package television programs for broadcast into foreign or multi-language U.S. markets.
Owner:GLOBAL TRANSLATION

Method and system context-aware for identifying, activating and executing software that best respond to user requests generated in natural language

InactiveUS20100004924A1Limiting its applicability and useReduce the potential mismatch for casesDigital data information retrievalNatural language data processingTypingText stream
A computer-implemented method capable of identifying, activating, and executing commands, methods, functions, interfaces, and software-based applications that can satisfy a specific natural language user request represented by a text stream and generated from any means such as typing, voice, gestures, signs or by human thoughts.
Owner:PAEZ YURI LUIS

Relay for personal interpreter

A relay is described to facilitate communication through the telephone system between hearing users and users who need or desire assistance in understanding voice communications. To overcome the speed limitations inherent in typing, the call assistant at the relay does not type most words but, instead, re-voices the words spoken by the hearing user into a computer operating a voice recognition software package trained to the voice of that call assistant. The text stream created by the computer and the voice of the hearing user are both sent to the assisted user so that the assisted user can be supplied with a visual text stream to supplement the voice communications. A time delay in the transmission of the voice of the hearing user through the relay is of assistance to the assisted user in comprehending the communications session.
Owner:ULTRATEC INC

Emergent topic detecting method and system facing text streams of micro-blog platform

The invention provides an emergent topic detecting method and system facing text streams of a micro-blog platform. The method comprises the following steps that (1) user data and user generation information data of the micro-blog platform are collected in real time, and information text and images are extracted; (2) a time window is set, the information text is divided, and real-time data streams and historical data are obtained; (3) characteristics are selected, and training of a popularity evaluation model and a long micro-blog extraction model is carried out; (4) popularity evaluation is carried out on the real-time data streams by means of the popularity evaluation model, long micro-blog extraction is carried out on the real-time data streams by means of the long micro-blog extraction model, the information which is evaluated to be popular is put into popular information sets, and extracted long micro-blog contents are put into long micro-blog sets; (5) whether the number of the popular information sets and the number of the long micro-blog sets achieve preset threshold values is judged, if yes, topic extraction is carried out through an LDA model or in a weighting summation mode, emergent topics are extracted from data of the popular information sets and the long micro-blog sets, if no, the method goes back to the step (1).
Owner:INST OF COMPUTING TECH CHINESE ACAD OF SCI

Method and system for operational improvements in dispatch console systems in a multi-source environment

A method and system for operational improvements in a dispatch console in a multi-source environment includes receiving (310) a plurality of audio streams simultaneously from a plurality of mobile devices, transcribing received audio streams by the means of speech-to-text conversion, presenting real-time transcriptions to the user and determining (320) if a first keyword is present in at least one of the plurality of audio and / or text streams. Upon determining the presence of the first keyword, the dispatch console automatically performs (330) at least one predefined dispatch console operation from a list of predefined dispatch console operations. The dispatch console further receives (340) a second keyword based on determining the presence of the first keyword and checks (350) for the presence of the second keyword within the audio and / or text streams thereby enabling additional automated dispatch console operations.
Owner:MOTOROLA SOLUTIONS INC

System Apparatus Circuit Method and Associated Computer Executable Code for Natural Language Understanding and Semantic Content Discovery

Disclosed are systems, apparatuses, circuits and methods for extrapolating meaning from vocalized speech or otherwise obtained text. Speech of a speaking user is sampled and digitized, the digitized speech is converted into a text stream, the text stream derived from speech or otherwise obtained is analyzed syntactically and semantically, a knowledgebase in the specific context domain of the text stream is utilized to construct one or more semantic / syntactic domain specific query analysis constrains / rule-sets, and a “Domain Specific Knowledgebase Query” (DSKQ) or set of queries is built at least partially based on the domain specific query analysis constrains / rule-sets.
Owner:JINNI MEDIA

Integrated speech recognition, closed captioning, and translation system and method

A system and method that integrates automated voice recognition technology and speech-to-text technology with automated translation and closed captioning technology to provide translations of “live” or “real-time” television content is disclosed. It converts speech to text, translates the converted text to other languages, and provides captions through a single device that may be installed at the broadcast facility. The device accepts broadcast quality audio, recognizes the speaker's voice, converts the audio to text, translates the text, processes the text for multiple caption outputs, and then sends multiple text streams out to caption encoders and / or other devices in the proper format. Because it automates the process, it dramatically reduces the cost and time traditionally required to package television programs for broadcast into foreign or multi-language U.S. markets.
Owner:GLOBAL TRANSLATION

System and method for analysing text stream message thereof

A system and method for analyzing text stream message for a micro-blog are provided. The system includes a sliding window module, storing a plurality of text stream messages from the micro-blog and updating the plurality of text stream messages once every preset duration; a dynamic text weight module, receiving the plurality of text stream messages and calculating the plurality of text stream messages for generating a burst weight according to a dynamic text stream weight algorithm; a clustering module, clustering the plurality of text stream messages for generating a plurality of clusters by a clustering algorithm according to the plurality of text stream messages and the burst weight; and a memory device, storing the clusters.
Owner:IND TECH RES INST

Method for detecting burst topic in user generation text stream based on graph clustering

The invention relates to a method for detecting a burst topic in a user generation text stream based on graph clustering and belongs to the technical field of internet data mining. By the method, a graph-based new field of view relative to the conventional topic detection problem is provided, and the detection problem of the burst topic in the text stream is converted into a typical graph clustering problem, so the problem can be solved by using the conventional graph theory method. The method comprises the following main steps of: acquiring the text stream; detecting the burse topic; constructing a burst word graph; and clustering burst words. The method aims at the detection of the burst topic in the user generation text stream and has the performance which is superior to that of the conventional method based on document clustering, a probability topic model and burst characteristic clustering.
Owner:TSINGHUA UNIV

Searching method and searching device based on text

The invention discloses a searching method and searching device based on text. The searching method based on text includes: step 1, acquiring feature words included in text streams which are sent by users, step 2, acquiring a feature probability of every business, which corresponds to every feature word from a pre-built feature probability lexicon, step 3, calculating a joint probability of every business, which corresponds to the text streams, according to the feature probability of every business, which corresponds to every feature word and step 4, outputting final matching business according to the joint probability acquired by calculating, storing every feature word in the feature probability lexicon and updating the corresponding feature probability of the feature word in the feature probability lexicon. According to the searching method and searching device based on text, the searching efficiency can be increased and operating costs can be reduced.
Owner:ZUNYI BRANCH OF CHINA MOBILE GRP GUIZHOU COMPANY

System and method for providing call and chat conferencing

An approach is disclosed for providing an integrated call and chat conferencing system. A first participant joins in a conference, wherein the first participant communicates over a voice session. The voice session is converted into a text stream and stored. A second participant joins in the conference, wherein the second participant communicates over a chat session. The stored converted text stream is presented to second participant.
Owner:VERIZON PATENT & LICENSING INC

Swoopy text for connecting annotations in fluid documents

A swoopy text method and system for generating and displaying curved text to connect primary source data with secondary data, including alternatively connecting different text streams, to augment the meaning of original text and / or to replace the meaning of the original text stream with secondary data.
Owner:XEROX CORP

Method, apparatus and computer program product for synchronizing separate compressed video and text streams to provide closed captioning and instant messaging integration with video conferencing

A method, apparatus and computer program product are provided for synchronizing separate compressed video and text streams to provide lightweight closed captioning and instant messaging integration with video conferencing. A video encoder encodes a video stream and periodically generates a synchronization frame event. Each generated synchronization frame event has a unique ID. A text recording agent receives the periodically generated synchronization frame events, and generates text packets associating stored text with the synchronization frame event. A video decoder decodes the video stream, periodically generating the synchronization frame event having the unique ID. A text display agent receives the periodically generated synchronization frame events and associates stored text packets with the synchronization frame events.
Owner:AIRBNB

System and method for direct speech translation system

PendingUS20200226327A1Simplifies speech recognitionSimplifies translationNatural language translationMathematical modelsEncoder decoderSpeech translation
A system for translating speech from at least two source languages into another target language provides direct speech to target language translation. The target text is converted to speech in the target language through a TTS system. The system simplifies speech recognition and translation process by providing direct translation, includes mechanisms described herein that facilitate mixed language source speech translation, and punctuating output text streams in the target language. It also in some embodiments allows translation of speech into the target language to reflect the voice of the speaker of the source speech based on characteristics of the source language speech and speaker's voice and to produce subtitled data in the target language corresponding to the source speech. The system uses models having been trained using (i) encoder-decoder architectures with attention mechanisms and training data using TTS and (ii) parallel text training data in more than two different languages.
Owner:APPL TECH APPTEK

Method and system for displaying PDF (Portable Document Format) document adaptively to window size and mobile terminal

The invention is suitable for the technical field of application of electronic books and provides a method and system for displaying a PDF (Portable Document Format) document adaptively to a window size, and a mobile terminal. The method comprises the following steps of: A, extracting a locally-stored PDF document and selecting the range of pages to be resolved from the locally-stored PDF document; B, resolving information in the selected page range according to a preset resolving object type to obtain attribute information of each resolving object, wherein the attribute information of each resolving object comprises the position information of each resolving object; C, typesetting each resolving object according to corresponding position information of each resolving object; and D, writing the typeset resolving objects into a document which can support word wrap in a text stream write-in mode, and displaying information in the document. By adopting a PDF document displaying technology provided by the invention, a PDF page can be read without adjusting a display window leftwards or rightwards, and better reading experience and great convenience can be brought to a user.
Owner:WONDERSHARE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products