Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

732results about "Video data querying" patented technology

Systems and methods for interacting with advanced displays provided by an interactive media guidance application

An interactive television application is used to provide search results to a user. The user is provided with an opportunity to indicate a desire to search for programs. In response, the interactive television application generates search criteria and searches for programs. The search results that are displayed to the user include mosaic listings associated with programs that match the search criteria. In some embodiments, the interactive television application displays the mosaic listings in a manner that accentuates the different levels of relevance of the search results to the search criteria.
Owner:ALL MEDIA GUIDE +10

Intelligent automated assistant for TV user interactions

Systems and processes are disclosed for controlling television user interactions using a virtual assistant. A virtual assistant can interact with a television set-top box to control content shown on a television. Speech input for the virtual assistant can be received from a device with a microphone. User intent can be determined from the speech input, and the virtual assistant can execute tasks according to the user's intent, including causing playback of media on the television. Virtual assistant interactions can be shown on the television in interfaces that expand or contract to occupy a minimal amount of space while conveying desired information. Multiple devices associated with multiple displays can be used to determine user intent from speech input as well as to convey information to users. In some examples, virtual assistant query suggestions can be provided to the user based on media content shown on a display.
Owner:APPLE INC

Client-server electronic program guide

A client-server interactive television program guide system is provided. An interactive television program guide client is implemented on user television equipment. The interactive television program guide provides users with an opportunity to define expressions that are processed by the program guide server. The program guide server may provide program guide data, schedules reminders, schedules program recordings, and parentally locks programs based on the expressions. Users' viewing histories may be tracked. The program guide server may analyze the viewing histories and generates viewing recommendations, targets advertising, and collects program ratings information based on the viewing histories.
Owner:ROVI GUIDES INC

Video surveillance system

A video surveillance system is set up, calibrated, tasked, and operated. The system extracts video primitives and extracts event occurrences from the video primitives using event discriminators. The system can undertake a response, such as an alarm, based on extracted event occurrences.
Owner:AVIGILON FORTRESS

Method and user interface for forensic video search

A forensic video search user interface is disclosed that accesses databases of stored video event metadata from multiple camera streams and facilitates the workflow of search of complex global events that are composed of a number of simpler, low complexity events.
Owner:SIEMENS AG

Image retrieving and delivering system and image retrieving and delivering method

In an image retrieving and delivering system and an image retrieving and delivering method, a feature descriptor is retrieved from a data base, in which each of a plurality of images including a moving picture and a static picture is registered with a feature descriptor describing the feature of the image, according to a retrieval condition input by a user, a retrieval result satisfying the retrieval condition is obtained, and the retrieval result is edited and processed according to a delivery condition obtained from a user terminal in which the retrieval result is to be received.
Owner:MITSUBISHI ELECTRIC CORP

Identifying works, using a sub-linear time search, such as an approximate nearest neighbor search, for initiating a work-based action, such as an action on the internet

A media work may be associated with an action by (a) extracting features from the media work, (b) determining an identification of the media work, based on the features extracted, using a sub-linear time search, such as an approximate nearest neighbor search for example, and (c) determining an action based on the identification of the media work determined. The media work may be an audio work. The features extracted from the work may include (A) a frequency decomposition of a signal of the audio work, (B) information samples of the audio work, (C) average intensities of sampled windows of the audio work, and / or (D) information from frequencies of the audio work.
Owner:NETWORK 1 TECH

Matching and recommending relevant videos and media to individual search engine results

A computer-implemented system and process for generating video search engine results page is disclosed. The system provides a query term and retrieves a collection of search results. Tags are generated for each search result and used to match media objects to each search result. The search results and video objects related to each search result are returned as a video search engine results page.
Owner:INTERTRUST TECH CORP

Speech recognition for internet video search and navigation

Speech representing a desired video site or video subject is detected and digitized at a TV remote, and then sent to a TV. The TV or in some embodiments an Internet server communicating with the TV use speech recognition principles to recognize the speech, enter a database using the recognized speech as entering argument, and return a link to an Internet site hosting the desired video. The link can be displayed on the TV for selection thereof by a user to retrieve the video.
Owner:SATURN LICENSING LLC

Identifying works, using a sub-linear time search, such as an approximate nearest neighbor search, for initiating a work-based action, such as an action on the internet

A media work may be associated with an action by (a) extracting features from the media work, (b) determining an identification of the media work, based on the features extracted, using a sub-linear time search, such as an approximate nearest neighbor search for example, and (c) determining an action based on the identification of the media work determined. The media work may be an audio work. The features extracted from the work may include (A) a frequency decomposition of a signal of the audio work, (B) information samples of the audio work, (C) average intensities of sampled windows of the audio work, and / or (D) information from frequencies of the audio work.
Owner:NETWORK 1 TECH

System and method for presentation of media related to a context

A system and method for presentation of media related to a context. A request is received over a network from a requesting device for media related to a context, wherein the request comprises at least one criteria. A query is formulated based on the context criteria so as to search, via the network, for user profile data, social network data, spatial data, temporal data and topical data that is available via the network and relates to the context and to media files so as to identify at least one media file that is relevant to the context criteria. A playlist is assembled via the network containing a reference to the media files. The media files on the playlist are transmitted over the network to the requesting device.
Owner:GOOGLE LLC

A combined video description method based on multi-modal features and multi-layer attention mechanism

The invention discloses a combined video description method based on multi-modal features and multi-layer attention mechanism. Firstly, the invention counts the words appearing in the description sentence to form a vocabulary, and numbers each word to facilitate vector representation. Then three kinds of feature data are extracted, including semantic attribute feature, Image information features extracted by 2D-CNN and video motion information features extracted by 3D-CNN, and then multi-modal data dynamic fusion through the multi-layer attention mechanism to obtain visual information, and then according to the current context, adjust the use of visual information; Finally, according to the current context and visual information, the words described in the video are generated. After the multi-modal features of the video are fused through the multi-layer attention mechanism, the invention generates the semantic description of the video based on the multi-modal features of the video, thereby effectively improving the accuracy of the video description.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Ranking Search Results

Content items and other entities may be ranked or organized according to a relevance to a user. Relevance may take into consideration recency, proximity, popularity, air time (e.g., of television shows) and the like. In one example, the popularity and age of a movie may be used to determine a relevance ranking. Popularity (i.e., entity rank) may be determined based on a variety of factors. In the movie example, popularity may be based on gross earnings, awards, nominations, votes and the like. According to one or more embodiments, entities may initially be categorized into relevance groupings based on popularity and / or other factors. Once categorized, the entities may be sorted within each grouping and later combined into a single ranked list.
Owner:TIVO CORP

Mobile terminal and method for controlling the same

A mobile terminal includes a display and controller configured to cause the display to display a playback screen of a first multimedia file and cause the display to display a first retrieval screen in response to receiving a first user input during the displaying of the playback screen of the first multimedia file, such that the first retrieval screen includes a plurality of thumbnail images respectively corresponding to one of a plurality of playback points in time on a per first time interval basis of the first multimedia file. The controller also causes the display to play the first multimedia file beginning at a playback point in time that corresponds to a selected one of the plurality of thumbnail images.
Owner:LG ELECTRONICS INC

Interactive content generation

Generation of interactive content. In an embodiment, a representation of candidate object(s) in content of a digital media asset are received. For each of the candidate object(s), feature(s) of the candidate object are compared to corresponding feature(s) of a plurality of reference objects to identify reference object(s) that match the candidate object. For each of the matched candidate object(s), a hotspot package is generated. The hotspot package may comprise a visual overlay which comprises information associated with the reference object(s) matched to the respective candidate object.
Owner:OIM SQUARED

Methods and systems for object-recognition and link integration in a composite video stream

Disclosed herein are methods and systems for object recognition and link integration in a composite video stream. One embodiment takes the form of a process that includes detecting an object of interest in a set of video frames. The process also includes tracking the movements of the detected object of interest across a subset of the video frames in the set of video frames. The process further includes generating a composite video stream from the video frames in the subset. The composite video stream shows the tracked movements of the detected object of interest without showing background data from the video frames in the subset. The process also includes outputting the generated composite video stream.
Owner:MOTOROLA SOLUTIONS INC

Video image processing method and video image processing device

The invention provides a video image processing method and a video image processing device. The method includes: acquiring a video image sequence; recognizing a target object from a video image frame in the video image sequence; tracking the target object, and determining a motion trail of the target object; acquiring video structural information on the basis of the target object and the motion trail of the target object; performing target object retrieval and / or video compression on the video image sequence on the basis of the video structural information. On the basis of the video image processing method and the video image processing device, quickness in target investigation is realized, target investigation is accelerated, and case cracking speed is increased.
Owner:NEUSOFT CORP

Intelligent automated assistant for tv user interactions

Systems and processes are disclosed for controlling television user interactions using a virtual assistant. A virtual assistant can interact with a television set-top box to control content shown on a television. Speech input for the virtual assistant can be received from a device with a microphone. User intent can be determined from the speech input, and the virtual assistant can execute tasks according to the user's intent, including causing playback of media on the television. Virtual assistant interactions can be shown on the television in interfaces that expand or contract to occupy a minimal amount of space while conveying desired information. Multiple devices associated with multiple displays can be used to determine user intent from speech input as well as to convey information to users. In some examples, virtual assistant query suggestions can be provided to the user based on media content shown on a display.
Owner:APPLE INC

Video clip playing method and device

ActiveCN107071542AQuickly understand the development of the plotEasy to useVideo data queryingSpeech recognitionUser needsTime efficient
The invention provides a video clip playing method and a video clip playing device. The method comprises the steps of acquiring voice search information sent by a user, and resolving the voice search information to acquire corresponding text information; using a pre-trained deep neural network model to extract a search field, a search intention and a search intention satisfaction condition from the text information; if a video clip queried by the user is learned according to the search intention, querying a preset label library corresponding to the search field, and acquiring a video label successfully matched with the search intention satisfaction condition; and playing the target video clip corresponding to the video label to the user according to a pre-stored video playing parameter corresponding to the video label. Therefore, skipping to the target video clip can be accurately achieved via voice search, so that the operation is simple and convenient, the time is saved, the user can rapidly learn plot development of the whole video, utilization of the user is facilitated, and the user requirement is met.
Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD +1

Browsing and Retrieval of Full Broadcast-Quality Video

A method includes steps of indexing a media collection, searching an indexed library and browsing a set of candidate program segments. The step of indexing a media collection creates the indexed library based on a content of the media collection. The step of searching the indexed library identifies the set of candidate program segments based on a search criteria. The step of browsing the set of candidate program segments selects a segment for viewing.
Owner:AMERICAN TELEPHONE & TELEGRAPH CO

Systems and methods for extracting meaning from speech-to-text data

Systems and methods are provided for simulating an interactive conversation with a recorded subject. In accordance with an implementation, a server receives a text string corresponding to a query spoken by a user during the interactive conversation, and subsequently obtains information associated with a plurality of candidate queries posed to the recorded subject. The obtained information may include, for corresponding ones of the candidate queries, a primary keyword, at least one of a contextual keyword or a qualifier keyword associated with the primary keyword, and synonym data. The server may generate scores for the candidate queries based on the text string and at least one of the keyword data or the synonym data. Based on the candidate query scores, the server may select one of the candidate queries that corresponds to the text string and video content that responds to the spoken query.
Owner:HARLESS WILLIAM G +2

Method and subsystem for searching media content within a content-search-service system

Various embodiments of the present invention include concept-service components of content-search-service systems which employ ontologies and vocabularies prepared for particular categories of content at particular times in order to score transcripts prepared from content items to enable a search-service component of a content-search-service system to assign estimates of the relatedness of portions of a content item to search criteria in order to render search results to clients of the content-search-service system. The concept-service component processes a search request to generate lists of related terms, and then employs the lists of related terms to process transcripts in order to score transcripts based on information contained in the ontologies.
Owner:LIMELIGHT NETWORKS

Information display method and device and computer storage medium

The invention provides an information display method and device and a computer storage medium, and the method comprises the steps: displaying a search box in a first preset region of a page, and playing a first multimedia resource in a second preset region, wherein first recommended search information related to the first multimedia resource is displayed in the search box; and in response to the trigger operation, switching the first multimedia resource into a second multimedia resource in a second preset area and playing the second multimedia resource, and correspondingly displaying second recommended search information related to the second multimedia resource in the search box. The search box containing the recommended search information related to the multimedia resource can be synchronously displayed on the video playing page, and the recommended search information displayed in the search box can be synchronously switched when the video resource played on the current page is switched, so that the information search efficiency can be improved.
Owner:BEIJING BYTEDANCE NETWORK TECH CO LTD

Generating method and system of video scene database and method and system for searching video scenes

The invention discloses generating method and system of a video scene database and a method and a system for searching video scene segments based on the video scene database generated by the former method. The generating method of the video scene database comprises the following steps of: (A) marking time anchor points and annotating subtitles in a video scene in a video file in a data source; (B) extracting the annotated subtitles into a subtitle database; (C) carrying out redundance cutting on the corresponding video file according to the marked time anchor points, intercepting a video scene segment corresponding to the subtitles and storing in a video scene segment database; and (D) establishing the corresponding relation between the subtitle segments in the subtitle database and the video scene segment in the video scene database. The invention provides data support for a user to conveniently and rapidly find a target video scene segment.
Owner:李平辉

Key frame extraction method and device and storage medium

The embodiment of the invention discloses a key frame extraction method and device and a storage mediumThe embodiment of the invention includes: acquiring the video frame set corresponding to the video,wherein the video frame set comprises a plurality of video frames; determining a current reference video frame in the video frame set, extracting a corresponding video frame from the video frame setaccording to the reference video frame to serve as a target video frame, acquiring similarity information between the target video frame and the reference video frame, and determining the target video frame as a key frame when the similarity information meets a preset condition. According to the scheme, the video key frames can be extracted based on the similarity between the video frames;, the effective video key frame can be quickly extracted from the video, the speed of extracting the video key frame is increased, the scheme does not depend on the frame rate of the video, the scheme can beapplied to videos of various frame rates, and the accuracy and flexibility of extracting the video key frame are improved.
Owner:TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products