Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

95 results about "Audio segmentation" patented technology

Video music matching method and device, electronic equipment and computer readable medium

The invention provides a video music matching method and device, electronic equipment and a computer readable storage medium, and relates to the technical field of audio processing. The method comprises the following steps: acquiring a video to be matched with music and an audio for matching with music; respectively obtaining a video segmentation point of the video and an audio segmentation pointof the audio; dividing the video into video clips according to the video segmentation points; dividing the audio into audio clips of which the number is the same as that of the video clips according to the audio segmentation points; adjusting the playing speed of each video clip or the playing speed of each audio clip, so that the playing durations of each video clip and each audio clip are the same in a one-to-one correspondence manner according to the playing sequence; and connecting the adjusted video clips according to a playing sequence to obtain a target video, connecting the adjusted audio clips according to the playing sequence to obtain a target audio, and jointly playing the target video and the target audio. Video picture features and music rhythm features in audios can be automatically and effectively combined, and the watching immersion of users is improved.
Owner:BEIJING BYTEDANCE NETWORK TECH CO LTD

Video processing method and device, electronic equipment and storage medium

ActiveCN110213670AAvoid too long questionsAvoid long questionsSelective content distributionFeature vectorScene segmentation
The invention provides a video processing method and device, electronic equipment and a storage medium. The video processing method comprises the steps of obtaining a to-be-processed video, and dividing the to-be-processed video into a plurality of units of to-be-processed videos; obtaining a scene feature vector and an audio feature vector corresponding to each unit of to-be-processed video; determining scene pre-segmentation points according to the scene feature vectors corresponding to every two adjacent units of to-be-processed videos, and determining audio pre-segmentation points according to the audio feature vectors corresponding to every two adjacent units of to-be-processed videos; performing scene segmentation on the to-be-processed video according to the scene pre-segmentation point, and searching a video clip of which the duration exceeds a set maximum duration threshold from video clips obtained by scene segmentation to serve as a to-be-segmented video clip; and carrying out audio segmentation on the to-be-segmented video clip according to the audio pre-segmentation point to obtain a segmented video clip. According to the invention, the accuracy of splitting is improved, and the requirements of users are better met.
Owner:BEIJING QIYI CENTURY SCI & TECH CO LTD

Segmentation clustering method and system for multi-person voice in complex environment

The invention discloses a segmentation clustering method and system for multi-person voice in a complex environment. The method comprises the following steps of: acquiring multiple continuous multi-person speaking voice segment audios according to multi-person speaking audios; and normalizing the multi-person speaking voice segment audios according to acoustic features to obtain normalized audios;acquiring multiple sections of to-be-processed audios; extracting voiceprint information characteristics of the multiple sections of to-be-processed audios; acquiring scores among all the to-be-processed audio segments by setting scoring criteria; according to the similarity scores among all the to-be-processed audio segments, acquiring category labels of a plurality of persons through a multi-stage redundant clustering algorithm; and segmenting and clustering the multi-person speaking audios according to the category labels of the plurality of persons. By using the redundant clustering method, the clustering center of a target speaker can be improved to be more dispersed, and the distinction degree is higher. And for an unclear voice segment of the target speaker in a complex environment, a better discrimination capability is realized, so that the classification error of speaker classification in a segmentation clustering task in the complex environment is reduced.
Owner:AISPEECH CO LTD

Paragraph association rule evaluation method based on multi-dimensional element video segmentation

The invention mainly provides a paragraph association rule evaluation method based on multi-dimensional element video segmentation. The paragraph association rule evaluation method specifically comprises the steps of 1, carrying out video analysis; 2, extracting a key frame in scene segmentation; step 3, carrying out scene segmentation based on the key frame; step 4, carrying out video audio segmentation; step 5, performing semantic segmentation on the video; 6, judging the paragraph association rule of the segmented video of the GNN network; and step 7, constructing an association network. According to the method, after the same video segment is subjected to multi-dimensional segmentation, a paragraph association rule construction mode is adopted to carry out matching on corresponding multi-dimensional elements. Compard with other paragraph association rule evaluation methods for video segmentation, the video is well segmented in the image dimension by combining the change of the pixels in the image sequence in the time domain and the correlation between the adjacent frames, the key information of the video is reserved, and an effective multi-dimensional element video segmentationparagraph association rule judgment method can be provided.
Owner:BEIJING UNIV OF POSTS & TELECOMM

In-vehicle safety monitoring and help seeking system based on automobile data recorder and crying recognition

The invention belongs to the technical field of safety monitoring, and discloses an in-vehicle safety monitoring and help seeking system based on an automobile data recorder and crying recognition. The system comprises an audio input module which is used for collecting a basic cry sample, wherein when an automobile flames out, the automobile data recorder triggers a parking monitoring state, a child safety monitoring help seeking system is started, and a built-in audio receiver receives a sound signal, carries out audio segmentation processing and stores audio clips, and a voice recognition module carries out next-step recognition; the voice recognition module which is used for confirming and recognizing the acquired audio signal; and a signal sending module which is used for sending a help seeking message to a vehicle owner through a wireless network of the automobile data recorder so as to complete the external output function of the help seeking signal. According to the invention, the automobile data recorder widely used in the existing automobile or school bus is fully utilized, a mobile phone dialing system and a crying recognition system are integrated, and the system can beintegrated with the Internet of Vehicles, so that corresponding responsible persons can be timely notified when crying occurs, so that trapped infants in the automobile can be timely discovered.
Owner:WUHAN UNIV OF SCI & TECH

Power equipment environment noise identification method based on time domain and frequency domain self-similarity

The invention discloses a power equipment environment noise identification method based on time domain and frequency domain self-similarity. The method comprises the following steps: firstly, acquiring an operation sound signal of power equipment to be monitored; segmenting the collected audio into minute-level recording samples, setting a proper frame length, framing each sample, and extracting time domain and frequency domain features of each frame; and performing similarity analysis on the features by utilizing a clustering-based similarity analysis method, and considering that the sampleswhich can only be clustered into one class cluster have time domain and frequency domain self-similarity characteristics, otherwise, the samples do not have similarity. When the recording sample has time domain and frequency domain self-similarity, the recording sample is reserved; otherwise, the recording sample is rejected. According to the method, the recording samples without time domain and frequency domain self-similarity noise interference can be effectively recognized and eliminated, effective samples are screened out, and support is provided for subsequent recognition of the operationstate of the power equipment based on sound signals.
Owner:CHANGSHA UNIVERSITY OF SCIENCE AND TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products