Methods, devices, electronic equipment and storage media for matching light effects in music playback
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- SHEN ZHEN NEEWER TECH CO LTD
- Filing Date
- 2026-02-10
- Publication Date
- 2026-06-30
AI Technical Summary
In existing technologies, music lighting effects are disconnected from the semantic content of lyrics, the dynamic expression of lighting effects is limited by the beat, lacks depth and narrative, and has a low level of intelligence.
By acquiring the time information of the lyrics text, performing semantic analysis, extracting dynamic semantic features, determining the dynamic lighting scheme from the preset lighting effect mode knowledge base, and controlling the dynamic changes of the lights independently of the beat rules.
It achieves a close match between lighting effects and the artistic conception of lyrics, enhancing the immersiveness and artistic appeal of music playback, and making light an artistic expression medium with independent narrative capabilities.
Smart Images

Figure CN122308776A_ABST
Abstract
Description
Technical Field
[0001] This disclosure relates to the field of music playback technology, and more specifically, to a method, apparatus, electronic device, and storage medium for matching light effects during music playback. Background Technology
[0002] Currently, in the field of music visualization and intelligent lighting control, existing technologies typically control lighting effects based on the low-level audio characteristics of music (such as physical parameters like beat, loudness, and spectrum). For example, a flash is triggered when a strong beat is detected, and the light flow speed or peak brightness is increased when the loudness increases.
[0003] The existing technical solutions have the following fundamental defects:
[0004] First, the dynamic lighting effects are completely disconnected from the semantic content of the lyrics. This results in a serious disconnect between the visual dynamic presentation and the user's understanding and emotional resonance with the lyrics, making the visual experience superficial and lacking depth and narrative.
[0005] Secondly, lighting effects are usually tightly coupled with beat response logic, which restricts the expression of lighting effects to the rhythmic framework of the beat, resulting in a low level of intelligence. Summary of the Invention
[0006] This disclosure provides at least one method, apparatus, electronic device, and storage medium for matching light effects during music playback to solve the aforementioned technical problems.
[0007] In a first aspect, embodiments of this disclosure provide a method for matching light effects during music playback, including: In response to a music playback command, obtain the lyrics of the currently playing music and the time information of the lyrics on the playback timeline; Perform semantic analysis on the lyrics text and extract at least one corresponding dynamic semantic feature; Based on the at least one dynamic semantic feature, a corresponding dynamic light effect scheme is determined from a preset light effect mode knowledge base. The dynamic light effect scheme is used to define the dynamic change mode of the light. During music playback, the dynamic lighting scheme is output based on the timing information of the lyrics to control the lights.
[0008] In one possible implementation, the method further includes, prior to performing semantic analysis on the lyrics text: The lyrics text is sliced based on the time information to form multiple lyrics slices; wherein, the time information is the start time point and end time point corresponding to each lyrics slice.
[0009] In one possible implementation, the slicing of the lyrics text specifically includes at least one of the following methods: Slicing is performed according to a preset fixed time interval; Segment the lyrics based on punctuation marks; Segmentation is performed based on semantically identified changes in themes or scenes.
[0010] In one possible implementation, the semantic analysis of the lyrics text includes: For each lyric slice in the multiple lyric slices, semantic analysis is performed on each lyric slice to extract the dynamic semantic features of each lyric slice.
[0011] In one possible implementation, the semantic analysis of each lyric slice includes: Semantic analysis is performed on each lyric slice using a pre-trained large language model, and at least one of the action features, emotion intensity features, or scene dynamic features described in each lyric slice is extracted as the dynamic semantic feature.
[0012] In one possible implementation, during music playback, for the playback time segment corresponding to the lyric-free segment, the control strategy for the output light effect dynamic scheme includes at least one of the following: Continue with the dynamic lighting effects scheme corresponding to the previous lyric slice; A gradual transition is created between the light effect dynamics of two adjacent lyric slices; Randomly select one of several lighting effect dynamic schemes that match the overall lyrics for output.
[0013] In one possible implementation, the dynamic change mode defined by the light effect dynamic scheme includes at least one of the following: light flow, flashing, jumping, and breathing modes.
[0014] In one possible implementation, the preset light effect pattern knowledge base is independent of the beat response rule base used to directly trigger the basic changes in the rhythm of the light effect based on the music beat.
[0015] Secondly, this disclosure also provides a light effect matching device for music playback, including: The acquisition module is used to acquire the lyrics text of the currently playing music and the time information of the lyrics text on the playback timeline in response to the music playback command; The extraction module is used to perform semantic analysis on the lyrics text and extract at least one corresponding dynamic semantic feature; The determination module is used to determine the corresponding dynamic light effect scheme from a preset light effect mode knowledge base based on the at least one dynamic semantic feature, wherein the dynamic light effect scheme is used to define the dynamic change mode of the light. The control module is used to output the dynamic lighting scheme to control the lights based on the timing information of the lyrics text during music playback.
[0016] Thirdly, this disclosure also provides an electronic device, including: a processor, a memory, and a bus, wherein the memory stores machine-readable instructions executable by the processor, and when the electronic device is running, the processor communicates with the memory via the bus, and when the machine-readable instructions are executed by the processor, they perform a light effect matching method for music playback as described in any one of the first aspects and various embodiments thereof.
[0017] Fourthly, this disclosure also provides a computer-readable storage medium storing a computer program that, when executed by a processor, performs the light effect matching method in music playback as described in any one of the first aspects and its various embodiments.
[0018] The aforementioned method, apparatus, electronic device, and storage medium for matching light effects in music playback respond to music playback commands by acquiring the lyrics of the currently playing music and their temporal information on the playback timeline. Semantic analysis is then performed on the lyrics, and based on the extracted dynamic semantic features, a corresponding dynamic light effect scheme is determined from a pre-defined light effect pattern knowledge base. Thus, during music playback, the dynamic light effect scheme is output to control the lighting based on the temporal information of the lyrics. This disclosure establishes an independent path from lyric understanding to light effect pattern matching, enabling light effects to break free from a simple dependence on audio physical features and become a dynamic visual language that interprets the meaning of the lyrics. This not only solves the fundamental problem of the disconnect between light effects and the artistic conception of the lyrics but also elevates lighting from its rhythmic nature to an artistic expression medium with independent narrative capabilities, significantly enhancing the immersiveness and artistic appeal of music playback.
[0019] Other advantages of this disclosure will be explained in more detail in conjunction with the following description and accompanying drawings.
[0020] It should be understood that the above description is merely an overview of the technical solution of this disclosure, so as to enable a general understanding of the technical means of this disclosure and to implement it in accordance with the contents of the specification. In order to make the above-mentioned objects, features and advantages of this disclosure more apparent and understandable, preferred embodiments are described in detail below with reference to the accompanying drawings. Attached Figure Description
[0021] To more clearly illustrate the technical solutions of the embodiments of this disclosure, the accompanying drawings used in the embodiments will be briefly described below. The accompanying drawings are incorporated in and constitute a part of this specification. These drawings illustrate embodiments conforming to this disclosure and, together with the specification, serve to illustrate the technical solutions of this disclosure. It should be understood that the drawings only illustrate certain embodiments of this disclosure and should not be considered as a limitation on the scope of protection. For those skilled in the art, other related drawings can be obtained based on these drawings without creative effort. Furthermore, the same reference numerals denote the same components throughout the drawings. In the drawings: Figure 1 A flowchart of a light effect matching method in music playback provided by an embodiment of this disclosure is shown; Figure 2 A schematic diagram of a light effect matching device for music playback provided in an embodiment of this disclosure is shown; Figure 3 A schematic diagram of an electronic device provided in an embodiment of the present disclosure is shown. Detailed Implementation
[0022] Exemplary embodiments of the present disclosure will now be described in more detail with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be implemented in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
[0023] In the description of embodiments disclosed herein, it should be understood that terms such as “comprising” or “having” are intended to indicate the presence of the disclosed features, figures, steps, behaviors, components, portions or combinations thereof in this specification, and do not exclude the possibility of the presence of one or more other features, figures, steps, behaviors, components, portions or combinations thereof.
[0024] Unless otherwise stated, " / " means "or". For example, A / B can mean A or B. In this article, "and / or" is merely a way of describing the relationship between related objects, indicating that there can be three relationships. For example, A and / or B can mean: A alone, A and B at the same time, and B alone.
[0025] The terms "first," "second," etc., are used only for ease of description to distinguish identical or similar technical features and should not be construed as indicating or implying the relative importance or number of these technical features. Therefore, a feature defined by "first," "second," etc., may explicitly or implicitly include one or more of that feature. In the description of embodiments of this disclosure, unless otherwise stated, the term "multiple" means two or more.
[0026] Research has revealed that current mainstream solutions rely on real-time analysis of the physical characteristics of audio signals. The system detects low-level features of the music, such as beat, loudness, and spectral energy, and uses preset rules to drive corresponding dynamic changes in the lights, such as flashing with the beat, changing brightness according to loudness, or making the light patterns flow.
[0027] However, existing solutions completely ignore the rich semantic information carried by music as an expressive art, especially the actions, scene dynamics, and changes in emotional intensity directly described in the lyrics. For example, when lyrics depict intense dynamic scenes such as "lightning and thunder" or "galloping horses," if the music rhythm is slow, the existing system's lighting effects may only present gentle breathing or slow flow, failing to match the lyrics' mood through rapid flashing or scanning. This results in a visual presentation that does not match the user's understanding of the lyrics and emotional expectations, leading to a superficial experience.
[0028] Furthermore, in existing systems, the set of knowledge rules that determines "how" the light effects "move" (such as choosing between a "bouncing" or "flowing" mode) is usually tightly coupled and integrated with the beat response logic that determines "when" the light effects "move" (trigger timing). This strongly coupled architecture confines the dynamic expression of the light effects to the rhythmic framework of the beat, losing the artistic freedom to select the most appropriate dynamic mode based on the independent semantic intelligence of the lyrics, and failing to achieve the deep expression of "telling the story of the lyrics with dynamic language".
[0029] In order to at least partially solve one or more of the above-mentioned problems and other potential problems, this disclosure provides a method, apparatus, device and storage medium for matching light effects in music playback, so as to at least solve the problems of the disconnect between the dynamic light effects and the semantic content of the lyrics and the limitation of the dynamic mode selection by the beat in the prior art.
[0030] To facilitate understanding of this embodiment, a detailed description of the light effect matching method for music playback disclosed in this disclosure embodiment will be provided first. The executing entity of the light effect matching method provided in this disclosure embodiment is generally an electronic device with a certain computing capability. This electronic device may include, for example, a terminal device, a server, or other processing devices. The terminal device may be a user equipment (UE), mobile device, user terminal, cellular phone, personal digital assistant (PDA), handheld device, computing device, in-vehicle device, wearable device, etc. Other devices may be stage lighting, smart home ambient lighting, and devices in entertainment interaction systems. In some possible implementations, the light effect matching method can be implemented by a processor calling computer-readable instructions stored in memory.
[0031] See Figure 1 The diagram illustrates a flowchart of a light effect matching method provided in an embodiment of this disclosure, the method comprising the following steps S101-S104: S101: In response to a music playback command, obtain the lyrics of the currently playing music and the time information of the lyrics on the playback timeline; S102: Perform semantic analysis on the lyrics text and extract at least one corresponding dynamic semantic feature; S103: Based on at least one dynamic semantic feature, determine the corresponding dynamic lighting scheme from the preset lighting effect mode knowledge base. The dynamic lighting scheme is used to define the dynamic change mode of the light. S104: During music playback, output a dynamic lighting scheme based on the timing information of the lyrics to control the lights.
[0032] The music playback command here can be issued by the user through voice, touch, or by selecting a song (such as "Storm") via the application interface. The system responds to the command by retrieving the audio file of the song and its corresponding lyrics file, such as a lyrics (Lyric, LRC) format file, from the local music library or online music service.
[0033] The lyrics file contains not only text, but also precise timestamps (i.e., time information) for each word or line of lyrics. The time information can be the start and end time sequence of each word, or the start and end time points of each line of lyrics obtained through parsing.
[0034] By analyzing the semantics of the lyrics, dynamic semantic features strongly correlated with dynamic expressions can be extracted. These dynamic semantic features include, for example, action features, emotional intensity features, or scene dynamic features. In practical implementation, the lyrics can be sliced first, and then the dynamic semantic features corresponding to each slice can be determined, thereby enabling a more granular presentation of lighting effects.
[0035] The system maintains a "lighting effect pattern knowledge base" independent of any beat rules. This knowledge base stores the mapping relationship from "dynamic semantic features" to "lighting effect dynamic schemes". A lighting effect dynamic scheme specifically defines the dynamic mode of the light (such as flowing, flashing, jumping) and its parameters (such as speed, frequency, amplitude, waveform).
[0036] During song playback, the system performs control along the timeline, specifically by determining the dynamic lighting scheme for each slice.
[0037] In order to achieve a more granular presentation of light effects, this embodiment of the present disclosure slices the lyrics text according to time information before performing semantic analysis on the lyrics text, forming multiple lyrics slices; wherein, the time information is the start time point and end time point corresponding to each lyrics slice.
[0038] Specifically, the system segments the complete lyrics text based on the acquired time information, forming multiple temporally consecutive lyrics segments. This embodiment of the disclosure can employ a sentence segmentation method based on punctuation marks (such as periods). For example, the first verse can be segmented into three segments: Slice 1: “Dark clouds press down on the city, threatening to crush it,” Slice 2: “Lightning tears through the sky,” and Slice 3: “The wind howls, my heart pounds like thunder.” Each segment is associated with its start time (T_s) and end time (T_e).
[0039] When segmenting lyrics based on punctuation, the primary method is segmentation by periods. That is, the text content between the time sequence of a period and the time sequence of the previous period is considered a single lyric slice. In other cases, if semicolons, question marks, exclamation marks, etc., appear, they can also be treated the same as periods.
[0040] In this embodiment of the disclosure, in addition to segmenting lyrics based on punctuation marks, slicing can also be performed based on preset fixed time intervals, or segmented based on semantically recognized theme or scene changes, or other segmentation methods, without specific limitations.
[0041] By slicing, the continuous stream of lyrics is divided into discrete, clearly defined semantic processing units along the timeline. This provides a structural foundation for subsequent precise dynamic feature analysis and lighting effect matching for each independent time segment, and is a key preprocessing step for realizing dynamic lighting and shadow storytelling "line by line" or "segment by segment." Furthermore, it provides various slicing strategies to adapt to different application needs and lyric structures, greatly enhancing the method's flexibility and versatility. Time interval slicing is simple and efficient; punctuation slicing conforms to natural language habits and can effectively capture the grammatical and emotional pauses in lyrics; semantic segmentation enables more intelligent and logically consistent thematic division, creating conditions for accurate matching of complex narrative lyrics.
[0042] Subsequently, semantic analysis is performed independently for each lyric slice. For each of the multiple lyric slices, semantic analysis is conducted separately to extract the dynamic semantic features of each lyric slice. In the specific implementation, the system calls a pre-trained large language model for deep semantic analysis.
[0043] For example, for Slice 1 (“Dark clouds loom over the city, threatening to destroy it”), the model might extract: {“Scene dynamics”: “gathering, oppression”, “Emotional intensity”: “tension, increasing”}; for Slice 2 (“Lightning tears through the sky and falls”), the model extracts: {“Action features”: “sudden, tearing, falling”, “Dynamic intensity”: “extremely high”}; for Slice 3 (“Howling wind and thunderous heart”), the model extracts: {“Action features”: “howling, vibration”, “Scene dynamics”: “chaos, continuous”, “Emotional intensity”: “excitement”}.
[0044] Based on the above slicing method, each lyric slice is guaranteed to obtain its independent semantic interpretation. Action features (such as "running" and "spinning") can directly correspond to the movement patterns of the light effects; emotional intensity features (such as "excited" and "relaxed") affect the intensity of the dynamics; and scene dynamic features (such as "rainstorm" and "stream") inspire the overall form of the dynamics. The extraction of these dimensions provides clear and interpretable semantic basis for subsequent light effect matching, greatly improving the accuracy and rationality of the matching. This allows for the customization of the most suitable dynamic light effect scheme, achieving refined light effect changes and high semantic fidelity, and avoiding the problem of rough dynamic expression caused by general matching of the entire song.
[0045] Different dynamic semantic features can be used to determine different dynamic lighting effect schemes. The lighting effect mode knowledge base provided in this embodiment stores the mapping relationship from dynamic semantic features to dynamic lighting effect schemes. As mentioned above, a dynamic lighting effect scheme specifically defines the dynamic mode of the light (such as flowing, flashing, jumping) and its parameters (such as speed, frequency, amplitude, waveform).
[0046] Taking the three slices mentioned above as examples, the features of Slice 1 ("gathering, compression") can match a "contraction breathing" scheme where the brightness of the light gradually gathers and increases from the periphery to the center, and the color (if adjustable) tends to darken. The features of Slice 2 ("sudden, tearing, extremely high intensity") can match a "random flash" scheme where all light units flicker at extremely high frequencies and irregularly, simulating lightning. The features of Slice 3 ("howling, chaos, excitement") can match a "violent flow" scheme where the light flows at high speed and in a wave-like manner along a specific direction, accompanied by random fluctuations in intensity.
[0047] In practical applications, another completely independent "beat response rule base" may be in operation at this time. It is only responsible for determining whether to trigger a basic "luminance pulse" or "position micro-shift" at a certain beat point. Its rules are simple and semantically unrelated. The complex dynamic scheme determined in this application is logically unrelated to this set of beat rules.
[0048] During song playback, the system executes control along the timeline. When playback reaches time point T_s1 (the starting point of Slice 1), the system instructs the lights to execute a matching "contraction breathing" pattern. Simultaneously, the underlying beat pulses may be superimposed on this dynamic, but the two originate independently. For Slice 2 and Slice 3, at their starting points T_s2 and T_s3, the system switches to the "random strobe" and "violent flow" patterns, respectively.
[0049] For short intervals between lyric slices or interludes or inserts without lyrics, the system can use preset strategies to handle them. The control strategies for the output color scheme include at least one of the following: continuing the color scheme corresponding to the previous lyric slice; performing a gradual transition between the color schemes of two adjacent lyric slices; or randomly selecting one of multiple color schemes that match the overall lyrics for output.
[0050] For example, by using a "gradual transition", the dynamic parameters of the lighting effect can be smoothly transitioned from the "violent flow" of Slice 3 to the soft dynamic scheme matched by the next lyric slice (such as "calm seas" of Slice 4) within the T_interlude time period, so as to achieve a seamless connection of visual narrative.
[0051] As can be seen, the embodiments of this disclosure provide intelligent light effect transition strategies for purely musical sections such as intros, interludes, and outros. These strategies take into account the light effect context of preceding and following lyric segments, making the dynamic changes in lyric-free periods a natural and coherent part of the overall visual narrative, rather than abrupt pauses or jumps, effectively ensuring the smoothness and integrity of the user's visual experience throughout the entire music appreciation process.
[0052] Considering the crucial role of the light effect pattern knowledge base in implementing the light effect matching method provided in this disclosure, the process of establishing the light effect pattern knowledge base will be described in detail below: Multi-source data acquisition: Collect a large number of movie special effects clips, dance videos, natural phenomenon videos (such as storms and flowing water) and their text descriptions to form a "dynamic visual-semantic description" pair.
[0053] Feature association learning: This approach utilizes computer vision techniques to analyze the core patterns (speed, rhythm, regularity) of light and shadow motion in video clips, and employs Natural Language Processing (NLP) techniques to understand the dynamic semantics within the textual descriptions. Machine learning models (such as cross-modal neural networks) are used to learn the mapping relationship from dynamic semantic vectors to light effect motion parameter vectors.
[0054] In practical implementation, expert rules can be integrated. For example, the classic lighting effect design experience of domain experts (such as lighting designers) for specific emotions or scenes (such as "romantic," "mysterious," and "outburst") can be encoded into rules and incorporated into the knowledge base. Furthermore, the knowledge base can be optimized through user feedback. The system allows users to rate or adjust the lighting effect expression of a particular lyric after experiencing it. This feedback data is used to fine-tune the mapping relationships in the knowledge base online, enabling the system to adapt to the aesthetic preferences of users or groups and achieve personalized evolution.
[0055] It is important to emphasize that this light effect mode knowledge base is independent of the beat response rule base used to directly trigger basic rhythmic changes in light effects based on the music's beat. This independent light effect mode knowledge base ensures that the selection of dynamic modes is entirely determined by the semantics of the lyrics, completely eliminating the interference and constraints of beat rules. This is the fundamental technical guarantee for achieving "semantics-driven dynamics" rather than "beat-driven dynamics."
[0056] In the description of this specification, references to terms such as "some possible implementations," "some implementations," "example," "specific example," or "some examples" indicate that a specific feature, structure, material, or characteristic described in connection with that implementation or example is included in at least one implementation or example of this disclosure, and the aforementioned terms do not necessarily refer to the same implementation or example. Furthermore, the described specific features, structures, materials, or characteristics can be combined in a suitable manner in any one or more implementations or examples. Moreover, without contradiction, those skilled in the art can combine and integrate the different implementations or examples described in this specification, as well as the features of different implementations or examples.
[0057] Regarding the method flowcharts of embodiments of this disclosure, certain operations are described as different steps performed in a certain order. Such flowcharts are illustrative and not restrictive. Some steps described herein may be grouped together and performed in a single operation, or some steps may be divided into multiple sub-steps, and some steps may be performed in an order different from that shown herein. The various steps shown in the flowcharts may be implemented in any way by any circuit structure and / or tangible mechanism (e.g., by software running on a computer device, hardware (e.g., logic functions implemented by a processor or chip), and / or any combination thereof).
[0058] Those skilled in the art will understand that in the methods described in the above specific embodiments, the order in which the steps are written does not imply a strict execution order, and the specific execution order of each step should be determined by its function and possible internal logic.
[0059] Based on the same inventive concept, this disclosure also provides a light effect matching device corresponding to the light effect matching method in music playback. Since the principle of the device in this disclosure is similar to the light effect matching method described above in this disclosure, the implementation of the device can refer to the implementation of the method, and the repeated parts will not be described again.
[0060] Reference Figure 2 The diagram shown is a schematic of a light effect matching device provided in an embodiment of this disclosure. The device includes: an acquisition module 201, an extraction module 202, a determination module 203, and a control module 204; wherein, The acquisition module 201 is used to respond to music playback commands and acquire the lyrics text of the currently playing music and the time information of the lyrics text on the playback timeline. Extraction module 202 is used to perform semantic analysis on the lyrics text and extract at least one corresponding dynamic semantic feature; The determination module 203 is used to determine the corresponding dynamic light effect scheme from a preset light effect mode knowledge base based on at least one dynamic semantic feature. The dynamic light effect scheme is used to define the dynamic change mode of the light. The control module 204 is used to output a dynamic lighting scheme to control the lights based on the timing information of the lyrics during music playback.
[0061] The aforementioned light effect matching device for music playback responds to music playback commands by acquiring the lyrics of the currently playing music and their timing information on the playback timeline. It then performs semantic analysis on the lyrics and, based on the extracted dynamic semantic features, determines the corresponding dynamic light effect scheme from a pre-defined light effect pattern knowledge base. Thus, during music playback, the dynamic light effect scheme is output to control the lighting based on the timing information of the lyrics. This disclosure establishes an independent path from lyric understanding to light effect pattern matching, enabling light effects to break free from a simple dependence on audio physical features and become a dynamic visual language that interprets the meaning of the lyrics. This not only solves the fundamental problem of the disconnect between light effects and the artistic conception of the lyrics but also elevates lighting from its basic function of rhythm to an artistic expression medium with independent narrative capabilities, significantly enhancing the immersiveness and artistic appeal of music playback.
[0062] In one possible implementation, the above-described apparatus further includes: The slicing module 205 is used to slice the lyrics text according to time information before performing semantic analysis on the lyrics text, forming multiple lyrics slices; wherein, the time information is the start time point and end time point corresponding to each lyrics slice.
[0063] In one possible implementation, the slicing module 205 is specifically configured to slice the lyrics text by at least one of the following methods: Slicing is performed according to a preset fixed time interval; Segment the lyrics based on punctuation marks; Segmentation is performed based on semantically identified changes in themes or scenes.
[0064] In one possible implementation, the extraction module 202 is specifically used to perform semantic analysis on the lyrics text according to the following steps: For each lyric slice in the multiple lyric slices, semantic analysis is performed on each lyric slice separately to extract the dynamic semantic features of each lyric slice.
[0065] In one possible implementation, the extraction module 202 is specifically used to perform semantic analysis on each lyric slice according to the following steps: Semantic analysis is performed on each lyric slice using a pre-trained large language model, and at least one of the action features, emotional intensity features, or scene dynamic features described in each lyric slice is extracted as dynamic semantic features.
[0066] In one possible implementation, the control module 204 is further configured to: During music playback, for the playback time segment corresponding to the lyric-free segment, the control strategy for the output light effect dynamic scheme includes at least one of the following: Continue with the dynamic lighting effects scheme corresponding to the previous lyric slice; A gradual transition is created between the light effect dynamics of two adjacent lyric slices; Randomly select one of several lighting effect dynamic schemes that match the overall lyrics for output.
[0067] In one possible implementation, the dynamic change modes defined by the light effect dynamic scheme include at least one of the following: light flow, flashing, jumping, and breathing modes.
[0068] In one possible implementation, the preset light effect pattern knowledge base is independent of the beat response rule base used to directly trigger the basic changes in the rhythm of the light effect based on the music beat.
[0069] It should be noted that the apparatus in this embodiment can implement the various processes of the aforementioned method and achieve the same effects and functions, which will not be elaborated here.
[0070] This disclosure also provides an electronic device, such as... Figure 3 The diagram shown is a schematic representation of an electronic device structure provided in this embodiment of the present disclosure, including: a processor 301, a memory 302, and a bus 303. The memory 302 stores machine-readable instructions executable by the processor 301 (e.g., ...). Figure 2The device includes modules 201 for acquisition, 202 for extraction, 203 for determination, and 204 for control (and corresponding execution instructions, etc.). When the electronic device is running, the processor 301 and the memory 302 communicate via the bus 303. When a machine-readable instruction is executed by the processor 301, the following processing is performed: In response to a music playback command, obtain the lyrics of the currently playing music and the time information of the lyrics on the playback timeline; Perform semantic analysis on the lyrics text and extract at least one corresponding dynamic semantic feature; Based on at least one dynamic semantic feature, the corresponding dynamic lighting scheme is determined from the preset lighting effect mode knowledge base. The dynamic lighting scheme is used to define the dynamic change mode of the light. During music playback, a dynamic lighting scheme is output based on the timing information of the lyrics to control the lights.
[0071] In one possible implementation, the instructions executed by the processor 301, before performing semantic analysis on the lyrics text, further include: The lyrics text is sliced based on time information to form multiple lyrics slices; the time information is the start time and end time of each lyrics slice.
[0072] In one possible implementation, the instructions executed by the processor 301 include slicing the lyrics text, specifically including at least one of the following methods: Slicing is performed according to a preset fixed time interval; Segment the lyrics based on punctuation marks; Segmentation is performed based on semantically identified changes in themes or scenes.
[0073] In one possible implementation, the instructions executed by the processor 301 include semantic analysis of the lyrics text, including: For each lyric slice in the multiple lyric slices, semantic analysis is performed on each lyric slice separately to extract the dynamic semantic features of each lyric slice.
[0074] In one possible implementation, the instructions executed by the processor 301 include semantic analysis of each lyrics slice, including: Semantic analysis is performed on each lyric slice using a pre-trained large language model, and at least one of the action features, emotional intensity features, or scene dynamic features described in each lyric slice is extracted as dynamic semantic features.
[0075] In one possible implementation, the control strategy for the dynamic light effect scheme output during music playback, for the playback time period corresponding to the lyric-free segment, includes at least one of the following: Continue with the dynamic lighting effects scheme corresponding to the previous lyric slice; A gradual transition is created between the light effect dynamics of two adjacent lyric slices; Randomly select one of several lighting effect dynamic schemes that match the overall lyrics for output.
[0076] In one possible implementation, the instructions executed by the processor 301 include at least one of the following dynamic change modes defined by the dynamic lighting scheme: light flow, flashing, jumping, and breathing.
[0077] In one possible implementation, the preset light effect mode knowledge base in the instructions executed by the processor 301 is independent of the beat response rule base used to directly trigger the basic change rhythm of the light effect according to the music beat.
[0078] This disclosure also provides a computer-readable storage medium storing a computer program that, when executed by a processor, performs the steps of the light effect matching method for music playback described in the above method embodiments. The storage medium can be a volatile or non-volatile computer-readable storage medium.
[0079] This disclosure also provides a computer program product carrying program code. The program code includes instructions that can be used to execute the steps of the light effect matching method in music playback described in the above method embodiments. For details, please refer to the above method embodiments, which will not be repeated here.
[0080] The aforementioned computer program product can be implemented through hardware, software, or a combination thereof. In one optional embodiment, the computer program product is specifically embodied in a computer storage medium; in another optional embodiment, the computer program product is specifically embodied in a software product, such as a software development kit (SDK), etc.
[0081] The various embodiments in this disclosure are described in a progressive manner, and the same or similar parts between the various embodiments can be referred to each other. Each embodiment focuses on describing the differences from other embodiments. In particular, the description of the apparatus, device, and computer-readable storage medium embodiments is simplified because they are basically similar to the method embodiments, and the relevant parts can be referred to the description of the method embodiments.
[0082] The apparatus, device, and computer-readable storage medium provided in this disclosure correspond one-to-one with the method. Therefore, the apparatus, device, and computer-readable storage medium also have similar beneficial technical effects as their corresponding methods. Since the beneficial technical effects of the method have been described in detail above, the beneficial technical effects of the apparatus, device, and computer-readable storage medium will not be repeated here.
[0083] Those skilled in the art will understand that embodiments of this disclosure can be implemented as methods and apparatus (devices or systems), or as computer-readable storage media. Therefore, this disclosure can be implemented entirely in hardware, entirely in software, or in a combination of software and hardware. Furthermore, this disclosure can be implemented as a computer-readable storage medium on one or more computer-readable storage media containing computer-usable program code (including, but not limited to, disk storage, read-only optical disc storage (CD-ROM), optical storage, etc.).
[0084] This disclosure is described with reference to flowchart illustrations and / or block diagrams of methods, apparatus (devices or systems), and computer-readable storage media according to embodiments of this disclosure. It should be understood that each block of the flowchart illustrations and / or block diagrams, and combinations of blocks in the flowchart illustrations and / or block diagrams, can be implemented by computer program instructions. These computer program instructions can be provided to a processor of a general-purpose computer, special-purpose computer, embedded processor, or other programmable data processing apparatus to create a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in one or more blocks of the flowchart illustrations and / or block diagrams.
[0085] These computer program instructions may also be stored in a computer-readable storage medium that can direct a computer or other programmable data processing device to function in a particular manner, such that the instructions stored in the computer-readable storage medium produce an article including instruction means, wherein the instruction means implement the functions specified in one or more flowcharts and / or one or more blocks in a block diagram.
[0086] These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer-implemented process, such that the instructions, which execute on the computer or other programmable apparatus, provide steps for implementing the functions specified in one or more processes in the flowchart and / or one or more blocks in the block diagram.
[0087] In a typical configuration, a computing device includes one or more processors (CPU), input / output interfaces, network interfaces, and memory.
[0088] Memory can include non-persistent storage in computer-readable media, such as random access memory (RAM) and / or non-volatile memory, such as read-only memory (ROM) or flash RAM. Memory is an example of computer-readable media.
[0089] Computer-readable media include permanent and non-permanent, removable and non-removable media, which can store information by any method or technology. Information can be computer-readable instructions, data structures, modules of programs, or other data. Examples of computer-readable storage media include, but are not limited to, phase-change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory, read-only memory, electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technologies, CD-ROM, digital versatile optical disc (DVD) or other optical storage, magnetic tape, magnetic magnetic disk storage or other magnetic storage devices, or any other non-transfer medium that can be used to store information accessible by a computing device. Furthermore, although the operations of the methods of this disclosure are described in a specific order in the accompanying drawings, this does not require or imply that these operations must be performed in that specific order, or that all of the operations shown must be performed to achieve the desired result. Additionally, certain steps may be omitted, multiple steps may be combined into one step, and / or a step may be broken down into multiple sub-steps.
[0090] While the spirit and principles of this disclosure have been described above with reference to several specific embodiments, it should be understood that this disclosure is not limited to the disclosed specific embodiments, and the division of aspects does not imply that features in these aspects cannot be combined. This disclosure is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims.
Claims
1. A method for matching light effects in music play, characterized in that, include: In response to a music playback command, obtain the lyrics of the currently playing music and the time information of the lyrics on the playback timeline; Perform semantic analysis on the lyrics text and extract at least one corresponding dynamic semantic feature; Based on the at least one dynamic semantic feature, a corresponding dynamic light effect scheme is determined from a preset light effect mode knowledge base. The dynamic light effect scheme is used to define the dynamic change mode of the light. During music playback, the dynamic lighting scheme is output based on the timing information of the lyrics to control the lights.
2. The method of claim 1, wherein, Before performing semantic analysis on the lyrics text, the method further includes: The lyrics text is sliced based on the time information to form multiple lyrics slices; wherein, the time information is the start time point and end time point corresponding to each lyrics slice.
3. The method of claim 2, wherein, The slicing of the lyrics text specifically includes at least one of the following methods: Slicing is performed according to a preset fixed time interval; Segment the lyrics based on punctuation marks; Segmentation is performed based on semantically identified changes in themes or scenes.
4. The method of claim 2, wherein, The semantic analysis of the lyrics text includes: For each lyric slice in the multiple lyric slices, semantic analysis is performed on each lyric slice to extract the dynamic semantic features of each lyric slice.
5. The method according to claim 4, characterized in that, The semantic analysis of each lyric slice includes: Semantic analysis is performed on each lyric slice using a pre-trained large language model, and at least one of the action features, emotion intensity features, or scene dynamic features described in each lyric slice is extracted as the dynamic semantic feature.
6. The method according to any one of claims 2 to 5, characterized in that, During music playback, for the playback time segment corresponding to the lyric-free segment, the control strategy for the output light effect dynamic scheme includes at least one of the following: Continue with the dynamic lighting effects scheme corresponding to the previous lyric slice; A gradual transition is created between the light effect dynamics of two adjacent lyric slices; Randomly select one of several lighting effect dynamic schemes that match the overall lyrics for output.
7. The method according to any one of claims 1 to 5, characterized in that, The dynamic change modes defined by the light effect dynamic scheme include at least one of the following: light flow, flashing, jumping, and breathing modes.
8. The method according to any one of claims 1 to 5, characterized in that, The preset light effect mode knowledge base is independent of the beat response rule base used to directly trigger the basic changes in the rhythm of light effects according to the music beat.
9. A light effect matching device for music playback, characterized in that, include: The acquisition module is used to acquire the lyrics text of the currently playing music and the time information of the lyrics text on the playback timeline in response to the music playback command; The extraction module is used to perform semantic analysis on the lyrics text and extract at least one corresponding dynamic semantic feature; The determination module is used to determine the corresponding dynamic light effect scheme from a preset light effect mode knowledge base based on the at least one dynamic semantic feature, wherein the dynamic light effect scheme is used to define the dynamic change mode of the light. The control module is used to output the dynamic lighting scheme to control the lights based on the timing information of the lyrics text during music playback.
10. An electronic device, characterized in that, include: The device includes a processor, a memory, and a bus. The memory stores machine-readable instructions executable by the processor. When the electronic device is running, the processor communicates with the memory via the bus. When the machine-readable instructions are executed by the processor, they perform the light effect matching method in music playback as described in any one of claims 1 to 8.
11. A computer-readable storage medium, characterized in that, The computer-readable storage medium stores a computer program that, when executed by a processor, performs the light effect matching method in music playback as described in any one of claims 1 to 8.