A method and device for extracting and analyzing image information based on voice input
A technology of image information and information extraction, which is applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of heavy user interaction burden and achieve the effect of reducing interaction burden
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0038] figure 1 It is a flow chart of the voice input-based image information extraction and analysis method provided in Embodiment 1 of the present invention, as shown in figure 1 As shown, the method includes:
[0039] S101. Obtain the information extraction intention of the user according to the voice input by the user.
[0040] Acquire the voice input by the user by using an acoustic sensor, and then convert the acquired voice input by the user into corresponding text information through voice recognition technology, and use the obtained text information as the user's information extraction intention.
[0041] Furthermore, in order to obtain the matching image information extraction scene more accurately, the text information obtained after speech recognition can be further processed, specifically, it can include: word segmentation processing for the text information obtained after speech recognition, and then semantic Analyze and extract the keyword groups, such as "com...
Embodiment 2
[0054] image 3 It is a schematic diagram of an image information extraction and analysis device based on voice input provided in Embodiment 2 of the present invention, as shown in image 3 As shown, the device includes: a preprocessing unit 10 , a matching unit 20 , and an analysis unit 30 .
[0055] The preprocessing unit 10 is configured to obtain the user's information extraction intention according to the voice input by the user.
[0056] The preprocessing unit 10 uses the acoustic sensor to acquire the voice input by the user, and then converts the acquired voice input by the user into corresponding text information through voice recognition technology, and uses the obtained text information as the user's information extraction intention.
[0057] Furthermore, in order to obtain the matching image information extraction scene more accurately, the preprocessing unit 10 may further process the text information obtained after the speech recognition, which may specifically ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


