Multimodal automatic scoring method for college English speech

An automatic scoring, multi-modal technology, applied in the field of deep learning, can solve the problems of single scoring method, inaccurate scoring results, waste of personnel time and energy, etc., to reduce costs and improve scoring accuracy.

Pending Publication Date: 2022-03-15
XIAMEN UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the scoring methods of these methods are too single, resulting in inaccurate final scoring results
However, manual scoring is often subject to subjective influence, resulting in unstable scoring results; moreover, manual scoring requires a lot of time and energy wasted, and the cost is high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multimodal automatic scoring method for college English speech

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0020]In related technologies, when scoring English speeches, most of them start from a single mode or directly manually score. The accuracy is low and wastes manpower and material resources. According to the multi-modal automatic scoring method for college English speeches according to the embodiment of the present invention, firstly, historical speech data is obtained, wherein the historical speech data includes speech videos and manual scoring results corresponding to the speech videos, and the manual scoring results include ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a multi-modal automatic scoring method, medium and equipment for college English speech. The method comprises the following steps: acquiring historical speech data; extracting text features, audio features and video features, and performing model training to obtain a language use evaluation sub-model, a speech expression evaluation sub-model and a non-language evaluation sub-model; generating a fourth data set according to the output of the three sub-models and the comprehensive score; performing model training to obtain a multi-modal fusion learning model; obtaining a speech video to be scored, extracting corresponding text features, audio features and video features, and outputting corresponding single scores through the three sub-models; inputting the single score into a multi-modal fusion learning model, and outputting a final score result corresponding to the speech video to be scored through the multi-modal fusion learning model; multi-modal scoring can be carried out on the English speech, and the scoring accuracy and the scoring efficiency are improved; and meanwhile, the cost for scoring the English speech is reduced.

Description

technical field [0001] The invention relates to the technical field of deep learning, in particular to a multimodal automatic scoring method for college English speeches. Background technique [0002] College English speech is a communicative activity characterized by multimodality. During the speech process, the speaker needs to use verbal and non-verbal modes to cooperate with each other. [0003] In related technologies, when scoring English speeches, most of them start from a single mode or directly manually score. Wherein, the unimodal scoring is, for example: extracting the speech in the speech process, and then scoring the speech according to the speech; or obtaining the speech text and scoring the speech text. However, the scoring methods of these methods are too single, resulting in inaccurate final scoring results. However, manual scoring is often subject to subjective influence, resulting in unstable scoring results; moreover, manual scoring requires a lot of t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V20/40G06Q10/06G06Q50/20
CPCG06Q10/06393G06Q50/205
Inventor 黄玲毅林和志郭洋洋姚舜禹许智军陈勇郑超茹黄联芬
Owner XIAMEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products