Model training method and device, corpus processing method and device and computer equipment
A model training and corpus technology, applied in the field of text processing, can solve the problems of low corpus processing efficiency, large labor cost and time cost, inability to screen out abnormal corpus, etc., and achieve the effect of improving processing efficiency and accuracy.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0044] figure 1 It is a flow chart of a model training method provided in Embodiment 1 of the present invention. This embodiment is applicable to the case of using corpus sample data to perform dependency analysis training to obtain a dependency analysis model. This method can be executed by a model training device , the device can be implemented by software and / or hardware, and generally can be integrated in computer equipment, such as figure 1 As shown, the method includes the following operations:
[0045] S110. Acquire corpus sample data in which there is a dependency relationship between word segmentation samples.
[0046] Wherein, the corpus sample data may be standard corpus data, which is used as sample data for model training. The word segmentation samples may be word segmentation data obtained by performing word segmentation processing on each corpus sample in the corpus sample data. There is a normal and reasonable dependency relationship among the word segmentat...
Embodiment 2
[0075] image 3 It is a flow chart of a corpus processing method provided in Embodiment 2 of the present invention. This embodiment is applicable to the situation of screening out abnormal corpus from corpus to be processed. The method can be executed by a corpus processing device, which can be implemented by software and / or hardware, and generally can be integrated in computer equipment, such as image 3 As shown, the method includes the following operations:
[0076] S210. Acquire the corpus to be processed, and input the corpus to be processed into the dependency analysis model.
[0077] Among them, the corpus to be processed may need to filter out the original corpus data of the abnormal corpus, and the corpus data may be voice data collected in real time and the like. It can be understood that the corpus to be processed may include multiple sentences.
[0078] In the embodiment of the present invention, the obtained corpus to be processed may be input into the dependen...
Embodiment 3
[0101] Figure 9 is a schematic diagram of a model training device provided in Embodiment 3 of the present invention, such as Figure 9 As shown, the device includes: a corpus sample data acquisition module 310 and a dependency analysis training module 320, wherein:
[0102] The corpus sample data acquisition module 310 is used to obtain the corpus sample data with dependency between the word segmentation samples;
[0103] The dependency analysis training module 320 is used to input the sample data of the corpus into a preset machine learning model to perform dependency analysis training to obtain a dependency analysis model; the dependency analysis model is used to perform abnormal screening processing on abnormal corpus .
[0104] In the technical solution of this embodiment, the dependency analysis model can be obtained by performing dependency analysis training on the word segmentation sample data in the corpus sample data, and the obtained dependency analysis model can ...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com