Generation method, optimization method and device of deep learning model, equipment and medium
A deep learning and model technology, applied in the computer field, can solve a lot of labor costs, learning costs, trial and error costs, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0040] figure 1 It is a flowchart of a deep learning model generation method provided by Embodiment 1 of the present invention. This embodiment is applicable to the case where the deep learning model supports model parallelism. This method can be generated by the deep learning model provided by the embodiment of the present invention device, which can be implemented in the form of software and / or hardware, and can generally be integrated into electronic equipment.
[0041] Such as figure 1 As shown, the generation method of the deep learning model provided by this embodiment includes:
[0042] S110. Obtain the parallel configuration parameters of the business model code and the target model.
[0043] The business model code refers to the model code used to set the business in the setting scenario, for example, it can be the model code used to set the business (such as natural language processing, etc.) in the artificial intelligence scenario. Exemplarily, the business model...
Embodiment 2
[0088] image 3 It is a flowchart of a method for generating a deep learning model provided by Embodiment 2 of the present invention. This embodiment is embodied on the basis of the foregoing embodiments, wherein the initial abstract syntax tree is constructed according to the business model code, and the initial abstract syntax tree is updated to the target abstract syntax tree according to the parallel configuration parameters of the target model, which may be specifically as follows:
[0089] Construct the module dependency graph of the business model code, and the initial abstract syntax tree corresponding to each module on the module dependency graph; based on the PASS mechanism, update the initial abstract syntax tree corresponding to each module to the corresponding target abstraction according to the parallel configuration parameters of the target model syntax tree.
[0090] Such as image 3 As shown, the generation method of the deep learning model provided by this ...
Embodiment 3
[0149] Figure 7 It is a flowchart of a method for generating a deep learning model provided by Embodiment 3 of the present invention. This embodiment provides a specific implementation manner on the basis of the foregoing embodiments.
[0150] Such as Figure 7 As shown, the generation method of the deep learning model provided by this embodiment includes:
[0151] S310. Obtain a business model code.
[0152] S320. Determine whether the business model code uses model parallelism. If yes, execute S330. If not, execute S3100.
[0153] S330. Initialize and acquire parallel configuration parameters of the target model.
[0154] Optionally, the parallel configuration parameters of the target model include: configuration parameters corresponding to the fragmentation mode, and / or configuration parameters corresponding to the pipeline mode, which may specifically include fragmentation size, fragmentation dimension, number of pipeline segments, number of pipeline aggregations, equ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


