Task-oriented training method and use method of generative large language model
By constructing functional description document samples of tool applications and a controllable operating environment, and combining reinforcement learning to train a generative large language model, the problem of isolation between LLM and tool applications is solved, and the training efficiency and accuracy of the model are improved.
CN116756564BActive Publication Date: 2026-06-19APOLLO INTELLIGENT CONNECTIVITY (BEIJING) TECH CO LTD
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- APOLLO INTELLIGENT CONNECTIVITY (BEIJING) TECH CO LTD
- Filing Date
- 2023-05-29
- Publication Date
- 2026-06-19
Smart Images

Figure CN116756564B_ABST
Abstract
This disclosure provides a method for training and using a generative large language model oriented towards task solving, relating to artificial intelligence technologies such as generative large language models and unsupervised training. The method includes: training a general generative large language model based on a first training sample constructed from different tool applications and corresponding functional description documents, obtaining a first model; in a controllable operating environment integrating various tool applications, controlling the first model to automatically explore and invoke multiple tool applications to complete the user needs represented by the preset user input, obtaining the actual execution results output by the first model; updating the first model using reinforcement learning based on the similarity between the standard execution results and the actual execution results, obtaining a second model; training the second model using reinforcement learning based on human feedback, thereby obtaining a target generative large language model that can accurately invoke tool applications to solve user tasks.
Need to check novelty before this filing date? Find Prior Art