Text generation device and text generation model training method through removal of target word noise
A three-stage training process for Large Language Models enhances performance by learning domain and task knowledge through targeted noise removal, addressing underperformance issues in existing LLMs.
Patent Information
- Authority / Receiving Office
- WO · WO
- Patent Type
- Applications
- Current Assignee / Owner
- UNIVERSITY INDUSTRY COOPERATION GROUP OF KYUNG HEE UNIVERSITY
- Filing Date
- 2025-11-26
- Publication Date
- 2026-06-18
AI Technical Summary
Existing Large Language Models (LLMs) face underperformance in specific tasks and domains due to a mismatch between pre-training and fine-tuning objectives, lacking sufficient domain and task knowledge.
A three-stage training process involving pre-training, target word denoising (TWD), and fine-tuning is employed to enhance the text generation model's performance by learning domain and task knowledge through post-training on a given dataset, using input and output text with targeted noise removal.
The method reduces training costs and improves model performance by focusing on domain and task-specific knowledge acquisition, outperforming existing models in various text generation tasks.
Smart Images

Figure KR2025019773_18062026_PF_FP_ABST