Systems and methods for improved optimization of machine-learned models
A machine learning model and learning rate technology, applied in the field of machine learning, can solve the problems of reducing generalization ability and speeding up
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0046] 1 Overview
[0047] Generally, the present disclosure relates to systems and methods for improving the optimization of machine learning models. Specifically, the present disclosure provides a stochastic optimization algorithm that is faster than a widely used algorithm for a fixed amount of calculation, and can also scale significantly better as more computing resources become available. Random optimization algorithms can be used with large batch sizes. As an example, in some embodiments, the system and method of the present disclosure can implicitly calculate the inverse Hessian of each mini-batch training data to generate the descent direction. This can be done without an explicit approximation to the Hessian or Hessian vector product. An example experiment is provided to train large image net models (for example, Inception-V3, Resnet-50, Resnet-101, and Inception-Resnet-V2) by successfully using small batch sizes of up to 32,000, which are not compared to the current...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



