FLEXIBLE AND ADAPTABLE LLM’s WITH CONTINUOUS SELF TRAINING

post by Escaque 66 (escaque-66) · 2024-06-19T14:17:32.109Z · LW · GW · 0 comments

Contents

  CURRENT UNFLEXIBLE MODELS
  PROPOSAL FOR DYNAMIC KNOWLEDGE ACQUISITION
  Automated identification of limitations or errors 
  Automated generation of training data to correct identified issues
  Short training cycles to integrate new data to the model
  BENEFITS AND DRAWBACKS OF THIS APPROACH
  Benefits:
  Drawbacks:
None
No comments

CURRENT UNFLEXIBLE MODELS

Current LLMs have serious limitations in adaptability, flexibility, and continuous learning. The knowledge and world model contained in the LLM’s parameters is fixed at the time of training completion. Interactions during inference time do not influence the model's knowledge:

In all these cases, an incorrect answer from the model will repeat time and again because the model is fixed. 

These limitations cause incorrect answers to be repeated since the model is static. Current solutions attempt to address these issues but have drawbacks:

 

PROPOSAL FOR DYNAMIC KNOWLEDGE ACQUISITION

We propose a more dynamic approach to knowledge acquisition, automating the incorporation of new pieces of knowledge through three main processes.

  1. Automated identification of limitations or errors.
  2. Automated generation of training data to correct identified issues.
  3. Short training cycles to integrate new data to the model.

 

Automated identification of limitations or errors 

The model can identify limitations or errors through various methods:

When the model realizes that there is a new piece of knowledge that should be incorporated into the model, it is stored for the next process.

 

Automated generation of training data to correct identified issues

We have already seen that a model can be trained to produce good training data for other models, whether it is for a general purpose LLM or for a physical device (Eureka).

Given the identified piece of knowledge to incorporate, the training data to fix that knowledge into the model can be automatically generated.

The atomization of knowledge elements to incorporate will limit the training dataset size.

 

Short training cycles to integrate new data to the model

With the training data automatically generated, periodic and frequent training cycles will update the main model to incorporate new knowledge efficiently. 

 

 

BENEFITS AND DRAWBACKS OF THIS APPROACH

Benefits:

In short, the models will learn from their interactions, as well as any other intelligent creature does in nature. 

 

Drawbacks:

 

In summary, the changing nature of the model will require continuous checks to guarantee its security and alignment. 

0 comments

Comments sorted by top scores.