Desgin: The memory system for octogen

## Background and Why

Even GPT-4, a well-performing model, can generate outdated code due to outdated training data. However, LLMs have a strong language understanding ability, which can be used to correct these errors through prompts.

1. **To enable LLM to use command-line tools and libraries that are not covered by the training data**. This includes two cases:
    * Tools and code repositories don't being included  in the training data at all.
    * The model is trained on data from outdated tools or libraries, which means that the model cannot use them right
2. **To prevent LLM from repeating the same mistakes**. When a LLM uses an incorrect code usage or tool, it will always repeat the incorrect code and then recall the correct result through long-term memory before executing the code. However, this can be improved by using short-term memory to store the correct tool or code usage  in the instructions.

## Desgin

![image](https://github.com/dbpunk-labs/octogen/assets/8623385/88c2765f-5a08-4885-b490-c5e5cf24694c)


## Other Memory Desgin

1. https://arxiv.org/pdf/2310.08560.pdf this paper can provide some data proof for the desgin
2. https://arxiv.org/abs/1909.09436

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Desgin: The memory system for octogen #165

Background and Why

Desgin

Other Memory Desgin

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Desgin: The memory system for octogen #165

Description

Background and Why

Desgin

Other Memory Desgin

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions