llm memory Download - llm memory Source code download

llm memory

AI Source Code

1.0.0

Download

Recognition, recall, and retention of few-shot memories in LLMs

This repository contains the code for reproducing the results reported in the following paper:

Orhan AE (2023) Recognition, recall, and retention of few-shot memories in large language models. arXiv:2303.17557.

The repository contains three Python files train.py, test.py, generate.py (all modified from the Huggingface causal language modeling example here) to train (or finetune) a model, to run a recognition test, and to run a recall test, respectively.

Usage examples

Some usage examples for these files are given below.

Finetune a gpt-j-6B model with the study sentences in seen_data_0.json for 1 epoch (1 exposure) on 4 GPUs (with a total batch size of 4x4=16 sentences) using the Huggingface Accelerate framework (see the example config file here):

accelerate launch --config_file accelerate_config.yaml --num_cpu_threads_per_process 4 train.py 
    --model_name_or_path "EleutherAI/gpt-j-6B" 
    --train_file "data/llm-experiment-data/expt1/seen_data_0.json" 
    --per_device_train_batch_size 4 
    --learning_rate 0.00001 
    --output_dir OUTPUT_DIR 
    --save_prefix INFORMATIVE_SAVE_PREFIX 
    --block_size 128 
    --num_train_epochs 1 
    --overwrite_cache

Run a recognition test on a model with the study sentences in seen_data_0.json and foils in unseen_data_0.json:

python -u test.py 
    --model_name_or_path MODEL_PATH 
    --seen_file "data/llm-experiment-data/expt1/seen_data_0.json" 
    --unseen_file "data/llm-experiment-data/expt1/unseen_data_0.json" 
    --per_device_eval_batch_size 1 
    --output_dir OUTPUT_DIR 
    --save_prefix INFORMATIVE_SAVE_PREFIX 
    --block_size 128 
    --overwrite_cache

Run a recall test with a model with the study sentences in seen_data_0.json:

python -u generate.py 
    --model_name_or_path MODEL_PATH 
    --seen_file "data/llm-experiment-data/expt1/seen_data_0.json" 
    --per_device_eval_batch_size 1 
    --output_dir OUTPUT_DIR 
    --save_prefix INFORMATIVE_SAVE_PREFIX 
    --block_size 128 
    --overwrite_cache

Reproduction

The scripts folder contains SLURM scripts for reproducing all experiments reported in the paper, using these three files. The data folder contains all the experimental data used in the experiments. The utils folder contains utility functions that were used to generate the experimental data. The results of all recognition, recall, and retention experiments reported in the paper are available from this Huggingface dataset repository.

Expand

Additional Information

Version 1.0.0
Type AI Source Code
Update Time 2025-09-09
size 17.05MB
From Github

Related Applications

TensorRT LLM

2024-11-10
GitHub sgrebnov/cordova plugin background download

2024-11-05
Hourly reminder (memory timer)

2023-06-15
mayday memory

2023-04-07
Bright Memory: Infinite

2022-07-29
Memory Hall simple personal website system

2010-12-10

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
ML stack

AI Source Code

1.0.0
awesome free chatgpt

AI Source Code

1.0.0
pywin_contextmenu

AI Source Code

Version update
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3

Related Information All