unit scaling demo Download - unit scaling demo Source code download

unit scaling demo

AI Source Code

1.0.0

Download

Unit Scaling demo

Code for the paper: Unit Scaling: Out-of-the-Box Low-Precision Training.

We'd like weights, activations & gradients all to be unit-variance at initialisation. To achieve this, we will introduce separate scaling factors for activations in the forwards pass and for gradients in the backwards pass.

This repository contains our experimentation code for experiments on character-level language modelling, and a demo notebook.

Overview:

Technique - Unit Scaling
Task - Character Language Modelling
Dataset - WikiText-103 (raw)
Framework - TF2/Keras, Poplar SDK
Logging - WandB

Structure:

run_experiment.py - configuration & entry point for a single experiment
run_sweep.py - sweep logic & configuration
scmm/ - core Python package and baseline implementation
- scmm/uscale/ - unit scaling implementation
- scmm/pedal/ - platform-specific adapters
dev - development task launch script (tests, lint, etc)
Dataset.ipynb - script used to generate the vocabulary from WikiText-103 (raw)
pytorch-notebook/unit-scaling-notebook.ipynb

See also:

pytorch-notebook/unit-scaling-notebook.ipynb - standalone PyTorch demo
branch:2023-01-paper - additional supporting materials for the paper

Usage

This code has been tested on Poplar SDK 3.1.0+1205.

python3 -m venv .venv
# Append to .venv/bin/activate:
# source PATH/TO/POPLAR_SDK/enable
source .venv/bin/activate
pip install wheel
pip install $POPLAR_SDK_ENABLED/../tensorflow-2.6.3+gc3.1.0+246224+2b7af067dae+amd_znver1-cp38-cp38-linux_x86_64.whl
pip install $POPLAR_SDK_ENABLED/../keras-2.6.0+gc3.1.0+246230+88e2debf-py2.py3-none-any.whl
pip install -r requirements.txt

python run_experiment.py

To reproduce

Our test result sweeps are described by run_sweep.py. By default this assumes the data is under /home/research-datasets/wikitext103_raw (train.txt, valid.txt, test.txt) and that the user is logged into WandB.