scaling laws for language transfer Download - scaling laws for language transfer Source code download

English

中文(简体) 中文(繁体) 한국어 日本語 English Português Español Русский العربية Indonesia Deutsch Français ภาษาไทย

Home>Programming related>AI Source Code

scaling laws for language transfer

AI Source Code

1.0.0

Download

Scaling Laws for Language Transfer Learning

Code and models from the blog post Scaling Laws for Language Transfer Learning

Motivation

Building upon work from Scaling Laws for Transfer (Hernandez et. al. 2021), my experiments focused on exploring the relationships between fine-tuning on non-English languages and trying to answer the question: How much does pre-training on English help when transferring across different languages as we vary the dataset size and model size?

Usage

This repo contains the code for:

Reproducing pre-trained decoder-only transformers using hyperparameters from Scaling Laws for Neural Languages but trained on OpenWebtext2 instead of WebText
Reproducing language transfer experiments for pre-trained English models to Chinese, Spanish, and German texts

All English pre-trained models were trained for 26 billion tokens with no repeats:

x6small 3.3M non-embedding parameters
x5small 16M non-embedding parameters
x4small 39M non-embedding parameters
x3small 51M non-embedding parameters
x2small 70M non-embedding parameters
small 124M non-embedding parameters

Datasets

English: OpenWebtext2
German: Oscar
Spanish: Oscar
Chinese: Community QA (webtext2091zh)

Expand

Additional Information

Version 1.0.0
Type AI Source Code
Update Time 2025-09-09
size 17.65KB
From Github

Related Applications

language tools

2024-11-11
efficient language detector

2024-11-06
Parameter Efficient Transfer Learning Benchmark

2024-11-06
scene language

2024-11-03
iTunes for Windows

2009-06-03
Ajax For Dummies

2009-05-23

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
ML stack

AI Source Code

1.0.0
awesome free chatgpt

AI Source Code

1.0.0
pywin_contextmenu

AI Source Code

Version update
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3

Related Information All