gpt 2 tensorflow2.0 ดาวน์โหลด - gpt 2 tensorflow2.0 ซอร์สโค้ดดาวน์โหลดดาวน์โหลด

gpt 2 tensorflow2.0

ซอร์สโค้ดอื่น ๆ

1.0.0

ดาวน์โหลด

GPT-2 การฝึกอบรมก่อนและการสร้างข้อความนำไปใช้ใน TensorFlow 2.0

ดำเนินการครั้งแรกใน TensorFlow 1.14 โดย Oapenai:- "OpenAI/GPT-2" Openai GPT-2 Paper:-"โมเดลภาษาเป็นผู้เรียนมัลติทาสก์ที่ไม่ได้รับการดูแล"

** ที่เก็บนี้มีการใช้งานการฝึกอบรมก่อนการฝึกอบรมและการสร้างลำดับใน TensorFlow 2.0, **

ความต้องการ

Python> = 3.6
setuptools == 41.0.1
ftfy == 5.6
tqdm == 4.32.1
คลิก == 7.0
sentencepiece == 0.1.83
tensorflow-gpu == 2.3.0
numpy == 1.16.4

การตั้งค่า

 $ git clone https://github.com/akanyaani/gpt-2-tensorflow2.0
$ cd gpt-2-tensorflow2.0
$ pip install -r requirements.txt

คุณสามารถฝึกอบรมรุ่นก่อนโดยใช้ข้อมูลตัวอย่างที่มีอยู่ในที่เก็บหรือคุณสามารถดาวน์โหลดข้อมูลโดยใช้ gitHub repo https://github.com/eukaryote31/openwebtext

รูปแบบการฝึกอบรมล่วงหน้าเกี่ยวกับข้อมูลตัวอย่างที่มีอยู่ในที่เก็บข้อมูล

 $ python pre_process.py --help

Options:
  --data-dir TEXT        training data path  [default: /data/scraped]
  --vocab-size INTEGER   byte pair vocab size  [default: 24512]
  --min-seq-len INTEGER  minimum sequence length  [default: 15]
  --max-seq-len INTEGER  maximum sequence length  [default: 512]
  --help                 Show this message and exit.
  
  
>> python pre_process.py

รูปแบบการฝึกอบรมล่วงหน้าบน OpenWebText หรือข้อมูลอื่น ๆ

 >> python pre_process.py --data-dir=data_directory --vocab-size=32000

 $ python train_gpt2.py --help

Options:
  --num-layers INTEGER      No. of decoder layers  [default: 8]
  --embedding-size INTEGER  Embedding size  [default: 768]
  --num-heads INTEGER       Number of heads  [default: 8]
  --dff INTEGER             Filter Size  [default: 3072]
  --max-seq-len INTEGER     Seq length  [default: 515]
  --vocab-size INTEGER      Vocab size  [default: 24512]
  --optimizer TEXT          optimizer type  [default: adam]
  --batch-size INTEGER      batch size  [default: 8]
  --learning-rate FLOAT     learning rate  [default: 0.001]
  --graph-mode BOOLEAN      TF run mode  [default: False]
  --distributed BOOLEAN     distributed training  [default: False]
  --help                    Show this message and exit.
  
  
>> python train_gpt2.py 
  --num-layers=8 
  --num-heads=8 
  --dff=3072 
  --embedding-size=768 
  --batch-size=32 
  --learning-rate=5e-5
  --graph-mode=True

การฝึกอบรมแบบกระจายใน GPU หลายตัว

 >> python train_gpt2.py 
  --num-layers=8 
  --num-heads=8 
  --dff=3072 
  --embedding-size=768 
  --batch-size=32 
  --learning-rate=5e-5 
  --distributed=True 
  --graph-mode=True

เริ่ม Tensorboard ผ่านบรรทัดคำสั่ง

 $ tensorboard --logdir /log

หลังจากเตรียมโมเดลของคุณแล้วคุณสามารถสร้างลำดับโดยให้บริบทบางอย่างกับโมเดล เปิดสมุดบันทึกนี้และโหลดโมเดลที่ผ่านการฝึกอบรมและผ่านบริบทเพื่อสร้างแบบจำลองมันจะส่งคืนลำดับที่สร้างขึ้น

 $ sequence_generator.ipynb

สิ่งที่ต้องทำ

 1. Parallel Preprocessing.
2. Shared weights across layers.
3. Factorized embedding.
4. Fine-Tuning wrapper.

ข้อมูลอ้างอิง:

"openai/gpt-2"
"HuggingFace Pytorch-Transformers"
"Tensorflow Transformers"
"The Illustrated GPT-2"

ผลงาน

ปัญหาและ PRS ของคุณยินดีต้อนรับเสมอ

ผู้เขียน

Abhay Kumar
อีเมลผู้แต่ง: [email protected]
ติดตามฉันบน Twitter

ใบอนุญาต

มิกซ์

กราฟการคำนวณของรุ่น GPT-2

ขยาย

ข้อมูลเพิ่มเติม

เวอร์ชัน 1.0.0
ประเภท ซอร์สโค้ดอื่น ๆ
เวลาอัปเดต 2025-03-11
ขนาด 1.79MB
มาจาก Github

แอปที่เกี่ยวข้อง

iHuzu CMS v1.0 เบต้า 12

2024-11-14
Horror Ice Cream เวอร์ชั่น 82.0 เมนูที่ถูกแฮ็ก

2024-06-21
ภาพยนตร์และโทรทัศน์ GPT เวอร์ชันล่าสุด

2023-10-30
เวิร์ดเพรส v5.0 เบต้า 3

2022-07-05
จินบล็อก เวอร์ชั่น 2.0

2022-06-01
Oauth2.0 แพลตฟอร์มการเข้าสู่ระบบแบบรวม v1.2

2022-05-28

แนะนำสำหรับคุณ

chat.petals.dev

ซอร์สโค้ดอื่น ๆ

1.0.0
GPT Prompt Templates

ซอร์สโค้ดอื่น ๆ

1.0.0
GPTyped

ซอร์สโค้ดอื่น ๆ

GPTyped 1.0.5
Google Dorks

ซอร์สโค้ดอื่น ๆ

1.0
shepherd

ซอร์สโค้ดอื่น ๆ

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

ซอร์สโค้ดอื่น ๆ

v1.1.0-rc-3
Google Dorks

ซอร์สโค้ดอื่น ๆ

1.0
shepherd

ซอร์สโค้ดอื่น ๆ

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

ซอร์สโค้ดอื่น ๆ

v1.1.0-rc-3

ข้อมูลที่เกี่ยวข้อง ทั้งหมด