mixture of experts下載 - mixture of experts來源代碼下載

mixture of experts

Python

1.0.0

下載

Pytorch專家層的稀疏門控混合物

該存儲庫包含Pytorch重新實現Pytorch的紙張中稀疏門控的MOE層。

 from moe import MoE
import torch

# instantiate the MoE layer
model = MoE ( input_size = 1000 , output_size = 20 , num_experts = 10 , hidden_size = 66 , k = 4 , noisy_gating = True )

X = torch . rand ( 32 , 1000 )

#train
model . train ()
# forward
y_hat , aux_loss = model ( X )

# evaluation

model . eval ()
y_hat , aux_loss = model ( X )

要求

安裝要求運行：

pip install -r requirements.py

例子

文件example.py包含一個最小的工作示例，說明瞭如何使用虛擬輸入和目標訓練和評估MOE層。運行示例：

python example.py

CIFAR 10示例

文件cifar10_example.py包含CIFAR 10數據集的最小工作示例。它通過任意的超參數且未完全融合，可實現39％的精度。運行示例：

python cifar10_example.py

使用

FastMoE：快速混合了專家訓練系統，該實現被用作單GPU培訓的參考Pytorch實現。

致謝

該代碼基於可以在此處找到的TensorFlow實現。

引用

 @misc{rau2019moe,
    title={Sparsely-gated Mixture-of-Experts PyTorch implementation},
    author={Rau, David},
    journal={https://github.com/davidmrau/mixture-of-experts},
    year={2019}
}

展開

附加信息

版本 1.0.0
類型 Python
更新時間 2025-07-12
大小 19.54KB
來自於 Github

相關應用

兵法

2024-11-14
泰坦之路

2024-09-17
蘇爾州

2024-08-19
盜賊街道

2024-02-21
英雄連：英雄傳說

2022-09-04
神話時代：龍的傳說

2022-08-29

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
ToDo Co

Python

1.0.0
Python Portfolio

Python
Redash開源的資料圖表工具v24.10.0

Python

24.10.0
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部