torchdistill下載torchdistill源代碼下載

torchdistill

其他源碼

PyTorch 2.5 support, model migrations, end of Python 3.8 support

下載

Torchdistill：用於知識蒸餾的模塊化，配置驅動的框架

Torchdistill （以前是KDKIT ）提供了各種最先進的知識蒸餾方法，並使您可以通過編輯聲明性YAML配置文件而不是Python代碼來設計（新）實驗。即使您需要在教師/學生模型中提取中間表示形式，您也無需重新實現模型，而這些模型通常會更改向前的界面，而是指定YAML文件中的模塊路徑。有關更多詳細信息，請參閱這些論文。

除了知識蒸餾外，該框架還可以幫助您設計和執行一般的深度學習實驗（無需編碼），以進行可再現的深度學習研究。即，它使您可以通過將教師條目排除在聲明性的YAML配置文件中，而無需老師培訓模型。您可以在下面和configs/sample/中找到此類示例。

當您在論文中提到Torchdistill時，請引用這些論文而不是此GitHub存儲庫。
如果您將Torchdistill用作工作的一部分，那麼您的引文將受到讚賞，並激勵我維護和升級此框架！

文件

您可以在https://yoshitomo-matsubara.net/torchdistill/上找到利用Torchdistill的API文檔和研究項目

前向鉤管理器

使用ForwardHookManager ，您可以在模型中提取中間表示，而無需修改其正向函數的接口。
本示例筆記本將使您更好地了解使用諸如知識蒸餾和中間表示的分析。

1個實驗→1個聲明的PYYAML配置文件

在Torchdistill中，許多組件和Pytorch模塊都是抽象的，例如，模型，數據集，優化器，損失等等！您可以在聲明的PYYAML配置文件中定義它們，以便可以將其視為實驗的摘要，在許多情況下，您將根本不需要編寫Python代碼。查看Configs/中可用的一些配置。您會看到哪些模塊抽象了，以及如何在聲明的PYYAML配置文件中定義它們以設計實驗。

如果您想使用此框架使用自己的模塊（模型，損失功能，數據集等），則可以在本地軟件包torchdistill/中編輯代碼/。
有關更多詳細信息，請參見官方文檔和討論。

基準

ILSVRC 2012（Imagenet）的TOP-1驗證精度

例子

可執行代碼可以在示例/中找到

圖像分類：Imagenet（ILSVRC 2012），CIFAR-10，CIFAR-100等
對象檢測：可可2017等
語義細分：Coco 2017，Pascal VOC等
文本分類：膠水等

對於CIFAR-10和CIFAR-100，某些模型將重新實現，並在Torchdistill中作為驗證模型可用。可以在此處找到更多詳細信息。

Hugging Face Model Hub可以使用一些由Torchdistill微調用於膠水任務的變壓器模型。樣品膠基準的結果和詳細信息可以在此處找到。

Google Colab示例

以下示例在演示中可用。請注意，這些示例適用於Google Colab用戶，並且與Amazon Sagemaker Studio Lab兼容。通常，如果您擁有自己的GPU，則示例/將是更好的參考。

CIFAR-10和CIFAR-100

沒有老師模型的培訓
知識蒸餾

膠水

沒有老師模型的微調
知識蒸餾

這些示例寫出了測試預測文件，供您查看膠排行榜系統中的測試性能。

Pytorch樞紐

如果您在支持Pytorch Hub的Pytorch Hub或GitHub存儲庫上找到模型，則只需編輯聲明性的YAML配置文件即可將它們導入教師/學生模型。

例如，如果您使用huggingface/pytorch-image-models（aka timm ）中可用的Resnest-50作為Imagenet數據集的教師模型，則可以通過Pytorch Hub在您的聲明性YAML配置文件中使用以下條目導入該模型。

 models :
  teacher_model :
    key : ' resnest50d '
    repo_or_dir : ' huggingface/pytorch-image-models '
    kwargs :
      num_classes : 1000
      pretrained : True

如何設置

Python> = 3.9
PIPENV（可選）

通過PIP/PIPENV安裝

 pip3 install torchdistill
# or use pipenv
pipenv install torchdistill

從此存儲庫中安裝（不建議）

 git clone https://github.com/yoshitomo-matsubara/torchdistill.git
cd torchdistill/
pip3 install -e .
# or use pipenv
pipenv install "-e ."

問題 /問題 /請求 /拉請求

如果找到錯誤，請隨時創建問題。
如果您有問題或功能請求，請在此處開始新的討論。請搜索問題和討論，並確保尚未解決您的問題/問題/請求。

歡迎拉動請求。請從問題開始，並與我討論解決方案，而不是從拉動請求開始。

引用

如果您在研究中使用Torchdistill ，請引用以下論文：
[紙] [preprint]

 @inproceedings { matsubara2021torchdistill ,
  title = { {torchdistill: A Modular, Configuration-Driven Framework for Knowledge Distillation} } ,
  author = { Matsubara, Yoshitomo } ,
  booktitle = { International Workshop on Reproducible Research in Pattern Recognition } ,
  pages = { 24--44 } ,
  year = { 2021 } ,
  organization = { Springer }
}

[Paper] [OpenReview] [Preprint]

 @inproceedings { matsubara2023torchdistill ,
  title = { {torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP} } ,
  author = { Matsubara, Yoshitomo } ,
  booktitle = { Proceedings of the 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS 2023) } ,
  publisher = { Empirical Methods in Natural Language Processing } ,
  pages = { 153--164 } ,
  year = { 2023 }
}

致謝

自2021年11月和2022年6月以來，Travis CI的OSS信貸和Jetbrain的免費許可計劃（開源）已支持該項目。

參考

？ pytorch/vision/參考/分類/
？ Pytorch/Vision/參考/檢測/
？ Pytorch/Vision/參考/分段/
？擁抱面/變壓器/示例/pytorch/文本分類
？ Geoffrey Hinton，Oriol Vinyals，Jeff Dean。 “在神經網絡中提取知識”（深度學習和表示學習研討會：Neurips 2014）
？ Adriana Romero，Nicolas Ballas，Samira Ebrahimi Kahou，Antoine Chassang，Carlo Gatta，Yoshua Bengio。 “ Fitnets：薄深網的提示”（ICLR 2015）
？ Junho Yim，Donggyu Joo，Jihoon Bae，Junmo Kim。 “知識蒸餾的禮物：快速優化，網絡最小化和轉移學習”（CVPR 2017）
？ Sergey Zagoruyko，Nikos Komodakis。 “更多地關注關注：通過注意轉移提高卷積神經網絡的表現”（ICLR 2017）
？ Nikolaos Passalis，Anastasios Tefas。 “通過概率知識轉移學習深層表示”（ECCV 2018）
？ Jangho Kim，Seonguk Park，Nojun Kwak。 “釋義復雜網絡：通過因子傳輸的網絡壓縮”（Neurips 2018）
？ Byeongho Heo，Minsik Lee，Sangdoo Yun，Jin Young Choi。 “通過隱藏神經元形成的激活邊界蒸餾的知識轉移”（AAAI 2019）
？他，Chunhua Shen，Zhi Tian，Dong Gong，Changming Sun，Youliang Yan。 “知識適應有效的語義細分”（CVPR 2019）
？ Wonpyo Park，Dongju Kim，Yan Lu，Minsu Cho。 “關係知識蒸餾”（CVPR 2019）
？ Sungsoo Ahn，Shell Xu Hu，Andreas Damianou，Neil D. Lawrence，Zhenwen Dai。 “知識轉移的變分信息蒸餾”（CVPR 2019）
？ Yoshitomo Matsubara，Sabur Baidya，Davide Callegaro，Marco Levorato，Sameer Singh。 “用於邊緣輔助實時系統的蒸餾拆分深神網絡”（視頻分析和智能邊緣的熱門話題的講習班：Mobicom 2019）
？ Baoyun Peng，Xiao Jin，Jiaheng Liu，Dongsheng Li，Yichao Wu，Yu Liu，Shunfeng Zhou，Zhaoning Zhang Zhang。 “知識蒸餾的相關一致性”（ICCV 2019）
？弗雷德里克·鄧（Frederick Tung），格雷格·莫里（Greg Mori）。 “具有相似性的知識蒸餾”（ICCV 2019）
？永隆，迪利普·克里希南（Dilip Krishnan），菲利普（Phillip） “對比表示蒸餾”（ICLR 2020）
？ Yoshitomo Matsubara，Marco Levorato。 “在挑戰網絡中進行邊緣輔助實時對象檢測的神經壓縮和過濾”（ICPR 2020）
？ Li Yuan，Francis Ehtay，Guilin Li，Tao Wang，Jiashi Feng。 “通過標籤平滑正規化重新訪問知識蒸餾”（CVPR 2020）
？ Guodong Xu，Ziwei Liu，Xiaoxiao Li，Chen Change Loy。 “知識蒸餾符合自學意義”（ECCV 2020）
？ Youcai Zhang，Zhonghao lan，Yuchen Dai，Fangao Zeng，Yan Bai，Jie Chang，Yichen Wei。 “ Prime Awawaweawaptive蒸餾”（ECCV 2020）
？ Pengguang Chen，Shu Liu，Hengshuang Zhao，Jiaya Jia。 “通過知識審查提取知識”（CVPR 2021）
？ Li Liu，Qingle Huang，Sihao Lin，Hongwei Xie，Bing Wang，Xiaojun Chang，Xiaodan Liang。 “探索多樣性保留的知識蒸餾的通道間相關性”（ICCV 2021）
？陶黃（Tao Huang），Shan You，Fei Wang，Chen Qian，Chang Xu。 “從更強大的老師那裡蒸餾”（神經2022）
？ Roy Miles，Krystian Mikolajczyk。 “了解投影儀在知識蒸餾中的作用”（AAAI 2024）
？ Shangquan Sun，Wenqi Ren，Jingzhi Li，Rui Wang，Xioochun Cao。 “知識蒸餾中的logit標準化”（CVPR 2024）

展開

附加信息

版本 PyTorch 2.5 support, model migrations, end of Python 3.8 support
類型其他源碼
更新時間 2025-04-18
大小 3.26MB
來自於 Github

相關應用

Google Dorks

2025-03-10
shepherd

2025-06-04
mongo express

2025-06-04
hidusbf

2025-02-14
Free Algorithms Books

2025-05-29
markdownpedia

2025-04-22

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部