HierarchyTransformers下載HierarchyTransformers源代碼下載

HierarchyTransformers

Ai源碼

v0.1.1 - Refactor code and add customised HiT trainer

下載

項目|擁抱面| arxiv | Zenodo

將層次結構嵌入語言模型。

新聞（ChangElog）？

重構代碼並添加自定義的HIT Trainer（ V0.1.1 ）。
與sentence-transformers>=3.4.0.dev0 （ v0.1.0 ）保持一致的重大發展。
項目頁面現在可用（單擊）。
初始版本（應使用sentence-transformers<3.0.0 ）和錯誤修復。（ v0.0.3 ）

關於

層次結構變壓器（HIT）是一個框架，它使基於變壓器編碼器的語言模型（LMS）能夠學習雙曲線空間中的層次結構。主要思想是構建一個直接限制LMS輸出嵌入空間的龐加爾球，利用雙曲線空間的指數擴展來組織實體層次層次。除了介紹此框架（請參閱Github上的代碼），我們還致力於培訓和釋放各種hierachiies的命中模型。模型和數據集將在HuggingFace上訪問。

安裝

主要依賴性

該存儲庫遵循與sentence-transformers庫相似的佈局。主模型直接擴展了句子變壓器體系結構。我們還利用deeponto從源數據和層次結構構造數據集中提取層次結構，而geoopt用於雙曲線空間中的算術。

sentence-transformers=3.3.1在評估過程中包含錯誤，這些錯誤是在其github dev版本sentence-transformers=3.4.0.dev0中固定的，請手動更新依賴項，直到官方3.4.0釋放。

從PYPI安裝

 # requiring Python>=3.9
pip install hierarchy_transformers

從Github安裝

pip install git+https://github.com/KRR-Oxford/HierarchyTransformers.git

Huggingface Hub

我們的熱門模型和數據集在HuggingFace Hub上發布。

開始

 from hierarchy_transformers import HierarchyTransformer

# load the model
model = HierarchyTransformer . from_pretrained ( 'Hierarchy-Transformers/HiT-MiniLM-L12-WordNetNoun' )

# entity names to be encoded.
entity_names = [ "computer" , "personal computer" , "fruit" , "berry" ]

# get the entity embeddings
entity_embeddings = model . encode ( entity_names )

默認探測補充預測

使用實體嵌入來預測它們之間的填充關係。

 # suppose we want to compare "personal computer" and "computer", "berry" and "fruit"
child_entity_embeddings = model . encode ([ "personal computer" , "berry" ], convert_to_tensor = True )
parent_entity_embeddings = model . encode ([ "computer" , "fruit" ], convert_to_tensor = True )

# compute the hyperbolic distances and norms of entity embeddings
dists = model . manifold . dist ( child_entity_embeddings , parent_entity_embeddings )
child_norms = model . manifold . dist0 ( child_entity_embeddings )
parent_norms = model . manifold . dist0 ( parent_entity_embeddings )

# use the empirical function for subsumption prediction proposed in the paper
# `centri_score_weight` and the overall threshold are determined on the validation set
subsumption_scores = - ( dists + centri_score_weight * ( parent_norms - child_norms ))

訓練自己的模型

使用我們存儲庫中的示例腳本來複製現有模型並訓練/評估自己的模型。

執照

 Copyright 2023 Yuan He.
All rights reserved.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at *<http://www.apache.org/licenses/LICENSE-2.0>*

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

引用

如果您發現此存儲庫或已發布的模型有用，請引用我們的出版物：

Yuan He，Zhangdie Yuan，Jiaoyan Chen，Ian Horrocks。語言模型作為層次結構編碼。出現在2024年Neurips。 /arxiv / /neurips /

 @article{he2024language,
  title={Language Models as Hierarchy Encoders},
  author={He, Yuan and Yuan, Zhangdie and Chen, Jiaoyan and Horrocks, Ian},
  journal={arXiv preprint arXiv:2401.11374},
  year={2024}
}

展開

附加信息

版本 v0.1.1 - Refactor code and add customised HiT trainer
類型 Ai源碼
更新時間 2025-09-10
大小 3.01MB
來自於 Github

相關應用

ML stack

2025-07-01
awesome free chatgpt

2025-01-04
pywin_contextmenu

2025-08-31
promptl

2025-02-17
tick.chat

2025-09-16
FastLoRAChat

2025-09-03

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
ML stack

Ai源碼

1.0.0
awesome free chatgpt

Ai源碼

1.0.0
pywin_contextmenu

Ai源碼

Version update
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部