PyTorch NLP下載PyTorch NLP源代碼下載

PyTorch NLP

其他源碼

Python 3.5 Support, Sampler Pipelining,

下載

？現在存檔了嗎？

隨著Pytorch工具鏈的成熟，是時候像這樣歸檔存儲庫了。您將能夠找到此工具包的每個部分的更多開發選項：

擁抱面部數據集（數據集）
擁抱面象kenizer（編碼器）
擁抱面孔指標（指標）
pytorch datapipes（下載＆採樣器）
擁抱臉部嵌入（單詞向量）
Pytorch NN（NN）
pytorch torchtext（多合一）

開發快樂！

如果有人想不構建此回購併繼續開發它，請隨時與我聯繫。您可以通過“ petrochukm [at] gmail.com”與我聯繫。

Pytorch自然語言處理（NLP）的基本實用程序

pytorch-nlp或用於簡稱的torchnlp ，是Pytorch NLP的基本實用程序庫。 torchnlp擴展了Pytorch，為您提供基本的文本數據處理功能。

Chloe Yeo徽標，Wellsaid Labs公司贊助

安裝？

確保您的Python 3.6+和Pytorch 1.0+。然後，您可以使用PIP安裝pytorch-nlp ：

 pip install pytorch - nlp

或通過以下方式安裝最新代碼：

 pip install git + https : // github . com / PetrochukM / PyTorch - NLP . git

文件

Pytorch-NLP的完整文檔可通過我們的ReadThedocs網站獲得。

開始

在NLP數據管道中，您需要實現以下基本步驟：

1。加載數據？

加載IMDB數據集，例如：

 from torchnlp . datasets import imdb_dataset

# Load the imdb training dataset
train = imdb_dataset ( train = True )
train [ 0 ]  # RETURNS: {'text': 'For a movie that gets..', 'sentiment': 'pos'}

加載自定義數據集，例如：

 from pathlib import Path

from torchnlp . download import download_file_maybe_extract

directory_path = Path ( 'data/' )
train_file_path = Path ( 'trees/train.txt' )

download_file_maybe_extract (
    url = 'http://nlp.stanford.edu/sentiment/trainDevTestTrees_PTB.zip' ,
    directory = directory_path ,
    check_files = [ train_file_path ])

open ( directory_path / train_file_path )

不用擔心，我們會為您處理緩存！

2。張量的文字

令牌化並將文本編碼為張量。

例如，每當遇到一個空格字符時， WhitespaceEncoder都會將文本分解為令牌。

 from torchnlp . encoders . text import WhitespaceEncoder

loaded_data = [ "now this ain't funny" , "so don't you dare laugh" ]
encoder = WhitespaceEncoder ( loaded_data )
encoded_data = [ encoder . encode ( example ) for example in loaded_data ]

3。張量

掌握了加載和編碼的數據，您需要批量數據集。

 import torch
from torchnlp . samplers import BucketBatchSampler
from torchnlp . utils import collate_tensors
from torchnlp . encoders . text import stack_and_pad_tensors

encoded_data = [ torch . randn ( 2 ), torch . randn ( 3 ), torch . randn ( 4 ), torch . randn ( 5 )]

train_sampler = torch . utils . data . sampler . SequentialSampler ( encoded_data )
train_batch_sampler = BucketBatchSampler (
    train_sampler , batch_size = 2 , drop_last = False , sort_key = lambda i : encoded_data [ i ]. shape [ 0 ])

batches = [[ encoded_data [ i ] for i in batch ] for batch in train_batch_sampler ]
batches = [ collate_tensors ( batch , stack_tensors = stack_and_pad_tensors ) for batch in batches ]

Pytorch-NLP在Pytorch現有的torch.utils.data.sampler ， torch.stack和default_collate e上構建，以支持長度不同的順序輸入！

4。培訓和推理

借助批次，您可以使用Pytorch使用梯度下降來開發和訓練模型。例如，查看此示例代碼，以培訓斯坦福大學自然語言推斷（SNLI）語料庫。

最後但並非最不重要的

Pytorch-NLP還有更多專注於NLP的實用程序軟件包，以支持您！？

確定性功能

現在，您已經設置了管道，您可能需要確保某些功能確定運行。用fork_rng包裝任何隨機的代碼，您會很好，就像這樣：

 import random
import numpy
import torch

from torchnlp . random import fork_rng

with fork_rng ( seed = 123 ):  # Ensure determinism
    print ( 'Random:' , random . randint ( 1 , 2 ** 31 ))
    print ( 'Numpy:' , numpy . random . randint ( 1 , 2 ** 31 ))
    print ( 'Torch:' , int ( torch . randint ( 1 , 2 ** 31 , ( 1 ,))))

這將始終打印：

 Random: 224899943
Numpy: 843828735
Torch: 843828736

預訓練的單詞向量

現在您已經計算出詞彙，您可能需要使用預訓練的單詞向量來設置嵌入，例如：

 import torch
from torchnlp . encoders . text import WhitespaceEncoder
from torchnlp . word_to_vector import GloVe

encoder = WhitespaceEncoder ([ "now this ain't funny" , "so don't you dare laugh" ])

vocab_set = set ( encoder . vocab )
pretrained_embedding = GloVe ( name = '6B' , dim = 100 , is_include = lambda w : w in vocab_set )
embedding_weights = torch . Tensor ( encoder . vocab_size , pretrained_embedding . dim )
for i , token in enumerate ( encoder . vocab ):
    embedding_weights [ i ] = pretrained_embedding [ token ]

神經網絡層

例如，從神經網絡軟件包中，應用最新的LockedDropout ：

 import torch
from torchnlp . nn import LockedDropout

input_ = torch . randn ( 6 , 3 , 10 )
dropout = LockedDropout ( 0.5 )

# Apply a LockedDropout to `input_`
dropout ( input_ ) # RETURNS: torch.FloatTensor (6x3x10)

指標

計算常見的NLP指標，例如BLEU評分。

 from torchnlp . metrics import get_moses_multi_bleu

hypotheses = [ "The brown fox jumps over the dog 笑" ]
references = [ "The quick brown fox jumps over the lazy dog 笑" ]

# Compute BLEU score with the official BLEU perl script
get_moses_multi_bleu ( hypotheses , references , lowercase = True )  # RETURNS: 47.9

幫助❓

也許查看較長的示例可能會對您有幫助examples/

需要更多幫助嗎？我們很高興通過吉特聊天回答您的問題

貢獻

我們發布了Pytorch-NLP，因為我們發現Pytorch中NLP缺乏基本工具包。我們希望其他組織可以從該項目中受益。我們感謝社區的任何貢獻。

貢獻指南

閱讀我們的貢獻指南，以了解我們的開發過程，如何提出錯誤的文件和改進以及如何構建和測試您對Pytorch-NLP的更改。

作者

邁克爾·佩特羅古克（Michael Petrochuk） - 開發人員
Chloe Yeo - 徽標設計

引用

如果您發現Pytorch-NLP對學術出版物有用，請使用以下Bibtex引用：

 @misc{pytorch-nlp,
  author = {Petrochuk, Michael},
  title = {PyTorch-NLP: Rapid Prototyping with PyTorch Natural Language Processing (NLP) Tools},
  year = {2018},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {url{https://github.com/PetrochukM/PyTorch-NLP}},
}

展開

附加信息

版本 Python 3.5 Support, Sampler Pipelining,
類型其他源碼
更新時間 2025-04-18
大小 980.17KB
來自於 Github

相關應用

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
pytorch image models

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部

PyTorch NLP

？現在存檔了嗎？

Pytorch自然語言處理（NLP）的基本實用程序

安裝？

文件

開始

1。加載數據？

2。張量的文字

3。張量

4。培訓和推理

最後但並非最不重要的

確定性功能

預訓練的單詞向量

神經網絡層

指標

幫助❓

貢獻

貢獻指南

相關工作

火把

Allennlp

作者

引用

GitHub sgrebnov/cordova plugin background download

Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

pytorch image models

Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

chat.petals.dev

GPT Prompt Templates

GPTyped

Google Dorks

shepherd

mongo express

Google Dorks

shepherd

mongo express

PyTorch NLP

？現在存檔了嗎？

Pytorch自然語言處理（NLP）的基本實用程序

安裝 ？

文件

開始

1。加載數據？

2。張量的文字

3。張量

4。培訓和推理

最後但並非最不重要的

確定性功能

預訓練的單詞向量

神經網絡層

指標

幫助❓

貢獻

貢獻指南

相關工作

火把

Allennlp

作者

引用

安裝？