negspacyのダウンロードnegspacyソースコードのダウンロード

negspacy

その他のソースコード

Spacy 3.3 support

ダウンロード

Negspacy：スペイシーの否定

テキストの概念を否定するためのスペイシーパイプラインオブジェクト。 Negexアルゴリズムに基づいています。

Negex-退院の否定的な発見と病気を識別するための簡単なアルゴリズムChapman、Bridewell、Hanbury、Cooper、Buchanan https://doi.org/10.1006/jbin.2001.1029

インストールと使用

ライブラリをインストールします。

pip install negspacy

ライブラリとスペイシーをインポートします。

 import spacy
from negspacy . negation import Negex

スペイシー言語モデルをロードします。 Negspacyパイプラインオブジェクトを追加します。エンティティタイプのフィルタリングはオプションです。

 nlp = spacy . load ( "en_core_web_sm" )
nlp . add_pipe ( "negex" , config = { "ent_types" :[ "PERSON" , "ORG" ]})

否定を表示します。

 doc = nlp ( "She does not like Steve Jobs but likes Apple products." )

for e in doc . ents :
	print ( e . text , e . _ . negex )

 Steve Jobs True
Apple False

Scispacyとペアリングして、テキストとプロセスの否定でUMLSの概念を見つけることを検討してください。

Negexパターン

pseudo_negations-偽のトリガー、曖昧な否定、または二重のネガのフレーズ
先行_negations-エンティティの前にある否定フレーズ
follow_negations-エンティティに続く否定フレーズ
終了- 否定検出の目的のために、部分で文をカットするフレーズ（.eg、 "But"）

用語

使用する用語セットを指定すると、 en_clinicalデフォルトで使用されます。

en =一般的な英語テキストのフレーズ
en_clinicalデフォルト=臨床ドメインに固有のフレーズを一般英語に追加する
en_clinical_sensitive =追加のフレーズを追加して、歴史的および場合によっては無関係なエンティティを排除するのに役立ちます

設定する：

 from negspacy . negation import Negex
from negspacy . termsets import termset

ts = termset ( "en" )

nlp = spacy . load ( "en_core_web_sm" )
nlp . add_pipe (
    "negex" ,
    config = {
        "neg_termset" : ts . get_patterns ()
    }
)

追加の機能

パターンを変更するか、使用するパターンを表示します

すべてのパターンを独自のセットに置き換えます

 nlp = spacy . load ( "en_core_web_sm" )
nlp . add_pipe (
    "negex" , 
    config = {
        "neg_termset" :{
            "pseudo_negations" : [ "might not" ],
            "preceding_negations" : [ "not" ],
            "following_negations" :[ "declined" ],
            "termination" : [ "but" , "however" ]
        }
    }
    )

組み込みの用語から、その場で個々のパターンを追加して削除します

 from negspacy . termsets import termset
ts = termset ( "en" )
ts . add_patterns ({
            "pseudo_negations" : [ "my favorite pattern" ],
            "termination" : [ "these are" , "great patterns" , "but" ],
            "preceding_negations" : [ "wow a negation" ],
            "following_negations" : [ "extra negation" ],
        })
#OR
ts . remove_patterns (
        {
            "termination" : [ "these are" , "great patterns" ],
            "pseudo_negations" : [ "my favorite pattern" ],
            "preceding_negations" : [ "denied" , "wow a negation" ],
            "following_negations" : [ "unlikely" , "extra negation" ],
        }
    )

使用中のパターンを表示します

 from negspacy . termsets import termset
ts = termset ( "en_clinical" )
print ( ts . get_patterns ())

名詞のチャンクの否定

使用している名前のエンティティ認識モデルに応じて、名詞と「一緒に噛み合った」否定があるかもしれません。例えば：

 nlp = spacy . load ( "en_core_sci_sm" )
doc = nlp ( "There is no headache." )
for e in doc . ents :
    print ( e . text )

# no headache

これにより、Negexアルゴリズムは前の否定を逃します。これを説明するために、 chunk_prefixを追加できます。

 nlp = spacy . load ( "en_core_sci_sm" )
ts = termset ( "en_clinical" )
nlp . add_pipe (
    "negex" ,
    config = {
        "chunk_prefix" : [ "no" ],
    },
    last = True ,
)
doc = nlp ( "There is no headache." )
for e in doc . ents :
    print ( e . text , e . _ . negex )

# no headache True

貢献

著者

ジェノ・ピザロ

ライセンス

他のライブラリ

このライブラリは、Spacy Universeで紹介されています。他の便利なライブラリとインスピレーションについては、それをチェックしてください。

指定されたエンティティ（生年月日、アカウント番号、または実験室の結果）に対応する値を抽出するスペイシーパイプラインオブジェクトを探している場合は、抽出物を見てください。

拡大する

追加情報

バージョン Spacy 3.3 support
タイプその他のソースコード
更新時間 2025-04-16
サイズ 200.96KB
から Github

negspacy

Negspacy：スペイシーの否定

新着情報

インストールと使用

Negexパターン

用語

追加の機能

パターンを変更するか、使用するパターンを表示します

名詞のチャンクの否定

貢献

著者

ライセンス

他のライブラリ

Google Dorks

shepherd

mongo express

hidusbf

Free Algorithms Books

markdownpedia

chat.petals.dev

GPT Prompt Templates

GPTyped

Google Dorks

shepherd

mongo express

Google Dorks

shepherd

mongo express