thesaurus
1.0.0
동의어/시소러스 오프라인 데이터베이스
jsonl 형식을 따릅니다. 즉, 각 줄은 별도의 json 문서임을 의미합니다. 그것은 포함한다 :
word: (String) Actual word
wordnet_id: (String) internal wordnet reference
key: (String) Some words can have multiple meanings. Each meaning will have same word, but different key.
pos: (String) part of speech tag, eg. `noun`, `verb`
synonyms: (Array of String) synonyms related to this key
desc: (Array of String) description of word
파일 : en_thesaurus.jsonl
Wordnet의 방출입니다. 사용에 대해서는 WordNet 라이센스를 참조하십시오. 오늘부터 :
License and Commercial Use of WordNet
WordNet® is unencumbered, and may be used in commercial applications in accordance with the following license agreement. An attorney representing the commercial interest should review this WordNet license with respect to the intended use.
WordNet License
This license is available as the file LICENSE in any downloaded version of WordNet.
WordNet 3.0 license: (Download)
WordNet Release 3.0 This software and database is being provided to you, the LICENSEE, by Princeton University under the following license. By obtaining, using and/or copying this software and database, you agree that you have read, understood, and will comply with these terms and conditions.: Permission to use, copy, modify and distribute this software and database and its documentation for any purpose and without fee or royalty is hereby granted, provided that you agree to comply with the following copyright notice and statements, including the disclaimer, and that the same appear on ALL copies of the software, database and documentation, including modifications that you make for internal use or for distribution. WordNet 3.0 Copyright 2006 by Princeton University. All rights reserved. THIS SOFTWARE AND DATABASE IS PROVIDED "AS IS" AND PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES OF MERCHANT- ABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF THE LICENSED SOFTWARE, DATABASE OR DOCUMENTATION WILL NOT INFRINGE ANY THIRD PARTY PATENTS, COPYRIGHTS, TRADEMARKS OR OTHER RIGHTS. The name of Princeton University or Princeton may not be used in advertising or publicity pertaining to distribution of the software and/or database. Title to copyright in this software, database and any associated documentation shall at all times remain with Princeton University and LICENSEE agrees to preserve same.
wordnet_extract.py 추가 종속성없이 실행할 수 있습니다. Python3.6+는이를 실행할 수 있어야합니다.
WordNet 데이터베이스를 구문 분석하고 .jsonl 파일을 생성합니다.
영어 데이터베이스를 찾을 수 있습니다 : WordNet
다른 언어가 엄격하게 같은 형식을 따르는 경우에도 효과가 있습니다.
용법:
usage: wordnet_extract.py [-h] [--db_path DB_PATH] output
positional arguments:
output Output file, jsonl extension
optional arguments:
-h, --help show this help message and exit
--db_path DB_PATH Directory where wordnet data files are located