hate speech and offensive language下載 - hate speech and offensive language源代碼下載

hate speech and offensive language

其他源碼

1.0.0

下載

自動仇恨言論檢測和令人反感的語言問題

托馬斯·戴維森（Thomas Davidson），達娜·沃斯利（Dana Warmsley），邁克爾·梅西（Michael Macy）和英格瑪·韋伯（Ingmar Weber）的存儲庫。 2017年。 “自動仇恨言論檢測和令人反感的語言問題。” ICWSM。您在這裡閱讀論文。

注意：此存儲庫不再積極維護。請不要發布有關現有代碼與新版本的Python或使用的軟件包的兼容性的問題。我不會接受任何拉的請求。如果您計劃在研究中使用此數據或代碼，請查看問題，因為一些GitHub用戶建議對代碼庫進行更改或改進。

2019新聞

我們在此數據集中有關於種族偏見的新論文，您可以在這裡閱讀

警告：數據，詞典和筆記本都包含種族主義，性別歧視，同性戀和進攻性的內容。

您可以在data目錄中找到我們的標記數據。我們將它們作為泡菜文件（Python 2.7）和CSV包括在內。您還將在包含Python 2.7代碼的src目錄中找到一個筆記本，以在論文中復制我們的分析，並在lexicons目錄中的詞典中進行分析，以試圖更準確地對仇恨言論進行分類。 classifier目錄包含一個腳本，指令和必要的文件，以便在新數據上運行我們的分類器，並提供了一個測試用例。

請在使用任何這些資源中的任何已發表的工作中引用我們的論文。

 @inproceedings{hateoffensive,
  title = {Automated Hate Speech Detection and the Problem of Offensive Language},
  author = {Davidson, Thomas and Warmsley, Dana and Macy, Michael and Weber, Ingmar}, 
  booktitle = {Proceedings of the 11th International AAAI Conference on Web and Social Media},
  series = {ICWSM '17},
  year = {2017},
  location = {Montreal, Canada},
  pages = {512-515}
  }

如果您有興趣使用我們的數據，請聯繫我們也會感謝它，如果您可以填寫此簡短表格，以便我們可以跟踪這些數據的使用方式並與處理類似問題的研究人員聯繫。

如果您有任何疑問，請thomas dot davidson at rutgers dot edu聯繫。

展開

附加信息

版本 1.0.0
類型其他源碼
更新時間 2025-04-16
大小 4.29MB
來自於 Github

相關應用

language tools

2024-11-11
efficient language detector

2024-11-06
scene language

2024-11-03
洞和吉

2023-12-31
勇氣與榮耀

2022-09-04
真正的仇恨

2022-08-11

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部