Persia (Farsi) Bersumpah Kata + .json Dataset
10.34740/kaggle/dsv/2094967CONTRIBUTING.md : Pedoman untuk berkontribusi pada repositori.CODE_OF_CONDUCT.md : Kode perilaku untuk peserta dalam repositori.CATATAN: Ini adalah daftar letih dari sumpah bahasa Persia yang dapat Anda gunakan dalam produksi Anda untuk memfilter konten yang tidak diinginkan. WordList adalah ketersediaan dalam format JSON.
Catatan:
Basis data ini berisi kata -kata yang dalam beberapa kasus mungkin memerlukan penyaringan. Pengguna harus menyesuaikan Data Tailor -dibuat untuk penggunaan tertentu. Mereka yang tertarik untuk menyelesaikan data ini didorong untuk menggunakannya agar proyek mereka memiliki teks yang bersih dan sesuai. Buat kemitraan yang lebih berharga daripada mengirim PR kecil. Dimungkinkan juga untuk menambahkan kelas atau fungsi ke berbagai bahasa pemrograman menggunakan database ini.
Saat ini fungsi bahasa berikut tersedia:
Lagi:
CONTRIBUTING.md : Instruksi untuk berpartisipasi dalam reservoir proyek iniCODE_OF_CONDUCT.md : Kode etika untuk peserta di reservoir composer require amirshnll/persian-swear-words
npm i persian-swear-words
Tautan Kode:? Kelas
var persianSwear = new PersianSwear ();
// add word(s) to DataSet
persianSwear . addWord ( "word" );
persianSwear . addWords ( new String []{ "word1" , "word2" });
// remove word(s) from DataSet
persianSwear . removeWord ( "word" );
persianSwear . removeWords ( new String []{ "word1" , "word2" });
// check single word
persianSwear . isBad ( "الا.غ " ); // true
persianSwear . isBad ( "امروز" ); // false
// check existing bad word in text
persianSwear . hasSwear ( "تو هیز هستی" ); // true
persianSwear . hasSwear ( "تو دوست من هستی" ); // false
// replace bad words in text
persianSwear . filterWords ( "تو هیز هستی" ); // تو * هستی
persianSwear . filterWords ( "تو هیز هستی" , "&" ); // تو & هستی Tautan Kode:? Fungsi
persianswear := NewPersianSwear ()
// add word(s) to DataSet
persianswear . AddWord ( "word" )
persianswear . AddWords ([] string { "word1" , "word2" })
// remove word(s) from DataSet
persianswear . RemoveWord ( "word" )
persianswear . RemoveWords ([] string { "word1" , "word2" })
// check single word
persianswear . IsBad ( "الا.غ " ) // true
persianswear . IsBad ( "امروز" ) // false
// check existing bad word in text
persianswear . HasSwear ( "تو هیز هستی" ) // true
persianswear . HasSwear ( "تو دوست من هستی" ) // false
// replace bad words in text
persianswear . FilterWords ( "تو هیز هستی" ) // تو * هستی
persianswear . FilterWords ( "تو هیز هستی" , "&" ) // تو & هستی Tautan Kode:? Kelas
require ( ' PersianSwear.php ' );
$ persianswear = new PersianSwear ();
// is bad
if ( $ persianswear -> is_bad ( ' خر ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
// not bad
if ( $ persianswear -> is_bad ( ' امروز ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
// not bad
if ( $ persianswear -> is_bad ( ' چرت و پرت ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
$ persianswear -> add_word ( ' چرت و پرت ' );
// is bad
if ( $ persianswear -> is_bad ( ' چرت و پرت ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
// is bad
if ( $ persianswear -> is_bad ( ' گاو ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
$ persianswear -> remove_word ( ' گاو ' );
// not bad
if ( $ persianswear -> is_bad ( ' گاو ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
// not bad
if ( $ persianswear -> has_swear ( ' تو دوست من هستی ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
// is bad
if ( $ persianswear -> has_swear ( ' تو هیز هستی ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
echo $ persianswear -> filter_words ( ' تو دوست من هستی ' ); // تو دوست من هستی
echo $ persianswear -> filter_words ( ' تو هیز هستی ' ); // تو * هستی
echo $ persianswear -> filter_words ( ' تو هیز هستی ' , " & " ); // تو & هستی
echo $ persianswear -> tostring (); // show all swear words Tautan Kode:? Kelas
persianswear = PersianSwear ()
print ( persianswear . is_bad ( 'خر' , ignoreOT = False )) # True
print ( persianswear . is_bad ( 'امروز' , ignoreOT = False )) # False
print ( persianswear . is_bad ( 'چرت و پرت' , ignoreOT = False )) # False
persianswear . add_word ( 'چرت و پرت' )
print ( persianswear . is_bad ( 'چرت و پرت' , ignoreOT = False )) # True
print ( persianswear . has_swear ( 'تو دوست من هستی' , ignoreOT = False )) # False
print ( persianswear . has_swear ( 'تو هیز هستی' , ignoreOT = False )) # True
print ( persianswear . filter_words ( 'تو دوست من هستی' , ignoreOT = False )) # تو دوست من هستی
print ( persianswear . filter_words ( 'تو هیز هستی' , ignoreOT = False )) # تو * هستی
print ( persianswear . filter_words ( 'تو هیز هستی' , '&' , ignoreOT = False )) # تو & هستی
print ( persianswear . is_bad ( 'خ.ر' , ignoreOT = True )) # True
print ( persianswear . is_bad ( 'ام.روز' , ignoreOT = True )) # False
print ( persianswear . has_swear ( 'تو دو.ست من هستی' , ignoreOT = True )) # False
print ( persianswear . has_swear ( 'تو اسک.ل هستی' , ignoreOT = True )) # True
print ( persianswear . filter_words ( 'تو دو.ست من هستی' , ignoreOT = True )) # تو دو.ست من هستی
print ( persianswear . filter_words ( 'تو هی.ز هستی' , ignoreOT = True )) # تو * هستی
print ( persianswear . filter_words ( 'تو هی.ز هس.تی' , ignoreOT = True )) # تو * هس.تی
print ( persianswear . tostring ()) # show all swear words Tautan Kode:? Fungsi
import PersianSwear from './PersianSwear.js' ;
const text1 = 'تو دوست من هستی' ;
console . log ( PersianSwear . hasSwear ( text1 ) ) ; // Output: false
const text2 = 'تو هیز هستی' ;
console . log ( PersianSwear . hasSwear ( text2 ) ) ; // Output: true
const text3 = 'تو هیز هستی' ;
const filteredText = PersianSwear . filterWords ( text3 ) ;
console . log ( filteredText ) ; // Output: تو * هستی
const word = 'خر' ;
console . log ( PersianSwear . isBad ( word ) ) ; // Output: true Tautan Kode:? Fungsi
import { test , expect } from "@jest/globals" ;
import { PersianSwear } from "../src" ;
test ( "test check bad word" , ( ) => {
expect ( PersianSwear . isBad ( "آشغال" ) ) . toBe ( true ) ;
} ) ;
test ( "test check not bad word" , ( ) => {
expect ( PersianSwear . isBad ( "سلام" ) ) . toBe ( false ) ;
} ) ;
test ( "test check text no has bad word" , ( ) => {
expect ( PersianSwear . hasSwear ( "سلام عزیزم" ) ) . toBe ( false ) ;
} ) ;
test ( "test check text has bad word" , ( ) => {
expect ( PersianSwear . hasSwear ( "سلام کصافت" ) ) . toBe ( true ) ;
} ) ;
test ( "test check text has bad word and replace with symbol" , ( ) => {
expect ( PersianSwear . filterWords ( "سلام کصافت خوبی" , "*" ) ) . toBe ( "سلام * خوبی" ) ;
} ) ; Tautan Kode:? Pembantu
Pertama -tama Anda perlu membuat instance dari filterpersianwords
var filter = new FilterPersianWords();
Jika Anda memiliki jalur file JSON opsional, Anda dapat diteruskan ke Konvensi.
Apakah satu kata buruk? var isBadWord = filter.IsBadWord("yourWord");
Apakah string garis multiin ? var isBadSentence = filter.IsBadSentence("your long sentence");
Dapatkan semua kata buruk di dalam string
var badList = filter.GetBadWords("your long sentence");
var clearedString = filter.RemoveBadWords("your bad sentence");Metode ini tidak akan mengubah apapun dari string kecuali kata -kata buruk.
Tautan Kode:? Kelas dan protokol
PersianSwear , di mana metode diterapkan: // add word(s) to DataSet
PersianSwear . shared . addWord ( " bad-word " )
PersianSwear . shared . addWords ( [ " bad-word-1 " , " bad-word-2 " ] )
// remove word(s) from DataSet
PersianSwear . shared . removeWord ( " bad-word " )
PersianSwear . shared . removeWords ( [ " bad-word-1 " , " bad-word-2 " ] )
// check single word
let isBadWord = PersianSwear . shared . isBadWord ( " single word " )
// check existing bad word in text
let hasBadWord = PersianSwear . shared . hasBadWord ( " long text " )
// existing bad word in text
let badWords = PersianSwear . shared . badWords ( in : " long text " )
// replace bad words in text
let newText = PersianSwear . shared . replaceBadWords ( in : " long text " , with : " **** " ) Ada protokol dengan nama PersianSwearDataLoader yang kinerjanya memuat kata -kata:
protocol PersianSwearDataLoader {
func loadWords (
_ completion : @escaping ( Result < PersianSwear . Words , Error > ) -> Void
)
}Misalnya, mengetik kata -kata diimplementasikan dari gerbang. Contohnya adalah sebagai berikut:
let loader = GithubPersianSwearDataLoader ( )
PersianSwear . shared . loadWords ( using : loader ) { result in
switch result {
case . failure ( let error ) :
print ( " Error: " , error . localizedDescription )
case . success ( let words ) :
print ( " Words: " , words . count )
}
}