Persian (Farsi) Swear Words + .json Datasets
10.34740/kaggle/dsv/2094967CONTRIBUTING.md : Guidelines for Contributing to The Repository.CODE_OF_CONDUCT.md : The Code of Conduct for Participants in the Repository.Note: This is a to-a -plete list of Persian Swears You can use in your production to filter unwanted content. Wordlist is availability in json format.
Notes:
This database contains words that may in some cases require filtering. Users must customize the data tailor -made for specific uses. Those interested in completing this data are encouraged to use it for their projects to have clean and appropriate texts. Make more valuable partnerships instead of sending small PRs. It is also possible to add Class or Function to different programming languages using this database.
Currently the functions of the following languages are available:
More:
CONTRIBUTING.md : Instructions for participating in the reservoir of this projectCODE_OF_CONDUCT.md : Ethical Code for Participants in the Reservoir composer require amirshnll/persian-swear-words
npm i persian-swear-words
Code Link:? Class
var persianSwear = new PersianSwear ();
// add word(s) to DataSet
persianSwear . addWord ( "word" );
persianSwear . addWords ( new String []{ "word1" , "word2" });
// remove word(s) from DataSet
persianSwear . removeWord ( "word" );
persianSwear . removeWords ( new String []{ "word1" , "word2" });
// check single word
persianSwear . isBad ( "الا.غ " ); // true
persianSwear . isBad ( "امروز" ); // false
// check existing bad word in text
persianSwear . hasSwear ( "تو هیز هستی" ); // true
persianSwear . hasSwear ( "تو دوست من هستی" ); // false
// replace bad words in text
persianSwear . filterWords ( "تو هیز هستی" ); // تو * هستی
persianSwear . filterWords ( "تو هیز هستی" , "&" ); // تو & هستی Code Link:? Function
persianswear := NewPersianSwear ()
// add word(s) to DataSet
persianswear . AddWord ( "word" )
persianswear . AddWords ([] string { "word1" , "word2" })
// remove word(s) from DataSet
persianswear . RemoveWord ( "word" )
persianswear . RemoveWords ([] string { "word1" , "word2" })
// check single word
persianswear . IsBad ( "الا.غ " ) // true
persianswear . IsBad ( "امروز" ) // false
// check existing bad word in text
persianswear . HasSwear ( "تو هیز هستی" ) // true
persianswear . HasSwear ( "تو دوست من هستی" ) // false
// replace bad words in text
persianswear . FilterWords ( "تو هیز هستی" ) // تو * هستی
persianswear . FilterWords ( "تو هیز هستی" , "&" ) // تو & هستی Code Link:? Class
require ( ' PersianSwear.php ' );
$ persianswear = new PersianSwear ();
// is bad
if ( $ persianswear -> is_bad ( ' خر ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
// not bad
if ( $ persianswear -> is_bad ( ' امروز ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
// not bad
if ( $ persianswear -> is_bad ( ' چرت و پرت ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
$ persianswear -> add_word ( ' چرت و پرت ' );
// is bad
if ( $ persianswear -> is_bad ( ' چرت و پرت ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
// is bad
if ( $ persianswear -> is_bad ( ' گاو ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
$ persianswear -> remove_word ( ' گاو ' );
// not bad
if ( $ persianswear -> is_bad ( ' گاو ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
// not bad
if ( $ persianswear -> has_swear ( ' تو دوست من هستی ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
// is bad
if ( $ persianswear -> has_swear ( ' تو هیز هستی ' ))
echo ' is bad ' ;
else
echo ' not bad ' ;
echo $ persianswear -> filter_words ( ' تو دوست من هستی ' ); // تو دوست من هستی
echo $ persianswear -> filter_words ( ' تو هیز هستی ' ); // تو * هستی
echo $ persianswear -> filter_words ( ' تو هیز هستی ' , " & " ); // تو & هستی
echo $ persianswear -> tostring (); // show all swear words Code Link:? Class
persianswear = PersianSwear ()
print ( persianswear . is_bad ( 'خر' , ignoreOT = False )) # True
print ( persianswear . is_bad ( 'امروز' , ignoreOT = False )) # False
print ( persianswear . is_bad ( 'چرت و پرت' , ignoreOT = False )) # False
persianswear . add_word ( 'چرت و پرت' )
print ( persianswear . is_bad ( 'چرت و پرت' , ignoreOT = False )) # True
print ( persianswear . has_swear ( 'تو دوست من هستی' , ignoreOT = False )) # False
print ( persianswear . has_swear ( 'تو هیز هستی' , ignoreOT = False )) # True
print ( persianswear . filter_words ( 'تو دوست من هستی' , ignoreOT = False )) # تو دوست من هستی
print ( persianswear . filter_words ( 'تو هیز هستی' , ignoreOT = False )) # تو * هستی
print ( persianswear . filter_words ( 'تو هیز هستی' , '&' , ignoreOT = False )) # تو & هستی
print ( persianswear . is_bad ( 'خ.ر' , ignoreOT = True )) # True
print ( persianswear . is_bad ( 'ام.روز' , ignoreOT = True )) # False
print ( persianswear . has_swear ( 'تو دو.ست من هستی' , ignoreOT = True )) # False
print ( persianswear . has_swear ( 'تو اسک.ل هستی' , ignoreOT = True )) # True
print ( persianswear . filter_words ( 'تو دو.ست من هستی' , ignoreOT = True )) # تو دو.ست من هستی
print ( persianswear . filter_words ( 'تو هی.ز هستی' , ignoreOT = True )) # تو * هستی
print ( persianswear . filter_words ( 'تو هی.ز هس.تی' , ignoreOT = True )) # تو * هس.تی
print ( persianswear . tostring ()) # show all swear words Code Link:? Function
import PersianSwear from './PersianSwear.js' ;
const text1 = 'تو دوست من هستی' ;
console . log ( PersianSwear . hasSwear ( text1 ) ) ; // Output: false
const text2 = 'تو هیز هستی' ;
console . log ( PersianSwear . hasSwear ( text2 ) ) ; // Output: true
const text3 = 'تو هیز هستی' ;
const filteredText = PersianSwear . filterWords ( text3 ) ;
console . log ( filteredText ) ; // Output: تو * هستی
const word = 'خر' ;
console . log ( PersianSwear . isBad ( word ) ) ; // Output: true Code Link:? Function
import { test , expect } from "@jest/globals" ;
import { PersianSwear } from "../src" ;
test ( "test check bad word" , ( ) => {
expect ( PersianSwear . isBad ( "آشغال" ) ) . toBe ( true ) ;
} ) ;
test ( "test check not bad word" , ( ) => {
expect ( PersianSwear . isBad ( "سلام" ) ) . toBe ( false ) ;
} ) ;
test ( "test check text no has bad word" , ( ) => {
expect ( PersianSwear . hasSwear ( "سلام عزیزم" ) ) . toBe ( false ) ;
} ) ;
test ( "test check text has bad word" , ( ) => {
expect ( PersianSwear . hasSwear ( "سلام کصافت" ) ) . toBe ( true ) ;
} ) ;
test ( "test check text has bad word and replace with symbol" , ( ) => {
expect ( PersianSwear . filterWords ( "سلام کصافت خوبی" , "*" ) ) . toBe ( "سلام * خوبی" ) ;
} ) ; Code Link:? Helper
First of all you need to create instance of filterpersianwords
var filter = new FilterPersianWords();
If you have an optional json file path you can pass down to convention.
Is a Single Word Bad? var isBadWord = filter.IsBadWord("yourWord");
Is a multiin line string ? var isBadSentence = filter.IsBadSentence("your long sentence");
Get All Bad Words Inside of String
var badList = filter.GetBadWords("your long sentence");
var clearedString = filter.RemoveBadWords("your bad sentence");This Method Will Not Change Any From String Except the Bad Words.
Code Link:? Classes and Protocol
PersianSwear , where the methods are implemented: // add word(s) to DataSet
PersianSwear . shared . addWord ( " bad-word " )
PersianSwear . shared . addWords ( [ " bad-word-1 " , " bad-word-2 " ] )
// remove word(s) from DataSet
PersianSwear . shared . removeWord ( " bad-word " )
PersianSwear . shared . removeWords ( [ " bad-word-1 " , " bad-word-2 " ] )
// check single word
let isBadWord = PersianSwear . shared . isBadWord ( " single word " )
// check existing bad word in text
let hasBadWord = PersianSwear . shared . hasBadWord ( " long text " )
// existing bad word in text
let badWords = PersianSwear . shared . badWords ( in : " long text " )
// replace bad words in text
let newText = PersianSwear . shared . replaceBadWords ( in : " long text " , with : " **** " ) There is a protocol with the name of PersianSwearDataLoader whose performance is to load words:
protocol PersianSwearDataLoader {
func loadWords (
_ completion : @escaping ( Result < PersianSwear . Words , Error > ) -> Void
)
}For example, typing the words is implemented from the gate. The example is as follows:
let loader = GithubPersianSwearDataLoader ( )
PersianSwear . shared . loadWords ( using : loader ) { result in
switch result {
case . failure ( let error ) :
print ( " Error: " , error . localizedDescription )
case . success ( let words ) :
print ( " Words: " , words . count )
}
}