pip install git+https://github.com/Maitreyapatel/reliability-checklist
python -m spacy download en_core_web_sm
python -c "import nltk;nltk.download('wordnet')"Evaluate example model/data with default configuration
# eval on CPU
recheck
# eval on GPU
recheck trainer=gpu +trainer.gpus=[1,2,3]Evaluate model with chosen dataset-specific experiment configuration from reliability_checklist/configs/task/
recheck tasl=<task_name>Specify the custom model_name as shown in following MNLI example
# if model_name is used for tokenizer as well.
recheck task=mnli custom_model="bert-base-uncased-mnli"
# if model_name is different for tokenizer then
recheck task=mnli custom_model="bert-base-uncased-mnli" custom_model.tokenizer.model_name="ishan/bert-base-uncased-mnli"# create config folder structure similar to reliability_checklist/configs/
mkdir ./configs/
mkdir ./configs/custom_model/
# run following command after creating new config file inside ./configs/custom_model/<your-config>.yaml
recheck task=mnli custom_model=<your-config>reliability-checklist supports the wide range of visualization tools. One can decide to go with default wandb online visualizer. It also generates plots that are highly informative which will be stored into logs directory.

reliability-checklistAny kind of positive contribution is welcome! Please help us to grow by contributing to the project.
If you wish to contribute, you can work on any features/issues listed here or create one on your own. After adding your code, please send us a Pull Request.
Please read
CONTRIBUTINGfor details on ourCODE OF CONDUCT, and the process for submitting pull requests to us.