Téléchargement lama - téléchargement de code source lama

? LAMA: Résolution Robust Large Mask Inpainting avec des convolutions de Fourier

Par Roman Suvorov, Elizaveta Logacheva, Anton Mashikhin, Anastasia Remizova, Arsenii Ashukha, Aleksei Silvestrov, Naejin Kong, Harshith Goka, Kiwoong Park, Victor Lempitsky.

LAMA se généralise étonnamment à des résolutions beaucoup plus élevées (~ 2K❗️) que ce qu'elle a vu pendant la formation (256x256), et atteint l'excellente performance même dans des scénarios difficiles, par exemple l'achèvement des structures périodiques.

[Page du projet] [ArXIV] [Supplémentaire] [Bibtex] [Résumé des documents de Gan décontractés]

Essayez dans Google Colab

Développement lama

(N'hésitez pas à partager votre article en créant un problème)

https://github.com/geekyutao/inpaint-anything --- tout: tout: segment tout répond à l'image de l'image

Raffinement des fonctionnalités pour améliorer l'image à haute résolution INTÉNÉRATION / VIDEO / CODE # 112 / par Geomagical Labs (Geomagical.com)

Applications non officielles de tiers:

(N'hésitez pas à partager votre application / implémentation / démo en créant un problème)

https://github.com/enesmsahin/simple-mama-painting - un package PIP simple pour la déainte de lama.
https://github.com/mallman/coremlama - Format de modèle de base d'Apple
https://cleanup.pictures - Un simple outil de suppression d'objets interactifs par @cyrilagne
- Lama-Cleaner by @sanster est une version auto-hôte de https://cleanup.pictures
Intégré aux espaces étreintes avec Gradio. Voir démo: par @ ak391
Telegram Bot @magiceraserbot par @moldoteck, code
Auto-lama = DE: TR OBJET DÉTECTION + LAMA INSEPAINTION PAR @ Andy971022
Lama-Magic-Eraser-Local = Une application de déception autonome construite avec PYQT5 par @ zhaoyun0071
HAMA - Élimination des objets avec un pinceau intelligent qui simplifie le dessin du masque.
Modelscope = la plus grande communauté modèle en chinois par @ Chenbinghui1.
LAMA avec maskdino = détection d'objets maskdino + lama dans le raffinement avec le raffinement par @ qwopqwop200.
Coremlama - Un scénario pour convertir le port de Lama Cleaner au format de modèle ML de base d'Apple.

Configuration de l'environnement

Clone The Repo: git clone https://github.com/advimman/lama.git

Il existe trois options d'un environnement:

Python virtualenv:

 virtualenv inpenv --python=/usr/bin/python3
source inpenv/bin/activate
pip install torch==1.8.0 torchvision==0.9.0

cd lama
pip install -r requirements.txt

Conda

 % Install conda for Linux, for other OS download miniconda at https://docs.conda.io/en/latest/miniconda.html
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh -b -p $HOME/miniconda
$HOME/miniconda/bin/conda init bash

cd lama
conda env create -f conda_env.yml
conda activate lama
conda install pytorch torchvision torchaudio cudatoolkit=10.2 -c pytorch -y
pip install pytorch-lightning==1.2.9

Docker: Aucune action n'est nécessaire ?.

Inférence

Courir

 cd lama
export TORCH_HOME=$(pwd) && export PYTHONPATH=$(pwd)

1. Télécharger les modèles pré-formés

Le meilleur modèle (Places2, Places Challenge):

 curl -LJO https://huggingface.co/smartywu/big-lama/resolve/main/big-lama.zip
unzip big-lama.zip

Tous les modèles (lieux et Celeba-HQ):

 download [https://drive.google.com/drive/folders/1B2x7eQDgecTL0oh3LSIBDGj0fTxs6Ips?usp=drive_link]
unzip lama-models.zip

2. Préparer des images et des masques

Télécharger les images de test:

 unzip LaMa_test_images.zip

Ou préparez vos données:

1) Créer des masques nommés `[images_name] _maskxxx [image_suffix]`, mettez des images et des masques dans le même dossier.

Vous pouvez utiliser le script pour la génération de masques aléatoires.

Vérifiez le format des fichiers:

 image1_mask001.png
image1.png
image2_mask001.png
image2.png

Spécifiez image_suffix , par exemple .png ou .jpg ou _input.jpg dans configs/prediction/default.yaml .

3. Prédire

Sur la machine hôte:

 python3 bin/predict.py model.path=$(pwd)/big-lama indir=$(pwd)/LaMa_test_images outdir=$(pwd)/output

Ou dans le docker

La commande suivante tirera l'image docker de Docker Hub et exécutera le script de prédiction

 bash docker/2_predict.sh $(pwd)/big-lama $(pwd)/LaMa_test_images $(pwd)/output device=cpu

Docker Cuda:

 bash docker/2_predict_with_gpu.sh $(pwd)/big-lama $(pwd)/LaMa_test_images $(pwd)/output

4. Prédire avec raffinement

Sur la machine hôte:

 python3 bin/predict.py refine=True model.path=$(pwd)/big-lama indir=$(pwd)/LaMa_test_images outdir=$(pwd)/output

S'entraîner et évaluer

Assurez-vous de courir:

 cd lama
export TORCH_HOME=$(pwd) && export PYTHONPATH=$(pwd)

Puis téléchargez des modèles pour la perte perceptuelle :

 mkdir -p ade20k/ade20k-resnet50dilated-ppm_deepsup/
wget -P ade20k/ade20k-resnet50dilated-ppm_deepsup/ http://sceneparsing.csail.mit.edu/model/pytorch/ade20k-resnet50dilated-ppm_deepsup/encoder_epoch_20.pth

Lieux

️ NB: Valeurs métriques FID / SSIM / LPIPS pour les endroits que nous voyons dans le papier LAMA est calculée sur 30000 images que nous produisons dans la section d'évaluation ci-dessous. Pour plus de détails sur la vérification des données d'évaluation [section 3. Dataset se divise en supplémentaire] ️

Sur la machine hôte:

 # Download data from http://places2.csail.mit.edu/download.html
# Places365-Standard: Train(105GB)/Test(19GB)/Val(2.1GB) from High-resolution images section
wget http://data.csail.mit.edu/places/places365/train_large_places365standard.tar
wget http://data.csail.mit.edu/places/places365/val_large.tar
wget http://data.csail.mit.edu/places/places365/test_large.tar

# Unpack train/test/val data and create .yaml config for it
bash fetch_data/places_standard_train_prepare.sh
bash fetch_data/places_standard_test_val_prepare.sh

# Sample images for test and viz at the end of epoch
bash fetch_data/places_standard_test_val_sample.sh
bash fetch_data/places_standard_test_val_gen_masks.sh

# Run training
python3 bin/train.py -cn lama-fourier location=places_standard

# To evaluate trained model and report metrics as in our paper
# we need to sample previously unseen 30k images and generate masks for them
bash fetch_data/places_standard_evaluation_prepare_data.sh

# Infer model on thick/thin/medium masks in 256 and 512 and run evaluation 
# like this:
python3 bin/predict.py 
model.path=$(pwd)/experiments/<user>_<date:time>_lama-fourier_/ 
indir=$(pwd)/places_standard_dataset/evaluation/random_thick_512/ 
outdir=$(pwd)/inference/random_thick_512 model.checkpoint=last.ckpt

python3 bin/evaluate_predicts.py 
$(pwd)/configs/eval2_gpu.yaml 
$(pwd)/places_standard_dataset/evaluation/random_thick_512/ 
$(pwd)/inference/random_thick_512 
$(pwd)/inference/random_thick_512_metrics.csv

Docker: Todo

Célèbre

Sur la machine hôte:

 # Make shure you are in lama folder
cd lama
export TORCH_HOME=$(pwd) && export PYTHONPATH=$(pwd)

# Download CelebA-HQ dataset
# Download data256x256.zip from https://drive.google.com/drive/folders/11Vz0fqHS2rXDb5pprgTjpD7S2BAJhi1P

# unzip & split into train/test/visualization & create config for it
bash fetch_data/celebahq_dataset_prepare.sh

# generate masks for test and visual_test at the end of epoch
bash fetch_data/celebahq_gen_masks.sh

# Run training
python3 bin/train.py -cn lama-fourier-celeba data.batch_size=10

# Infer model on thick/thin/medium masks in 256 and run evaluation 
# like this:
python3 bin/predict.py 
model.path=$(pwd)/experiments/<user>_<date:time>_lama-fourier-celeba_/ 
indir=$(pwd)/celeba-hq-dataset/visual_test_256/random_thick_256/ 
outdir=$(pwd)/inference/celeba_random_thick_256 model.checkpoint=last.ckpt

Docker: Todo

Défi des lieux

Sur la machine hôte:

 # This script downloads multiple .tar files in parallel and unpacks them
# Places365-Challenge: Train(476GB) from High-resolution images (to train Big-Lama) 
bash places_challenge_train_download.sh

TODO: prepare
TODO: train 
TODO: eval

Docker: Todo

Créez vos données

Veuillez vérifier les scripts bash pour la préparation des données et la génération de masques à partir de la section Celebahq, si vous êtes resté à l'une des étapes suivantes.

Sur la machine hôte:

 # Make shure you are in lama folder
cd lama
export TORCH_HOME=$(pwd) && export PYTHONPATH=$(pwd)

# You need to prepare following image folders:
$ ls my_dataset
train
val_source # 2000 or more images
visual_test_source # 100 or more images
eval_source # 2000 or more images

# LaMa generates random masks for the train data on the flight,
# but needs fixed masks for test and visual_test for consistency of evaluation.

# Suppose, we want to evaluate and pick best models 
# on 512x512 val dataset  with thick/thin/medium masks 
# And your images have .jpg extention:

python3 bin/gen_mask_dataset.py 
$(pwd)/configs/data_gen/random_<size>_512.yaml  # thick, thin, medium
my_dataset/val_source/ 
my_dataset/val/random_<size>_512.yaml # thick, thin, medium
--ext jpg

# So the mask generator will: 
# 1. resize and crop val images and save them as .png
# 2. generate masks

ls my_dataset/val/random_medium_512/
image1_crop000_mask000.png
image1_crop000.png
image2_crop000_mask000.png
image2_crop000.png
...

# Generate thick, thin, medium masks for visual_test folder:

python3 bin/gen_mask_dataset.py 
$(pwd)/configs/data_gen/random_<size>_512.yaml   #thick, thin, medium
my_dataset/visual_test_source/ 
my_dataset/visual_test/random_<size>_512/  #thick, thin, medium
--ext jpg


ls my_dataset/visual_test/random_thick_512/
image1_crop000_mask000.png
image1_crop000.png
image2_crop000_mask000.png
image2_crop000.png
...

# Same process for eval_source image folder:

python3 bin/gen_mask_dataset.py 
$(pwd)/configs/data_gen/random_<size>_512.yaml   #thick, thin, medium
my_dataset/eval_source/ 
my_dataset/eval/random_<size>_512/  #thick, thin, medium
--ext jpg



# Generate location config file which locate these folders:

touch my_dataset.yaml
echo "data_root_dir: $(pwd)/my_dataset/" >> my_dataset.yaml
echo "out_root_dir: $(pwd)/experiments/" >> my_dataset.yaml
echo "tb_dir: $(pwd)/tb_logs/" >> my_dataset.yaml
mv my_dataset.yaml ${PWD}/configs/training/location/


# Check data config for consistency with my_dataset folder structure:
$ cat ${PWD}/configs/training/data/abl-04-256-mh-dist
...
train:
  indir: ${location.data_root_dir}/train
  ...
val:
  indir: ${location.data_root_dir}/val
  img_suffix: .png
visual_test:
  indir: ${location.data_root_dir}/visual_test
  img_suffix: .png


# Run training
python3 bin/train.py -cn lama-fourier location=my_dataset data.batch_size=10

# Evaluation: LaMa training procedure picks best few models according to 
# scores on my_dataset/val/ 

# To evaluate one of your best models (i.e. at epoch=32) 
# on previously unseen my_dataset/eval do the following 
# for thin, thick and medium:

# infer:
python3 bin/predict.py 
model.path=$(pwd)/experiments/<user>_<date:time>_lama-fourier_/ 
indir=$(pwd)/my_dataset/eval/random_<size>_512/ 
outdir=$(pwd)/inference/my_dataset/random_<size>_512 
model.checkpoint=epoch32.ckpt

# metrics calculation:
python3 bin/evaluate_predicts.py 
$(pwd)/configs/eval2_gpu.yaml 
$(pwd)/my_dataset/eval/random_<size>_512/ 
$(pwd)/inference/my_dataset/random_<size>_512 
$(pwd)/inference/my_dataset/random_<size>_512_metrics.csv

Ou dans le docker:

 TODO: train
TODO: eval

Indices

Générer différents types de masques

La commande suivante exécutera un script qui génère des masques aléatoires.

 bash docker/1_generate_masks_from_raw_images.sh 
    configs/data_gen/random_medium_512.yaml 
    /directory_with_input_images 
    /directory_where_to_store_images_and_masks 
    --ext png

La commande de génération de données de test stocke des images dans le format, qui convient à la prédiction.

Le tableau ci-dessous décrit les configurations que nous avons utilisées pour générer différents ensembles de tests à partir du papier. Notez que nous ne réparons pas une graine aléatoire , les résultats seront donc légèrement différents à chaque fois.

	Places 512x512	Celeba 256x256
Étroit	random_thin_512.yaml	random_thin_256.yaml
Moyen	random_medium_512.yaml	random_medium_256.yaml
Large	random_thick_512.yaml	random_thick_256.yaml

N'hésitez pas à modifier le chemin de configuration (argument # 1) en toute autre configuration dans configs/data_gen ou ajustez les fichiers configurants eux-mêmes.

Remplacer les paramètres dans les configurations

Vous pouvez également remplacer les paramètres de configuration comme ceci:

 python3 bin/train.py -cn <config> data.batch_size=10 run_title=my-title

Où l'extension de fichier .yaml est omise

Modèles Options

Noms de configuration pour les modèles de papier (Supply dans la commande de formation):

 * big-lama
* big-lama-regular
* lama-fourier
* lama-regular
* lama_small_train_masks

Qui sont assis dans des configurations / formation / dossier

Temps de formation et ressources

FAIRE

Remerciements

Code et modèles de segmentation si forme CSAILVISION.
La métrique LPIPS est de Richzhang
Ssim est de po-hsun-su
FID est de Mseitzer

Citation

Si vous avez trouvé ce code utile, veuillez envisager de citer:

 @article{suvorov2021resolution,
  title={Resolution-robust Large Mask Inpainting with Fourier Convolutions},
  author={Suvorov, Roman and Logacheva, Elizaveta and Mashikhin, Anton and Remizova, Anastasia and Ashukha, Arsenii and Silvestrov, Aleksei and Kong, Naejin and Goka, Harshith and Park, Kiwoong and Lempitsky, Victor},
  journal={arXiv preprint arXiv:2109.07161},
  year={2021}
}

Développer