lamaダウンロード - lamaソースコードのダウンロード

？ラマ：フーリエの畳み込みでパインティングする解像度 - ロボスト大規模マスク

ローマ・スヴォロフ、エリザヴェタ・ロガチェバ、アントン・マシキン、アナスタシア・レミズバ、アルセニ・アシュカ、アレクセイ・シルベストロフ、ナージン・コング、ハーシス・ゴーカ、キウォンパーク、ビクター・レンピツキー。

ラマは、トレーニング中に見られたよりもはるかに高い解像度（〜2K❗️）に驚くほどよく一般化し（256x256）、挑戦的なシナリオ、例えば周期構造の完了でも優れたパフォーマンスを達成します。

[プロジェクトページ] [arxiv] [補足] [bibtex] [カジュアルガンペーパーの概要]

Google Colabでお試しください

ラマ開発

（問題を作成して、お気軽にお気軽にお気軽にお問い合わせください）

https://github.com/geekyutao/inpaint-anything --- Inpaint Anything：セグメントでは、画像が入力します

高解像度の画像を改善するための機能の改善

非公式のサードパーティアプリ：

（問題を作成して、お気軽にアプリ/実装/デモを共有してください）

https://github.com/enesmsahin/simple-lama-inpainting-ラマの発酵用のシンプルなPIPパッケージ。
https://github.com/mallman/coremlama-AppleのコアMLモデル形式
https://cleanup.pictures- @cyrildiagneによるシンプルなインタラクティブオブジェクト削除ツール
- @SansterのLama-Cleanerは、https：//cleanup.picturesの自己ホストバージョンです
グラデーションを使用してハグするフェイススペースに統合されています。デモ： @ak391を参照してください
Telegram bot @magiceraserbot by @moldoteck、コード
auto-lama = de：trオブジェクト検出 + @andy971022によるラマインペインティング
lama-magic-eraser-local = @zhaoyun0071によってpyqt5を使用して構築されたスタンドアロンインペインティングアプリケーション
HAMA-マスク図面を簡素化するスマートブラシを使用したオブジェクトの削除。
ModelScope = @chenbinghui1による中国の最大のモデルコミュニティ。
lama with maskdino = maskdinoオブジェクトの検出 + @qwopqwop200による改良によるラマ。
Coremlama-ラマクリーナーのラマ港をAppleのCore MLモデル形式に変換するスクリプト。

環境のセットアップ

レポをクローンする： git clone https://github.com/advimman/lama.git

環境には3つのオプションがあります。

Python Virtualenv：

 virtualenv inpenv --python=/usr/bin/python3
source inpenv/bin/activate
pip install torch==1.8.0 torchvision==0.9.0

cd lama
pip install -r requirements.txt

コマンド

 % Install conda for Linux, for other OS download miniconda at https://docs.conda.io/en/latest/miniconda.html
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh -b -p $HOME/miniconda
$HOME/miniconda/bin/conda init bash

cd lama
conda env create -f conda_env.yml
conda activate lama
conda install pytorch torchvision torchaudio cudatoolkit=10.2 -c pytorch -y
pip install pytorch-lightning==1.2.9

Docker：アクションは必要ありませんか？

推論

走る

 cd lama
export TORCH_HOME=$(pwd) && export PYTHONPATH=$(pwd)

1.事前に訓練されたモデルをダウンロードします

最高のモデル（Places2、Places Challenge）：

 curl -LJO https://huggingface.co/smartywu/big-lama/resolve/main/big-lama.zip
unzip big-lama.zip

すべてのモデル（場所とセレバ-HQ）：

 download [https://drive.google.com/drive/folders/1B2x7eQDgecTL0oh3LSIBDGj0fTxs6Ips?usp=drive_link]
unzip lama-models.zip

2。画像とマスクを準備します

テスト画像のダウンロード：

 unzip LaMa_test_images.zip

またはデータを準備してください。

1）「[Images_name] _maskxxx [image_suffix] `という名前のマスクを作成し、同じフォルダーに画像とマスクを入れます。

ランダムマスク生成にはスクリプトを使用できます。

ファイルの形式を確認してください。

 image1_mask001.png
image1.png
image2_mask001.png
image2.png

configs/prediction/default.yamlでimage_suffix 、例えば.pngまたは.jpgまたは_input.jpgを指定します。

3。予測します

ホストマシンで：

 python3 bin/predict.py model.path=$(pwd)/big-lama indir=$(pwd)/LaMa_test_images outdir=$(pwd)/output

またはDockerで

次のコマンドは、Docker HubからDocker画像を引き出し、予測スクリプトを実行します

 bash docker/2_predict.sh $(pwd)/big-lama $(pwd)/LaMa_test_images $(pwd)/output device=cpu

Docker Cuda：

 bash docker/2_predict_with_gpu.sh $(pwd)/big-lama $(pwd)/LaMa_test_images $(pwd)/output

4.洗練されて予測します

ホストマシンで：

 python3 bin/predict.py refine=True model.path=$(pwd)/big-lama indir=$(pwd)/LaMa_test_images outdir=$(pwd)/output

トレーニングと評価

実行することを確認してください：

 cd lama
export TORCH_HOME=$(pwd) && export PYTHONPATH=$(pwd)

次に、知覚損失のためにモデルをダウンロードします。

 mkdir -p ade20k/ade20k-resnet50dilated-ppm_deepsup/
wget -P ade20k/ade20k-resnet50dilated-ppm_deepsup/ http://sceneparsing.csail.mit.edu/model/pytorch/ade20k-resnet50dilated-ppm_deepsup/encoder_epoch_20.pth

場所

ショ和NB：Lama Paperに表示される場所のFID/SSIM/LPIPSメトリック値は、以下の評価セクションで作成する30000の画像で計算されます。評価データの詳細についてショ和

ホストマシンで：

 # Download data from http://places2.csail.mit.edu/download.html
# Places365-Standard: Train(105GB)/Test(19GB)/Val(2.1GB) from High-resolution images section
wget http://data.csail.mit.edu/places/places365/train_large_places365standard.tar
wget http://data.csail.mit.edu/places/places365/val_large.tar
wget http://data.csail.mit.edu/places/places365/test_large.tar

# Unpack train/test/val data and create .yaml config for it
bash fetch_data/places_standard_train_prepare.sh
bash fetch_data/places_standard_test_val_prepare.sh

# Sample images for test and viz at the end of epoch
bash fetch_data/places_standard_test_val_sample.sh
bash fetch_data/places_standard_test_val_gen_masks.sh

# Run training
python3 bin/train.py -cn lama-fourier location=places_standard

# To evaluate trained model and report metrics as in our paper
# we need to sample previously unseen 30k images and generate masks for them
bash fetch_data/places_standard_evaluation_prepare_data.sh

# Infer model on thick/thin/medium masks in 256 and 512 and run evaluation 
# like this:
python3 bin/predict.py 
model.path=$(pwd)/experiments/<user>_<date:time>_lama-fourier_/ 
indir=$(pwd)/places_standard_dataset/evaluation/random_thick_512/ 
outdir=$(pwd)/inference/random_thick_512 model.checkpoint=last.ckpt

python3 bin/evaluate_predicts.py 
$(pwd)/configs/eval2_gpu.yaml 
$(pwd)/places_standard_dataset/evaluation/random_thick_512/ 
$(pwd)/inference/random_thick_512 
$(pwd)/inference/random_thick_512_metrics.csv

Docker：Todo

セレバ

ホストマシンで：

 # Make shure you are in lama folder
cd lama
export TORCH_HOME=$(pwd) && export PYTHONPATH=$(pwd)

# Download CelebA-HQ dataset
# Download data256x256.zip from https://drive.google.com/drive/folders/11Vz0fqHS2rXDb5pprgTjpD7S2BAJhi1P

# unzip & split into train/test/visualization & create config for it
bash fetch_data/celebahq_dataset_prepare.sh

# generate masks for test and visual_test at the end of epoch
bash fetch_data/celebahq_gen_masks.sh

# Run training
python3 bin/train.py -cn lama-fourier-celeba data.batch_size=10

# Infer model on thick/thin/medium masks in 256 and run evaluation 
# like this:
python3 bin/predict.py 
model.path=$(pwd)/experiments/<user>_<date:time>_lama-fourier-celeba_/ 
indir=$(pwd)/celeba-hq-dataset/visual_test_256/random_thick_256/ 
outdir=$(pwd)/inference/celeba_random_thick_256 model.checkpoint=last.ckpt

Docker：Todo

場所は挑戦します

ホストマシンで：

 # This script downloads multiple .tar files in parallel and unpacks them
# Places365-Challenge: Train(476GB) from High-resolution images (to train Big-Lama) 
bash places_challenge_train_download.sh

TODO: prepare
TODO: train 
TODO: eval

Docker：Todo

データを作成します

次の手順のいずれかで立ち往生している場合は、Celebahqセクションからのデータ準備とマスク生成については、Bashスクリプトを確認してください。

ホストマシンで：

 # Make shure you are in lama folder
cd lama
export TORCH_HOME=$(pwd) && export PYTHONPATH=$(pwd)

# You need to prepare following image folders:
$ ls my_dataset
train
val_source # 2000 or more images
visual_test_source # 100 or more images
eval_source # 2000 or more images

# LaMa generates random masks for the train data on the flight,
# but needs fixed masks for test and visual_test for consistency of evaluation.

# Suppose, we want to evaluate and pick best models 
# on 512x512 val dataset  with thick/thin/medium masks 
# And your images have .jpg extention:

python3 bin/gen_mask_dataset.py 
$(pwd)/configs/data_gen/random_<size>_512.yaml  # thick, thin, medium
my_dataset/val_source/ 
my_dataset/val/random_<size>_512.yaml # thick, thin, medium
--ext jpg

# So the mask generator will: 
# 1. resize and crop val images and save them as .png
# 2. generate masks

ls my_dataset/val/random_medium_512/
image1_crop000_mask000.png
image1_crop000.png
image2_crop000_mask000.png
image2_crop000.png
...

# Generate thick, thin, medium masks for visual_test folder:

python3 bin/gen_mask_dataset.py 
$(pwd)/configs/data_gen/random_<size>_512.yaml   #thick, thin, medium
my_dataset/visual_test_source/ 
my_dataset/visual_test/random_<size>_512/  #thick, thin, medium
--ext jpg


ls my_dataset/visual_test/random_thick_512/
image1_crop000_mask000.png
image1_crop000.png
image2_crop000_mask000.png
image2_crop000.png
...

# Same process for eval_source image folder:

python3 bin/gen_mask_dataset.py 
$(pwd)/configs/data_gen/random_<size>_512.yaml   #thick, thin, medium
my_dataset/eval_source/ 
my_dataset/eval/random_<size>_512/  #thick, thin, medium
--ext jpg



# Generate location config file which locate these folders:

touch my_dataset.yaml
echo "data_root_dir: $(pwd)/my_dataset/" >> my_dataset.yaml
echo "out_root_dir: $(pwd)/experiments/" >> my_dataset.yaml
echo "tb_dir: $(pwd)/tb_logs/" >> my_dataset.yaml
mv my_dataset.yaml ${PWD}/configs/training/location/


# Check data config for consistency with my_dataset folder structure:
$ cat ${PWD}/configs/training/data/abl-04-256-mh-dist
...
train:
  indir: ${location.data_root_dir}/train
  ...
val:
  indir: ${location.data_root_dir}/val
  img_suffix: .png
visual_test:
  indir: ${location.data_root_dir}/visual_test
  img_suffix: .png


# Run training
python3 bin/train.py -cn lama-fourier location=my_dataset data.batch_size=10

# Evaluation: LaMa training procedure picks best few models according to 
# scores on my_dataset/val/ 

# To evaluate one of your best models (i.e. at epoch=32) 
# on previously unseen my_dataset/eval do the following 
# for thin, thick and medium:

# infer:
python3 bin/predict.py 
model.path=$(pwd)/experiments/<user>_<date:time>_lama-fourier_/ 
indir=$(pwd)/my_dataset/eval/random_<size>_512/ 
outdir=$(pwd)/inference/my_dataset/random_<size>_512 
model.checkpoint=epoch32.ckpt

# metrics calculation:
python3 bin/evaluate_predicts.py 
$(pwd)/configs/eval2_gpu.yaml 
$(pwd)/my_dataset/eval/random_<size>_512/ 
$(pwd)/inference/my_dataset/random_<size>_512 
$(pwd)/inference/my_dataset/random_<size>_512_metrics.csv

またはDockerで：

 TODO: train
TODO: eval

ヒント

さまざまな種類のマスクを生成します

次のコマンドは、ランダムマスクを生成するスクリプトを実行します。

 bash docker/1_generate_masks_from_raw_images.sh 
    configs/data_gen/random_medium_512.yaml 
    /directory_with_input_images 
    /directory_where_to_store_images_and_masks 
    --ext png

テストデータ生成コマンドは、予測に適した形式で画像を保存します。

以下の表は、論文からさまざまなテストセットを生成するために使用した構成を説明しています。ランダムシードを修正しないため、結果は毎回わずかに異なることに注意してください。

	場所512x512	セレバ256x256
狭い	random_thin_512.yaml	random_thin_256.yaml
中くらい	random_medium_512.yaml	random_medium_256.yaml
広い	random_thick_512.yaml	random_thick_256.yaml

構成パス（引数＃1）をconfigs/data_genの他の任意に変更するか、構成ファイル自体を調整してください。

構成内のパラメーターをオーバーライドします

また、このような構成のパラメーターをオーバーライドすることもできます。

 python3 bin/train.py -cn <config> data.batch_size=10 run_title=my-title

ここで、.yamlファイル拡張子は省略されています

モデルオプション

紙からのモデルの構成名（トレーニングコマンドへの代用）：

 * big-lama
* big-lama-regular
* lama-fourier
* lama-regular
* lama_small_train_masks

構成/トレーニング/フォルダーに座っています

リンク

すべてのデータ（モデル、テスト画像など）https://disk.yandex.ru/d/amdeg-bijmvsug
論文https://disk.yandex.ru/d/xkqjzevrk5vllqからの画像をテストします
事前に訓練されたモデルhttps://disk.yandex.ru/d/egqasnlohjuzag
知覚損失のモデルhttps://disk.yandex.ru/d/ncvmqlmt_ktemq
当社のトレーニングログは、https：//disk.yandex.ru/d/9bt1wnsds4jdkqで入手できます

トレーニング時間とリソース

トト

謝辞

Form CsailVisionの場合、セグメンテーションコードとモデル。
LPIPSメトリックはRichzhangからのものです
SSIMはPo-Hsun-Suからです
fidはMseitzerからのものです

引用

このコードが役立つ場合は、引用を検討してください。

 @article{suvorov2021resolution,
  title={Resolution-robust Large Mask Inpainting with Fourier Convolutions},
  author={Suvorov, Roman and Logacheva, Elizaveta and Mashikhin, Anton and Remizova, Anastasia and Ashukha, Arsenii and Silvestrov, Aleksei and Kong, Naejin and Goka, Harshith and Park, Kiwoong and Lempitsky, Victor},
  journal={arXiv preprint arXiv:2109.07161},
  year={2021}
}

拡大する