Unduh japanese clip - Unduh Kode Sumber japanese clip

japanese clip

Kode Sumber AI

v0.2.0

Unduh

Klip Jepang

rinna-icon

Repositori ini mencakup kode untuk klip Jepang (Kontras Bahasa-gambar pra-pelatihan) varian oleh Rinna Co., Ltd.

Daftar isi
Berita
Model pretrained
Penggunaan
Kutipan
Lisensi

Berita

Juli 2022

V0.2.0 dirilis!

Baik model Clip dan Cloob ditingkatkan! Sekarang, rinna/japanese-cloob-vit-b-16 mencapai 54,64.
Dirilis Template Prompt Jepang kami dan kode contoh (lihat scripts/example.py ) untuk klasifikasi nol-shot ImageNet. Template tersebut dibersihkan untuk bahasa Jepang berdasarkan templat OpenAI 80.
Mengubah kutipan

Model pretrained

Nama model	Top1*	Top5*
Rinna/Japanese-Cloob-Vit-B-16	54.64	72.86
Rinna/Jepang-Klip-Vit-B-16	50.69	72.35

sonoisa/clip-vit-b-32-jepang-v1	38.88	60.71
Klip multibahasa	14.36	27.28

*Validasi Imagenet Zero-shot mengatur akurasi top-k.

Penggunaan

Instal Paket

$ pip install git+https://github.com/rinnakk/japanese-clip.git

Berlari

 from PIL import Image
import torch
import japanese_clip as ja_clip

device = "cuda" if torch . cuda . is_available () else "cpu"
# ja_clip.available_models()
# ['rinna/japanese-clip-vit-b-16', 'rinna/japanese-cloob-vit-b-16']
# If you want v0.1.0 models, set `revision='v0.1.0'`
model , preprocess = ja_clip . load ( "rinna/japanese-clip-vit-b-16" , cache_dir = "/tmp/japanese_clip" , device = device )
tokenizer = ja_clip . load_tokenizer ()

image = preprocess ( Image . open ( "./data/dog.jpeg" )). unsqueeze ( 0 ). to ( device )
encodings = ja_clip . tokenize (
    texts = [ "犬" , "猫" , "象" ],
    max_seq_len = 77 ,
    device = device ,
    tokenizer = tokenizer , # this is optional. if you don't pass, load tokenizer each time
)

with torch . no_grad ():
    image_features = model . get_image_features ( image )
    text_features = model . get_text_features ( ** encodings )
    
    text_probs = ( 100.0 * image_features @ text_features . T ). softmax ( dim = - 1 )

print ( "Label probs:" , text_probs )  # prints: [[1.0, 0.0, 0.0]]

Kutipan

Untuk mengutip repositori ini:

@inproceedings{japanese-clip,
  author = {シーン 誠, 趙 天雨, 沢田 慶},
  title = {日本語における言語画像事前学習モデルの構築と公開},
  booktitle= {The 25th Meeting on Image Recognition and Understanding},
  year = 2022,
  month = July,
}