ScreenAI下載 - ScreenAI源代碼下載

ScreenAI

其他源碼

1.0.0

下載

多模式

屏幕AI

從本文中實施Screenai模型：“用於UI和信息圖表理解的視覺語言模型”。流量為：IMG +文本 - >貼片大小 - > vit->嵌入式 + concat-> attn + ffn-> cross attn + ffn + ffn + self attn-> to out。紙鏈接：

安裝

pip3 install screenai

用法

 import torch
from screenai . main import ScreenAI

# Create a tensor for the image
image = torch . rand ( 1 , 3 , 224 , 224 )

# Create a tensor for the text
text = torch . randn ( 1 , 1 , 512 )

# Create an instance of the ScreenAI model with specified parameters
model = ScreenAI (
    patch_size = 16 ,
    image_size = 224 ,
    dim = 512 ,
    depth = 6 ,
    heads = 8 ,
    vit_depth = 4 ,
    multi_modal_encoder_depth = 4 ,
    llm_decoder_depth = 4 ,
    mm_encoder_ff_mult = 4 ,
)

# Perform forward pass of the model with the given text and image tensors
out = model ( text , image )

# Print the shape of the output tensor
print ( out )

執照

麻省理工學院

引用

 @misc { baechler2024screenai ,
    title = { ScreenAI: A Vision-Language Model for UI and Infographics Understanding } , 
    author = { Gilles Baechler and Srinivas Sunkara and Maria Wang and Fedir Zubach and Hassan Mansoor and Vincent Etter and Victor Cărbune and Jason Lin and Jindong Chen and Abhanshu Sharma } ,
    year = { 2024 } ,
    eprint = { 2402.04615 } ,
    archivePrefix = { arXiv } ,
    primaryClass = { cs.CV }
}

托多

在編碼器和解碼器中實現nn.modulelist（[]）

展開

附加信息

版本 1.0.0
類型其他源碼
更新時間 2025-03-08
大小 215.5KB
來自於 Github

相關應用

Google Dorks

2025-03-10
shepherd

2025-06-04
mongo express

2025-06-04
hidusbf

2025-02-14
Free Algorithms Books

2025-05-29
markdownpedia

2025-04-22

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部